Module 3
Module 3
Challenges in ERM:
The Hessian matrix of the cost function may have a poor condition
number, causing slow convergence.
2. Local Minima
3. Saddle Points
Saddle points, where the gradient is zero but not a minimum, are
more common in high-dimensional spaces.
6. Vanishing Gradients
Local optimization steps may not align with the global cost
structure.
8. Inexact Gradients
9. Long-Term Dependencies
Key Features:
2. Application:
3. Limitations:
2. Incorporation of Momentum:
4. Bias Correction:
Advantages of Adam:
Limitations: