Introduction To Gradient Descent

Gradient descent is an optimization algorithm used in machine learning to minimize cost functions by iteratively adjusting model parameters. It relies on the learning rate to determine step sizes and can converge to a minimum, although it may only find local minima for non-convex functions. The algorithm is applicable in various fields, including linear regression, stock predictions, and medical diagnostics, but has limitations such as sensitivity to learning rates and potential convergence issues.

Uploaded by

alirehman123001

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views8 pages

Introduction To Gradient Descent

Uploaded by

alirehman123001

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Introduction to

Gradient Descent

Gradient descent is a powerful optimization algorithm used in machine

learning to minimize a cost function. It iteratively adjusts the parameters
of a model to find the optimal values that best fit the data.

by Amir Ali
Understanding the Concept
Iteration Learning Rate Convergence
Gradient descent works by The learning rate determines The algorithm continues
taking small steps in the the size of each step, and it's iterating until it reaches a point
direction of the negative crucial to find the right balance where the changes in the
gradient of the cost function, to ensure convergence. parameters are negligible,
which points towards the indicating the minimum has
minimum. been found.
Cost Function and Optimization

1 Cost Function 2 Optimization

The cost function is a mathematical Gradient descent aims to find the values of
expression that quantifies the error between the model parameters that minimize the cost
the predicted and actual values. function.

3 Convexity 4 Non-convex Functions

For convex cost functions, gradient descent is For non-convex functions, gradient descent
guaranteed to find the global minimum. may only find a local minimum, which may not
be the global minimum.
Gradient Computation
1 Partial Derivatives
The gradient is the vector of partial derivatives of the cost function with
respect to each parameter.

2 Chain Rule
When the model is complex, the gradient is computed using the chain rule to
differentiate the composite functions.

3 Numerical Approximation
In some cases, the gradient may be difficult to compute analytically, so it can
be approximated numerically.
Updating the Parameters

Parameter Update Learning Rate Batch Size

In each iteration, the The learning rate controls the In some cases, the gradient is
parameters are updated by step size and can significantly computed using a subset of
subtracting the product of the impact the convergence of the the training data (batch), rather
gradient and the learning rate algorithm. than the entire dataset.
from the current values.
Convergence and Stopping Criteria

Convergence Criteria Maximum Iterations

Gradient descent stops when the changes in the A maximum number of iterations can be set as a
parameter values or the cost function become stopping criterion, in case the algorithm does not
negligible, indicating the minimum has been converge naturally.
reached.

Early Stopping Regularization

To avoid overfitting, the algorithm can be Regularization techniques can be used to
stopped when the performance on a validation prevent overfitting and ensure the algorithm
set starts to deteriorate. converges to a suitable minimum.
Example Application: Linear Regression

Housing Prices Stock Predictions Medical Diagnostics

Gradient descent can be used to Gradient descent can also be
find the optimal coefficients of a applied to predict stock prices Gradient descent is used in
linear regression model to predict based on historical data and machine learning models for
housing prices based on features other financial indicators. medical diagnosis, predicting the
like square footage and number likelihood of a patient having a
of bedrooms. certain condition based on their
symptoms and test results.
Advantages and Limitations of Gradient
Descent
Advantages Limitations

Efficient for large-scale optimization problems Can get stuck in local minima for non-convex
functions

Works well with high-dimensional data Sensitive to the choice of learning rate

Guaranteed to find the global minimum for May require many iterations to converge,
convex functions especially for complex models

Can be parallelized for faster computation May not perform well with sparse or noisy data

FAT Test Form
100% (1)
FAT Test Form
4 pages
Gradient Descent Algorithm in Machine Learning
No ratings yet
Gradient Descent Algorithm in Machine Learning
21 pages
Gradient Descent Unit3
No ratings yet
Gradient Descent Unit3
9 pages
Automatic Plastic Injection Moulding Machine - Injection Moulding Machines - Injection Moulding Manufacturers
No ratings yet
Automatic Plastic Injection Moulding Machine - Injection Moulding Machines - Injection Moulding Manufacturers
23 pages
Gradient Descent Algorithm in Machine Learning - Analytics Vidhya
No ratings yet
Gradient Descent Algorithm in Machine Learning - Analytics Vidhya
11 pages
Gradient Descent (GD) - GD With Momentum - Nesterov Accelerated GD - Stochastic GD - OrIGINAL
No ratings yet
Gradient Descent (GD) - GD With Momentum - Nesterov Accelerated GD - Stochastic GD - OrIGINAL
25 pages
Wma14-01-June-2023 Solved
50% (2)
Wma14-01-June-2023 Solved
32 pages
Science 4 Q3W4
100% (2)
Science 4 Q3W4
134 pages
Gradient Descent in Linear Regression
No ratings yet
Gradient Descent in Linear Regression
30 pages
Effluent Treatment Plant
100% (1)
Effluent Treatment Plant
11 pages
A Business Letter Is A Letter Written in Formal Language
100% (1)
A Business Letter Is A Letter Written in Formal Language
5 pages
CSD411 Week7 Regression
No ratings yet
CSD411 Week7 Regression
75 pages
Gradient Descent
No ratings yet
Gradient Descent
7 pages
Trigonometry Derivative Integration Hard MCQs
100% (1)
Trigonometry Derivative Integration Hard MCQs
1 page
Gradient Descent: By-Vineet Ahuja BCA-V1-E 00221102021
No ratings yet
Gradient Descent: By-Vineet Ahuja BCA-V1-E 00221102021
10 pages
Purchase Receipt
No ratings yet
Purchase Receipt
3 pages
ML Lecture2
No ratings yet
ML Lecture2
36 pages
Gradient Descent A Fundamental Optimization Algorithm
No ratings yet
Gradient Descent A Fundamental Optimization Algorithm
30 pages
Gradient Descent
No ratings yet
Gradient Descent
27 pages
04gradient Descent
No ratings yet
04gradient Descent
21 pages
Gradient Descent and Cost Function
No ratings yet
Gradient Descent and Cost Function
14 pages
GD Types
No ratings yet
GD Types
98 pages
Gradient Descent: Rohit Sharma Pushpendra Kumar Sharma
No ratings yet
Gradient Descent: Rohit Sharma Pushpendra Kumar Sharma
12 pages
L4 More On Linear Regression and Polynomial Regression
No ratings yet
L4 More On Linear Regression and Polynomial Regression
37 pages
Interview Question What Is Gradient Descent 1679467271
No ratings yet
Interview Question What Is Gradient Descent 1679467271
16 pages
Gradient Descent
No ratings yet
Gradient Descent
14 pages
5 Optimizers
No ratings yet
5 Optimizers
10 pages
What Is Gradient Descent - Built in
No ratings yet
What Is Gradient Descent - Built in
11 pages
Gradient Descent
No ratings yet
Gradient Descent
12 pages
Gradient Descend
No ratings yet
Gradient Descend
64 pages
ML Lecture # 03 Gradient Descent
No ratings yet
ML Lecture # 03 Gradient Descent
23 pages
Gradient Decent
No ratings yet
Gradient Decent
40 pages
Lec05-1-Gradient Descent-Detailed
No ratings yet
Lec05-1-Gradient Descent-Detailed
62 pages
CSR Bernard Madoff Case Analysis and Conclusion
No ratings yet
CSR Bernard Madoff Case Analysis and Conclusion
6 pages
Gradient Descent
No ratings yet
Gradient Descent
9 pages
CCS355 Neural Networks and Deep Learning
No ratings yet
CCS355 Neural Networks and Deep Learning
142 pages
Semi-Detailed - in - Scien-Chemical Reaction
No ratings yet
Semi-Detailed - in - Scien-Chemical Reaction
5 pages
5.1loss Function, Optimization, GD
No ratings yet
5.1loss Function, Optimization, GD
39 pages
ML Lec 08 Gradient Descent
No ratings yet
ML Lec 08 Gradient Descent
37 pages
Grade 5 P.E and Arts Paper 1 End of Year Exams 2022
No ratings yet
Grade 5 P.E and Arts Paper 1 End of Year Exams 2022
2 pages
DL Unit - 2
No ratings yet
DL Unit - 2
20 pages
Enigma Submission
No ratings yet
Enigma Submission
3 pages
Gradient Descent - A Quick, Simple Introduction - Built in
No ratings yet
Gradient Descent - A Quick, Simple Introduction - Built in
15 pages
LInear
No ratings yet
LInear
14 pages
Gradient Descent Algorithm in Machine Learning: Dr. P. K. Chaurasia
No ratings yet
Gradient Descent Algorithm in Machine Learning: Dr. P. K. Chaurasia
24 pages
Gradient Descent
No ratings yet
Gradient Descent
5 pages
4 - Gradient Descent and Stochastic GD
No ratings yet
4 - Gradient Descent and Stochastic GD
37 pages
Gradient Descent Final
No ratings yet
Gradient Descent Final
27 pages
Gradient Descent
No ratings yet
Gradient Descent
4 pages
Gradient Descent
No ratings yet
Gradient Descent
15 pages
AI33
No ratings yet
AI33
6 pages
Forward Look 2023 - Analysis and Research Team
No ratings yet
Forward Look 2023 - Analysis and Research Team
19 pages
Gradient Descent
No ratings yet
Gradient Descent
8 pages
Gradient Descent Algorithm.Y...
No ratings yet
Gradient Descent Algorithm.Y...
10 pages
Pianoman: "Piano Man"
No ratings yet
Pianoman: "Piano Man"
2 pages
Gradient Descent
No ratings yet
Gradient Descent
6 pages
Gradient Descent DS Rohit Sharma Fench Knjs
No ratings yet
Gradient Descent DS Rohit Sharma Fench Knjs
15 pages
Gradient Descent
No ratings yet
Gradient Descent
9 pages
Deep Learning (Part 8) - Coursesteach
No ratings yet
Deep Learning (Part 8) - Coursesteach
16 pages
Yash 21bsds12
No ratings yet
Yash 21bsds12
3 pages
Gradient Descent Deep Learning: by T.K. Damodharan Vice President, RBS Reg - No: PC2013003013008
No ratings yet
Gradient Descent Deep Learning: by T.K. Damodharan Vice President, RBS Reg - No: PC2013003013008
37 pages
Tiling Checklist Updated
No ratings yet
Tiling Checklist Updated
3 pages
AA278A Lecture Notes 8. Optimal Control and Dynamic Games: Claire J. Tomlin May 11, 2005
No ratings yet
AA278A Lecture Notes 8. Optimal Control and Dynamic Games: Claire J. Tomlin May 11, 2005
12 pages
UNIT III Part-2
No ratings yet
UNIT III Part-2
39 pages
Gradient Descent - PR
No ratings yet
Gradient Descent - PR
31 pages
11 Gradient Descent
No ratings yet
11 Gradient Descent
58 pages
chp2 Gradient Descent Algorithm
No ratings yet
chp2 Gradient Descent Algorithm
5 pages
Gradient Descent
No ratings yet
Gradient Descent
13 pages
14-RMSProp and Adam Optimization-12!08!2024
No ratings yet
14-RMSProp and Adam Optimization-12!08!2024
2 pages
Gradient Descent
No ratings yet
Gradient Descent
2 pages
Gradient Descent Algorithm Is A First
No ratings yet
Gradient Descent Algorithm Is A First
5 pages
Gradient Descent
No ratings yet
Gradient Descent
5 pages
Adam Optimizer
No ratings yet
Adam Optimizer
22 pages
Digital Resonance Frequency Tester
No ratings yet
Digital Resonance Frequency Tester
3 pages
Gradient Descent
No ratings yet
Gradient Descent
17 pages
Xe155ucr Spec
No ratings yet
Xe155ucr Spec
20 pages
NEET 2017 A Detailed Analysis by Resonance
No ratings yet
NEET 2017 A Detailed Analysis by Resonance
10 pages
Test and Evaluation of Aircraft Avionics and Weapon Systems 2nd Edition Robert B. Mcshea PDF Download
No ratings yet
Test and Evaluation of Aircraft Avionics and Weapon Systems 2nd Edition Robert B. Mcshea PDF Download
52 pages
PL 2
No ratings yet
PL 2
1 page
SBC 81
No ratings yet
SBC 81
32 pages
Activity 2 - Qualitative Test For The Presence of Organic Compounds
No ratings yet
Activity 2 - Qualitative Test For The Presence of Organic Compounds
5 pages
Khabouris Acts
No ratings yet
Khabouris Acts
84 pages
Chapter 5 & 6
No ratings yet
Chapter 5 & 6
28 pages
El 771 Lecture Detection
No ratings yet
El 771 Lecture Detection
65 pages
Gambling CH - 6
No ratings yet
Gambling CH - 6
44 pages
Nust Schedule
No ratings yet
Nust Schedule
3 pages
9th Computer FLPs
No ratings yet
9th Computer FLPs
17 pages
Nanophotonics Applications in Biosensors: Danial Atique & Ali Rehman Metamaterials
No ratings yet
Nanophotonics Applications in Biosensors: Danial Atique & Ali Rehman Metamaterials
17 pages
9th Computer FLPs
No ratings yet
9th Computer FLPs
16 pages
Assume You Have Just Been Hired As A Business Manager
0% (1)
Assume You Have Just Been Hired As A Business Manager
3 pages
Session 21-22 - IB Chapter 18 Global Production and Supply Chains
No ratings yet
Session 21-22 - IB Chapter 18 Global Production and Supply Chains
14 pages
Sketchuptextureclub - Textures - Terms of Use
No ratings yet
Sketchuptextureclub - Textures - Terms of Use
2 pages
Matched Filter
No ratings yet
Matched Filter
10 pages
LCDM 4000 (Product Spec V1.03)
No ratings yet
LCDM 4000 (Product Spec V1.03)
7 pages
Advance Computer Architechture
No ratings yet
Advance Computer Architechture
9 pages
1797
No ratings yet
1797
5 pages
Wa0000.
No ratings yet
Wa0000.
10 pages
Analytical Review
No ratings yet
Analytical Review
2 pages
Feb 25 Pay Slip
No ratings yet
Feb 25 Pay Slip
1 page
Sahu Krishi Kendra Kanhiwada-25 Aug 24
No ratings yet
Sahu Krishi Kendra Kanhiwada-25 Aug 24
1 page
Random Sample Consensus: Robust Estimation in Computer Vision
From Everand
Random Sample Consensus: Robust Estimation in Computer Vision
Fouad Sabry
No ratings yet

Introduction To Gradient Descent

Uploaded by

Introduction To Gradient Descent

Uploaded by

Introduction to

Gradient descent is a powerful optimization algorithm used in machine

1 Cost Function 2 Optimization

3 Convexity 4 Non-convex Functions

Parameter Update Learning Rate Batch Size

Convergence Criteria Maximum Iterations

Early Stopping Regularization

Housing Prices Stock Predictions Medical Diagnostics

You might also like