0% found this document useful (0 votes)

100 views6 pages

Assignment Problem - Gradient Descent

Gradient Descent is one of the most important part of machine learning, this assignment is helpful for understanding it in step wise manner.

Uploaded by

UMESH SUGARA

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

100 views6 pages

Assignment Problem - Gradient Descent

Gradient Descent is one of the most important part of machine learning, this assignment is helpful for understanding it in step wise manner.

Uploaded by

UMESH SUGARA

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Assignment Problem: Gradient Descent Algorithm

in Python

Consider the following 3 datapoints.

X1 Y
(Feature) (Target)
1 4.8

3 12.4

5 15.5

• The following plot shows these 3 datapoints in Blue circles. Also shown
is the red-line (with squares), which we are claiming is the “best-fit
line”.

• The claim is that this best fit-line will have the minimum error for
prediction (the predicted values are actually the red-squares, hence the
vertical difference is the error in prediction).

• This total difference (error) across all the datapoints is expressed as the
Mean Squared Error Function, which will be minimized using the
Gradient Descent Algorithm, discussed below.

• Minimizing or maximizing any quantity is mathematically referred as an

Optimization Problem, and hence the solution (the point where the
minima/maxima exists) is referred the “optimal values”.
Here X1 refers to the independent variable (also called as Feature / Attribute in
Machine Learning)
Y is the dependent variable (also known as Target Variable in ML)

The net Objective is to find the Equation of the Best-Fitting Straight Line
(through these 3 data points, mentioned in the above table, also represented by
the blue circles in the above plot).

Ŷ = w 0 + w1X1 --------------- (1)

is the equation of the best-fit line (red-line in the plot)

w1 = slope of the line; w0 = intercept of the line

w0 and w1 are also called model weights.

Y^ is the predicted values of Y, given by the “best-fit line”. These predicted

values are represented by red-squares on the red-line. Of course, the predicted
values are NOT exactly same as the actual values of Y (blue circles), the
vertical difference represents the error in the prediction given by:
ˆ −Y
Errori = Y -------------- (2)
i i

for any ith data point.

1 N 1 N ˆ
MSE = 
N i =1
( Errori ) 2
= 
N i =1
(Yi − Yi )2 --------------- (3)

N = Total no. of data points (in this case, it is 3)

Now the problem is to find the “optimal values” of the slope and intercept of
this best-fit line, such that the “Mean Squared Error” (MSE) is minimum. You
can easily see that the yellow-line (a poor-fit line) which has “non-optimal”
values of slope & intercept fits the data very badly (btw the exact equation of
the yellow line is x+6, so slope is 1 and intercept is 6 units)

How will i get the optimal values of the slope and intercept ????

This is where the Gradient Descent Algorithm comes!

 N ˆ 
w k +1
0 = w −    (Yi − Yi ) 
k
0
 i =1 
 N ˆ 
w k +1
1 = w −    [(Yi − Yi )  X 1i ] 
k
1
 i =1  ---------- (4 & 5)
where w0k & w1k represent the values of the intercept and the slope of the linear-fit
in the kth iteration, whereas w0k +1 & w1k +1 represent the values of the intercept and
the slope of the linear-fit in the (k+1)th iteration (next iteration). w0 & w1 are
also called as model weights or model coefficients.
α represents the Learning Rate.

The derivation of the above equations will be done in the machine learning
lessons.
Gradient Descent Algorithm
1. Initialize the algorithm with random values of α, and weights (w0, w1)
2. Calculate predictions Y^ = w0 + w1 * x as shown in the equation 1
3. Calculate Error terms & MSE Loss Function (L):
N N
Error terms are:  (Yˆi − Yi ) and
i =1
[(Yˆ − Y )  X
i =1
i i 1i ] for the datapoints i=1 to

N (here N is equal to 3)

Loss function as shown in equation 3

4. Update your weights: Using equations 4 and 5

5. Repeat 2-4, until convergence.

Based on the above-mentioned steps, we can calculate the weights. Let the
learning rate (α) be 0.01 and initialize the weights w0 and w1 as 0.
X1 Y Y^ Y^-Y (Y^-Y)*X1

1 4.8 0 -4.8 -4.8

3 12.4 0 -12.4 -37.2

5 15.5 0 -15.5 -77.5

Sum -32.7 -119.5

w01 = 0 – (0.01) * (-32.7) = 0.327

w11 = 0 – (0.01) * (-119.5) = 1.195

With these new weights, let us update the above table.

X1 Y Y^ Y^-Y (Y^-Y)*X1

1 4.8 1.522 -3.278 -3.278

3 12.4 3.912 -8.488 -25.464

5 15.5 6.302 -9.198 -45.99

Sum -20.964 -74.732

w02 = 0.327 – (0.01) * (-20.964) = 0.537

w12 = 1.195 – (0.01) * (-74.732) = 1.942

With these new weights, let us update the above table.

X1 Y Y^ Y^-Y (Y^-Y)*X1

1 4.8 2.479 -2.321 -2.321

3 12.4 6.363 -6.037 -18.111

5 15.5 10.247 -5.253 -26.265

Sum -13.611 -46.697

As we can see, the sum of errors is decreasing as we are updating the weights.
We can continue to update the weights like the above manner until the sum of
errors become minimum (i.e. reaches almost a constant value)

Finally I want the plots of :

• Loss function (y-axis) vs w0 (x-axis)
• Loss function (y-axis) vs w0 (y-axis)
• 3D-plot of Loss function w.r.t. w0 & w1 similar to below image

NCP-IB Exam Questions
No ratings yet
NCP-IB Exam Questions
3 pages
Promorpheus
100% (4)
Promorpheus
927 pages
Astrology of The Seers Final Test All Parts Questions
100% (1)
Astrology of The Seers Final Test All Parts Questions
14 pages
TOP 5 AI Stocks To Buy in 2023
100% (1)
TOP 5 AI Stocks To Buy in 2023
7 pages
Depth Prediction Single Image
No ratings yet
Depth Prediction Single Image
8 pages
Adaline/Madaline:Applications
100% (1)
Adaline/Madaline:Applications
25 pages
Analysis of Two Partial-least-Squares Algorithms For Multivariate Calibration PDF
No ratings yet
Analysis of Two Partial-least-Squares Algorithms For Multivariate Calibration PDF
17 pages
MATHEMATICS Parallel Scientific Computation
No ratings yet
MATHEMATICS Parallel Scientific Computation
324 pages
Government Agencies and Its Functions Cabinet Members of The Philippines Government
No ratings yet
Government Agencies and Its Functions Cabinet Members of The Philippines Government
7 pages
Felipe R. Verallo Memorial Foundtion, Inc.: Clotilde, Dakit, Bogo City, Cebu
33% (3)
Felipe R. Verallo Memorial Foundtion, Inc.: Clotilde, Dakit, Bogo City, Cebu
6 pages
SAP New GL Accounting
100% (1)
SAP New GL Accounting
120 pages
Quantum Computing For Fusion Energy Science Applications
No ratings yet
Quantum Computing For Fusion Energy Science Applications
40 pages
Lecture 23
No ratings yet
Lecture 23
38 pages
A Tacholess Order Tracking Methodology Based On A Probabilistic
No ratings yet
A Tacholess Order Tracking Methodology Based On A Probabilistic
17 pages
Chebyshev Approximation
No ratings yet
Chebyshev Approximation
6 pages
Mathematical Foundations For AI Basic
No ratings yet
Mathematical Foundations For AI Basic
3 pages
Computational Tools and Software MATLAB Python
No ratings yet
Computational Tools and Software MATLAB Python
5 pages
Vim Method
No ratings yet
Vim Method
7 pages
Liao Homotopy
100% (2)
Liao Homotopy
317 pages
Matlab Matlab Toolbox Deep Learning Toolbox Neural Network Toolbox Libraries Functions How To Use
No ratings yet
Matlab Matlab Toolbox Deep Learning Toolbox Neural Network Toolbox Libraries Functions How To Use
5 pages
Glimpses of Soliton Theory: The Algebra and Geometry of Nonlinear Pdes
100% (3)
Glimpses of Soliton Theory: The Algebra and Geometry of Nonlinear Pdes
322 pages
Implementation of The Vold-Kalman Order Tracking Filters
No ratings yet
Implementation of The Vold-Kalman Order Tracking Filters
8 pages
Numerical Methods - B. Ram
No ratings yet
Numerical Methods - B. Ram
236 pages
Center Manifold Reduction
100% (2)
Center Manifold Reduction
8 pages
Chapter 17 PDF
No ratings yet
Chapter 17 PDF
10 pages
A Gentle Introduction To Backpropagation
100% (1)
A Gentle Introduction To Backpropagation
15 pages
Eem520l3 2023
No ratings yet
Eem520l3 2023
25 pages
Introduction To AlphaFold RCS 2022
No ratings yet
Introduction To AlphaFold RCS 2022
36 pages
Math4ml PDF
No ratings yet
Math4ml PDF
21 pages
Lecture Notes - Logistic Regression
100% (1)
Lecture Notes - Logistic Regression
11 pages
Derivation of Black Hole Solutions
No ratings yet
Derivation of Black Hole Solutions
113 pages
GEG 402 Slides of Numerical Analysis of Ordinary Differential Equations 3
100% (1)
GEG 402 Slides of Numerical Analysis of Ordinary Differential Equations 3
54 pages
Gradient Based Optimization
No ratings yet
Gradient Based Optimization
24 pages
OptimisationII Notes
100% (2)
OptimisationII Notes
94 pages
Euler Equation Fluid PDF
No ratings yet
Euler Equation Fluid PDF
2 pages
Model Predictive Control Using YALMIP Getting Started
No ratings yet
Model Predictive Control Using YALMIP Getting Started
5 pages
Least-Squares Fitting of Two 3-D Point Sets
No ratings yet
Least-Squares Fitting of Two 3-D Point Sets
3 pages
Modeling With Penalized Splines
No ratings yet
Modeling With Penalized Splines
50 pages
VOLTERRA INTEGRAL EQUATIONS .Ru
No ratings yet
VOLTERRA INTEGRAL EQUATIONS .Ru
15 pages
MATLAB Source Codes
No ratings yet
MATLAB Source Codes
69 pages
Textbook of Vector Analysis and Coordinate Geometry 9350843145 9789350843147 Compress
No ratings yet
Textbook of Vector Analysis and Coordinate Geometry 9350843145 9789350843147 Compress
204 pages
Spring 2022 CS7643 Deep Learning Syllabus and Schedule - v5.1
No ratings yet
Spring 2022 CS7643 Deep Learning Syllabus and Schedule - v5.1
11 pages
Edwin L. Woollett - Maxima by Example
100% (1)
Edwin L. Woollett - Maxima by Example
514 pages
Partial Differential Equations of Applied Mathematics Lecture Notes, Math 713 Fall, 2003
No ratings yet
Partial Differential Equations of Applied Mathematics Lecture Notes, Math 713 Fall, 2003
128 pages
ASSIGNMENT (1) Mathematica
No ratings yet
ASSIGNMENT (1) Mathematica
4 pages
Introduction To Ordinary Differential Equations With: Mathematica®
No ratings yet
Introduction To Ordinary Differential Equations With: Mathematica®
9 pages
History of Numerical Weather Prediction
50% (2)
History of Numerical Weather Prediction
73 pages
Clustering Algorithm
No ratings yet
Clustering Algorithm
17 pages
Graph Theory For Circuit Analysis
No ratings yet
Graph Theory For Circuit Analysis
17 pages
Theoretical Physics For Students by Alexander Fufaev
No ratings yet
Theoretical Physics For Students by Alexander Fufaev
204 pages
Julius O. Smith - Mathematics of The Discrete Fourier Transform
No ratings yet
Julius O. Smith - Mathematics of The Discrete Fourier Transform
247 pages
Optimal Control Matlab
No ratings yet
Optimal Control Matlab
25 pages
Large-Scale Deep Reinforcement Learning
No ratings yet
Large-Scale Deep Reinforcement Learning
6 pages
Machine Learning
100% (1)
Machine Learning
185 pages
Complex analysis A Complete Guide
From Everand
Complex analysis A Complete Guide
Gerardus Blokdyk
No ratings yet
Lectures on the Coupling Method
From Everand
Lectures on the Coupling Method
Torgny Lindvall
No ratings yet
Solutions Manual to accompany An Introduction to Numerical Methods and Analysis
From Everand
Solutions Manual to accompany An Introduction to Numerical Methods and Analysis
James F. Epperson
5/5 (1)
Textbook of Engineering Chemistry
From Everand
Textbook of Engineering Chemistry
C. Parameswara Murthy
No ratings yet
Nonlinear Dynamic in Engineering by Akbari-Ganji’S Method
From Everand
Nonlinear Dynamic in Engineering by Akbari-Ganji’S Method
Mohammadreza Akbari
No ratings yet
Computer Aided Design of Electrical Machines
From Everand
Computer Aided Design of Electrical Machines
K.M. Vishnu Murthy
No ratings yet
Hopfield Networks: Fundamentals and Applications of The Neural Network That Stores Memories
From Everand
Hopfield Networks: Fundamentals and Applications of The Neural Network That Stores Memories
Fouad Sabry
No ratings yet
CH - En.u4cse19101 Cheduri Linearregression
No ratings yet
CH - En.u4cse19101 Cheduri Linearregression
8 pages
Linear Regression With One Variable
No ratings yet
Linear Regression With One Variable
48 pages
Regression
No ratings yet
Regression
16 pages
3.linear Regression
No ratings yet
3.linear Regression
18 pages
Regression PPT
No ratings yet
Regression PPT
21 pages
Chart Summary of The Following Methods: GTM, DM, TPR, and CTL
No ratings yet
Chart Summary of The Following Methods: GTM, DM, TPR, and CTL
2 pages
Aashto T316-13
No ratings yet
Aashto T316-13
5 pages
DUM - DT 225 Unit Report 01.07.2023 15-49-59
No ratings yet
DUM - DT 225 Unit Report 01.07.2023 15-49-59
26 pages
HXU18UA - Senzor Hall Easy Eda
No ratings yet
HXU18UA - Senzor Hall Easy Eda
3 pages
ACTIVITY DESIGN Linggo NG Kabataan
No ratings yet
ACTIVITY DESIGN Linggo NG Kabataan
4 pages
CH 4-MCM-WS
No ratings yet
CH 4-MCM-WS
8 pages
2017 Resume Book For Web SDM
No ratings yet
2017 Resume Book For Web SDM
27 pages
Ethical Concerns Associatedwith Corporate Financeand Their Management
No ratings yet
Ethical Concerns Associatedwith Corporate Financeand Their Management
9 pages
Path Tracking Robot: Prepared by
No ratings yet
Path Tracking Robot: Prepared by
14 pages
Catalogo Ariston - Malasia
No ratings yet
Catalogo Ariston - Malasia
40 pages
Solids Control Cuttings Management Fluids Processing Catalog
No ratings yet
Solids Control Cuttings Management Fluids Processing Catalog
88 pages
Writing Lesson 1
No ratings yet
Writing Lesson 1
5 pages
Affordable Housing Prototypes For Sub-Saharan Africa
No ratings yet
Affordable Housing Prototypes For Sub-Saharan Africa
30 pages
Mr. Jonathan O. Macula: Mathematics Teacher
No ratings yet
Mr. Jonathan O. Macula: Mathematics Teacher
28 pages
WISS CSWIP Technology Multiple Choice Exam 1 Version A 18 August 2014
No ratings yet
WISS CSWIP Technology Multiple Choice Exam 1 Version A 18 August 2014
10 pages
Bosch HMB Installation Instructions: OVERVIEW - Engineer's Specification
No ratings yet
Bosch HMB Installation Instructions: OVERVIEW - Engineer's Specification
12 pages
Adi CB Manual Intek M-790 - SM
No ratings yet
Adi CB Manual Intek M-790 - SM
61 pages
21514kenmore 21514 Manual
No ratings yet
21514kenmore 21514 Manual
56 pages
Septic Tank Detail Lavatory and Water Closet Installation Detail P-4
No ratings yet
Septic Tank Detail Lavatory and Water Closet Installation Detail P-4
1 page
Leakage Current and Common Mode Voltage Issues in Modern AC Drive Systems
No ratings yet
Leakage Current and Common Mode Voltage Issues in Modern AC Drive Systems
8 pages
Weldtrace Update Training 3
No ratings yet
Weldtrace Update Training 3
6 pages
QUESTIONNAIRES FINAL CAMANIAN Edited
No ratings yet
QUESTIONNAIRES FINAL CAMANIAN Edited
4 pages
Kitchen Accessories PDF
No ratings yet
Kitchen Accessories PDF
309 pages
EQUIPMENT LIST - 2021-01-15 - Rev-E
No ratings yet
EQUIPMENT LIST - 2021-01-15 - Rev-E
9 pages

Assignment Problem - Gradient Descent

Uploaded by

Assignment Problem - Gradient Descent

Uploaded by

Assignment Problem: Gradient Descent Algorithm

Consider the following 3 datapoints.

• Minimizing or maximizing any quantity is mathematically referred as an

Ŷ = w 0 + w1X1 --------------- (1)

is the equation of the best-fit line (red-line in the plot)

w0 and w1 are also called model weights.

Y^ is the predicted values of Y, given by the “best-fit line”. These predicted

for any ith data point.

N = Total no. of data points (in this case, it is 3)

This is where the Gradient Descent Algorithm comes!

Loss function as shown in equation 3

4. Update your weights: Using equations 4 and 5

1 4.8 0 -4.8 -4.8

3 12.4 0 -12.4 -37.2

5 15.5 0 -15.5 -77.5

Sum -32.7 -119.5

w01 = 0 – (0.01) * (-32.7) = 0.327

w11 = 0 – (0.01) * (-119.5) = 1.195

With these new weights, let us update the above table.

1 4.8 1.522 -3.278 -3.278

3 12.4 3.912 -8.488 -25.464

5 15.5 6.302 -9.198 -45.99

Sum -20.964 -74.732

w02 = 0.327 – (0.01) * (-20.964) = 0.537

w12 = 1.195 – (0.01) * (-74.732) = 1.942

With these new weights, let us update the above table.

1 4.8 2.479 -2.321 -2.321

3 12.4 6.363 -6.037 -18.111

5 15.5 10.247 -5.253 -26.265

Sum -13.611 -46.697

Finally I want the plots of :

You might also like