Gradient Decent Calculation

The document describes the process of gradient descent for linear regression. It defines the variables used in gradient descent including X (training examples), y (labels), θ (parameters), α (learning rate), m (number of examples). It shows that θ is updated by subtracting a term containing the product of the learning rate, the transpose of X, and the difference between the predicted and actual values (errors). This minimizes the errors in the predictions to optimize the model.

Uploaded by

Arindam Sen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views2 pages

Gradient Decent Calculation

Uploaded by

Arindam Sen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 2

theta = theta - (alpha/m) * (X' * (X * theta - y))

Assume that the following values of X, y and θ are given:

 m = number of training examples

 n = number of features + 1

Here

 m = 5 (training examples)
 n = 4 (features+1)
 X = m x n matrix
 y = m x 1 vector matrix
 θ = n x 1 vector matrix
 xi is the ith training example
 xj is the jth feature in a given training example

Further,

 h(x) = ([X] * [θ]) (m x 1 matrix of predicted values for our training set)
 h(x)-y = ([X] * [θ] - [y]) (m x 1 matrix of Errors in our predictions)

whole objective of machine learning is to minimize Errors in predictions. Based on the above
corollary, our Errors matrix is m x 1 vector matrix as follows:
To calculate new value of θj, we have to get a summation of all errors (m rows) multiplied
by jth feature value of the training set X. That is, take all the values in E, individually multiply
them with jth feature of the corresponding training example, and add them all together. This will
help us in getting the new (and hopefully better) value of θj. Repeat this process for all j or the
number of features. In matrix form, this can be written as:

This can be simplified as:

 [E]' x [X] will give us a row vector matrix, since E' is 1 x m matrix and X is m x n
matrix. But we are interested in getting a column matrix, hence we transpose the resultant
matrix.

More succinctly, it can be written as:

Since (A * B)' = (B' * A'), and A'' = A, we can also write the above as

Lecture 3 - Machine Learning (Stanford) - HZ4cvaztQEs
No ratings yet
Lecture 3 - Machine Learning (Stanford) - HZ4cvaztQEs
64 pages
Programming Ex.1
No ratings yet
Programming Ex.1
6 pages
Machine Learning - Home - Week 2 - Notes - Coursera
No ratings yet
Machine Learning - Home - Week 2 - Notes - Coursera
10 pages
Updating Weight
No ratings yet
Updating Weight
9 pages
Lecture 1.2. Basics and Prerequisite
No ratings yet
Lecture 1.2. Basics and Prerequisite
34 pages
Lecture 1.2 Basics and Prerequisite
No ratings yet
Lecture 1.2 Basics and Prerequisite
35 pages
Ex4 Tutorial - Forward and Back-Propagation
No ratings yet
Ex4 Tutorial - Forward and Back-Propagation
20 pages
Homework2 - Tran Anh Vu
No ratings yet
Homework2 - Tran Anh Vu
3 pages
Week 2
No ratings yet
Week 2
5 pages
2022 Linear Regression
No ratings yet
2022 Linear Regression
34 pages
Linear Regression With Multiple Features
No ratings yet
Linear Regression With Multiple Features
7 pages
Week5-LectureNotes
No ratings yet
Week5-LectureNotes
15 pages
Solution Quiz 1
No ratings yet
Solution Quiz 1
5 pages
cs229.... Machine Language. Andrew NG
No ratings yet
cs229.... Machine Language. Andrew NG
17 pages
Deriving The Normal Equation Using Matrix Calculus
No ratings yet
Deriving The Normal Equation Using Matrix Calculus
18 pages
Assignment 1
No ratings yet
Assignment 1
14 pages
Week 5 Lecture Notes
No ratings yet
Week 5 Lecture Notes
15 pages
Tutorial Ex3
No ratings yet
Tutorial Ex3
2 pages
2IIG0 Cheat Sheet 1
No ratings yet
2IIG0 Cheat Sheet 1
2 pages
Lect5 Reg
No ratings yet
Lect5 Reg
16 pages
LogisticRegression ExercisesSolutions
No ratings yet
LogisticRegression ExercisesSolutions
5 pages
Regression Analysis
No ratings yet
Regression Analysis
54 pages
Stochastic Gradient Descent
No ratings yet
Stochastic Gradient Descent
7 pages
Cost Function
No ratings yet
Cost Function
17 pages
A Journey From Linear Algebra To Machine Learning
No ratings yet
A Journey From Linear Algebra To Machine Learning
50 pages
Slide 9 - SVM
No ratings yet
Slide 9 - SVM
27 pages
CS229 Lecture 2 PDF
100% (1)
CS229 Lecture 2 PDF
48 pages
Linear Regression With Multiple Variable
No ratings yet
Linear Regression With Multiple Variable
30 pages
1.1 ID5059 1.2 Tom Kelsey - Jan 2021: February 15, 2021
No ratings yet
1.1 ID5059 1.2 Tom Kelsey - Jan 2021: February 15, 2021
43 pages
Machine Learning: The Basics
No ratings yet
Machine Learning: The Basics
288 pages
Math Behind ML Algos
No ratings yet
Math Behind ML Algos
18 pages
Maths Behind ML Algos
No ratings yet
Maths Behind ML Algos
18 pages
03 Model Selection and Train-Validation-Test Sets 12 Min
No ratings yet
03 Model Selection and Train-Validation-Test Sets 12 Min
7 pages
Machine Learning (Summary)
No ratings yet
Machine Learning (Summary)
20 pages
Shifting Method
No ratings yet
Shifting Method
9 pages
Lecture-03 - Vectors and Matrices
No ratings yet
Lecture-03 - Vectors and Matrices
27 pages
TD Class - Solution
No ratings yet
TD Class - Solution
5 pages
(Chapman
No ratings yet
(Chapman
69 pages
Dda3020 2024F HW1
No ratings yet
Dda3020 2024F HW1
6 pages
Linear - Regression - SGD
No ratings yet
Linear - Regression - SGD
71 pages
Cours-1regression Lineaire PDF
No ratings yet
Cours-1regression Lineaire PDF
24 pages
Week 6 Lecture Notes
No ratings yet
Week 6 Lecture Notes
9 pages
Cheat Sheet For Exam
No ratings yet
Cheat Sheet For Exam
2 pages
Al3451 Ia 2 Answer Key
No ratings yet
Al3451 Ia 2 Answer Key
12 pages
W2M3-Linear Regression
No ratings yet
W2M3-Linear Regression
32 pages
ML Intro Numericals
No ratings yet
ML Intro Numericals
27 pages
Multilayer Perceptron: R - S - S - S Network
No ratings yet
Multilayer Perceptron: R - S - S - S Network
28 pages
HW 1
No ratings yet
HW 1
3 pages
Rui 4 Margin-En
No ratings yet
Rui 4 Margin-En
1 page
CPSC 540 Assignment 1 (Due January 19)
100% (1)
CPSC 540 Assignment 1 (Due January 19)
9 pages
Backpropagation Math
No ratings yet
Backpropagation Math
6 pages
Linear Regression
No ratings yet
Linear Regression
62 pages
BITS F464 ML Lecture Notes
No ratings yet
BITS F464 ML Lecture Notes
86 pages
Linear-Regression 231212 072619
No ratings yet
Linear-Regression 231212 072619
13 pages
Back Propagation LSN 4
No ratings yet
Back Propagation LSN 4
17 pages
5.2 Regression
No ratings yet
5.2 Regression
19 pages
Lec 07-08 - Final
No ratings yet
Lec 07-08 - Final
32 pages
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
From Everand
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
Peter Kattan
2.5/5 (2)
An Introduction to Linear Algebra and Tensors
From Everand
An Introduction to Linear Algebra and Tensors
M. A. Akivis
1/5 (1)
AAL2 - AAL5 Alarms in RNC
No ratings yet
AAL2 - AAL5 Alarms in RNC
3 pages
Anaconda Installation
No ratings yet
Anaconda Installation
5 pages
Linear Algebra Review (Op3onal) : Matrices and Vectors
No ratings yet
Linear Algebra Review (Op3onal) : Matrices and Vectors
25 pages
Curso de Machine Learning Lectura 1
No ratings yet
Curso de Machine Learning Lectura 1
39 pages

Gradient Decent Calculation

Uploaded by

Gradient Decent Calculation

Uploaded by

theta = theta - (alpha/m) * (X' * (X * theta - y))

Assume that the following values of X, y and θ are given:

 m = number of training examples

This can be simplified as:

More succinctly, it can be written as:

You might also like