0% found this document useful (0 votes)

477 views

Linear Regression Using Batch Gradient Descent

Linear Regresseion batch Gradient descent algorithm

Uploaded by

Moses Agebure

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

477 views

Linear Regression Using Batch Gradient Descent

Linear Regresseion batch Gradient descent algorithm

Uploaded by

Moses Agebure

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Linear regression using batch gradient descent

http://www.dsplog.com/2011/10/29/batch-gradient-descent/

DSP log

Google
Home About Blog

(2 votes, Analog
Channel Coding DSP GATE MIMO

average: 5.00 out of 5)

by Krishna Sankar on October 29, 2011

Modulation OFDM Subscribe

I happened to stumble on Prof. Andrew Ngs Machine Learning classes which are available online as part of Stanford Center for Professional Development. The first lecture in the ser data set using linear regression. For understanding this concept, I chose to take data from the top 50 articles of this blog based on the pageviews in the month of September 2011.

Notations
Let be the number of training set (in our case top 50 articles), be the input sequence (the page index), be the output sequence (the page views for each page index) be the number of features/parameters (=2 for our example). The value of corresponds to the training set is defined as :

Let us try to predict the number of page views for a given page index using a hypothesis, where

where, is the page index, .

Linear regression using gradient descent

Given the above hypothesis, let us try to figure out the parameter which minimizes the square of the error between the predicted value and the actual output for all values

Let us define the cost function .

as,

The scaling by fraction is just for notational convenience. Let us start with some parameter vector , and keep changing the to reduce the cost function , i.e.

The parameter vector Note :

after algorithm convergence can be used for prediction.

1. For each update of the parameter vector , the algorithm process the full training set. This algorithm is called Batch Gradient Descent. 2. For the given example with 50 training sets, the going over the full training set is computationally feasible. However when the training set is very large, we need to use a slight vari We will discuss that in another post.

1 of 7

13/02/2013 13:49

Linear regression using batch gradient descent

http://www.dsplog.com/2011/10/29/batch-gradient-descent/

3. The proof of the derivation of

involving differential with

will be of interest. We will discuss that in another post.

Matlab/Octave code snippet

clear ; close all; x = [1:50].'; y = [4554 3014 2171 1891 1593 1532 1416 1326 1297 1266 ... 1248 1052 951 936 918 797 743 665 662 652 ... 629 609 596 590 582 547 486 471 462 435 ... 424 403 400 386 386 384 384 383 370 365 ... 360 358 354 347 320 319 318 311 307 290 ].'; m = length(y); % store the number of training examples x = [ ones(m,1) x]; % Add a column of ones to x n = size(x,2); % number of features theta_vec = [0 0]'; alpha = 0.002; err = [0 0]'; for kk = 1:10000 h_theta = (x*theta_vec); h_theta_v = h_theta*ones(1,n); y_v = y*ones(1,n); theta_vec = theta_vec - alpha*1/m*sum((h_theta_v - y_v).*x).'; err(:,kk) = 1/m*sum((h_theta_v - y_v).*x).'; end figure; plot(x(:,2),y,'bs-'); hold on plot(x(:,2),x*theta_vec,'rp-'); legend('measured', 'predicted'); grid on; xlabel('Page index, x'); ylabel('Page views, y'); title('Measured and predicted page views');

The computed .

values are

With this hypotheses, the predicted page views is shown in the red curve (in the below plot). In matlab code snippet, kept the number of step of gradient descent blindly as 10000. One can probably stop the gradient descent when the cost function is small and/or when ra

Couple of things to note : 1. Given that the measured values are showing an exponential trend, trying to fit a straight line does not seem like a good idea. Anyhow, given this is the first post in this series, I let it 2. The value of controls the rate of convergence of the algorithm. If algorithm to diverge. 3. Have not figured how to select Plotting the variation of is very small, the algorithm takes small steps and takes longer time to converge. Higher value of causes t

value suitable (fast convergence) for the data set under consideration. Will figure that out later.

for different values of

clear; j_theta = zeros(250, 250); % initialize j_theta theta0_vals = linspace(-5000, 5000, 250); theta1_vals = linspace(-200, 200, 250); for i = 1:length(theta0_vals) for j = 1:length(theta1_vals) theta_val_vec = [theta0_vals(i) theta1_vals(j)]'; h_theta = (x*theta_val_vec); j_theta(i,j) = 1/(2*m)*sum((h_theta - y).^2); end end figure; surf(theta0_vals, theta1_vals,10*log10(j_theta.')); xlabel('theta_0'); ylabel('theta_1');zlabel('10*log10(Jtheta)'); title('Cost function J(theta)'); figure; contour(theta0_vals,theta1_vals,10*log10(j_theta.')) xlabel('theta_0'); ylabel('theta_1') title('Cost function J(theta)');

2 of 7

13/02/2013 13:49

Linear regression using batch gradient descent

http://www.dsplog.com/2011/10/29/batch-gradient-descent/

Given that the surface() plot is bit unwieldy in my relatively slow desktop, using contour() plot seems to be a much better choice. Can see that the minima of this cost function lies nea .

References An Application of Supervised Learning Autonomous Deriving

Please click here to SUBSCRIBE to newsletter and download the FREE e-Book on probability of error in AWGN. Thanks for visiting! Happy learning.

Weighted Least Squares and

Closed form solution for linear

Least Squares in Gaussian Noise Tagged as: machine_learning D id you like this article? Make sure that you do not miss a new article by subscribing Subscribing via e-mail entitles you to download the free e-Book on BER of BPSK/QPS

Choose Gmail Now

Mail.Google.com Get Your Email as SMS Start Enjoying Gmail Now!

SVM - Predictive Models

www.dtreg.com Support Vector Machine Predictive Models

Free optimization tool.

www.jmodelica.org Python and Modelica based tool for dynamic optimization. Try now.

Ads by Google

Automate Batch Processing MATLAB Plotting

{ 7 comments read them below or add one }

3 of 7

13/02/2013 13:49

Linear regression using batch gradient descent

http://www.dsplog.com/2011/10/29/batch-gradient-descent/

Bobaker Madi December 30, 2012 at 12:04 am Thanks for this topic I have the same question of Harneet . He meant if we have a random points how can we apply this method. if we sort them in this case we will get exactly a line or curve and ea thanks again for your illustration Reply

Krishna Sankar January 2, 2013 at 6:15 am @Bobaker: Ah, I understand have not played with the random unsorted data. Did you try making this data set random is it converging? I think it will still converge Reply

Paul October 6, 2012 at 5:06 am Hi Krishna, Thank you very much for the article. In your Matlab/Octave code snippet, you have a 1/m factor in the expression for theta_vec. However, in the LaTeX formulae that precede it, this factor is missing. Could you Many thanks, Paul Reply

Krishna Sankar October 6, 2012 at 6:39 am @Paul: Nice observation. While toying with the matlab code, found that the gradient descent is not converging and hence put an additional scaling of 1/m to the error term This 1/m term can be considered to be part of alpha and hence can be absent in the mathematical description. Agree? Reply

Harneet Oberoi November 2, 2011 at 5:47 am Hello, Thanks for sharing this. It is very helpful. My question is that the input and output data you have used is sorted from smallest to largest whereas it follows an exponential distrib let me put it in this way out of input (x) and output (y), only one is sorted and the other is random Would really appreciate your help Thanks Harneet Reply

Krishna Sankar November 7, 2011 at 4:56 am @Harneet: Given my limited exposure to the topic, am unable to understand your query. Can you please put across an example Reply

Deepak October 31, 2011 at 12:38 am thanku for a great article .. sir , i was wondering if u have planned any posts on SNR estimation techniques in the near future Reply Leave a Comment

Name * E-mail * Website

Notify me of followup comments via e-mail

4 of 7

13/02/2013 13:49

Linear regression using batch gradient descent

http://www.dsplog.com/2011/10/29/batch-gradient-descent/

{ 5 trackbacks }

Back! Stochastic Gradient Descet Closed form solution for linear regression Least Squares in Gaussian Noise Maximum Likelihood MATLAB: What is an implementation of gradient descent in Matlab? - Quora Previous post: Back! Next post: Stochastic Gradient Descent

Connect with us

Enter your Email here...

Simulation of dynamics
UniversalMechanism.com Simulation of mechanical systems Tracked, road and railway vehicles

Octave Batch Curve Fitting

Tag

16-PSK 16-QAM

802.11a

2012 Alamouti AWGN BPSK Capacity Communication conference Digital Diversity ECE eye diagram first order frequency offset FSK GATE Gray IISc interpol

noise Nyquist OFDM PAM pdf phase phase_noise PSK pulse shaping QAM raised cosine Rayleigh SIC STBC TETRA transmitter Viterbi ZF
Ratings

Symbol Error Rate (SER) for QPSK (4-QAM) modulation (5.00 out of 5) BER for BPSK in ISI channel with MMSE equalization (5.00 out of 5) Chi Square Random Variable (5.00 out of 5) Using Toeplitz matrices in MATLAB (5.00 out of 5) IQ imbalance in transmitter (5.00 out of 5) Bit error rate for 16PSK modulation using Gray mapping (5.00 out of 5) Signal to quantization noise in quantized sinusoidal (5.00 out of 5) BER for BPSK in ISI channel with Zero Forcing equalization (5.00 out of 5) About (5.00 out of 5) Negative Frequency (5.00 out of 5)
Categories

Linear regression using batch gradient descent

http://www.dsplog.com/2011/10/29/batch-gradient-descent/

Advertisement from Amazon

Comment

Mark on Modeling phase noise (frequency domain approach) Ravinder on Bit Error Rate (BER) for BPSK modulation Krishna Sankar on MIMO with MMSE equalizer Krishna Sankar on Selection Diversity Krishna Sankar on Bit Error Rate (BER) for BPSK modulation Krishna Sankar on Alamouti STBC Krishna Sankar on MIMO with ZF SIC and optimal ordering
Top Rated posts

Bit Error Rate (BER) for BPSK modulation - 52 votes BER for BPSK in Rayleigh channel - 33 votes BER for BPSK in OFDM with Rayleigh multipath channel - 32 votes Alamouti STBC - 29 votes Maximal Ratio Combining (MRC) - 28 votes Download free e-book on error probability in AWGN - 21 votes Understanding an OFDM transmission - 20 votes MIMO with MMSE equalizer - 19 votes MIMO with Zero Forcing equalizer - 19 votes Rayleigh multipath channel model - 18 votes

6 of 7

13/02/2013 13:49

Linear regression using batch gradient descent

http://www.dsplog.com/2011/10/29/batch-gradient-descent/

Find us on Facebook

dspLog on
[DSP log] Thoughts on digital signal processing

ANALOG & DSP Complex to Real DSP DesignLine DSP Guide DSPRelated Octave Octave-Forge Online Scientific Calculato

+52

Like 1,411 people like [DSP log] Thoughts on digital signal processing.

7 of 7

13/02/2013 13:49

Digital Modulations using Matlab
From Everand
Digital Modulations using Matlab
Mathuranathan Viswanathan
4/5 (6)
12061903206177
33% (6)
12061903206177
33 pages
Spectra Net 2
No ratings yet
Spectra Net 2
4 pages
1.1 ID5059 1.2 Tom Kelsey - Jan 2021: February 15, 2021
No ratings yet
1.1 ID5059 1.2 Tom Kelsey - Jan 2021: February 15, 2021
43 pages
Regression Analysis
No ratings yet
Regression Analysis
54 pages
Gradient Descent Algorithm Matlab
No ratings yet
Gradient Descent Algorithm Matlab
3 pages
Gradient Descent and SGD
No ratings yet
Gradient Descent and SGD
8 pages
2EL1730 ML Lecture02 Linear and Logistic Regression
No ratings yet
2EL1730 ML Lecture02 Linear and Logistic Regression
65 pages
Linear+regression+with+one+variable
No ratings yet
Linear+regression+with+one+variable
48 pages
lec6_7_Linear_regression
No ratings yet
lec6_7_Linear_regression
38 pages
Fisseha Berhane,: Analytical and Numerical Solutions, With R, To Linear Regression Problems
No ratings yet
Fisseha Berhane,: Analytical and Numerical Solutions, With R, To Linear Regression Problems
22 pages
5.1Loss Function, Optimization,Gd
No ratings yet
5.1Loss Function, Optimization,Gd
39 pages
LinearRegression Annotated
No ratings yet
LinearRegression Annotated
116 pages
Linear Regression Gradient Descent Vs Analytical Solution
No ratings yet
Linear Regression Gradient Descent Vs Analytical Solution
5 pages
3 Types of Gradient Descent Algorithms For Small & Large Datasets
No ratings yet
3 Types of Gradient Descent Algorithms For Small & Large Datasets
9 pages
Linear and Logistic Regression: Marta Arias Marias@lsi - Upc.edu
No ratings yet
Linear and Logistic Regression: Marta Arias Marias@lsi - Upc.edu
25 pages
Regression
No ratings yet
Regression
16 pages
Module3_Ch1
No ratings yet
Module3_Ch1
83 pages
07_Gradient_Descent_For_Linear_Regression_10_min
No ratings yet
07_Gradient_Descent_For_Linear_Regression_10_min
5 pages
Linear Regression
No ratings yet
Linear Regression
14 pages
11 Gradient Descent
No ratings yet
11 Gradient Descent
58 pages
Gradient Descent Tutorial
No ratings yet
Gradient Descent Tutorial
3 pages
An Introduction To Gradient Descent and Linear Regression
No ratings yet
An Introduction To Gradient Descent and Linear Regression
8 pages
Lecture3_upload
No ratings yet
Lecture3_upload
28 pages
L. D. College of Engineering: Lab Manual For
No ratings yet
L. D. College of Engineering: Lab Manual For
70 pages
Linear Regression: Machine Learning
No ratings yet
Linear Regression: Machine Learning
9 pages
Notes Unit 1-3 Part-III
No ratings yet
Notes Unit 1-3 Part-III
25 pages
Gradient Descent Example PDF
No ratings yet
Gradient Descent Example PDF
3 pages
Lec 9-10 Gradient Descent
No ratings yet
Lec 9-10 Gradient Descent
60 pages
Notes 3
No ratings yet
Notes 3
59 pages
J
No ratings yet
J
59 pages
Updating_Weight
No ratings yet
Updating_Weight
9 pages
04 LinearRegression PDF
No ratings yet
04 LinearRegression PDF
61 pages
Linear Regression
No ratings yet
Linear Regression
62 pages
Introduction To Machine Learning Algorithms: Linear Regression
No ratings yet
Introduction To Machine Learning Algorithms: Linear Regression
1 page
Gdesc LMS
No ratings yet
Gdesc LMS
7 pages
Gradient Descent
No ratings yet
Gradient Descent
5 pages
Lecture Notes 5 Linear Regression
No ratings yet
Lecture Notes 5 Linear Regression
11 pages
GradientDescent-Regression_slides
No ratings yet
GradientDescent-Regression_slides
26 pages
Gradient Descent
No ratings yet
Gradient Descent
5 pages
gradient-descent-from-scratch-complete-intuition
No ratings yet
gradient-descent-from-scratch-complete-intuition
8 pages
Gradient descent
No ratings yet
Gradient descent
16 pages
CSE_412__Lab_Manual_3___Linear_Regression
No ratings yet
CSE_412__Lab_Manual_3___Linear_Regression
10 pages
Stochastic Gradient Descent Algorithm
No ratings yet
Stochastic Gradient Descent Algorithm
6 pages
DeepLearning Lect2 3
No ratings yet
DeepLearning Lect2 3
89 pages
Regression
No ratings yet
Regression
30 pages
Sheet 3 Sol 3
No ratings yet
Sheet 3 Sol 3
3 pages
Lec9 - Linear Models
No ratings yet
Lec9 - Linear Models
44 pages
Lecture02a Optimization Annotated PDF
No ratings yet
Lecture02a Optimization Annotated PDF
23 pages
Lect03 CSN382
No ratings yet
Lect03 CSN382
31 pages
(MLP) Lecture Notes
No ratings yet
(MLP) Lecture Notes
22 pages
Linearna Regresija - NG
No ratings yet
Linearna Regresija - NG
7 pages
Stanford ML CS229-Merged Notes
No ratings yet
Stanford ML CS229-Merged Notes
126 pages
CS229 Lecture Notes: Supervised Learning
No ratings yet
CS229 Lecture Notes: Supervised Learning
293 pages
Machine Learning Notes AndrewNg
No ratings yet
Machine Learning Notes AndrewNg
141 pages
Machine Learning Notes by Standard Andrew Ng
No ratings yet
Machine Learning Notes by Standard Andrew Ng
142 pages
Module 3
No ratings yet
Module 3
27 pages
Mlfa Autumn 23 Optimization
No ratings yet
Mlfa Autumn 23 Optimization
37 pages
Least Square Vs Gradient Descent
No ratings yet
Least Square Vs Gradient Descent
52 pages
Prs Lab1 Merged
No ratings yet
Prs Lab1 Merged
215 pages
Linear Regression
No ratings yet
Linear Regression
63 pages
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet
The Web3 Financial Services Ecosystem in Brazil
No ratings yet
The Web3 Financial Services Ecosystem in Brazil
20 pages
Decision making process of STC Group(1)
No ratings yet
Decision making process of STC Group(1)
2 pages
Chapter 1-5
100% (1)
Chapter 1-5
77 pages
Hair Transplant Workshop at Elegance Vidyalaya
No ratings yet
Hair Transplant Workshop at Elegance Vidyalaya
4 pages
Case Study of Decision Justice Khalil Ur Rehman
100% (1)
Case Study of Decision Justice Khalil Ur Rehman
36 pages
Installation Manual Dishwasher Ge Adora DDT595
No ratings yet
Installation Manual Dishwasher Ge Adora DDT595
48 pages
Manish Resume
No ratings yet
Manish Resume
2 pages
Research Paper Chapter I V 2.4 ELECTRONIC GADGETS
No ratings yet
Research Paper Chapter I V 2.4 ELECTRONIC GADGETS
32 pages
94 - CPAR Final Preaboard AFAR - Booklet
No ratings yet
94 - CPAR Final Preaboard AFAR - Booklet
14 pages
Performance Analysis of Rayleigh Fading Channel Model: Experiment 8
No ratings yet
Performance Analysis of Rayleigh Fading Channel Model: Experiment 8
9 pages
BDY26, 183 T2 BDY27, 184 T2 BDY28, 185 T2: Absolute Maximum Ratings
No ratings yet
BDY26, 183 T2 BDY27, 184 T2 BDY28, 185 T2: Absolute Maximum Ratings
5 pages
B Inggris
No ratings yet
B Inggris
4 pages
Marine Cargo Handbook
No ratings yet
Marine Cargo Handbook
14 pages
Nanal Q1 2012 Yudu Version
100% (1)
Nanal Q1 2012 Yudu Version
60 pages
Sumathi Salesforce Developer Resume
No ratings yet
Sumathi Salesforce Developer Resume
2 pages
F3 Maths Online Lesson 23
No ratings yet
F3 Maths Online Lesson 23
26 pages
SWE322: Software Security: Malware
No ratings yet
SWE322: Software Security: Malware
57 pages
Shortcut Distillation Demo Model for PIMS-AO
No ratings yet
Shortcut Distillation Demo Model for PIMS-AO
1 page
Alflah Bank SBS Step-By-Step (SBS) Monthly Installment Plan
No ratings yet
Alflah Bank SBS Step-By-Step (SBS) Monthly Installment Plan
3 pages
Selling skills project
No ratings yet
Selling skills project
6 pages
VORAD Adaptive Cruise Control and Collision Warning System
No ratings yet
VORAD Adaptive Cruise Control and Collision Warning System
16 pages
Kuerer s Breast Surgical Oncology 1st Edition Ph.D. Kuerer - Download the full ebook set with all chapters in PDF format
100% (1)
Kuerer s Breast Surgical Oncology 1st Edition Ph.D. Kuerer - Download the full ebook set with all chapters in PDF format
59 pages
VABB
100% (1)
VABB
109 pages
Broadway Bridge Design Workshop Powerpoint
No ratings yet
Broadway Bridge Design Workshop Powerpoint
47 pages
Lesson-ExemplarInterdisciplinary-in-TLE by RELLY ANN DESTURA
No ratings yet
Lesson-ExemplarInterdisciplinary-in-TLE by RELLY ANN DESTURA
18 pages
G.R. No. 89070 May 18, 1992
No ratings yet
G.R. No. 89070 May 18, 1992
7 pages
RESA-RA-9298-1
No ratings yet
RESA-RA-9298-1
1 page
AccuQuilt Unveils AccuQuiltable™ – an Integrated, Smart Project Design Software for Quilters
No ratings yet
AccuQuilt Unveils AccuQuiltable™ – an Integrated, Smart Project Design Software for Quilters
4 pages

Linear Regression Using Batch Gradient Descent

Uploaded by

Linear Regression Using Batch Gradient Descent

Uploaded by

Linear regression using batch gradient descent

average: 5.00 out of 5)

by Krishna Sankar on October 29, 2011

where, is the page index, .

Linear regression using gradient descent

Let us define the cost function .

The parameter vector Note :

after algorithm convergence can be used for prediction.

Linear regression using batch gradient descent

3. The proof of the derivation of

involving differential with

will be of interest. We will discuss that in another post.

Matlab/Octave code snippet

for different values of

Linear regression using batch gradient descent

References An Application of Supervised Learning Autonomous Deriving

Stochastic Gradient Descent

Weighted Least Squares and

Closed form solution for linear

Choose Gmail Now

SVM - Predictive Models

Free optimization tool.

Automate Batch Processing MATLAB Plotting

{ 7 comments read them below or add one }

Linear regression using batch gradient descent

Name * E-mail * Website

Notify me of followup comments via e-mail

Linear regression using batch gradient descent

Enter your Email here...

More Recent Posts

Octave Batch Curve Fitting

Linear regression using batch gradient descent

Advertisement from Amazon

Linear regression using batch gradient descent

You might also like