0% found this document useful (0 votes)

8 views20 pages

Lec 3

The document provides an overview of regression analysis, focusing on its purpose to predict the value of a response variable from one or more attribute variables. It explains the concepts of simple and multiple linear regression, the Ordinary Least Squares (OLS) method for estimating parameters, and the gradient descent optimization technique. Additionally, it includes mathematical derivations and examples to illustrate the application of these concepts in predicting outcomes based on given data.

Uploaded by

ku777965

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views20 pages

Lec 3

Uploaded by

ku777965

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 20

Dr.

Supriyo Mandal,
Ph.D. (IIT Patna)
Postdoc (ZBW, University of Kiel,
Germany)

Course code: CS31002 (L-T-P-Cr: 3-1-0-4) Course Name: Machine Learning

Credits: 4
What is Regression

Regression – predict value of response variable from attribute variables.

Variables – continuous numeric values

Regression analysis – a set of statistical processes for estimating the relationships between a dependent variable and
one or more independent variables.
v Dependent variables are often called the 'predictand', 'outcome' or 'response' variable;
v Independent variables are often called 'predictors', 'covariates', 'explanatory variables' or 'features'.
v Regression analysis is a way of mathematically sorting out which of those variables does indeed have an
impact. Used for modeling the future relationship between the variables.

Statistical process – a science of collecting, exploring, organizing, analyzing, interpreting data and exploring patterns
and trends to answer questions and make decisions (Broad area).
y=a+bx
;a=intercept
b=slope/gradient/steep size
y= Dependent variable
x=independent variable
Basics of Regression Models

v Regression models predict a value of the Y variable given known values of the X variables.

v Prediction within the range of values in the dataset used for model-fitting is known as interpolation.

v Prediction outside this range of the data is known as extrapolation.

v First, a model to estimate the outcome need to be fixed.

v Then the parameters of that model need to be estimated using any chosen method (e.g., least
squares).
Example..........

Hour (x) Marks (y)

5 40
7 120
12 180

y:marks
16 210
20 240

y=a+bx
;a=intercept
b=slope/gradient/steep size
y= Dependent variable
x=independent variable x: hour

Calculate the gradient........

Linear Regression with Gradient Decent....
v Gradient descent is an iterative optimization algorithm to find the minimum of a function. Here that function is our
Loss Function. We will use Mean Square Error (MSE) as Loss Function in this topic which is shown below:
� � � �
v E= �=�
[�� − (� + ��)]2 = �=�
(�� − �� )2
� �

v Understanding Gradient Descent

Linear Regression with Gradient Decent....
v Mathematical derivation of Gradient Descent in simple Linear Regression :
v 1. Initially let a = 0 and b = 0. Let L be our learning rate. This controls how much the value of b changes with each step. L
could be a small value like 0.0001 for good accuracy.
v 2. Calculate the partial derivative of the loss function with respect to a and b, and plug in the current values of x, y, b and a in
it to obtain the derivative value D.
1 �
v Db = 2 [�
�=1 �
− (� + ��)](−��)
�
−� �
v Db = � [�
�=� � �
− (� + ��)]
�
−� �
v Db = � (�
�=� � �
− �� )
�

v Db is the value of the partial derivative with respect to b.

v Similarly, the partial derivative with respect to a is Da:

1 �
v Da = 2 [�
�=1 �
− (� + ��)](−1)
�
−� �
v Da = �=�
[�� − (� + ��)]
�
−� �
v Da = �=�
(�� − �� )
�
Linear Regression with Gradient Decent....
v Mathematical derivation of Gradient Descent in simple Linear Regression :
v 3. Now we update the current value of b and a using the following equation:
v b = b − L×Db

v a = a − L×Da

v 4. We repeat this process until our loss function is a very small value or ideally 0 (which means 0 error or 100%
accuracy). The value of b and a that we are left with now will be the optimum values.

v Now going back to our analogy, b can be considered the current position of the person. D is equivalent to the
steepness of the slope and L can be the speed with which he moves. Now the new value of b that we calculate
using the above equation will be his next position, and L×D will be the size of the steps he will take.
v When the slope is more steep (D is more) he takes longer steps and when it is less steep (D is less), he takes smaller
steps.
v Finally he arrives at the bottom of the valley which corresponds to our loss = 0.
v Now with the optimum value of b and a our model is ready to make predictions !
Linear Regression with Gradient Decent....
Example..........

Hour Marks

5 40

7 120

12 180

y:Marks
16 210

20 240

Find the regression line.....

x: hour
Simple linear regression: There is only one continuous independent variable x and the assumed relation
between the independent variable and
the dependent variable y is
y = a + bx.
Details study.................
v Let x be the independent predictor variable and y the dependent variable.
v Assume that we have a set of observed values of x and y. A simple linear regression model defines the relationship between x
and y using a line defined by an equation in the following form:
y = a + bx
v To determine the optimal estimates of α and β, an estimation method known as Ordinary Least Squares (OLS).

v The OLS method

v In the OLS method, the values of y-intercept and slope are chosen such that they minimize the sum of the squared errors; that
is, the sum of the squares of the vertical distance between the predicted y-value and the actual y-value (see Figure 7.1). Let ��
be the predicted value of ��

v Then the sum of squares of errors is given by

�
v E= �=1
(�� − ��)2
�
= [�
�=1 �
− (� + ��)]2

v So we are required to find the values of a and b such that E is minimum.

Details study.................
�
v E= �=1
(�� − ��)2
�
= [�
�=1 �
− (� + ��)]2
v To solve the above equation we have to take two partial derivations as below:
��
v =0 ------(i) and =0 ------(ii)
��

v By solving eq(i)
�
=> 2 [�
�=1 �
− � − ��](-1) = 0
� � �
=> −2 �
�=1 �
+ 2� �=1
1 + 2� �
�=1 �
=0
� �
=> − �
�=1 �
+ �� + � �
�=1 �
=0
� �
=> �� = �
�=1 �
−� �
�=1 �
1 � 1 �
=> � = �
�=1 �
−� �
�=1 �
� �

=> � = � − ��
� � � �
where � = �
�=� �
(mean of values of y), � = �
�=� �
(mean of values of x)
� �
Details study.................
�
v E= �=1
(�� − ��)2
�
= �=1
[�� − (� + ��)]2

v To solve the above equation we have to take two partial derivations as below:
��
v ��
=0 ------(i) and ��
=0 ------(ii)

v By solving eq(ii)
�
=> 2 �=1
[�� − � − ��](−��) = 0
� � �
=> −2 ��
�=1 � �
+ 2� �
�=1 �
+ 2� �2
�=1 �
=0
� � �
=> − ��
�=1 � �
+� �
�=1 �
+� �2
�=1 �
=0
� � �
=> − ��
�=1 � �
+ (� − ��) �
�=1 �
+� �2
�=1 �
=0
� � � �
=> �( �2
�=1 �
−� �)
�=1 �
= ��
�=1 � �
−� �
�=1 �
� � � �
=> �[ �{
�=1 � �=1
(�� − �)}] = �{
�=1 � �=1
(�� − �)}
�
�=� (��− �)
=> � = �
�=� (��− �)

1 n
By multiplying n−1
{ i=1
(xi − x)} in the numerator and denominator of RHS
1 � �
{ �=1 (��− �)} �=1 (��− �) ��(�,�)
=> � = �−1
1 � � =
�−1
{ �=1 (��− �)} �=1 (��− �)
��(�)
For problem and solutions........

https://www.ncl.ac.uk/webtemplate/ask-assets/external/maths-resources/statistics/regression-and-
correlation/simple-linear-regression.html
v We assume that there are N independent variables x1, x2, ⋯ , xN . Let the dependent variable be y.
v Let there also be n observed values of these variables:

v The multiple linear regression model defines the relationship between the N independent variables and the
dependent variable by an equation of the following form:
y = β0 + β1x1 + ⋯ + βNxN
v As in simple linear regression, here also we use the ordinary least squares (OLS) method to obtain the
optimal estimates of β0, β1, ⋯ , βN. The method yields the following procedure for the computation of these
optimal estimates. Let

v Then it can be shown that the regression coefficients are given by

B = (XT X)−1 XT Y
v Example:
v Fit a multiple linear regression model to the following data:

v Solution:
v In this problem, there are two independent variables and four sets of values of the variables. Thus, in the notations
used above, we have n = 2 and N = 4. The multiple linear regression model for this problem has the form
y = β0 + β1x 1 + β2x 2.
v The computations are shown below.
y = 2.0625 − 2.3750x1 + 3.2500x2

MAF3821 2024 Part1
100% (1)
MAF3821 2024 Part1
35 pages
ML Lec-3
No ratings yet
ML Lec-3
11 pages
4-Curve Fitting and Interpolation
No ratings yet
4-Curve Fitting and Interpolation
48 pages
ML L6 Linear Regresion
No ratings yet
ML L6 Linear Regresion
54 pages
Linear Regression
No ratings yet
Linear Regression
18 pages
Linear Regression
No ratings yet
Linear Regression
62 pages
US - TMC - 06 - Curve Fitting & Interpolation
No ratings yet
US - TMC - 06 - Curve Fitting & Interpolation
64 pages
Daunit 3
No ratings yet
Daunit 3
32 pages
Regression PPT
No ratings yet
Regression PPT
21 pages
Linear Regression
No ratings yet
Linear Regression
34 pages
Chapter 3
No ratings yet
Chapter 3
40 pages
Linear Regression
No ratings yet
Linear Regression
20 pages
Linear Regression
100% (1)
Linear Regression
8 pages
ML EasySol
No ratings yet
ML EasySol
62 pages
Lecture3 221109 035214
No ratings yet
Lecture3 221109 035214
87 pages
NOTES - UNIT 2 - Machine Learning
No ratings yet
NOTES - UNIT 2 - Machine Learning
33 pages
Econometrics For Finance Lecture III
No ratings yet
Econometrics For Finance Lecture III
54 pages
Chapter4 Regression
No ratings yet
Chapter4 Regression
15 pages
Derivation of The Ordinary Least Squares Estimator Multiple Regression Case
100% (1)
Derivation of The Ordinary Least Squares Estimator Multiple Regression Case
10 pages
What Is Linear Regression
No ratings yet
What Is Linear Regression
7 pages
WEEK2 Simple Regression
No ratings yet
WEEK2 Simple Regression
133 pages
Regression Analysis in Machine Learning
No ratings yet
Regression Analysis in Machine Learning
13 pages
Chat Openai Com Share 42b24a73 839b 4128 Ade9 7d8eed9e9533
No ratings yet
Chat Openai Com Share 42b24a73 839b 4128 Ade9 7d8eed9e9533
21 pages
Manual ML 1
No ratings yet
Manual ML 1
8 pages
Linear Regression
No ratings yet
Linear Regression
19 pages
Linear Regression Models
No ratings yet
Linear Regression Models
41 pages
Unit-3 Notes
No ratings yet
Unit-3 Notes
16 pages
WST 311 Notes Part 2 2024
No ratings yet
WST 311 Notes Part 2 2024
21 pages
Basic Interview Question of Linear Regression
No ratings yet
Basic Interview Question of Linear Regression
9 pages
Regression
No ratings yet
Regression
60 pages
3 SimpleLinearRegression
No ratings yet
3 SimpleLinearRegression
30 pages
Chapter Three
No ratings yet
Chapter Three
22 pages
Chapter Two: Bivariate Regression Mode
100% (1)
Chapter Two: Bivariate Regression Mode
54 pages
2 Simple Linear Regression
No ratings yet
2 Simple Linear Regression
22 pages
Data Analytics Unit 3 Notes
100% (3)
Data Analytics Unit 3 Notes
28 pages
Chapter 02
No ratings yet
Chapter 02
14 pages
Expt5 ML Lab
No ratings yet
Expt5 ML Lab
6 pages
Regression 1
No ratings yet
Regression 1
32 pages
Chapter2
No ratings yet
Chapter2
20 pages
CSE 412 Lab Manual 3 Linear Regression
No ratings yet
CSE 412 Lab Manual 3 Linear Regression
10 pages
Linear Regression-Part 2
No ratings yet
Linear Regression-Part 2
26 pages
Linear Regression
No ratings yet
Linear Regression
16 pages
Linear Regression Using Gradient Descent
No ratings yet
Linear Regression Using Gradient Descent
2 pages
Supervised Machine Learning - Regression
No ratings yet
Supervised Machine Learning - Regression
34 pages
DMJAP LinearRegression 3
No ratings yet
DMJAP LinearRegression 3
28 pages
Linear Regression
No ratings yet
Linear Regression
25 pages
Business Statistics II
100% (2)
Business Statistics II
100 pages
Machine Learning Lecture 1
No ratings yet
Machine Learning Lecture 1
5 pages
Derivation of The Ordinary Least Squares Estimator Simple Linear Regression Case
No ratings yet
Derivation of The Ordinary Least Squares Estimator Simple Linear Regression Case
17 pages
Simple - Linear - Regression-Presentation - Review-Analysis - Covariance
No ratings yet
Simple - Linear - Regression-Presentation - Review-Analysis - Covariance
10 pages
What Is Linear Regression
No ratings yet
What Is Linear Regression
14 pages
Regression Analysis Material
No ratings yet
Regression Analysis Material
12 pages
Regression Analysis
No ratings yet
Regression Analysis
49 pages
07 Multiple Regression Analysis PDF
No ratings yet
07 Multiple Regression Analysis PDF
26 pages
EC501 Lecture 01
No ratings yet
EC501 Lecture 01
28 pages
Regression
No ratings yet
Regression
16 pages
Regression and Optimization in ML
No ratings yet
Regression and Optimization in ML
41 pages
Linear Regression
No ratings yet
Linear Regression
15 pages
Homework 3: Gauss-Jordan Elimination, Otherwise No Points)
No ratings yet
Homework 3: Gauss-Jordan Elimination, Otherwise No Points)
2 pages
Efficient nonlinear function approximation in analog resistive crossbars for recurrent neural networks的全文翻译
No ratings yet
Efficient nonlinear function approximation in analog resistive crossbars for recurrent neural networks的全文翻译
15 pages
1491-Article Text-3334-1-10-20201216
No ratings yet
1491-Article Text-3334-1-10-20201216
6 pages
Ampeg VT 22 Manual
No ratings yet
Ampeg VT 22 Manual
10 pages
Lec 09
No ratings yet
Lec 09
56 pages
DLBELPDR Final
No ratings yet
DLBELPDR Final
89 pages
10 3bilinear Transform
No ratings yet
10 3bilinear Transform
12 pages
Class - X Mathematics Worksheet Chapter - 2: Polynomials
100% (1)
Class - X Mathematics Worksheet Chapter - 2: Polynomials
2 pages
WA
No ratings yet
WA
31 pages
Machine Learning Performance Evaluation Report
No ratings yet
Machine Learning Performance Evaluation Report
40 pages
Mathcity Notes (NA)
No ratings yet
Mathcity Notes (NA)
281 pages
Co 3 Sem 17330 Data Structure Using Cwinter 2018
No ratings yet
Co 3 Sem 17330 Data Structure Using Cwinter 2018
3 pages
Adaptive Equalization Techniques Using Recursive Least Square (RLS) Algorithm
No ratings yet
Adaptive Equalization Techniques Using Recursive Least Square (RLS) Algorithm
8 pages
Hw3- Trần Thị Thanh Ngân-ielsiu18223
100% (1)
Hw3- Trần Thị Thanh Ngân-ielsiu18223
11 pages
Max Dea Book
No ratings yet
Max Dea Book
295 pages
DLT Unit-2
100% (1)
DLT Unit-2
50 pages
U2-ML-QB With Answers
No ratings yet
U2-ML-QB With Answers
16 pages
Skiplist
No ratings yet
Skiplist
5 pages
SPSS Annotated Output K Means Cluster Anal
No ratings yet
SPSS Annotated Output K Means Cluster Anal
10 pages
Ci - Adaline & Madaline Network
No ratings yet
Ci - Adaline & Madaline Network
35 pages
24MTCS003HY KMP Algorithm Presentation
No ratings yet
24MTCS003HY KMP Algorithm Presentation
12 pages
Fem Notes PDF
No ratings yet
Fem Notes PDF
2 pages
Lecture 6
No ratings yet
Lecture 6
45 pages
Fir Filters Report
100% (1)
Fir Filters Report
8 pages
Grade 10 Summative Test Q1
82% (11)
Grade 10 Summative Test Q1
4 pages
Labview Internal Exam Questions 2022-23
0% (1)
Labview Internal Exam Questions 2022-23
2 pages
Deep Learning Material
No ratings yet
Deep Learning Material
136 pages
An Improvement of DBSCAN Algorithm To Analyze Cluster For Large Dataset
No ratings yet
An Improvement of DBSCAN Algorithm To Analyze Cluster For Large Dataset
5 pages
Cramers Rule 3 by 3 Notes PDF
No ratings yet
Cramers Rule 3 by 3 Notes PDF
4 pages
MSCCS - 104
No ratings yet
MSCCS - 104
5 pages

Lec 3

Uploaded by

Lec 3

Uploaded by

Dr.

Course code: CS31002 (L-T-P-Cr: 3-1-0-4) Course Name: Machine Learning

Regression – predict value of response variable from attribute variables.

v Prediction outside this range of the data is known as extrapolation.

v First, a model to estimate the outcome need to be fixed.

Hour (x) Marks (y)

Calculate the gradient........

v Understanding Gradient Descent

v Db is the value of the partial derivative with respect to b.

v Similarly, the partial derivative with respect to a is Da:

Find the regression line.....

v The OLS method

v Then the sum of squares of errors is given by

v So we are required to find the values of a and b such that E is minimum.

v Then it can be shown that the regression coefficients are given by

You might also like