0% found this document useful (0 votes)

11 views14 pages

A07 Linear Regression v2 2up

The document provides an overview of linear regression, a fundamental machine learning technique used for predicting continuous outcomes based on input variables. It explains the concept of fitting a best fit line to data points, the least squares method for minimizing errors, and introduces polynomial and multivariate regression models. Additionally, it highlights the importance of understanding the relationship between variables and cautions against confusing correlation with causality.

Uploaded by

Ayyappan Harikrishnan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views14 pages

A07 Linear Regression v2 2up

Uploaded by

Ayyappan Harikrishnan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

Linear Regression

Mehul Motani
Electrical & Computer Engineering
National University of Singapore
Email: [email protected]

© Mehul Motani Linear Regression 1

B. Sikdar
11/17/10
Machine Learning Taxonomy
• Machine learning is function approximation
• Supervised learning – Access to labeled dataset
• Unsupervised learning – Dataset is not labeled
• Classification – The output is categorical
• Regression – The output is continuous
• Many different ML models are available
• Simplest model is Linear Regression
– Find the best fit line which goes through the data points
– Linear regression is supervised learning
© Mehul Motani Linear Regression 2
Example – Tire tread vs Mileage
3
How are Tire tread wear
and Mileage related?

Clearly, tire tread wear

increases with mileage!
But how?

© Mehul Motani Linear Regression 3

Example – Tire tread vs Mileage

4 Mileage Groove Depth
(in 1000 miles) (in mils)
0 394.33
4 329.50
8 291.00
12 255.17
16 229.33
20 204.83
24 179.00
28 163.83
32 150.33
Tire tread wear vs. mileage. From: Statistics and Data
Analysis; Tamhane and Dunlop; Prentice Hall.
© Mehul Motani Linear Regression 4
A. Seyedi
11/17/10
Some Training Data

!
dependent
variable
(output)

x – independent variable (input)

© Mehul Motani Linear Regression 5

A. Seyedi
11/17/10
Linear Regression - Best Fit Line
• Linear Regression - Fit data with the “best”
hyperplane which goes through the data points

Best fit line

! = #! + #" %
!
dependent
variable ! = #$ + &
(output)
# is the slope of the line
& is the y-intercept
" – independent variable (input)

© Mehul Motani Linear Regression 6

A. Seyedi
11/17/10
Linear Regression - Best Fit Line
• For each point the differences between the predicted point
and the actual observation is the residue or error
• The “best” fit line is the one which minimizes the overall
error, which is defined as the sum of squared errors.

Best fit line

!
! = #! + #" %
dependent
variable
(output)

x – independent variable (input)

© Mehul Motani Linear Regression 7

Polynomial Curve Fitting

Degree M
polynomial

© Mehul Motani Linear Regression 8

0th Order Polynomial (Degree 0)
9

© Mehul Motani Linear Regression 9

1st Order Polynomial (Degree 1)

© Mehul Motani Linear Regression 10

3rd Order Polynomial (Degree 3)
11

© Mehul Motani Linear Regression 11

9th Order Polynomial (Degree 9)

© Mehul Motani Linear Regression 12

How good is the fit?

Bad Fit

Good Fit
Over-Fitting

Root-Mean-Square(RMS) Error

© Mehul Motani Linear Regression 13

Linear Regression
14
• Response/outcome/dependent variable: !
• Predictor/explanatory/independent variable: "
• Example 1: Estimate electricity demand for home cooling
(!) from the average daily temperature (")
• Example 2: Relationship between the head size and body
size of a newborn
• Regression analysis: statistical methodology to estimate
the relationship between " and !
• Correlation analysis: statistical methodology used to
asses the strength of relationship between " and !

© Mehul Motani Linear Regression 14

Linear Regression
15 • One response variable and one explanatory variable
• We denote the explanatory variable as # and response
variable as $
• % pairs of observations !! ; "! , ' = 1, ⋯ , %
• !! is the observed values of the random variable $! and is
related to "! by:
$! = ," + ,# "! + .!
• .! : random error with / .! = 0 and 123 .! = 4 $
• The “true regression line” models the true but unknown
mean of $!
/ $! = !5! = ," + ,# "!

© Mehul Motani Linear Regression 15

Linear Regression
16
• The error .! :
• Independent and identically distributed
• Variety of causes:
• Measurement errors
• Other variables affecting $! not included in the
model
• Assumption / .! = 0: implies there is no systematic
bias
• Usual model for .! : .! ~7 0, 4 $
• Justified by the Central Limit Theorem

© Mehul Motani Linear Regression 16

Example – Tire tread vs Mileage
17
Inspect the data – Does the data have a linear trend?
Mileage Groove Depth
(in 1000 miles) (in mils)
# $
0 394.33
4 329.50
8 291.00
12 255.17
16 229.33
20 204.83
24 179.00
28 163.83
32 150.33
© Mehul Motani Linear Regression 17

Simple Linear Regression Model

• In simple linear regression, the data is represented as:

!! = ," + ,# "! + .! (1)
where .! ~7 0, 4 $
• The candidate / fitted model:
!5 = ," + ,# " (2)
where
• ," : intercept
• ,# : slope of regression line

© Mehul Motani Linear Regression 18

Least Squares Fitting
19
• 8! : difference between
data and candidate line
Candidate Line 8! = !! − !5!
= !! − ," + ,# "!
ei
• Goal: Find the ," and
,# which minimize the
sum of squared errors:
&

: = ; !! − ," + ,# "! $

!%#

Least Squares Fitting

20 • Obtain the values of ," and ,# that minimizes the sum of
squared errors
% % %
%&
Partial = −2 + $" − '! + '$ #" = 0 ⇒ /'! + '$ + #" = + $"
%'!
derivative "#$ "#$ "#$

% % % %
%&
= −2 + #" $" − '! + '$ #" = 0 ⇒ '! + #" + '$ + #"& = '$ + #" $"
%'$
"#$ "#$ "#$ "#$

∑%"#$ #"& ∑%"#$ $" − ∑%"#$ #" ∑%"#$ #" $"

'0! = &
After / ∑%"#$ #"& − ∑%"#$ #"
some
algebra / ∑%"#$ #" $" − ∑%"#$ #" ∑%"#$ $"
'0$ = &
/ ∑%"#$ #"& − ∑%"#$ #"
© Mehul Motani Linear Regression 20
Least Squares Fitting
21
• To simplify:
& & & &
1
'!" = ( $# − $̅ !# − !+ = ( $# !# − ( $# ( !#
-
#$% #$% #$% #$%
& & & '
1
'!! = ( $# − $̅ '
= ( $#' − ( $#
-
#$% #$% #$%
& & & '
' ' 1
'"" = ( !# − !+ = ( !# − ( !#
-
#$% #$% #$%
Note:
1
$
1
$ 0/( = !+ − 0% $̅
"̅ = ' "! and !+ = ' !! ⇒ )'(
&
!"#
&
!"# 0/% = )''

Example – Tire tread vs Mileage

22
/=9
Mileage Groove Depth
∑ #" = 144, ∑ #"& = 3264
(in 1000 miles) (in mils)
∑ $" = 2197.32, ∑ $"& = 589887.08 # $
∑ #" $" = 28167.72 0 394.33
#̅ = 16, $> = 244.15 4 329.50
?)* = −6989.40
8 291.00
?)) = 960
?)* 12 255.17
'0$ = = −7.281
?)) 16 229.33
'0! = $> − '$ #̅ = 360.64 20 204.83
@ = BCD. CE − F. GHIJ (I)
A 24 179.00
Exercise: Write code in python to compute the 28 163.83
best fit line for the tire tread example and plot
32 150.33
the best fit line overlaid with the data points.
© Mehul Motani Linear Regression 22
Linear Regression – Tire tread vs Mileage
23 Mileage Groove Depth
(in 1000 miles) (in mils)
# $
0 394.33
4 329.50
8 291.00
12 255.17
16 229.33
20 204.83
24 179.00
28 163.83
32 150.33
Tire tread wear vs. mileage. From: Statistics and Data
Analysis; Tamhane and Dunlop; Prentice Hall.
© Mehul Motani Linear Regression 23

A. Seyedi
11/17/10
Anscombe's Quartet
A B

C D

The best fit line can be misleading!

Linear regression is easy to explain!

• One nice advantage of linear regression models (and linear
classification) is the potential to look at the coefficients to give
insight into which input variables are most important in
predicting the output
• The variables with the largest magnitude have the highest
correlation with the output
– A large positive coefficient implies that the output will increase
when this input is increased (positively correlated)
– A large negative coefficient implies that the output will decrease
when this input is increased (negatively correlated)
– A small or 0 coefficient suggests that the input is uncorrelated
with the output (at least at the 1st order)
• Linear regression can be used to find best "indicators"
• However, be careful not to confuse correlation with causality

Multivariate Regression
26
• We have explored problems with one response variable ! and one
explanatory variable "
• Sometimes a straight line (linear regression) is not adequate and
quadratic or cubic model is needed (polynomial regression)
• Sometimes there are more than one predictor variables and their
simultaneous effect needs to be modeled
• # pairs of observations !! ; "!", "!#, ⋯ , "!$ , ' = 1, ⋯ , #
• Multiple regression model: * = +% + +& -& + +' -' + ⋯ + +( -( + .
• Polynomial regression: Linear in / and not necessarily "’s: "" =
", "# = " #, "$ = " $
• Simple linear regression: !; "
• Multiple linear regression: !; "", "#, ⋯ , ")
• Multivariate regression: !", !#, ⋯ , !* ; "", "#, ⋯ , ")
© Mehul Motani Linear Regression 26
Multiple Linear Regression
27
• Least squares fit: & = ∑%"#$ $" − '! + '$ #"$ + '& #"& + ⋯ + '+ #"+ &

• Taking the partial derivatives and equating to zero:

%&
= −2 + $" − '! + '$ #"$ + '& #"& + ⋯ + '+ #"+ = 0 for N = 1,2, ⋯ , O
%'!
%&
= −2 + $" − '! + '$ #"$ + '& #"& + ⋯ + '+ #"+ #", = 0 for N = 1,2, ⋯ , O
%',

• After simplification (for 1 = 1,2, ⋯ , 5):

/'! + '$ + #"$ + ⋯ + '+ + #"+ = + $"

/'! + #", + '$ + #"$ #", + ⋯ + '+ + #"+ #", = + $" #",

• These have to be solved simultaneously for 0% , 0' , ⋯ , 0*

Multiple Linear Regression

28
• Matrix form: • Regression model:
Q$ T$ 6 = 78 + 9
Q T&
P= & , S= ⋮
⋮
Q% T% • Simultaneous linear
1 #$$ #$& ⋯ #$+ equations whose
1 #&$ #&& ⋯ #&+ solution gives the least
U=
⋮ ⋮ ⋮ ⋱ ⋮ square estimates:
1 #%$ #%& ⋯ #%+
7+ 78 = 7+ 6
'! '0!
'$ '0$ • Solution is given by the
W = '& , X
W = '0&
⋮ pseudoinverse:
⋮
'+ 0 : = 7+ 7 ,- 7+ 6
8
'+
© Mehul Motani Linear Regression 28

Sven O Krumke Integer Programming Polyhedra and Algorithms Lecture Notes
No ratings yet
Sven O Krumke Integer Programming Polyhedra and Algorithms Lecture Notes
188 pages
Numerical-Methods Question and Answer
100% (5)
Numerical-Methods Question and Answer
32 pages
Mathematics in Chemical Eng
No ratings yet
Mathematics in Chemical Eng
148 pages
TCMG - MEEG 573 - SP - 20 - Lecture - 7
No ratings yet
TCMG - MEEG 573 - SP - 20 - Lecture - 7
69 pages
212C Numerical Methods
0% (1)
212C Numerical Methods
22 pages
Deshe Bideshe MMM
No ratings yet
Deshe Bideshe MMM
106 pages
CHAPTER 6 NUmerical Differentiation
No ratings yet
CHAPTER 6 NUmerical Differentiation
6 pages
Take Note:: Miracle Light Christian Academy Casilagan, City of Ilagan, Isabela Mathematics 8
No ratings yet
Take Note:: Miracle Light Christian Academy Casilagan, City of Ilagan, Isabela Mathematics 8
4 pages
Unit 3c Linear Regression
No ratings yet
Unit 3c Linear Regression
98 pages
1.linear Regression PSP
No ratings yet
1.linear Regression PSP
92 pages
Hhghiikkk
No ratings yet
Hhghiikkk
29 pages
Machine Learning and Linear Regression
100% (1)
Machine Learning and Linear Regression
55 pages
Binomial Theorem
No ratings yet
Binomial Theorem
3 pages
Algebraic Expressions: College Algebra
No ratings yet
Algebraic Expressions: College Algebra
8 pages
1694600692-Unit2.1 Linear Regression CU 2.0
No ratings yet
1694600692-Unit2.1 Linear Regression CU 2.0
45 pages
Gaussian Elimination
No ratings yet
Gaussian Elimination
6 pages
Classical Machine Learning: Linear Regression: Ramesh S
No ratings yet
Classical Machine Learning: Linear Regression: Ramesh S
28 pages
Chapter - 3: Linear Programming-Graphical Solution
No ratings yet
Chapter - 3: Linear Programming-Graphical Solution
14 pages
Linear Regression
No ratings yet
Linear Regression
15 pages
Unit-4 DS Student
No ratings yet
Unit-4 DS Student
43 pages
Machine Learning and Deep Learning Course
No ratings yet
Machine Learning and Deep Learning Course
23 pages
Docx
No ratings yet
Docx
13 pages
Exp No 03
No ratings yet
Exp No 03
15 pages
Unit 3 Notes
No ratings yet
Unit 3 Notes
33 pages
Finite Difference Method For PDEs
No ratings yet
Finite Difference Method For PDEs
21 pages
Regression Coeffient
No ratings yet
Regression Coeffient
52 pages
M1 - S4 - Linear Programming - Simplex Method
No ratings yet
M1 - S4 - Linear Programming - Simplex Method
20 pages
Linear Regression
No ratings yet
Linear Regression
36 pages
Module 4
No ratings yet
Module 4
41 pages
Unit - 3 Machine Learning
No ratings yet
Unit - 3 Machine Learning
30 pages
KKT Scribed
No ratings yet
KKT Scribed
7 pages
Machine Learning Class Slide
No ratings yet
Machine Learning Class Slide
44 pages
BroydenMethod Example2
No ratings yet
BroydenMethod Example2
5 pages
Worksheet 1
No ratings yet
Worksheet 1
7 pages
Regression Analysis
No ratings yet
Regression Analysis
49 pages
Regression Modelling
No ratings yet
Regression Modelling
25 pages
Linear Regression
No ratings yet
Linear Regression
11 pages
Untitled
No ratings yet
Untitled
11 pages
MachineLearning Unit II
No ratings yet
MachineLearning Unit II
45 pages
Linear Regression
No ratings yet
Linear Regression
7 pages
Regression: Unit Iii
No ratings yet
Regression: Unit Iii
54 pages
Department of Computing: MATH333: Numerical Analysis
No ratings yet
Department of Computing: MATH333: Numerical Analysis
3 pages
Chapter 4 (Regression)
No ratings yet
Chapter 4 (Regression)
125 pages
Kadhim 2021 J. Phys. Conf. Ser. 1963 012104
No ratings yet
Kadhim 2021 J. Phys. Conf. Ser. 1963 012104
9 pages
Linear Regression - Everything You Need To Know About Linear Regression
No ratings yet
Linear Regression - Everything You Need To Know About Linear Regression
17 pages
ML Unit
No ratings yet
ML Unit
23 pages
Lecture 8 Linear and Multiple Regression
No ratings yet
Lecture 8 Linear and Multiple Regression
55 pages
2.1 Regression Analysis
No ratings yet
2.1 Regression Analysis
28 pages
Supervised Machine Learning - Regression
No ratings yet
Supervised Machine Learning - Regression
34 pages
Complete Linear Regression Algorithm
No ratings yet
Complete Linear Regression Algorithm
4 pages
Linear Regression
No ratings yet
Linear Regression
15 pages
SML Updated UNIT 3
No ratings yet
SML Updated UNIT 3
41 pages
Updated Lecture 7
No ratings yet
Updated Lecture 7
29 pages
Lecture 9-10
No ratings yet
Lecture 9-10
28 pages
Lecture 012 - 24 - 4 - 24
No ratings yet
Lecture 012 - 24 - 4 - 24
15 pages
C1 U4 BBB VWCZxa 277 Fi FH
No ratings yet
C1 U4 BBB VWCZxa 277 Fi FH
9 pages
Lecture 4
No ratings yet
Lecture 4
22 pages
Mbas901 - L3
No ratings yet
Mbas901 - L3
103 pages
Deck2 BusinessIntelligence M1 ACSA
No ratings yet
Deck2 BusinessIntelligence M1 ACSA
15 pages
Lecture 9-10 - Regression and Classification Cognitive
No ratings yet
Lecture 9-10 - Regression and Classification Cognitive
61 pages
Linear Regression Algorithm
No ratings yet
Linear Regression Algorithm
16 pages
5 - AML Lecture 5 - Linear Regression
No ratings yet
5 - AML Lecture 5 - Linear Regression
56 pages
U3 U4 Regression
No ratings yet
U3 U4 Regression
22 pages
CLL113 Quiz 2 Solutions
No ratings yet
CLL113 Quiz 2 Solutions
22 pages
Unit2 Optimizer
No ratings yet
Unit2 Optimizer
18 pages
Algebra 202 March 2021
No ratings yet
Algebra 202 March 2021
3 pages
Linear Regression
No ratings yet
Linear Regression
49 pages
MachineLearning Unit-II
No ratings yet
MachineLearning Unit-II
45 pages
AdvancedSensorySystems 3b SVD
No ratings yet
AdvancedSensorySystems 3b SVD
13 pages
chp6 (10) Fam
No ratings yet
chp6 (10) Fam
24 pages
Chapter - 2 - Linear and Logistic Regression
No ratings yet
Chapter - 2 - Linear and Logistic Regression
34 pages
Linear Regression
No ratings yet
Linear Regression
46 pages
Lecture 3 - Linear Regression Imran 20022025 092939am
No ratings yet
Lecture 3 - Linear Regression Imran 20022025 092939am
46 pages
Supervised Learning Algorithms
No ratings yet
Supervised Learning Algorithms
20 pages
10.introduction To Artificial Intelligence
No ratings yet
10.introduction To Artificial Intelligence
25 pages
Matrix Chain Multiplication
No ratings yet
Matrix Chain Multiplication
4 pages
NCERT Solutions Class 9 Maths Exercise 2.1 Chapter 2 - Polynomials
No ratings yet
NCERT Solutions Class 9 Maths Exercise 2.1 Chapter 2 - Polynomials
4 pages
AIML MSE 2 Notes
No ratings yet
AIML MSE 2 Notes
35 pages
18-Linear Regression
No ratings yet
18-Linear Regression
29 pages
Numerical Differentiation Approximations and Error Analysis
No ratings yet
Numerical Differentiation Approximations and Error Analysis
3 pages
AAI Lecture 10 SP 25
No ratings yet
AAI Lecture 10 SP 25
37 pages
ML Module3 Regression
No ratings yet
ML Module3 Regression
51 pages
Linear Regression Formula: Get Free Access To Lakhs of
No ratings yet
Linear Regression Formula: Get Free Access To Lakhs of
11 pages
Unit 2
No ratings yet
Unit 2
18 pages
Unit 11
No ratings yet
Unit 11
21 pages
Curve Fitting Lecture Note - Part 1
No ratings yet
Curve Fitting Lecture Note - Part 1
18 pages
Unit 2
No ratings yet
Unit 2
136 pages
Econometrics: A Simple Introduction
From Everand
Econometrics: A Simple Introduction
K.H. Erickson
3.5/5 (5)
Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)
Gale Researcher Guide for: Econometric Models
From Everand
Gale Researcher Guide for: Econometric Models
Chupp
No ratings yet

A07 Linear Regression v2 2up

Uploaded by

A07 Linear Regression v2 2up

Uploaded by

Linear Regression

© Mehul Motani Linear Regression 1

Clearly, tire tread wear

© Mehul Motani Linear Regression 3

Example – Tire tread vs Mileage

x – independent variable (input)

© Mehul Motani Linear Regression 5

Best fit line

© Mehul Motani Linear Regression 6

Best fit line

x – independent variable (input)

© Mehul Motani Linear Regression 7

Polynomial Curve Fitting

© Mehul Motani Linear Regression 8

© Mehul Motani Linear Regression 9

1st Order Polynomial (Degree 1)

© Mehul Motani Linear Regression 10

© Mehul Motani Linear Regression 11

9th Order Polynomial (Degree 9)

© Mehul Motani Linear Regression 12

© Mehul Motani Linear Regression 13

© Mehul Motani Linear Regression 14

© Mehul Motani Linear Regression 15

© Mehul Motani Linear Regression 16

Simple Linear Regression Model

• In simple linear regression, the data is represented as:

© Mehul Motani Linear Regression 18

© Mehul Motani Linear Regression 19

Least Squares Fitting

∑%"#$ #"& ∑%"#$ $" − ∑%"#$ #" ∑%"#$ #" $"

© Mehul Motani Linear Regression 21

Example – Tire tread vs Mileage

The best fit line can be misleading!

Linear regression is easy to explain!

© Mehul Motani Linear Regression 25

• Taking the partial derivatives and equating to zero:

• After simplification (for 1 = 1,2, ⋯ , 5):

• These have to be solved simultaneously for 0% , 0' , ⋯ , 0*

© Mehul Motani Linear Regression 27

Multiple Linear Regression

You might also like