Linear Regression

linear regression

Uploaded by

tsilavinarakotomavo2002

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views

Linear Regression

linear regression

Uploaded by

tsilavinarakotomavo2002

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Linear Regression

Nathanaël Carraz Rakotonirina

Mathématiques Informatique et Statistique Appliquées (MISA)

Université d’Antananarivo
Model
The linear regression model is of the form

f (x; θ) = w1 x1 + ... + wD xD + b = w > x + b

I θ = (w , b) : parameters
I w : weights
I b : bias
b can be absorbed into w by defining w = [b, w1 , ..., wD ] and x = [1, x1 , ..., xD ], so
that
f (x; θ) = w > x
x can be replaced by a non-linear function of the inputs φ(x) called basis expansion
function.
f (x; θ) = w > φ(x)
The general form of the linear regression model with all observations:

ŷ = Xw + b

I N : Number of observations
I D : number of features
I ŷ ∈ RN : predictions
I X ∈ RN×D : inputs (design matrix)
I w ∈ RD : weights
I b ∈ R : bias
When the bias b is absorbed
ŷ = Xw
Loss function - Least squares

Goal:
Find the parameters w that minimize the residual sum of squares (loss)
N N
1X 1X
RSS(w ) = (yi − f (xi ))2 = (yi − w > xi )2
2 2
i=1 i=1

We can minimize it analytically or iteratively using gradient descent.

Probabilistic Interpretation
The targets and inputs are related as follows

y = w >x +

where is the residual error between the predictions and the true response (unmodeled
effects/random noise). We assume has a Gaussian distribution ∼ N (0, σ 2 ).

p(y |x; θ) = N (y ; w > x, σ 2 )

where θ = (w , σ 2 ).
We estimate the parameters using MaximumQN Likelihood Estimation. We want the
parameters that maximizes the likelihood i=1 p(yi |xi ; θ). It is easier to minimize the
Negative log likelihood
N
X
NLL(θ) = − log p(yi |xi ; θ)
i=1

It can be shown that minimizing the NLL is equivalent to minimizing the RSS.
Ordinary Least Squares
Our loss function is
N
1X 1 1
J(w ) = RSS(w ) = (yi − w > xi )2 = ||Xw − y ||22 = (Xw − y )> (Xw − y )
2 2 2
i=1

The gradient is given by

∇w RSS(w ) = X > Xw − X > y

Setting the gradient to zero
X > Xw = X > y
called the normal equations.
The solution ŵ called the ordinary least squares solution is given by

ŵ = (X > X )−1 X > y

Is it a unique global minimum ?
We check if the Hessian is positive definite. It is given by
∂
H(x) = RSS(w ) = X > X
∂w
If the columns of x are linearly independent, then H is positive definite and ŵ is a
unique global minimum.
Numerical issues

The inverse should not be computed directly. X > X can be singular or ill-conditioned.
There are alternatives:
I SVD
I QR decomposition

Explore further
I Polynomial regression (other basis expansions)
I Weighted linear regression
I Bayesian linear regression

St2334-Cheatsheet Organized
No ratings yet
St2334-Cheatsheet Organized
2 pages
Manifolds, Tensor Analysis and Applications, 3rd Ed - J E Marsden, T Ratiu, R Abraham, 2002 - Homework Sets & Solutions
No ratings yet
Manifolds, Tensor Analysis and Applications, 3rd Ed - J E Marsden, T Ratiu, R Abraham, 2002 - Homework Sets & Solutions
154 pages
05 Regression Least Squares
No ratings yet
05 Regression Least Squares
5 pages
Lec 16
No ratings yet
Lec 16
15 pages
Linear Regression: Volker Tresp 2017
No ratings yet
Linear Regression: Volker Tresp 2017
25 pages
Kayatu
No ratings yet
Kayatu
3 pages
Support Vector Machines (SVM) : Y.H. Hu
No ratings yet
Support Vector Machines (SVM) : Y.H. Hu
25 pages
Lect 3
No ratings yet
Lect 3
14 pages
Logistic Regression Loss
No ratings yet
Logistic Regression Loss
7 pages
Feedback Linearization
No ratings yet
Feedback Linearization
47 pages
IDC402_lec22
No ratings yet
IDC402_lec22
23 pages
lecture03d_ridge
No ratings yet
lecture03d_ridge
13 pages
Linear Regression: 1 Perspective 1: Maximum Likelihood Estimation
No ratings yet
Linear Regression: 1 Perspective 1: Maximum Likelihood Estimation
5 pages
Lect3 2
No ratings yet
Lect3 2
43 pages
FEM - 3 Weighted Residuals
No ratings yet
FEM - 3 Weighted Residuals
49 pages
03 Lyapunov PDF
No ratings yet
03 Lyapunov PDF
15 pages
cs188 Fa23 Note21
No ratings yet
cs188 Fa23 Note21
8 pages
ass6_solns
No ratings yet
ass6_solns
13 pages
Taleextra
No ratings yet
Taleextra
15 pages
Machine Learning II: The Linear Model
No ratings yet
Machine Learning II: The Linear Model
48 pages
Digital Communications: Lecture Notes by Y. N. Trivedi
No ratings yet
Digital Communications: Lecture Notes by Y. N. Trivedi
5 pages
Lecture - 03 Stability of Equilibrium Points
No ratings yet
Lecture - 03 Stability of Equilibrium Points
35 pages
2021 Week 5 Chapter3 Control
No ratings yet
2021 Week 5 Chapter3 Control
7 pages
Python Tutorial
No ratings yet
Python Tutorial
37 pages
Notes Linearregression
No ratings yet
Notes Linearregression
4 pages
Lect 23
No ratings yet
Lect 23
17 pages
COL774 Practice Problems
No ratings yet
COL774 Practice Problems
22 pages
Logistic Regression
No ratings yet
Logistic Regression
10 pages
FIT5197 2021 S1 Formula Sheet
No ratings yet
FIT5197 2021 S1 Formula Sheet
20 pages
Examen Metodos Numericos
No ratings yet
Examen Metodos Numericos
17 pages
ECS171: Machine Learning: Lecture 4: Optimization (LFD 3.3, SGD)
No ratings yet
ECS171: Machine Learning: Lecture 4: Optimization (LFD 3.3, SGD)
45 pages
Nonlinear Control Lecture # 6 Stability of Equilibrium Points
No ratings yet
Nonlinear Control Lecture # 6 Stability of Equilibrium Points
20 pages
L Lyapunov
No ratings yet
L Lyapunov
52 pages
Chapter 2 - 1907876925
No ratings yet
Chapter 2 - 1907876925
33 pages
Multivariate classification
No ratings yet
Multivariate classification
7 pages
Convex Cardinality Optimization
No ratings yet
Convex Cardinality Optimization
26 pages
00-statistics
No ratings yet
00-statistics
18 pages
Classification
No ratings yet
Classification
19 pages
L16-LogisticRegression
No ratings yet
L16-LogisticRegression
15 pages
Estimation
No ratings yet
Estimation
39 pages
Tutorial 2
No ratings yet
Tutorial 2
3 pages
Lyapunov Stability Theory: Peter Al Hokayem and Eduardo Gallestey March 16, 2015
No ratings yet
Lyapunov Stability Theory: Peter Al Hokayem and Eduardo Gallestey March 16, 2015
15 pages
Kalmannote Basics
No ratings yet
Kalmannote Basics
4 pages
Class Notes 2
No ratings yet
Class Notes 2
18 pages
lecture3_supervised_learning_I
No ratings yet
lecture3_supervised_learning_I
84 pages
Chapter 3 - Beams and Frames
No ratings yet
Chapter 3 - Beams and Frames
69 pages
APPM221 Lesson4 2022
No ratings yet
APPM221 Lesson4 2022
25 pages
Cheat Sheet 4
No ratings yet
Cheat Sheet 4
2 pages
L14-PCA
No ratings yet
L14-PCA
22 pages
math8530_lecture-6-05_h
No ratings yet
math8530_lecture-6-05_h
16 pages
Shallow Water
No ratings yet
Shallow Water
25 pages
Lecture2 PDF
No ratings yet
Lecture2 PDF
25 pages
Nonparametric Classification 10/36-702: 1 1 N N N I I
No ratings yet
Nonparametric Classification 10/36-702: 1 1 N N N I I
20 pages
Probability and Statistics, slides
No ratings yet
Probability and Statistics, slides
73 pages
10725_Lecture11
No ratings yet
10725_Lecture11
6 pages
SVM-CDing2024 11 15
No ratings yet
SVM-CDing2024 11 15
54 pages
Sample problem2
No ratings yet
Sample problem2
2 pages
Class Notes 3
No ratings yet
Class Notes 3
18 pages
Group Theory I Essentials
From Everand
Group Theory I Essentials
Emil Milewski
No ratings yet
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
From Everand
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
Jeffrey M. Wooldridge
No ratings yet
Multivariate Laplace Distribution
No ratings yet
Multivariate Laplace Distribution
3 pages
Random Variables, Distributions, Multidimensional Random Variables
No ratings yet
Random Variables, Distributions, Multidimensional Random Variables
9 pages
Rotated Component Matrix
No ratings yet
Rotated Component Matrix
4 pages
Formula Sheet
No ratings yet
Formula Sheet
8 pages
Chapter 13 PowerPoint
No ratings yet
Chapter 13 PowerPoint
36 pages
Multiple Regression Analysis & Applications
No ratings yet
Multiple Regression Analysis & Applications
23 pages
Chapt 05
No ratings yet
Chapt 05
12 pages
The Relative Performance of VAR and VECM Model: Xzhang@business - Queensu.ca
No ratings yet
The Relative Performance of VAR and VECM Model: Xzhang@business - Queensu.ca
4 pages
Fit Indices in SEM
No ratings yet
Fit Indices in SEM
6 pages
Multiple Regression Analysis: DR J Reeves Wesley Professor VIT Business School Reeveswesley.j@vit - Ac.in
No ratings yet
Multiple Regression Analysis: DR J Reeves Wesley Professor VIT Business School Reeveswesley.j@vit - Ac.in
19 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
30 pages
Logistic Regression
No ratings yet
Logistic Regression
7 pages
12 Anova
No ratings yet
12 Anova
43 pages
Contoh Uji Validitas & Reliabilitas PDF
No ratings yet
Contoh Uji Validitas & Reliabilitas PDF
5 pages
Quantile Regression Explained
No ratings yet
Quantile Regression Explained
4 pages
Machine Learning General: Definiton
No ratings yet
Machine Learning General: Definiton
14 pages
Machine Learning Notebook
No ratings yet
Machine Learning Notebook
19 pages
Corelation Based On Absolute Gain: Midcap Small Cap Nifty 50 Gold Silver S&P 500 DJ
No ratings yet
Corelation Based On Absolute Gain: Midcap Small Cap Nifty 50 Gold Silver S&P 500 DJ
19 pages
Practice 02 Nonlinear Regression
No ratings yet
Practice 02 Nonlinear Regression
3 pages
Logit Probit
No ratings yet
Logit Probit
17 pages
CBS3006_MACHINE-LEARNING_ETH_1.0_66_CBS3006_61 ACP
No ratings yet
CBS3006_MACHINE-LEARNING_ETH_1.0_66_CBS3006_61 ACP
2 pages
Exploratory and Confirmatory Factor Analysis Understanding Concepts and Applications 1st Edition Bruce Thompson 2024 scribd download
100% (10)
Exploratory and Confirmatory Factor Analysis Understanding Concepts and Applications 1st Edition Bruce Thompson 2024 scribd download
77 pages
Chapter 14 (14.1 - 14.2)
No ratings yet
Chapter 14 (14.1 - 14.2)
22 pages
Correlation & Regression
No ratings yet
Correlation & Regression
20 pages
MDS Assignment1 2023
No ratings yet
MDS Assignment1 2023
3 pages
LPM, Logit and Probit Models
No ratings yet
LPM, Logit and Probit Models
21 pages
The Fundamentals of Regression Analysis PDF
No ratings yet
The Fundamentals of Regression Analysis PDF
99 pages
Multinomial Logistic Regression - Spss Data Analysis Examples
No ratings yet
Multinomial Logistic Regression - Spss Data Analysis Examples
1 page
Pearson Product-Moment Correlation
100% (3)
Pearson Product-Moment Correlation
17 pages
STAT 502 Syllabus (AGB)
No ratings yet
STAT 502 Syllabus (AGB)
1 page

Linear Regression

Uploaded by

Linear Regression

Uploaded by

Linear Regression

Nathanaël Carraz Rakotonirina

Mathématiques Informatique et Statistique Appliquées (MISA)

f (x; θ) = w1 x1 + ... + wD xD + b = w > x + b

We can minimize it analytically or iteratively using gradient descent.

p(y |x; θ) = N (y ; w > x, σ 2 )

The gradient is given by

∇w RSS(w ) = X > Xw − X > y

ŵ = (X > X )−1 X > y

You might also like