0% found this document useful (0 votes)

16 views

Simple Linear Regression Model

The document describes the simple linear regression model where: 1) The dependent variable Y is modeled as a linear function of the independent variable X, plus an error term ε. 2) The ordinary least squares (OLS) method is used to estimate the unknown slope β1 and intercept β0 parameters by minimizing the sum of squared errors between observed and predicted Y values. 3) The OLS estimators β^0 and β^1 are proven to be unbiased, and their variances depend on the variance of the error term ε and the distribution of X values. The R-squared statistic and the sums of squared errors are also introduced to measure the goodness of fit of the

Uploaded by

frapass99

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views

Simple Linear Regression Model

Uploaded by

frapass99

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

SIMPLE LINEAR REGRESSION MODEL

The Data Generating Process (DGP), or the population, is described by the following linear model:
Y j=β 0 + β 1 X j +ε j
 Y j is the j-th observation of the dependent variable Y (it is known)
 X j is the j-th observation of the independent variable X (it is known)
 β 0 is the intercept term (it is unknown)
 β 1 is the slope parameter (it is unknown)
 ε j is the j-th error, the j-th unobserved factor that, besides X, affects Y (it is unknown)

Since the values of X j and Y j are known but the values of β 0, β 1and ε j are unknown, also the regression
model that describes the relationship between X and Y is unknown.

Graphically, the errors are the vertical distances between the observations of the dataset and the
observations predicted by the linear regression model:

The ordinary least squares criterion (OLS)

With the OLS method we find the estimators of the unknown parameters β 0 and β 1that maximize the
explicative power of the linear model β 0 + β 1 X j and that, therefore, minimize the sum of the squared
errors:
n n
( ^β0 , ^β 1)=argmin ∑ ( Y j−β 0−β 1 X j )2 =argmin ∑ ε 2j
j=1 j=1
n
where ∑ ( Y j−β 0 −β1 X j ) is the objective function O ( β 0 , β 1 ), that must be derived with respect to β 0
2

j=1

and β 1and must be put equal to 0 in order to find ^β 0 and ^β 1.

The 1st of the first order conditions (FOC) is

n
∂O
=0 ∑ 2 ( Y j− β^ 0 − ^β1 X j ) (−1)=0 ⟺
∂ β0 j=1

⇓
^β =Y − ^β X
0 n 1 n
where:
n
1
 Y n= ∑ Y is the sample average of Y
n j =1 j
n
1
 X n= ∑ X is the sample average of X
n j=1 j
The 2nd of the first order conditions (FOC) is
n
∂O
=0 ⟺ ∑ 2 ( Y j− ^β 0− β^ 1 X j ) (−X j)=0
∂ β1 j=1

⇓
n n

∑ ( X ¿ ¿ j− X n) ( Y j−Y n ) 1
∑ ( X ¿ ¿ j− X n )( Y j−Y n ) sample covariance between X∧Y
^β 1= j =1 = ∙ j=1
= ¿¿
n n
T 1 sample variance of X
∑ ( X j−X n ) 2 ∙ ∑ ( X −X n )
T j=1 j
2

j=1

If ^β 0 and ^β 1 are the OLS estimators, the predicted values are defined as

Y^ j= ^β 0 + ^β 1 X j
while the residuals are defined as
ε^ j=Y j−Y^ j=Y j−( ^β0 + β^ 1 X j )
More ε^ j is close to 0, better is the quality of the regression.

The crucial assumptions

Consider 2 r.vs. X and Y and the following regression model
Y = β0 + β 1 X +ε

We know that E [ Y |X ]= β0 + β 1 X and ε =Y −¿ ¿Therefore

E [ ε| X ] =E [ Y −( β ¿ ¿ 0+ β 1 X )¿ X ]=E [ Y | X ] −E ¿
E [ Y |X ]−{β 0+ β1 E ¿

So, by the Law of iterated expectations

E [ ε ]=E [ E [ ε| X ] ] =E [ 0 ]=0
In conclusion, the expectation of the unknown factors is not influenced by X and it is equal to 0:
E [ ε| X ] =E [ ε ] =0
This implies that
E [ Xε ] =0
Because, by the Law of iterated expectations, E [ Xε ] =E [ E [ Xε|X ] ] =E [ XE [ ε| X ] ] =E [ X ∙ 0 ] =0

Moreover
E [ ε^ j ]=0
3 important properties
We can derive 3 properties from the previous conclusions:
 ^ coincides with the sample average of Y
Property 1: the sample average of the fitted values Y

Y^ n=Y n

 Property 2: the sample average of the OLS residuals ε^ jis 0:

ε^ j=0
 Property 3: the sample covariance between the regressors and the OLS residuals is always 0
n
1
∑ ( X −X n ) ( ^ε j− ε^ j ) =¿ 0 ¿
n j=1 j

Measures of goodness of prediction

The R2 measures the proportion of the variance of Y that is explained by X. The R2 varies between 0 and 1.
Higher is the R2, better is the fit of the model to the data.

2 SSE SST −SSR SSR

R= = =1−
SST SST SST
Where:
 The Total sum of squares (SST) measures the data dispersion (total variance of data)
n
SST =∑ (Y j−Y n )2
j =1

 The Explained sum of squares (SSE) measures the fitted values dispersion (variance explained by the
regression)
n n
SSE=∑ ( Y^j−Y n )2=¿ ∑ ( Y^j−Y^ n)2 ¿
j=1 j=1

 The Sum of squares residuals (SSR) measures the residuals dispersion (variance explained by the
residuals)
n n
SSR=∑ (Y j−Y^j )2=¿ ∑ ( ε^ j)2 ¿
j=1 j=1

Theorem: the total variance of the data is given by the sum of the variance explained by the regression and
the variance explained by the residuals.
SST =SSE+ SSR
Of course:
- smaller is the SSR, better is the fit of the regression to the data
2
SSR ≪ SST ⇒ R ≈ 1(good fit )
- higher is the SSR, worst is the fit of the regression to the data
2
SSR ≈ SST ⇒ R ≈ 0 ( poor fit )

Unbiasedness of the OLS estimators

Assume that
1) the DGP of (X j ,Y ¿ ¿ j)¿ is Y j=β 0 + β 1 X j +ε j
2) E [ ε j ] =0
3) E [ ε j∨ X 1 , … , X n ] =E [ ε j ] =0
These 3 assumptions are enough to prove that ^β 0 and ^β 1are unbiased estimators of β 0 and β 1.
^β can also be expressed as
1
n

∑ (X ¿ ¿ j−X n )ε j
^β 1=β 1+ j=1 ¿
SS T X
n
Where SS T X =∑ ( X j −X n ) is the total sum of squares of X.
2

j=1

From this, we derive that

E [ ^β 1|X ] =β 1

And then, by the law of iterated expectations, ^β 1 is an unbiased estimator of β 1, in the sense that

E [ ^β 1 ]=β 1

[ [ ]]
Because E ^β 1 =E E ^β 1| X =E [ β1 ] = β1
[ ]
^β can also be expressed as
0

^β =β +( β¿¿ 1− ^β ) X +ε ¿
0 0 1 n n

From this, we derive that

E [ ^β 0|X ]=β 0

And then, by the law of iterated expectations, ^β 0 is an unbiased estimator of β 0, in the sense that

E [ ^β 0 ]= β0

[ [ ]]
Because E ^β 0 =E E ^β 0|X =E [ β 0 ] =β 0
[ ]
Variance of the OLS estimators
Assume that
1) the DGP of (X j ,Y ¿ ¿ j)¿ is Y j=β 0 + β 1 X j +ε j
2) E [ ε j ] =0
3) E [ ε j∨ X 1 , … , X n ] =E [ ε j ] =0
4) the ε j are independent
5) V [ ε j∨X 1 , … , X n ] =E [ ε j ∨X 1 , … , X n ] −( E [ ε j∨ X 1 , … , X n ] ) =E [ ε j ∨X 1 , … , X n ]=E [ ε j ]=σ ε
2 2 2 2 2

2
σε
Then E [ ^β 1∨X ]=β 1 +
2 2
, so
SS T X
2 2 2
σε 2 σε σε
V [ β 1∨X ]=
^ Because V [ ^β 1∨X ]=E [ ^β 1∨X ]−( E [ ^β 1∨X ]) =β 1+
2 2 2
−β 1=
SST X SS T X SST X

Note that:
 As the variance of the errors goes to 0, the variance of the estimator goes to 0, so the estimator gets
more and more precise
σ 2ε → 0 ⇒ V [ ^β 1∨X ] → 0
 As the dimension of the sample goes to + ∞, the variance of the estimator goes to 0, so the estimator
gets more and more precise
n →+∞ ⇒ SS T X →+ ∞ ⇒ V [ β^ 1∨X ] → 0

2
σε 2 1 2
E [ ^β 0∨ X ]=β 0 +
2 2
X + σ , so
SST X n n ε
n

σ2 ∑ X 2j Because
V [ ^β 0∨X ]= ε j=1
n SST X
n

∑ X 2j
( )
2 2 2
σ 1 X 1 σ
V [ ^β 0∨X ]=E [ β^ 20∨ X ]−( E [ ^β 0∨ X ] ) =β 20+
2 ε
X 2n+ σ 2ε −β 20=σ 2ε n
+ = ε j=1
SST X n SST X n n SST X

Note that:
 As the variance of the errors goes to 0, the variance of the estimator goes to 0, so the estimator gets
more and more precise
σ 2ε → 0 ⇒ V [ ^β 0∨X ] → 0
 As the dimension of the sample goes to + ∞, the variance of the estimator goes to 0, so the estimator
gets more and more precise
n →+∞ ⇒ SS T X →+ ∞⇒ V [ β^ 0∨ X ] →0

Estimator of the error’s variance

How can we compute V [ β
^ ∨X ] and V [ ^β ∨X ]? σ 2ε is not observed!
0 1

Assume that
1) the DGP of (X j ,Y ¿ ¿ j)¿ is Y j=β 0 + β 1 X j +ε j
2) E [ ε j ] =0
3) E [ ε j∨ X 1 , … , X n ] =E [ ε j ] =0
4) the ε j are independent
5) V [ ε j∨X 1 , … , X n ] =E [ ε j ∨X 1 , … , X n ] −( E [ ε j∨ X 1 , … , X n ] ) =E [ ε j ∨X 1 , … , X n ]=E [ ε j ]=σ ε
2 2 2 2 2

Therefore, the random variable

n
1
σ^ 2ε = ∑ ε^ 2= SSR
n−2 j=1 j n−2
2
is an unbiased estimator of the error’s variance σ ε , in the sense that

E [ σ^ ¿ ¿ ε ]=σ ε ¿
2 2

Econometrics - Fumio Hayashi (Solutions)
No ratings yet
Econometrics - Fumio Hayashi (Solutions)
19 pages
CH 2 Multiple Regression S
No ratings yet
CH 2 Multiple Regression S
78 pages
UnivariateRegression 2
No ratings yet
UnivariateRegression 2
72 pages
FECO Note 2 - Simple Linear Regression: Xuan Chinh Mai
No ratings yet
FECO Note 2 - Simple Linear Regression: Xuan Chinh Mai
7 pages
Lecture 2-3_Properties of the OLS Estimates
No ratings yet
Lecture 2-3_Properties of the OLS Estimates
20 pages
Properties of OLS Estimators: Assumptions Underlying Model
100% (1)
Properties of OLS Estimators: Assumptions Underlying Model
23 pages
OLS Estimation of Single Equation Models PDF
No ratings yet
OLS Estimation of Single Equation Models PDF
40 pages
Week 3-4
No ratings yet
Week 3-4
75 pages
5 OLS Asymptotics
No ratings yet
5 OLS Asymptotics
17 pages
Gujarati D, Porter D, 2008: Basic Econometrics 5Th Edition Summary of Chapter 3-5
No ratings yet
Gujarati D, Porter D, 2008: Basic Econometrics 5Th Edition Summary of Chapter 3-5
64 pages
cheatsheet
No ratings yet
cheatsheet
2 pages
Chapter3
No ratings yet
Chapter3
52 pages
Ec2 1
No ratings yet
Ec2 1
11 pages
Non-Spherical Errors: 1 Efficient OLS
No ratings yet
Non-Spherical Errors: 1 Efficient OLS
14 pages
8. Linear Regression
No ratings yet
8. Linear Regression
29 pages
Econometrics Chap 3
No ratings yet
Econometrics Chap 3
19 pages
Chapter3 PDF
No ratings yet
Chapter3 PDF
52 pages
Multiple Regression Model and Multicollinearity
No ratings yet
Multiple Regression Model and Multicollinearity
25 pages
Multiple Regression Model
No ratings yet
Multiple Regression Model
6 pages
Simple Regression
No ratings yet
Simple Regression
27 pages
Chapter 6: Regression
No ratings yet
Chapter 6: Regression
7 pages
Classical Linear Regression and Its Assumptions
No ratings yet
Classical Linear Regression and Its Assumptions
63 pages
Wooldridge 6e AppE IM
No ratings yet
Wooldridge 6e AppE IM
5 pages
02 Simple Regression
No ratings yet
02 Simple Regression
29 pages
Introduction To Mathematical Modeling: Simple Linear Regression
No ratings yet
Introduction To Mathematical Modeling: Simple Linear Regression
21 pages
Multiple Linear Reegression
No ratings yet
Multiple Linear Reegression
21 pages
ECO 401 Econometrics: SI 2021 Week 2, 14 September
100% (1)
ECO 401 Econometrics: SI 2021 Week 2, 14 September
47 pages
LLICO2b_ECO1_English
No ratings yet
LLICO2b_ECO1_English
15 pages
ECO375H_Slides_3
No ratings yet
ECO375H_Slides_3
39 pages
Simple-Linear-Regression-Model-3 24
No ratings yet
Simple-Linear-Regression-Model-3 24
87 pages
Properties of The OLS Estimator: Quantitative Methods 2
No ratings yet
Properties of The OLS Estimator: Quantitative Methods 2
57 pages
Econometrics Handout Session 2
No ratings yet
Econometrics Handout Session 2
18 pages
Ols Estimates
No ratings yet
Ols Estimates
16 pages
Ols 23-24
No ratings yet
Ols 23-24
87 pages
04 - Multiple Regression Asymptotics (1)
No ratings yet
04 - Multiple Regression Asymptotics (1)
32 pages
Im ch08
No ratings yet
Im ch08
12 pages
Regression Basics in Matrix Terms: 1 The Normal Equations of Least Squares
No ratings yet
Regression Basics in Matrix Terms: 1 The Normal Equations of Least Squares
3 pages
Econ 329 - Statistical Properties of The Ols Estimator: Sanjaya Desilva
No ratings yet
Econ 329 - Statistical Properties of The Ols Estimator: Sanjaya Desilva
12 pages
Lesson01 PDF 02
No ratings yet
Lesson01 PDF 02
5 pages
The Simple Regression Model: DR Jin Hongfei 1
No ratings yet
The Simple Regression Model: DR Jin Hongfei 1
41 pages
Notes2
No ratings yet
Notes2
16 pages
Classical Multiple Regression
No ratings yet
Classical Multiple Regression
5 pages
Econometrics Lecture 3 Simple Linear Regression (SLR) For Cross Sectional Data Part 2
No ratings yet
Econometrics Lecture 3 Simple Linear Regression (SLR) For Cross Sectional Data Part 2
39 pages
Econometrics I, MT2 Problem Set 1: y y Corr R
No ratings yet
Econometrics I, MT2 Problem Set 1: y y Corr R
1 page
統計摘要
No ratings yet
統計摘要
12 pages
SLRM note
No ratings yet
SLRM note
15 pages
Education and Research: UP School of Statistics Student Council
No ratings yet
Education and Research: UP School of Statistics Student Council
26 pages
Lect 6
No ratings yet
Lect 6
20 pages
Linear Regression Analysis: Module - Vii
No ratings yet
Linear Regression Analysis: Module - Vii
10 pages
Tema I (Mínimos Cuadrados Ordinarios)
No ratings yet
Tema I (Mínimos Cuadrados Ordinarios)
49 pages
Ordinary Least Squares: Rómulo A. Chumacero
No ratings yet
Ordinary Least Squares: Rómulo A. Chumacero
50 pages
TA_session_06
No ratings yet
TA_session_06
13 pages
Econometrics
No ratings yet
Econometrics
13 pages
Business Stat & Emetrics - Inference in Regression
No ratings yet
Business Stat & Emetrics - Inference in Regression
7 pages
4basic Econometrics Chapter III
No ratings yet
4basic Econometrics Chapter III
13 pages
Econometrics: Two Variable Regression: The Problem of Estimation
No ratings yet
Econometrics: Two Variable Regression: The Problem of Estimation
28 pages
Emet2007 Notes
No ratings yet
Emet2007 Notes
6 pages
Differential Forms
From Everand
Differential Forms
Henri Cartan
5/5 (2)
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
From Everand
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
Jeffrey M. Wooldridge
No ratings yet
Introduction to Bessel Functions
From Everand
Introduction to Bessel Functions
Frank Bowman
2.5/5 (1)
Adv Analytical Theory and Methods: Regression
No ratings yet
Adv Analytical Theory and Methods: Regression
45 pages
UMBayesAdaptIntro SM2
No ratings yet
UMBayesAdaptIntro SM2
64 pages
Journal of Statistical Software
No ratings yet
Journal of Statistical Software
66 pages
Introductory Econometrics For Finance Chris Brooks Solutions To Review Questions - Chapter 5
No ratings yet
Introductory Econometrics For Finance Chris Brooks Solutions To Review Questions - Chapter 5
9 pages
Topic 1: Decision Analysis 1. Six Steps in Decision Theory
No ratings yet
Topic 1: Decision Analysis 1. Six Steps in Decision Theory
8 pages
Asset Pricing: Prof. Lutz Hendricks
No ratings yet
Asset Pricing: Prof. Lutz Hendricks
49 pages
12
No ratings yet
12
16 pages
Studenmund Ch14 v2
No ratings yet
Studenmund Ch14 v2
48 pages
ABI-301Final Period Practical Problems
No ratings yet
ABI-301Final Period Practical Problems
4 pages
Business Econometrics Lecture Notes Quiz Econ2271
No ratings yet
Business Econometrics Lecture Notes Quiz Econ2271
2 pages
Table For PV and FV (Single Sum and Annuity)
No ratings yet
Table For PV and FV (Single Sum and Annuity)
10 pages
Regression Analysis in Excel (In Easy Steps)
No ratings yet
Regression Analysis in Excel (In Easy Steps)
4 pages
Risk Analysis - Assignment 5
No ratings yet
Risk Analysis - Assignment 5
2 pages
Operation Research Assignment 2
100% (1)
Operation Research Assignment 2
7 pages
Errata For Deveaux, Velleman and Bock, Stats: Data and Models, 3 Ed
No ratings yet
Errata For Deveaux, Velleman and Bock, Stats: Data and Models, 3 Ed
1 page
Chapter 11 Answers
0% (2)
Chapter 11 Answers
14 pages
Quasi-Experimental and Single-Case Designs M:C 10
100% (1)
Quasi-Experimental and Single-Case Designs M:C 10
5 pages
Assignment Forecasting M08228209 UPDATED
No ratings yet
Assignment Forecasting M08228209 UPDATED
44 pages
Graphical Method and Simplex Method
100% (1)
Graphical Method and Simplex Method
61 pages
AE 2023 Lecture7
No ratings yet
AE 2023 Lecture7
40 pages
Detecting Multicollinearity in Regression Analysis
No ratings yet
Detecting Multicollinearity in Regression Analysis
4 pages
Module01 LinearRegression
No ratings yet
Module01 LinearRegression
41 pages
Probability and Statistics Course Outline
67% (3)
Probability and Statistics Course Outline
2 pages
Bayesian Decision Theory: Intro To
No ratings yet
Bayesian Decision Theory: Intro To
56 pages
HW 17 - MLR
No ratings yet
HW 17 - MLR
5 pages
Week 4 - Cont. Week 3 Slides - Updated
No ratings yet
Week 4 - Cont. Week 3 Slides - Updated
43 pages
Lecture 9.0 - Statistics
No ratings yet
Lecture 9.0 - Statistics
39 pages
ARIMAX Analysis
No ratings yet
ARIMAX Analysis
4 pages
Network Diagram
No ratings yet
Network Diagram
17 pages
Decision Theory
No ratings yet
Decision Theory
10 pages

Simple Linear Regression Model

Uploaded by

Simple Linear Regression Model

Uploaded by

SIMPLE LINEAR REGRESSION MODEL

The ordinary least squares criterion (OLS)

and β 1and must be put equal to 0 in order to find ^β 0 and ^β 1.

The 1st of the first order conditions (FOC) is

The crucial assumptions

We know that E [ Y |X ]= β0 + β 1 X and ε =Y −¿ ¿Therefore

So, by the Law of iterated expectations

 Property 2: the sample average of the OLS residuals ε^ jis 0:

Measures of goodness of prediction

2 SSE SST −SSR SSR

Unbiasedness of the OLS estimators

From this, we derive that

From this, we derive that

Estimator of the error’s variance

Therefore, the random variable

You might also like