Chapter 2
Chapter 2
L/O/G/O
Chapter 2
THE SIMPLE REGRESSION MODEL
Wooldridge: Introductory Econometrics:
A Modern Approach, 5e
Content
1. Definition of the simple linear regression model
2. The ordinary least squares estimates (OLS)
3. Properties of OLS on any sample
4. Goodness-of-Fit
5. Assumptions for the simple linear regression model
6. Units of measurement and functional form
www.themegallery.com
1
10/22/2021
www.themegallery.com
as long as
1 shows how much the dependent The interpretation is only correct if all
variable (y) changes if the independent other things remain unchanged when the
variable (x) is increased by 1 unit independent variable is increased by 1 unit
• The simple linear regression model is rarely applicable in practice but its
discussion is useful for pedagogical reasons.
www.themegallery.com
2
10/22/2021
www.themegallery.com
3
10/22/2021
www.themegallery.com
4
10/22/2021
L n i 1
2. xi ( yi ˆ0 ˆ1 xi ) 0
ˆ
1 i 1 ˆ0 y ˆ1 x
• Example: Find the sample regression function: (million dong/year)
x (Income) 100 80 98 95 75 79 78 69 81 88
90 75 78 88 62 69 65 55 60 70
1076 4309
ˆ1 0.9881; ˆ0 12.0937
1089 363
www.themegallery.com
( SRF ) : y 12.0937 0.9881x uˆ
9
• Fitted regression:
963.191 18.501 roe
salary
Fitted value (predicted value) When the return on equity increases by 1 percent,
of salary then salary is predicted to change by 18.501$, if all
other things remain unchanged.
• Causal interpretation of 𝜷𝟏 ?
www.themegallery.com
10
5
10/22/2021
Unknown population
regression line
www.themegallery.com
11
L n n
2. xi ( yi 0 1 xi ) xi uˆi 0
ˆ ˆ
ˆ
1 i 1 i 1
www.themegallery.com
12
6
10/22/2021
4. Goodness-of-Fit
• Aim: How well does the explanatory variable explain the dependent variable?
• Total sum of squares: represents total variation in dependent variable.
n
SST ( yi y ) 2
i 1
• Explained sum of squares: represents variation explained by regression.
n
SSE ( yˆ i y ) 2
i 1
• Residual sum of squares: represents variation that is not explained by regression.
n n
SSR ( yi yˆ i ) uˆi2
2
i 1 i 1
• Decomposition of total variation: SST = SSE + SSR
• SSR should be as small as possible.
www.themegallery.com
13
4. Goodness-of-Fit
Coefficient of determination:
SSR SSE
R2 1
SST SST
Properties:
• 0 R2 1
• R2 0: OLS provides a poor fit to the data.
• R2 1: OLS provides a perfect fit to the data.
• R2 is interpreted as the fraction of the sample variation in y that is explained by
the regression.
• Caution: A high R2 does not necessarily mean that the regression has a causal
interpretation!
www.themegallery.com
14
7
10/22/2021
4. Goodness-of-Fit
Example: CEO Salary and return on equity
963.191 18.501 roe
salary
The regression explains only 1.3%
n 209; R 2 0.0132 of the total variation in salaries
15
The value of the explanatory variable must contain no information about the
variability
www.themegallery.com
of the unobserved factors.
16
8
10/22/2021
www.themegallery.com
17
www.themegallery.com
18
9
10/22/2021
19
uˆ 2
i
SSR
ˆ 2 i 1
n2 n2
Theorem 2.3: Unbiased estimator of 2
Under Assumptions SLR.1 through SLR.5, 𝜎 is an unbiased estimator of 2:
E (ˆ 2 ) 2
𝜎 = 𝜎 is called the standard error of the regression (SER) or the standard
error of the estimate.
www.themegallery.com
20
10
10/22/2021
If 𝜎 is plugged in for the unknown 2, then we have the estimated variances:
n n
ˆ . x
2 2
x 2
ˆ 2
ˆ 2 i
ˆ 2 i
V
ar( ˆ1 ) n
V
ar( ˆ0 ) n
i 1
. i 1
SSTx n SSTx
(x x)
i 1
i
2
n. ( xi x ) 2
i 1
www.themegallery.com
21
x 2
i
sd ( ˆ1 ) Var( ˆ1 ) sd ( ˆ0 ) Var( ˆ0 ) . i 1
SSTx n SSTx
If 𝜎 is plugged in for the unknown , then we have the estimated standard deviations
of OLS estimators:
n
ˆ ˆ x 2
se( ˆ1 ) V
i
ar( ˆ1 ) se( ˆ0 ) V
ar( ˆ0 ) . i 1
SSTx n SSTx
The estimated standard deviations of the regression coefficients are called “standard
errors”. They measure how precisely the regression coefficients are estimated.
www.themegallery.com
22
11
10/22/2021
23
24
12