0% found this document useful (0 votes)

79 views61 pages

The Simple Regression Model

This document provides an overview of the simple linear regression model. It discusses (1) defining the model as relating a dependent variable y to an independent variable x, plus an error term, (2) estimating the model parameters β0 and β1 using methods of moments and ordinary least squares, and (3) key assumptions that the error term has a mean of zero and is uncorrelated with the independent variable. Examples are provided to illustrate key concepts such as the population regression function.

Uploaded by

张敏然

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

79 views61 pages

The Simple Regression Model

Uploaded by

张敏然

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 61

Department of Finance & Banking, University of Malaya

The Simple Regression

Model

Dr. Aidil Rizal Shahrin

[email protected]

October 10, 2020

Contents

1 Simple Regression Model

2 Estimating β0 & β1
2.1 Method of Moments
2.2 Ordinary Least Square

3 Sample Regression Function (SRF)

3.1 Example...

4 Goodness of Fit

5 Units of Measurements and Functional Form

5.1 Changing Units of Measurements
5.2 Incorporating Nonlinearities
5.3 Meaning of ’Linear’ Regression
2/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme
Contents

6 Expected Values and Variances of OLS Estimator

6.1 Unbiasedness of OLS
6.2 Variances of the OLS Estimators
6.2.1 Sampling Variances of the OLS Estimators
6.2.2 Estimating the Error Variance

7 Regression through Origin

3/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme

Simple Regression Model

i. Most of the time in econometric analysis, we have this

premises; y and x are two variables representing some
populations (most common scenario more than 2 RV).
ii. We are interested in ‘explaining y in term of x’.
iii. In modeling ‘y in terms of x’, we confront 3 issues:
1. There is never an exact relationship between 2 variables,
how do we allow for other factors to affect y? In theory the
relationship is always exact, did you noticed that?
2. What is the functional relationship between y and x? Linear,
quadratic, cubic or even exponential and etc.? If you have
economic theory, it will give the guidance.
3. How assure are we capturing ‘ceteris paribus’? Remember
we are interested in causality!

4/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme

Simple Regression Model

iv. The simple linear regression model (also known as

two-variable regression model or bivariate linear regression
model) relates the two variables x and y.
ｉｆ　ｓｔｉｌｌ　ｈａｖｅ　β　２　ｘ２　就是ｍｕｌｔｉｐｌｅ　ｒｅｇｒｅｓｓｉｏｎ
　？１是０的话　ｘ不能ｅｘｐｌａｉｎ　ｙ
如果　　？１增加　ｙ也会加 y = β0 + β1 x + u (1)

v. The variables y and x have several names as shown in

Tab.1.
Y X
Dependent variable Independent variable
Explained variable Explanatory variable
Response variable Control variable
Predicted variable Predictor variable
Regressand Regressor
Table 1: Terminology for simple regression

5/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme

Simple Regression Model

vi. The variable u in Eq.1 is called error term or disturbance in

the relationship, represents factor other than x that affect y.
Simply say, u stands for all unobserved factors. ｔｈａｔ　ａｆｆｅｃｔ　ｙ
vii. Eq.1 also address the functional relationship between y and
x. If the other factors in u are held fixed (remember ceteris
paribus?), so that the change in u is zero, ∆u = 0, then x
has a linear effect on y:

∆y
= β1 if ∆u = 0 (2)
∆x
viii. Thus, β1 in Eq.2 is the slope parameter in Eq.1 and it is of
primary interest in applied economics. While β0 in Eq.1 is
the intercept parameter or sometimes called constant term.
Also has its uses, although rarely central to analysis.

6/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme

２２ Simple Regression Model

ix. Linearity of Eq.1 implies that a one-unit change in x has

the same effect on y, regardless of the initial value of x as
shown in Eq.2. This does not work for increasing returns
idea in economy, why?
x. The most difficult issue to address is whether Eq.1 really
allows us to draw ceteris paribus conclusions about how x
affects y. β1 does measure the effect of x on y by holding
other factors in u fixed. If any of the unobservable in u is
related to the x, we will not get reliable estimators of β0
and β1 of Eq.1 (more on this later).
xi. One key assumption about u is that, as long we have
intercept β0 in Eq.1, this assumption always hold
ｅｘｐｅｃｔｅｄ　ｖａｌｕｅ　ｏｆ　ｕ　ｉｆ　ｉｎｔｅｒｃｅｐｔ

E(u) = 0 (3)

7/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme

２３ Simple Regression Model

xii. A natural measure of how x and u are related is correlation

coefficient. If u and x are uncorrelated, then as random
variable, they are not linearly related.
xiii. However, correlation only measure linear dependence
between variable. While u is uncorrelated with x, but
being correlated with x2 where correlation fail to capture.
xiv. A much stronger assumption and better is the expected
value of u given x, or conditional expectation of u with
given x or: Ｃｏｒｒ（ｕ，ｘ　＝　０）

E(u|x) = E(u) (4)

Remember this hold when u and x are statistically
independent? 　ａｖｅｒａｇｅ　ｖａｌｕｅ　ｏｆ　ｕ　ｄｏｅｓ
ｎｏｔ　ｄｅｐｅｎｄ　ｏｎ　ｔｈｅ　ｖａｌｕｅ　ｏｆ　ｘ

8/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme

Simple Regression Model

xv. Thus, we can combine Eq.4 and Eq.3 to have

E(u|x) = E(u) = 0 (5)

If we take conditional expectation of Eq.1 condition on x

we have:

E(y|x) = E(β0 + β1 x + u|x)

= E(β0 |x) + E(β1 x|x) + E(u|x)
(6)
= β0 + β1 E(x|x) + E(u|x)
= β0 + β1 x

where Eq.6 is called population regression function (PRF).

Remember, Eq.6 states that average value of y changes

9/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme

Simple Regression Model

with x; it does not say y equals β0 + β1 x for all units in the

population. Or

y = β0 + β1 x + u
E(y|x) = β0 + β1 x

Noticed the different? Refer to Fig.1, can you relate with

Eq.6?

10/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme

２４ Simple Regression Model
这线叫ｐｏｐｕｌａｔｉｏｎ　ｒｅｇｒｅｓｓｉｏｎ　ｍｏｄｅｌ

Figure 1: E(y|x) as a linear function of x

11/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme

２４ Estimating β0 & β1
Ｅｓｔｉｍａｔｏｒ　ｍｕｓｔ　ｂｅ　ａ　ｆｕｎｃｔｉｏｎ　ｏｆ　ｓａｍｐｌｅ
i. Now we address how to estimate β0 and β1 in Eq.1.
ii. In order to do this, we need a sample from the population
of interest.
iii. Let i stands for observation of sample size n, or i = 1, . . . , n
for both y and x, where now we can rewrite Eq.1 for each i
as:
yi = β0 + β1 xi + ui (7)
where Eq.7 can be any of these:

y1 = β0 + β1 x1 + u1
y2 = β0 + β1 x2 + u2
..
.
yn = β0 + β1 xn + un
12/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme
Estimating β0 & β1

iv. For example, xi is annual income and yi is the annual

savings for family i during particular year. And we have
collected data on 15 families or n = 15.
v. Two key equations in estimating β0 and β1 of Eq.1:

E(u) = 0, hold with intercept (8)

Cov(x, u) = E(xu) = 0, x and u not correlated (9)

Eq.8 hold when we have intercept. For Eq.9, if

E(u|x) = E(u), then cov(x, u) and corr(x, u) will equal zero.
Why?
Ｅ［ｕ｜ｘ］＝　Ｅ［ｕ］

cov(x, u) = E(xu) − E(x)E(u)

= E(xu)

13/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme

Estimating β0 & β1

And, by law of iterated expectation, we have

E(xu) = E[E(xu|x)]

But
E(xu|x) = xE(u|x)
Thus

E(xu) = E[E(xu|x)]
= E[xE(u|x)]
= E[x0]
=0

we have proof Eq.9.

14/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme

２５ Method of Moments

i. From Eq.1, u = y − β0 − β1 x, and inserting this in Eq.8 and

Eq.9, we have

E(u) = E(y − β0 − β1 x) = 0 (10)

E(xu) = E[x(y − β0 − β1 x)] = 0 (11)

ii. Remember previously (refer to mathematical statistic

notes), in method of moments to estimate population mean
PnY, or E(Y) = µ is by using its sample counterpart or
of
i=1 Yi /n. Applying this to Eq.10 and Eq.11, we have (the
’hat’ ˆ· refer to estimate):
n
X
n−1 (yi − β̂0 − β̂1 xi ) = 0 (12)
i=1
n
X
n−1 xi (yi − β̂0 − β̂1 xi ) = 0 (13)
15/61 Aidil Rizal Shahrin i=1University of Malaya Unofficial Beamer Theme
Method of Moments
iii. Eq.12 can be rewrite as (using property of summation):
ｅｓｔｉｍａｔｅ　ｏｆ　ｂｅｔａ１

ȳ = β̂0 + β̂1 x̄ or
(14)
ｐｏｐｕｌａｔｉｏｎ　ｐａｒａｍｅｔｅｒ β̂0 = ȳ − β̂1 x̄ Ｅｓｔｉｍａｔｏｒ　ｏｆ　ｂｅｔａ０

iv. Inserting Eq.14 into Eq.13, we have

n
X
xi [yi − (ȳ − β̂1 x̄) − β̂i xi ] = 0
i=1

v. Finally, after manipulation of the above (refer to your

textbook) Pn Ｅｓｔｉｍａｔｏｒ　ｏｆ　ｂｅｔａ１
(x i − x̄)(yi − ȳ)
β̂1 = i=1Pn 2
(15)
i=1 (xi − x̄)

With ni=1 (xi − x̄)2 > 0, there must be variation of x. Same

P
x for all the sample will not work, no variation. Why,
16/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme
Method of Moments
(xi − x̄) = 0! Can you figure out? Numerator can be zero
but not denominator. Thus, Eq.14 and Eq.15 is an estimator
of β0 and β1 respectively based on method of moments.
vi. Using simple algebra, Eq.15 can be rewrite as
ＳＤ　ｏｆ　ｙ
ｓａｍｐｌｅ　ｃｏｒｒｅｌａｔｉｏｎ
σ̂y ａｌｗａｙｓ　ｐｏｓｉｔｉｖｅ
β̂1 = ρ̂xy · (16)
ｉｆ　ｃｏｒｒｅｌａｔｉｏｎ　ｘｙ　ｎｅｇａｔｉｖｅ，　ｂｅｔａ　ｈａｔ　ａｌｓｏ　ｎｅｇａｔｉｖｅ σ̂x
ＳＤ　ｏｆ　ｘ

Where ρ̂xy is the sample correlation between xi and yi . σ̂y

and σ̂x denotes sample standard deviation of y and x
respectively.
vii. If xi and yi are positively correlated in the sample, β̂1 > 0
and vice versa (standard deviation is always positive).
viii. Thus, simple regression is an analysis of correlation
between 2 variables, and so one must be careful in
inferring causality.
17/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme
Ordinary Least Square

i. We can also estimate β0 and β1 in Eq.1 using OLS.

ii. Firstly, the residual for observation i is the difference
between the actual yi and its fitted value ŷi or

ûi = yi − ŷi = yi − β̂0 − β̂1 xi (17)

where the fitted value ŷi equal to β̂0 + β̂1 xi .

iii. The idea of OLS is to minimize sum of squared residuals
with respect to β̂0 and β̂1 , or
n
X
min û2i w.t. β̂0 , β̂1 (18)
i=1

18/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme

Ordinary Least Square

Thus, we have
n
∂ X
(yi − β̂0 − β̂1 xi )2 = 0 (19)
∂ β̂0 i=1
n
∂ X
(yi − β̂0 − β̂1 xi )2 = 0 (20)
∂ β̂1 i=1

By solving Eq.19 and Eq.20, we have

n
X
−2 (yi − β̂0 − β̂1 xi ) = 0 (21)
i=1
n
X
−2 xi (yi − β̂0 − β̂1 xi ) = 0 (22)
i=1

19/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme

Ordinary Least Square

iv. From Eq.21, solving for β̂0 , we have exactly the same as
Eq.14. Then inserting this solution into Eq.22, with some
algebra manipulation, we end up with Eq.15.
v. Thus, method of moments and OLS produce the same
estimator for β0 and β1 which is Eq.14 and Eq.15
respectively.

20/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme

Sample Regression Function (SRF)

i. Previously, we have discuss the PRF in Eq.6 or

E(y|x) = β0 + β1 x

which is unknown and fixed in the population (the β0 and

β1 is unknown and fixed)!
ii. With estimate of β0 and β1 from Eq.14 and Eq.15 (either by
MM or OLS), we can have

ŷ = β̂0 + β̂1 x (23)

a sample regression function (SRF) which is not fixed.

Since SRF is obtained from a given sample data, a new
sample will give different slope estimate of β̂0 and β̂1 .
Refer to Fig.2.
21/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme
Sample Regression Function (SRF)

Figure 2: Fitted values and residuals

22/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme

Example...

i. Let say you are interested to study do return on equity

(roe) do influence CEO salary.
ii. The model you have in mind is (you assume it is linear)

salary = β0 + β1 roe

iii. While the econometric model will be

salary = β0 + β1 roe + u

23/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme

Example...

iv. And the PRF is

E(salary|roe) = β0 + β1 roe

Why do we need the PRF, even though it is unknown?

Refer to Fig.1 that show the PRF line. This line is unknown
because it is population and fixed. Let say roe = 20%, it will
give E(salary|roe = 20%).
v. And the SRF is the estimate of PRF, or

\ = β̂0 + β̂1 roe

salary

vi. Based on n = 209, we have

\ = 963.191 + 18.501roe
salary

24/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme

Example...

Figure 3: The OLS regression line salary = 963.191 + 18.501roe and the
(unknown) PRF

25/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme

Goodness of Fit

i. Define:
a. Total sum of squares (SST) as:
n
X
SST ≡ (yi − ȳ)2 (24)
i=1

Which measure the total sample variation in the yi ; how

spread the yi are in the sample.
b. Explained sum of squares (SSE) as:
n
X
SSE ≡ (ŷi − ȳ)2 (25)
i=1

Which measure the total sample variation in the ŷi .

26/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme

Goodness of Fit

c. Residual sum of squares (SSR) as:

n
X
SSR ≡ û2i (26)
i=1

Which measure the total sample variation in the ûi .

ii. Since total variation in y can always be expressed as the
sum of the explained variation and the unexplained
variation SSR, or (see notes for proof and different
textbook used different abbreviation)

SST = SSE + SSR (27)

iii. We divide Eq.27 by SST to get

SSE SSR
1= +
SST SST
27/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme
Goodness of Fit

iv. The R-squared of the regression, or coefficient of

determination, is defined as:
SSE SSR
R2 ≡ =1− (28)
SST SST
v. R2 is the ratio of the explained variation compared to the
total variation; thus it is interpreted as the fraction of the
sample variation in y that is explained by x.
vi. The value of R2 is always between zero and one, because
SSE ≤ SST. We normally multiply R2 with 100 in
interpreting making it in percentage form.
vii. In cross sectional analysis, low R2 is common.
viii. Beware that using R2 as main gauge of success for an
econometric analysis can lead to trouble. Low R2 doesn’t
mean that regression result is useless.
28/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme
Goodness of Fit

ix. Word of caution! All the discussion above assume that we

have intercept like in Eq.1. More on this later

29/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme

Changing Units of Measurements

i. In the CEO salary regression, we obtain the following:

salary = 963.191 + 18.501roe

(29)
n = 209, R2 = 0.0132

where salary is in thousands of dollars and roe is in

percentage.
ii. If we convert salary from thousands of dollars to become in
dollars (we multiply the salary data with 1000) and called it
salardol, we have
V

salardol = 963, 191 + 18, 501roe (30)

30/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme

Changing Units of Measurements
iii. So, when dependent variable is multiplied (divided) by
constant c, then the intercept and slope estimates are
multiplied (divided) by c, or

(ŷ × c) = β̂0 × c + β̂1 × c × x (31)

iv. If we convert roe to decimal (we divide the roe data by 100)
and called it roedec, we have
V

salary = 963.191 + 1, 850.1roedec (32)

v. So when independent variable are divided (multiplied) by

constant c, then the slope estimate are multiplied (divided)
by constant c. x
ŷ = β̂0 + β̂1 × c (33)
c
vi. The R2 is not affected with the changes in units of y or x.
31/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme
Incorporating Nonlinearities

i. Why in applied work in social sciences we encounter

regression equation where the dependent variable appears
in logarithmic form?
ii. Refer to wage equation below:
V

wage = −0.90 + 0.54educ (34)

where wage is dollars per hour and educ denotes years of

schooling. In Eq.34, additional year of education is
predicted to increase hourly wage by 54 cents. It is not
reasonable since we expect higher education (more years of
schooling) have higher predicted wage than lower
education (happen due to linearity of the Eq.34).

32/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme

Incorporating Nonlinearities

iii. A more realistic model is:

log(wage) = β0 + β1 educ + u (35)

where log(·) is natural logarithm. In Eq.35, with ∆u = 0, we

have
∆ log(wage) = β1 × ∆educ (36)
The key relationship is this approximation

y1 − y0 ∆y
log(y1 ) − log(y0 ) ≈ = , or
y0 y0
(37)
∆wage
log(wage1 ) − log(wage0 ) ≈
wage0

33/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme

Incorporating Nonlinearities

where the approximation work well for small changes in y.

Replacing Eq.37 into Eq.36 and multiply both side with
100, we have (dropping the subscript)

∆wage
× 100 = (β1 × 100) ∆educ (38)
wage

where on the LHS of Eq.38 is known as percentage change

in wage. The result of Eq.35 is
V

log(wage) = 0.584 + 0.083educ (39)

where each year of education (∆educ = 1) increases wage

by a constant percentage of 8.3%(0.083 × 100). Since
percentage change in wage is the same for each additional
year of educ, the change in wage for an extra year of educ as
education increases; Eq.35 implied an increasing return to
34/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme
Incorporating Nonlinearities

education. It can be easily seen by exponentiating Eq.35,

we have
wage = exp(β0 + β1 educ + u) (40)
With u = 0, the graph of Eq.40

Figure 4: wage = exp(β0 + β1 educ), with β1 > 0

35/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme

Incorporating Nonlinearities

Those with higher education is predicted to increase wage

much higher than lower education.
iv. Below we have summary of functional involving
logarithms (more on this later)

Model D.V. I.V. Interpretation of β1

Level-level y x ∆y = β1 ∆x
Level-log y log(x) ∆y = (β1 /100)%∆x
Log-level (semi-elasticity) log(y) x %∆y = (100β1 )∆x
Log-log (elasticity) log(y) log(x) %∆y = β1 %∆x
Table 2: Summary of functional forms involving logarithms

36/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme

Meaning of ’Linear’ Regression

i. The simple linear regression model

y = β0 + β1 x + u

where linearity referring to linear in the parameter β0 and

β1 .
ii. The are no restrictions on how y and x relate to the original
explained and explanatory variable of interest. For
example, we can use simple regression to estimate a model
such as: √
cons = β0 + β1 inc + u

37/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme

Meaning of ’Linear’ Regression

iii. This following model are not linear in parameter, so cannot

rely on linear regression.

1
cons = +u
β0 + β1 inc

required nonlinear regression model.

38/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme

Unbiasedness of OLS

i. Previously, we claim that the key assumption of simple

regression analysis is E(u|x) = 0.
ii. Remember, β̂0 and β̂1 as estimator for the population
parameter β0 and β1 respectively of Eq.1.
iii. The focus of this section is studying the properties of the
distribution of β̂0 and β̂1 over different random samples
from the population.
iv. We establish the first important properties of OLS
estimator which is unbiasedness which depend on simple
set assumptions (SLR stands for simple linear regression).

39/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme

Unbiasedness of OLS

Assumption SLR.1: Linear in Parameters

In the population model, the dependent variable y is related to
independent variable x, and the error (or disturbance), u as

y = β0 + β1 x + u (41)

where β0 and β1 are the population intercept and slope

parameters, respectively.

v. The Eq.41 is not restrictive; by choosing y and x

appropriately (like log), we can obtain interesting
nonlinear relationship with linear model of Eq.41.

Assumption SLR.2: Random Sampling

We have a random sample of size n, {(xi , yi ) : i = 1, 2, . . . , n},
following the population model in Eq.41.
40/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme
Unbiasedness of OLS

vi. Failure of random sampling normally related to time series

analysis and sample selection problems. Not all
cross-sectional samples can be viewed as outcome of
random samples, but many can be.
vii. We can write Eq.41 in term of random sample as

yi = β0 + β1 xi , i = 1, 2, . . . , n, (42)

where ui is the error or disturbance for observation i. Refer

to Fig.5:

41/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme

Unbiasedness of OLS

Figure 5: Graph of yi = β0 + β1 xi + ui

42/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme

Unbiasedness of OLS

Assumption SLR.3: Sample Variation in X

The sample outcomes on x, namely, {xi , i = 1, . . . , n}, are not all
the same value.
viii. Easily fulfill unless x in population variation is minimal or
the sample size is small.
ix. Assumption SLR.3 fail if the sample standard deviation of
xi is zero.
Assumption SLR.4: Zero Conditional Mean
The error u has an expected value of zero given any value of
the explanatory variable. In other words,

Ｆｉｘｅｄ　ｉｎ　ｒｅｐｅａｔｅｄ　ｓａｍｐｌｅ E(u|x) = 0

43/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme

Unbiasedness of OLS
x. For random sample, assumption SLR.4 implies that
E(ui |xi ) = 0, for all i = 1, 2, . . . , n.
xi. In statistical derivations, conditioning on the sample value
of independent variable is the same as treating the xi as
fixed in repeated samples. Let say we choose n sample
values for x1 , x2 , . . . , xn , then we obtain a sample on y.
Next, another sample of y is obtained using the same
values for x1 , x2 , . . . , xn . Then another y is obtained, again
using the same values x1 , x2 , . . . , xn and so on.
xii. This fixed-in-repeated-samples scenario is not very realistic
in non-experimental contexts. For example, in studying
relationship with consumption (y) and income (x), where
we choose value of income ahead of time and the sampling
individuals with those particular level of income. In reality,
individual income and consumption are both recorded
randomly.
44/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme
Unbiasedness of OLS

Fixed-in-repeated-sample assumption
The danger in fixed-in-repeated-samples assumption always
implies that ui and xi are independent.

xiii. To show that the OLS estimators are unbiased, the

estimator Eq.15 can be rewritten as (refer to textbook for
detail derivation)
n
X
1
β̂1 = β1 + di ui (43)
SSTx
i=1

where:
n
X
SSTx = (xi − x̄)2 (44)
i=1
di = xi − x̄ (45)
45/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme
Unbiasedness of OLS

Taking expected value of Eq.43 (This derivation are

conditioning on {x1 , x2 , . . . , xn } implicit, as a result SSTx
and di are nonrandom)
n
" X #
1
E(β̂1 ) = β1 + E di ui
SSTx
i=1
X n
1
= β1 + E (di ui )
SSTx
i=1
n
(46)
X
1
= β1 + di E (ui )
SSTx
i=1
X n
1
= β1 + di · 0
SSTx
i=1
= β1
46/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme
Unbiasedness of OLS

For β̂0 , Eq.14 can be rewritten as (averaging Eq.42 across i

to get ȳ = β0 + β1 x̄ + ū and insert it into Eq.14)

β̂0 = ȳ − β̂1 x̄
= β0 + β1 x̄ + ū − β̂1 x̄ (47)
= β0 + (β1 − β̂1 )x̄ + ū

xiv. Taking expected value of Eq.47,

E(β̂0 ) = β0 + E[(β1 − β̂1 )x̄] + E(ū)

= β0 + E[(β1 − β̂1 )]x̄ (48)
= β0

Since E(ū) = 0 by assumption SLR.2 and SLR.4 and

E(β̂1 ) = β1 which implies that E[(β1 − β̂1 )] = 0.
47/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme
Unbiasedness of OLS

Unbiasedness of OLS Estimator

With assumption SLR.1 through SLR.4 hold, β̂0 (Eq.14) and β̂1
(Eq.15) are unbiased estimator of β0 and β1 in Eq.1.

xv. Remember that unbiasedness is a feature of the sampling

distributions of β̂0 and β̂1 .
xvi. If the sample we obtain is somehow ’typical’, then our
estimate should be ’near’ the population value.
Unfortunately, we might be unlucky that the point estimate
is far from β1 and we can never know for sure whether this
is the case.

48/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme

Variances of the OLS Estimators

i. It is important to know how far we can expect β̂1 to be

away from β1 on average.
ii. The measure of spread in the distribution of β̂1 (and β̂0 ) is
easiest to work with the variance or its square root, the
standard deviation.
Assumption SLR.5: Homoskedasticity
The error u has the same variance given any value of the
explanatory variable. In other words, ｖａｒｉａｎｃｅ　ｏｆ　ｔｈｅ　ｕ（ｅｒｒｏｒ）

Var(u|x) = σ 2

iii. The homoskedasticity assumption plays no role in

showing that β̂0 and β̂1 are unbiased.
49/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme
Variances of the OLS Estimators

iv. The σ 2 is often called the error variance or disturbance

variance (see textbook pg. 45 for proof.)
v. And the square root of σ 2 , or
√
σ2 = σ (49)

is standard deviation of the error. A larger σ means that

distribution of the unobservable affecting y is more spread
out.

50/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme

４５ Variances of the OLS Estimators

vi. Assumption SLR.4 and SLR.5 can be stated also as

ｌｉｎｅａｒ　ｐａｒａｍｅｔｅｒ
E(y|x) = β0 + β1 x, SLR.4
2
Var(y|x) = σ , SLR.5
ｃｏｎｓｔａｎｔＶａｒ［ｕ｜ｘ］
In other words, the conditional expectation of y given x is
linear, but the variance of y given x is constant. This
situation is graphed in Fig.6:

51/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme

Variances of the OLS Estimators

Figure 6: The simple regression model under homoskedasticity

vii. When var(u|x) depends on x (not constant), the error term

exhibit heteroskedasticity (or nonconstant variance).
52/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme
Variances of the OLS Estimators

viii. Because var(u|x) = var(y|x), heteroskedasticity is present

whenever var(y|x) is a function of x. In Fig.7 we have
heteroskedasticity problem where the variance increase as
educ level increase.

Figure 7: Var(wage|educ) increasing with educ

53/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme

Sampling Variances of the OLS Estimators

i. Under assumptions SLR.1 through SLR.5

σ 2 n−1 ni=1 x2i σ 2 n−1 ni=1 x2i

P P
Var(β̂0 ) = Pn 2
= (50)
i=1 (xi − x̄) SSTx
σ2 σ2
Var(β̂1 ) = Pn 2
= (51)
i=1 (xi − x̄) SSTx

where these are conditional on the sample values

{x1 , . . . , xn } (for proof refer to textbook).
ii. Eq.50 and 51 are invalid in the presence of
heteroskedasticity.

54/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme

Sampling Variances of the OLS Estimators

iii. Most of the time we are interested on Eq.51. It depends on

the error variance σ 2 and the total variation in {x1 , . . . , xn }.
Larger error variance, the larger is Var(β̂1 ). This make
sense since more variation in the unobservable affecting y
makes it more difficult to precisely estimate β1 . While more
variability in independent variable (xi ) is preferred. And
the total variation in the xi increases with sample size.

55/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme

Estimating the Error Variance

i. The problem with Eq.50 and 51 is that σ 2 is unknown. So

we used data to estimate σ 2 , which then allow us to
estimate Var(β̂0 ) and Var(β̂1 ).
ii. To estimate σ 2 , we use the property of conditional variance
which state that

Var(u|x) = σ 2 = E(u2 |x) − [E(u|x)]2

(52)
= E(u2 |x)

And using the law of total variance

Var(u) = E[Var(u|x)] + Var[E(u|x)]

= E[σ 2 ] + Var[0] (53)
2
=σ

56/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme

Estimating the Error Variance

Thus
Var(u|x) = Var(u) = σ 2 (54)
And

Var(u) = E[u − E(u)]2

= E[u2 ] (55)
2
=σ
Pn
iii. So, unbiased estimator for σ 2 is n−1 2
i=1 ui .
iv. However, we do not observed the errors ui , but we do have
estimates of it which is the OLS residuals ûi . So if we
replace the error with the residuals, we have
n
X SSR
n−1 û2i = (56)
n
i=1
57/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme
Estimating the Error Variance

v. However, Eq.56 is still biased because it must account for

two restrictions (refer to your textbook). Thus the unbiased
estimator for σ 2 is
n
1 X 2 SSR
σ̂ 2 = ûi = (57)
n−2 n−2
i=1

where n − 2 is the degree of freedom where 2 is deducted

from the n due to two restrictions mention earlier.
vi. If we replace σ 2 in Eq.50 and 51 with Eq.57, we now have
unbiased estimators of Var(β̂0 ) and Var(β̂1 ).

58/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme

Estimating the Error Variance

vii. Later on, we need estimators of the standard deviations of

β̂0 and β̂1 and this require estimating σ and its natural
estimator is √
σ̂ = σ̂ 2 (58)
and is called the standard error of the regression (SER)
(also known as standard error of the estimate or root mean
squared error). The estimator in Eq.58 is biased estimator of
σ but it is consistent estimator of σ.
viii. Since our focus on β̂1 , the natural estimator of standard
deviation of β̂1 , or sd(β̂1 ) is (taking square root of Eq.51) is

σ̂ σ̂
se(β̂1 ) = √ = P (59)
SSTx n 2 1/2

i=1 (xi − x̄)

is called standard error of β̂1 .

59/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme
Regression through Origin

i. A regression through the origin

ỹ = β̃1 x (60)

because Eq.60 passes through the point x = 0, ỹ = 0, the

origin.
ii. The slope estimate of Eq.60 (refer textbook for derivation)
Pn
xi yi
β̃1 = Pi=1
n 2
(61)
i=1 xi

60/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme

Regression through Origin

iii. When regression through origin is used, one must be

careful in interpreting the R2 that typically reported by the
software (unless stated otherwise) because the R2 is
obtained without removing the sample average of
{yi : i = 1, . . . , n} in obtaining SST. In other words,
Pn 2
2 i=1 (yi − β̃1 xi )
R =1− Pn 2 (62)
i=1 yi

where the denominator act as if we know the average

value of y in population is zero.

61/61 Aidil Rizal Shahrin University of Malaya Unofficial Beamer Theme

Barber, B. K. (1996) - Parental Psychological Control. Revisiting A Neglected Construct. Child Development, 67 (6), 3296-3319 PDF
No ratings yet
Barber, B. K. (1996) - Parental Psychological Control. Revisiting A Neglected Construct. Child Development, 67 (6), 3296-3319 PDF
25 pages
BivariateReg WT2425
No ratings yet
BivariateReg WT2425
109 pages
Slides (Handout) - Caio - Chapter 2 (Wooldridge)
No ratings yet
Slides (Handout) - Caio - Chapter 2 (Wooldridge)
86 pages
Lecture 3 Simple Linear Regression
No ratings yet
Lecture 3 Simple Linear Regression
46 pages
CH 11 Slides
No ratings yet
CH 11 Slides
41 pages
Lecture 2
No ratings yet
Lecture 2
47 pages
CH 3
No ratings yet
CH 3
123 pages
Chapter 2 - 1907876925
No ratings yet
Chapter 2 - 1907876925
33 pages
Lecture 8
No ratings yet
Lecture 8
32 pages
EC212: Introduction To Econometrics Simple Regression Model (Wooldridge, Ch. 2)
No ratings yet
EC212: Introduction To Econometrics Simple Regression Model (Wooldridge, Ch. 2)
107 pages
Lecture 7
No ratings yet
Lecture 7
30 pages
STAT 445 Regression Analysis
No ratings yet
STAT 445 Regression Analysis
49 pages
Chapter 3 Econometrics
No ratings yet
Chapter 3 Econometrics
67 pages
STAT 445-Lecture 1 - 2021
No ratings yet
STAT 445-Lecture 1 - 2021
42 pages
Simple-Linear-Regression-Model-3 24
No ratings yet
Simple-Linear-Regression-Model-3 24
87 pages
Pertemuan 3
No ratings yet
Pertemuan 3
23 pages
Chapter 2
No ratings yet
Chapter 2
22 pages
Lec Topic2
No ratings yet
Lec Topic2
68 pages
Session 3
No ratings yet
Session 3
17 pages
Simple Linear Regression Model I
No ratings yet
Simple Linear Regression Model I
83 pages
Lecture 8 - Removed
No ratings yet
Lecture 8 - Removed
13 pages
Lecture 2-2 - Simple Linear Regression (One Regressor)
No ratings yet
Lecture 2-2 - Simple Linear Regression (One Regressor)
22 pages
Simple Regression Model CH02
No ratings yet
Simple Regression Model CH02
60 pages
L1 The SLR Model
No ratings yet
L1 The SLR Model
11 pages
LECTURE2
No ratings yet
LECTURE2
13 pages
EC2C4 Econometrics II
No ratings yet
EC2C4 Econometrics II
56 pages
Econ Shu301 Ch5
No ratings yet
Econ Shu301 Ch5
26 pages
Pertemuan 2 - Simple Linear Regression
No ratings yet
Pertemuan 2 - Simple Linear Regression
24 pages
Linear Regression
No ratings yet
Linear Regression
47 pages
Linear Regression Analysis - 3
No ratings yet
Linear Regression Analysis - 3
21 pages
Regression 2
No ratings yet
Regression 2
28 pages
BS Classes V2
No ratings yet
BS Classes V2
70 pages
Chapter Two: Bivariate Regression Mode
100% (1)
Chapter Two: Bivariate Regression Mode
54 pages
Top2 Estimation Handout
No ratings yet
Top2 Estimation Handout
39 pages
Non-Destructive Testing: Laboratory Manual
100% (1)
Non-Destructive Testing: Laboratory Manual
65 pages
AG909 Quantitative Methods For Finance
No ratings yet
AG909 Quantitative Methods For Finance
7 pages
Multiple Regression Analysis: I 0 1 I1 K Ik I
100% (1)
Multiple Regression Analysis: I 0 1 I1 K Ik I
30 pages
p8 p15 Annotated
No ratings yet
p8 p15 Annotated
10 pages
Linear Regression 101
No ratings yet
Linear Regression 101
20 pages
Chapter 9 Simple Linear Regression and Correlation
No ratings yet
Chapter 9 Simple Linear Regression and Correlation
56 pages
(AMALEAKS - BLOGSPOT.COM) Statistics (STAT-112) - Grade 11 Week 1-10
No ratings yet
(AMALEAKS - BLOGSPOT.COM) Statistics (STAT-112) - Grade 11 Week 1-10
100 pages
Simple - Linear - Regression-Presentation - Review-Analysis - Covariance
No ratings yet
Simple - Linear - Regression-Presentation - Review-Analysis - Covariance
10 pages
Lec3 2019 PDF
No ratings yet
Lec3 2019 PDF
43 pages
Assignments Ashoka University
No ratings yet
Assignments Ashoka University
32 pages
Chapter2 Econometrics Old
No ratings yet
Chapter2 Econometrics Old
37 pages
Simple Linear Regression.: 29.1 Method of Least Squares
No ratings yet
Simple Linear Regression.: 29.1 Method of Least Squares
4 pages
Lectures
No ratings yet
Lectures
766 pages
Stat 353 Study Guide
No ratings yet
Stat 353 Study Guide
44 pages
The Simple Regression Model: DR Jin Hongfei 1
No ratings yet
The Simple Regression Model: DR Jin Hongfei 1
41 pages
Regression With One Regressor
No ratings yet
Regression With One Regressor
25 pages
Daunit 3
No ratings yet
Daunit 3
32 pages
Multiple Regression Model
No ratings yet
Multiple Regression Model
17 pages
FCDS - RA ch1 Sp21
No ratings yet
FCDS - RA ch1 Sp21
14 pages
Lecture 4
No ratings yet
Lecture 4
11 pages
Ordinary Least Squares Linear Regression Review: Week 4
No ratings yet
Ordinary Least Squares Linear Regression Review: Week 4
10 pages
Chapter 2
No ratings yet
Chapter 2
22 pages
Simple Linear Regression.: 29.1 Method of Least Squares
No ratings yet
Simple Linear Regression.: 29.1 Method of Least Squares
4 pages
Module 3 Descriptive Statistics
No ratings yet
Module 3 Descriptive Statistics
38 pages
統計摘要
No ratings yet
統計摘要
12 pages
Lecture 2: Simple Linear Regression Model: Recap
No ratings yet
Lecture 2: Simple Linear Regression Model: Recap
5 pages
Ordinary Least Squares
No ratings yet
Ordinary Least Squares
21 pages
Statistics Assignment 05
50% (2)
Statistics Assignment 05
14 pages
Tutorial 1
No ratings yet
Tutorial 1
3 pages
Statistical Intervals
No ratings yet
Statistical Intervals
27 pages
Math644 Chapter 1 Part1
No ratings yet
Math644 Chapter 1 Part1
5 pages
To The: of Post
No ratings yet
To The: of Post
4 pages
Helicopter Assignment
No ratings yet
Helicopter Assignment
10 pages
Business Statistic S: ..By Mr. Ravi Prasad
No ratings yet
Business Statistic S: ..By Mr. Ravi Prasad
21 pages
Chapter 10
No ratings yet
Chapter 10
14 pages
ML Coursera Python Assignments
100% (1)
ML Coursera Python Assignments
20 pages
Gambling, Random Walks and The Central Limit Theorem: 3.1 Random Variables and Laws of Large Num-Bers
No ratings yet
Gambling, Random Walks and The Central Limit Theorem: 3.1 Random Variables and Laws of Large Num-Bers
59 pages
Career Choice
No ratings yet
Career Choice
9 pages
Protein Timing
No ratings yet
Protein Timing
13 pages
Kanban Systems: Kan-Ban System
No ratings yet
Kanban Systems: Kan-Ban System
9 pages
Business Statistics 4-6
No ratings yet
Business Statistics 4-6
96 pages
Statistics - JEE Main 2024 April Question Bank - MathonGo
No ratings yet
Statistics - JEE Main 2024 April Question Bank - MathonGo
7 pages
Matlab Code For Random Variable
No ratings yet
Matlab Code For Random Variable
8 pages
Visual Outcomes and Accommodat PDF
No ratings yet
Visual Outcomes and Accommodat PDF
13 pages
Estimating The Missing Mark When A Candidate Is Absent
No ratings yet
Estimating The Missing Mark When A Candidate Is Absent
2 pages
Les6e PPT 02 04
No ratings yet
Les6e PPT 02 04
42 pages
Excercise 2 2024 Solution
No ratings yet
Excercise 2 2024 Solution
4 pages
Stat 401B Exam 1 F16-Key
No ratings yet
Stat 401B Exam 1 F16-Key
7 pages
9719-Article Text-41230-1-10-20250309
No ratings yet
9719-Article Text-41230-1-10-20250309
18 pages
Zero Defect Quality in The Automobile Industry: A SIX SIGMA
No ratings yet
Zero Defect Quality in The Automobile Industry: A SIX SIGMA
3 pages
Measurement Technique Cheat Sheet
No ratings yet
Measurement Technique Cheat Sheet
7 pages
Management
No ratings yet
Management
3 pages
Central Limit Theorem
No ratings yet
Central Limit Theorem
6 pages
Capsule Calculus
From Everand
Capsule Calculus
Ira Ritow
No ratings yet
Calculus-II (Mathematics) Question Bank
From Everand
Calculus-II (Mathematics) Question Bank
Mohmmad Khaja Shareef
No ratings yet