Topic 3b Simple Linear Regression
Topic 3b Simple Linear Regression
The OLS is also referred to as the classical least squares (CLS) and it
has the following advantages;
Firstly, the parameter estimates obtained by OLS have some optimal
properties.
Secondly, the computational procedure of OLS is fairly simple.
Thirdly, OLS has been applied in a wide range of economic
relationship with fairly satisfactory results.
Fourthly, the mechanics of OLS are simple to understand.
Fifthly, OLS is an essential component of most other econometrics
techniques.
The Simple Linear Regession Model
Y = f (X )
Yi = b0 + b1 Xi
The above speci…cation shows that price is the only factor in‡uencing
the level of quantity supplied and this implies that there is a one way
causation between variables Y and X.
Our aim is to estimate the numerical values of b̂0 and b̂1 .
b̂0 = 0 it implies that quantity supplied is zero when price is zero.
If b̂0 > 0 it means that some quantity were supplied when the price
drops or falls.
If b̂0 < 0 ignore this because a negative quantity does not make sense
in economics.
Model Speci…cation (Cont’d)
The sign of b̂0 is essential in determining the price elasticity of supply.
b̂1 > 0 because it is expected to be expected to be positive since a
supply curve is upward sloping.
dQ P dY X
Recall that ηp = . = .
dP Q dX Y
dY
From the supply function = b1
dX
To compute the elasticity from a regression line, we use the estimate
of b̂1 and the mean values of price (X̄) and quantity (Ȳ)
X̄
) ηp = b̂1 .
Ȳ
Ȳ = b̂0 + b̂1 X̄
b̂1 X̄
) ηp =
b̂0 + b̂1 X̄
Model Speci…cation (Cont’d)
Yi = b0 + b1 Xi + µ
Y
Y we're trying to
predict ^β 11X ii
^
^ = β 00 +
ε Expected
(Mean) Y
Y ii
E(Y) = β 0 + β 1X
Prediction, ^
Y
X
XP
Explained and Unexplained Variation (Cont’d)
Yi = b0 + b1 Xi +µ
| {z } | {z } |{z}
[Variations [Systematic/ [Random
in Yi ] Explained Unexplained
Variation] Variation]
The second component in the bracket is the variations in Y explained
by changes in X and the third bracket is the variation not explained
by any speci…c factor.
Assumptions of Error Term
n n
Σ ei2 = Σ (Yi b̂0 b̂1 Xi )2
i =1 i =1
∂Σe 2 ∂Σe 2
= 0 and =0
∂b̂0 ∂b̂1
Using Chain rule; if y = f (w ) and w = f (x )
dy dy dw
) = .
dx dw dx
Where w = Σ(Yi b̂0 b̂1 Xi )
∂Σe 2 ∂w
Σe 2 = w 2, ) = 2w , = 1
∂w ∂b̂0
∂Σe 2
= 2Σ(Yi b̂0 b̂1 Xi ). ( 1) = 0
∂b̂0
ΣYi = nb̂0 + b̂1 ΣXi
The Normal Equations of OLS (Cont’d)
∂Σe 2 ∂w
= 2w , = Xi
∂w ∂b̂1
∂Σe 2
= 2Σ(Yi b̂0 b̂1 Xi ). ( Xi ) = 0
∂b̂1
Σ(Xi Yi b̂0 Xi b̂1 Xi2 ) = 0
ΣXi Yi = b̂0 ΣXi b̂1 ΣXi2
The normal equation gives; ΣYi = nb̂0 + b̂1 ΣXi (1)
ΣXi Yi = b̂0 ΣXi b̂1 ΣXi2
Expressing in matrix of the form AX = B
n ΣXi b̂0 ΣYi
=
ΣXi ΣXi2 b̂1 ΣXi Yi
The Normal Equations of OLS (Cont’d)
b̂0 = Ȳ b̂1 X̄
Σxi yi
b̂1 =
Σxi2
Recall that Σxi yi = fnΣXY (ΣX )(ΣY )g/n
Σxi2 = fnΣX 2 (ΣX )2 g/n
Numerical Values of the Correlation Coe¢ cient
Using the data in our previous example on the relation between price
and quantity supplied estimate the regression line.
n = 10, ΣX = 110, ΣY = 610, Σx 2 = 330, Σxy = 1810, ΣX 2 =
1540, ΣXY = 8520
X̄
) ηp = b̂1 .
Ȳ
11
5.484 = 0.989
61
ηp is inelastic.