Multiple Regression Model
Multiple Regression Model
• Assumption M.1:
We will keep the basic least squares assumption - We will as-
sume that the error term is mean independent of all regressors
(loosely speaking - all Xs are uncorrelated with the error term,
i.e.
E(ui |X1 , X2 , . . . , Xk ) = E(ui |X) = 0
Yi = β0 + β1 Xi + β2 Di + ui
Y Yi = β0 + β1 Xi + β2
6
Yi = β0 + β1 Xi
β0 + β2
β0
Yi = β0 + β1 Xi + β2 Xi Di + ui
Y
6 Y = β0 + (β1 + β2 )X
Y = β0 + β1 X
β0
-
X
interpretation:
• When income goes up by 1%, WTP goes up by 0.14%.
• low education is the reference group (we have omitted this
dummy variable). Medium educated individuals have a WTP
47% higher than the low educated ones and high educated 58%
more.
•
Yi = β0 + β1 Xi1 + β2 Xi2 + ui
Yi = β0 + β1 Xi1 + νi
νi = β2 Xi2 + ui
• We can substitute this into the expression for the OLS estima-
tor to obtain
N
N
(Xi1 − X̄1 )β2 Xi2 + (Xi1 − X̄1 )ui
i=1 i=1
β̂1 = β1 +
N
(Xi1 − X̄1 )2
i=1
cov(X1 , X2 )
E[β̂1 |X] = β1 + β2
V ar(X1 )
• The bias will be zero in two cases:
– When the coefficient β2 is zero. In this case the regressor
X2 obviously does not belong to the regression.
– When the covariance between the two regressors X1 and
X2 is zero.
• Thus in general omitting regressors which have an impact on
Y (β2 non-zero) will bias the OLS estimator of the coefficients
on the included regressors unless the omitted regressors are
uncorrelated with the included ones.
Y̌i = β0 + β1 Xi + ui
Yi = β0 + β1 Xi + ui − νi
= β0 + β1 Xi + wi
N
¯ )(Y − Ȳ )
(X̌i − X̌ i
i=1
β̂1 =
N
¯ )2
(X̌i − X̌
i=1
N
(Xi + νi − X̄)(β0 + β1 Xi + ui − Ȳ )
i=1
=
N
(Xi + νi − X̄)2
i=1
N
β1 (Xi − X̄)2
i=1
=
N
(Xi − X̄)2 + νi2 − 2νi (Xi − X̄)
i=1
β1 V ar(Xi )
E(βˆ1 ) = ≤ β1
V ar(Xi ) + V ar(νi )
• Measurement error on Xi leads to a biased OLS estimate,
biased towards zero. This is also called attenuation bias.