Lec2 Ase Iev
Lec2 Ase Iev
Karim Nchare
January 2020
Correlation does not imply causation!!
Correlation does not imply causation!!
Correlation does not imply causation!!
Definitions of internal and external validity
Yi = β0 + β1 X1i + ui
Yi = β0 + β1 X1i + ui , ui = β2 X2i + wi
I we then have
cov (Yi , X1i ) cov (ui , X1i )
plim β̂1 = = β1 +
n→∞ var (X 1i ) var (X1i )
cov (β2 X2i + wi , X1i ) cov (X2i , X1i )
= β1 + = β1 + β2
var (X1i ) var (X1i )
cov (X2i , X1i )
plim β̂1 = β1 + β2
n→∞ var (X1i )
Omitted variables
I An omitted variable X2i leads to an inconsistent OLS estimate
of the causal effect of X1i if:
I The omitted variable X2i is a determinant of the dependent
variable Yi (β2 6= 0), and
I he omitted variable X2i is correlated with the regressor of
interest X1i (cov (X2i , X1i ) 6= 0)
I If there exists 1 or more variables that satisfy both conditions,
I The OLS estimator does not provide a unbiased and consistent
estimate of the causal effect of X1i
I the OLS regression is not internally valid
I The sign of the bias is given by the sign of β2 cov (X2i , X1i )
I If β2 cov (X2i , X1i ) < 0 we say that OLS underestimates the
causal effect of X1i
I If β2 cov (X2i , X1i ) > 0 we say that OLS overestimates the
causal effect of X1i
I Example: In the returns to education, ability is an omitted
variable.
Functional form misspecification
I Suppose that the true population regression model is
Yi = β0 + β1 X1i + ui , ui = β2 X1i2 + wi
I we then have
cov (Yi , X1i ) cov (ui , X1i )
plim β̂1 = = β1 +
n→∞ var (X1i ) var (X1i )
2
cov (β2 X1i + wi , X1i )
= β1 +
var (X1i )
cov (X1i2 , X1i )
plim β̂1 = β1 + β2
n→∞ var (X1i )
X̃1i = X1i + ωi
Yi = β0 + β1 X̃1i − β1 ωi + ui
Ỹi = Yi + ηi
I and we estimate
Ỹi = β0 + β1 X1i + ei , ei = ui + ηi
Cov (ui , ui ) 6= 0, i 6= j