Chapter 6 Econometrics
Chapter 6 Econometrics
Where:
(6.3)
• Hence, the explanatory variables in the estimated regression (6.2) are not
independent of the error term (unless the omitted variable is
uncorrelated with all the included variables—something which is very
unlikely)
• What happens if we estimate Equation 6.2 when Equation 6.1 is the truth?
• We get bias!
• What this means is that:
(6.4)
• Instead of having an expected value equal to the true β1 the estimate will compensate
for the fact that X2 is missing from the equation.
• If X1 and X2 are correlated and X2 is omitted from the equation, then the OLS
estimation procedure will attribute to X1 variations in Y actually caused by X2, and a
biased estimate of β1 will result.
• To see how a left-out variable can cause bias, picture a production function
that states that output depends on the amount of labor and capital used.
Y=f(K,L)
• What would happen if data on capital were unavailable for some reason and
K was omitted from the equation?
• In this case, we would be leaving out the impact of capital on output in our
model.
• This omission would almost surely bias the estimate of the coefficient of
labor because it is likely that capital and labor are positively correlated.
• As a result, the OLS program would attribute to labor the increase in output
actually caused by capital to the extent that labor and capital were
correlated.
• Thus the bias would be a function of the impact of capital on output and the
correlation between capital and labor.
© 2011 Pearson Addison-Wesley. All rights reserved. 1-١٥٨
The Consequences of an Omitted Variable (cont.)
• To generalize for a model with two independent variables, the expected value of the
coefficient of an included variable (X1) when a relevant variable (X2) is omitted from the
equation equals:
Where α1 is the slope coefficient of the secondary regression that relates X2 to X1:
In a nutshell
The amount of bias is a function of the impact of the omitted variable on the
dependent variable times a function of the correlation between the included and
the omitted variable
• So, the bias exists unless:
1. the true coefficient equals zero, or
2. the included and omitted variables are uncorrelated
• What if:
– You have an unexpected result, which leads you to believe that you have
an omitted variable
– You have two or more theoretically sound explanatory variables as
potential “candidates” for inclusion as the omitted variable to the equation is
to use
• How do you choose between these variables?
• One possibility is expected bias analysis
– Expected bias: the likely bias that omitting a particular variable would
have caused in the estimated coefficient of one of the included variables