Lecture 16
Lecture 16
Perfect multicollinearity is the violation of Assumption 6 (no explanatory variable is a
perfect linear function of any other explanatory variables).
If two or more independent variables have an exact linear relationship between them then
we have perfect multicollinearity.
Examples: including the same information twice (weight in pounds and weight in
kilograms), not using dummy variables correctly (falling into the dummy
variable trap), etc.
Page 1 of 10
CHAPTER 8: MULTICOLLINEARITY
Why? OLS cannot estimate the marginal effect of on while holding constant
because moves exactly when moves!
Page 2 of 10
CHAPTER 8: MULTICOLLINEARITY
Imperfect (or Near) Multicollinearity
When we use the word multicollinearity we are usually talking about severe imperfect
multicollinearity.
Page 3 of 10
CHAPTER 8: MULTICOLLINEARITY
The Consequences of Multicollinearity
2. The variances and the standard errors of the regression coefficient estimates will
increase. This means lower t-statistics.
Page 4 of 10
CHAPTER 8: MULTICOLLINEARITY
The Detection of Multicollinearity
Pairwise correlations among independent variables might be high (in absolute value).
Rule of thumb: If the correlation > 0.8 then severe multicollinearity may be present.
Possible for individual regression coefficients to be insignificant but for the overall fit of
the equation to be high.
A VIF measures the extent to which multicollinearity has increased the variance of an
estimated coefficient. It looks at the extent to which an explanatory variable can be
explained by all the other explanatory variables in the equation.
Page 5 of 10
CHAPTER 8: MULTICOLLINEARITY
Suppose our regression is equation includes k explanatory variables:
… .
Step 1: Run the OLS regression for each X variable. For example for :
1
VIF( )
1
No single solution exists that will eliminate multicollinearity. Certain approaches may be
useful:
1. Do Nothing
If a variable is redundant, it should have never been included in the model in the
first place. So dropping it actually is just correcting for a specification error. Use
economic theory to guide your choice of which variable to drop.
Page 7 of 10
CHAPTER 8: MULTICOLLINEARITY
4. Increase the Sample Size
Increasing the sample size improves the precision of an estimator and reduces the
adverse effects of multicollinearity. Usually adding data though is not feasible.
Page 8 of 10
CHAPTER 8: MULTICOLLINEARITY
Example
What else?
Advertising
Election surveys
Office expenses
Salaries
Other
Page 9 of 10
CHAPTER 8: MULTICOLLINEARITY
What if a researcher were interested in the individual effect of each of these
expenditures?
Even if you correct the model there still may be an imperfect multicollinearity between
the components of campaign expenditures.
Page 10 of 10