Multicollinearity
Multicollinearity
• Effects of multicollinearity
• Detecting multicollinearity
• Remedies for multicollinearity
The nature of Multicollinearity
Perfect multicollinearity:
When there are some functional relationships existing
among independent variables, that is iXi = 0
0 =0
(Why?) –
uncorrelated
Example: Production function
(log form)
Yi = 0 + 1X1i + 2X2i + 3X3i + i
Y X1 X2 X3 Y: Output
Unaffected:
a. OLS estimators are still BLUE.
b. The overall fit of the equation
c. The estimation of the coefficients of
non-multicollinear variables
The variances of OLS estimators
increase with the degree of
multicollinearity
Regression model:
Yi = 0 + 1X1i + 2X2i + i
coefficients
Closer relation between X1 and X2
b. se ˆ k
2
tends to be large
ˆ k t df , / 2 se ˆ k
CORRELATION MATRIX
X1 X2 X3 X4 X5
X1 1.00000 0.99686 0.99135 0.52583 0.97214
X2 0.99686 1.00000 0.99127 0.54331 0.96524
X3 0.99135 0.99127 1.00000 0.46144 0.97262
X4 0.52583 0.54331 0.46144 1.00000 0.53618
X5 0.97214 0.96524 0.97262 0.53618 1.00000
Auxiliary Regressions
Results:
High R2 and adjust R2
Less significant t-values
OLS estimates and SE’s can be
sensitive to specification and small
changes in data
Specification changes:
Add or drop variables
Small changes:
Add or drop some observations
Change some data values
High Simple
Correlation
Coefficients
rij
X X X X
i i j j
X X X X
i i
2
j j
2
(3)
VIF ˆk
1
1 Rk
2
Insignificant
Since M1 and M2
are highly related
VIFPF = 43.4
Signs are unexpected VIFlnYd =23.3
Most t-values are insignificant
VIFPB = 18.9
VIFN =18.5
VIFP =4.4
Drop N, but not improved
Improved
6. Doing Nothing