Lecture For 111424
Lecture For 111424
𝑜
𝑟
3 9 27 81 1 1 1 1
4 16 64 256 2 4 8 16
hi = xi(X'X)-1xi'
Cutoff: >
2p/n
Studentized residual
𝑖
𝑖
𝑠
1−h
𝑠
𝑡
𝑢
𝑑
𝑒
𝑛
𝑡
𝑟
()
𝑖
𝑟
Cutoff: > 2
Covariance ratio
^ −^
()
𝑖
𝑖
𝑠
=
𝐷
𝐹
𝐹
𝐼
𝑇
𝑆
() h( )
𝑖
𝑖
𝑦
𝑦
Cutoff: 
Difference in betas: DFBETAS
(Mike’s favorite)
− ()
𝑖
𝑗
𝑗
=
𝑠
𝑋
𝑋
𝑇
( )
𝐷
𝐹
𝐵
𝐸
𝑇
𝐴
𝑆
()
𝑗
𝑖
𝑗
𝑏
𝑏
Cutoff: 
Cook’s Distance
2 [ (1 − h )2 ]
h
𝑖
𝑖
𝑝
𝑠
=
𝑖
𝐷
𝑖
𝑖
𝑖
𝑟
Cutoff: 
Multicollinearity
Tolerance (Tol)
Variance Inflation Factor (VIF)
Eigenvalues
Condition numbers
Tolerance and Variance Inflation
Factor
2
=1−
𝑘
𝑇
𝑜
𝑙
𝑒
𝑟
𝑎
𝑛
𝑐
𝑒
𝑅
Rough cutoff 1: < .1
Rough cutoff 2: < 1-
R2
1 1
𝑘
𝑇
𝑜
𝑙
𝑅
= =
𝑉
𝐼
𝐹
1− 2
𝐶
𝑜
𝑣
𝑎
𝑟
𝑖
𝑎
𝑛
𝑐
𝑒
𝑚
𝑎
𝑡
𝑟
𝑖
𝑥
𝐴
[0 1]
1 0
𝐸
: − =0 h =
𝑖
𝑔
𝑒
𝑛
𝑣
𝑎
𝑙
𝑢
𝑒
𝑒
𝑞
𝑢
𝑎
𝑡
𝑖
𝑜
𝑛
𝐴
𝜆
𝐼
𝑤
𝑒
𝑟
𝑒
𝐼
3− 1
𝜆
− = =0
𝐴
𝜆
𝐼
1 3−
𝜆
(3 − )2 − 1 = 0
𝜆
2
9−6 + −1=0
𝜆
𝜆
( − 4)( − 2) = 0
𝜆
𝜆
= 4, 2
𝜆
Interpretation
Eigenvalues roughly equal or none particularly large indicate no
multicollinearity
Turn into condition number

Criteria
>50
10 beginning; 100 problematic (Belsley, Kuh, & Welsch, 1980)
For high condition number check which independent variable has largest
proportion of variance associated with that eigenvalue and consider
removing
Star data
stardata.toclass.R
stardata.forR.dat
stardata.dat