Lecture5 Appendix
Lecture5 Appendix
Heng Peng
= λ1 a∗T PP T a∗ = λ1 ka∗ k2 = λ1
Hence when a = e1
aT Σa
max = λ1
a6=0 aT a
= Trace(ΛP T P) = Trace(Λ) = λ1 + · · · + λp .
Hence
√
Cov(Yi , Xk ) λi eik eik λi
ρYi ,Xk = p p = √ √ = √
Var(Yi ) Var(Xk ) λi σkk σkk
I Example 5.1.
λ1 = 5.83, eT1 = [.383, −.924, 0]
λ2 = 2.00, eT2 = [0, 0, 1]
λ3 = 0.17, eT3 = [.924, .383, 0]
Therefore, the principal components are
Y2 = eT2 X = X3
Y3 = eT3 X = .924X1 + .383X2
λ2 = λ3 = · · · = λp = 1 − ρ
I Continuous: and their eigenvectors are
1 −1
eT2 = √ ,√ , 0, . . . , 0
1×2 1×2
1 1 −2
eT3 = √ ,√ ,√ 0, . . . , 0
2×3 2×3 2×3
...... ......
" #
1 1 −(i − 1)
eTi = p ,··· , p ,p , 0, . . . , 0
(i − 1) × i (i − 1) × i (i − 1) × i
...... ......
" #
T 1 1 −(p − 1)
ep = p ,··· , p ,p
(p − 1) × p (p − 1) × p (p − 1) × p
The first principal component
p
1 X
Y1 = eT1 Z = √ Zi
p i=1
explaining a proportion
λ1 1 + (p − 1)ρ 1−ρ
= =ρ+
p p p
of the total population variation.
P16. Example 5.4. The natural logarithms of the dimension of 24 male turtles
have sample mean vector x̄T = [4.725, 4.478, 3.703] and covariance matri
11.072 8.019 8.160
−3
S = 10 8.019 6.417 6.005
8.160 6.005 6.773
and
1.000 .632 .511 .115 .155
.632 1.000 .574 .322 .213
R=
.511 .574 1.000 .183 .146
.115 .322 .183 1.000 .683
.155 .213 .146 .683 1.000
We note that R is the covariance matrix of the standardized observations
x1 − x̄1 x2 − x̄2 x5 − x̄5
z1 = √ , z2 = √ , . . . , z1 = √
s11 s22 s55
I Example 5.5. Continuous. The eigenvalues and corresponding normalized
eigenvectors of R, determined by a computer, are
Under the standardized variables, we obtain the first two sample principal
components:
I Communality:
Specific variance: ψi
P27. Example 5.7.
19 30 2 12 4 1 2 0 0 0
30 57 5 23 7 2 0 4 0 0
Σ= = + = LLT +Ψ
2 5 38 47 −1 6 0 0 1 0
12 23 47 68 1 8 0 0 0 3
The Communality of X1 is
19 = 42 + 12 + 2 = 17 + 2
P28. Example 5.8. If Σ can be factored by a factor analysis model with
m = 1, then
X1 − µ1 = `11 F1 + ε1
X2 − µ2 = `21 F1 + ε2
X3 − µ3 = `31 F1 + ε3
or
1 = `211 + ψ1 , .90 = `11 `21 , .70 = `11 `31
1 = `221 + ψ2 , .40 = `21 `31 , 1 = `231 + ψ3
The pair of equations
.70 = `11 `31 , .40 = `21 `31
implies that
.40
`21 = `11
.70
Substituting this result for `21 in the equation .90 = `11 `21 , yields
`211 = 1.575
or `11 = ±1.255.
I Example 5.8. Continuous.
Since Var(F1 ) = 1 by assumption and Var(X1 ) = 1,
`11 = Cov(X1 , F1 ) = Corr(X1 , F1 ) which cannot be greater than
unity. So from this point of view |`11 | = 1.225 is too large. Also,
the equation
1 = `211 + ψ1
gives
ψ1 = 1 − 1.575 = −.575
which is unsatisfactory, since it gives a negative value for
Var(ε) = ψ1 . So the solution is not consistent, and is not a proper
solution.
P29. if λ̂1 , . . . , λ̂m are relative large compared to λ̂m+1 , . . . , λ̂p
m
X
ψ̃i = sii − `˜2ij
j=1
P30. q q
`˜211 + `˜221 + · · · + `˜2p1 = ( λ̂1 ê1 )T ( λ̂1 ê1 ) = λ̂1 .
P31 Example 5.9.
.56 .82
.78 −.53
.56
T
.65 .78 .65 .94 .80
L̃L̃ + Ψ̃ = .75
.82 −.53 .75 −.10 −.54
.94 −.10
.80 −.54
.02 0 0 0 0 1 .01 .97 .44 .00
0 .12 0 0 0 1 .11 .79 .91
+ 0 0 .02 0 0 = 1 .53 .11
0 0 0 .11 0 1 .81
0 0 0 0 .07 1
P33. Example 5.10.
0 −.099 −.185 −.025 .056
−.099 0 −.134 .014 −.054
R − L̃L̃T − Ψ̃ =
−.185 −.134 0 .003 .006
−.025 .014 .003 0 −.156
.056 −.054 .006 −.156 0
P35. Corrected:
L̂z = V̂−1/2 L̂, Ψ̂z = V̂−1/2 Ψ̂V̂−1/2
Or given the estimated loadings L̂z and specific variance Ψ̂z
obtained from R, the resulting maximum likelihood estimates for a
factor analysis of the covariance matrix [(n − 1)/n]S are
or
`ˆij = `ˆz,ij
p
σ̂ij and ψ̂i = ψ̂z,i σ̂ii
P33. Example 5.11.
0 .001 −.002 .000 .052
.001 0 .002 .000 −.033
T
R − L̃L̃ − Ψ̃ =
−.002 .002 0 .000 .001
.000 .000 .000 0 .000
.052 −.033 .001 .000 0
Σ = LLT + Ψ L
∗
Σ =
LT I
Then
and
f̂j∗ = Tf̂j , j = 1, 2, . . . , n.
P51. Example 5.16.
.763 .024 .42 0 0 0 0
.821 .227 0 .27 0 0 0
L̂∗z =
.669 .104 and Ψ̂z = 0 0 .54 0 0
.118 .993 0 0 0 .00 0
.113 .675 0 0 0 0 .53