Statistical Methods For Bioinformatics Lecture 4
Statistical Methods For Bioinformatics Lecture 4
Extraversion
Underlying ideas
The dimension reduction may reduce variance component in
the error
The variability in the data is relevant for the response
Procedural description:
1 Find linear set of φj1 so that:
ŷ = UU T y
Ridge is given by:
si2
ŷ = Udiag 2
UT y
si + λ
PCR by:
ŷ = Udiag {1, . . . , 1, 0, . . . , 0} U T y
Statistical Methods for Bioinformatics
The connection between Ridge and PCR
si2
ŷ = Udiag UT y
si2 + λ
What is s? Remember the PCA: formulation
1
C = n−1 X T X = VDV T . Together with X = USV T we can
S2
derive that D = n−1 . Hence the singular values s are related to the
eigenvalues of the covariance matrix as follows:
si2
di =
n−1
Exercises
Finish labs of Chapter 6
and Exercise below.