0% found this document useful (0 votes)

15 views7 pages

Book4 SVD

Uploaded by

jeren2606

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views7 pages

Book4 SVD

Uploaded by

jeren2606

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

CHAPTER 7.

DIMENSIONALITY REDUCTION 233

7.4 Singular Value Decomposition

Principal components analysis is a special case of a more general matrix decompo-
sition method called Singular Value Decomposition (SVD). We saw above in (7.28)
that PCA yields the following decomposition of the covariance matrix

Σ = UΛUT (7.37)

where the covariance matrix has been factorized into the orthogonal matrix U con-
taining its eigenvectors, and a diagonal matrix Λ containing its eigenvalues (sorted
in decreasing order). SVD generalizes the above factorization for any matrix. In
particular for an n × d data matrix D with n points and d columns, SVD factorizes
D as follows

D = L∆RT (7.38)

where L is a orthogonal n × n matrix, R is an orthogonal d × d matrix, and ∆ is an

n × d “diagonal” matrix. The columns of L are called the left singular vectors, and
the columns of R (or rows of RT ) are called the right singular vectors. The matrix
∆ is deﬁned as
(
δi If i = j
∆(i, j) =
0 If i 6= j

where i = 1, · · · , n and j = 1, · · · , d. The entries ∆(i, i) = δi along the main

diagonal of ∆ are called the singular values of D, and they are all non-negative. If
the rank of D is r ≤ min(n, d), then there will be only r non-zero singular values,
which we assume are ordered as follows

δ1 ≥ δ2 ≥ · · · ≥ δr > 0

One can discard those left and right singular vectors that correspond to zero singular
values, to obtain the reduced SVD as

D = Lr ∆r RTr (7.39)

where Lr is the n × r matrix of the left singular vectors, Rr is the d × r matrix of the
right singular vectors, and ∆r is the r × r diagonal matrix containing the positive
singular vectors. The reduced SVD leads directly to the spectral decomposition of
CHAPTER 7. DIMENSIONALITY REDUCTION 234

D, given as

D =Lr ∆r RTr
  
  δ1 0 · · · 0 — rT1 —
| | |  0 δ · · · 0  — rT —
 2  2 
= l1 l2 · · · lr   .
 .. .. . . . ... 
. 
 — ..
. 
—
| | |
0 0 · · · δr — rTr —
=δ1 l1 rT1 + δ2 l2 rT2 + · · · + δr lr rTr
Xr
= δi li rTi
i=1

The spectral decomposition represents D as a sum of rank one matrices of the form
δi li rTi . By selecting the q largest singular-values δ1 , δ2 , · · · , δq and the correspond-
ing left and right singular-vectors, we obtain the best rank q approximation to the
original matrix D. That is, if Dq is the matrix deﬁned as
q
X
Dq = δi li rTi
i=1

then it can be shown that Dq is the rank q matrix that minimizes the expression

kD − Dq kF

where kAkF is called the Frobenius Norm of the n × d matrix A, deﬁned as

v
u n d
uX X
kAkF = t A(i, j)2
i=1 j=1

7.4.1 Geometry of SVD

In general, any n × d matrix D represents a linear transformation, D : Rd → Rn ,
from the space of d-dimensional vectors to the space of n-dimensional vectors, since
for any r ∈ Rd there exists l ∈ Rn such that

Dr = l

The set of all vectors l ∈ Rn such that Dr = l over all possible r ∈ Rd , is called the
column space of D, and the set of all vectors r ∈ Rd , such that DT l = r over all
l ∈ Rn , is called the row space of D, which is equivalent to the column space of DT .
In other words, the column space of D is the set of all vectors that can be obtained
as a linear combinations of columns of D, and the row space of D is the set of all
vectors that can be obtained as a linear combinations of the rows of D (or columns
CHAPTER 7. DIMENSIONALITY REDUCTION 235

of DT ). Also note that the set of all vectors r ∈ Rd , such that Dr = 0 is called the
null space of D, and ﬁnally, the set of all vectors l ∈ Rn , such that DT l = 0 is called
the left null space of D.
One of the main properties of SVD is that it gives a basis for each of the four
fundamental spaces associated with matrix the D. If D has rank r, it means that
it has only r independent columns, and also only r independent rows. Thus, the r
left singular vectors l1 , l2 , · · · , lr corresponding to the r non-zero singular values of
D in (7.38) represent a basis for the column space of D. The remaining n − r left
singular vectors lr+1 , · · · , ln represent a basis for the left null space of D. For the
row space, the r right singular vectors r1 , r2 , · · · , rr corresponding to the r non-zero
singular values, represent a basis for the row space of D, and the remaining d − r
right singular vectors rj , represent a basis for the null space of D.
Consider the reduced SVD expression in (7.39). Right multiplying both sides
of the equation by Rr and noting that RTr Rr = Ir , where Ir is the r × r identity
matrix, we have

DRr = Lr ∆r RTr Rr
DRr = Lr ∆r
 
δ1 0 · · · 0
 0 δ2 · · · 0 
 
DRr = Lr  . .. . . .
 .. . . .. 
0 0 · · · δr
   
| | | | | |
D r1 r2 · · · rr  = δ1 l1 δ2 l2 · · · δr lr 
| | | | | |

From the above, we conclude that

Dri = δi li for all i = 1, · · · , r

In other words, SVD is a special factorization of the matrix D, such that any basis
vector ri for the row space is mapped to the corresponding basis vector li in the
column space, scaled by the singular value δi . As such, we can think of the SVD as
a mapping from an orthonormal basis (r1 , r2 , · · · , rr ) in Rd (the row space) to an
orthonormal basis (l1 , l2 , · · · , lr ) in Rn (the column space), with the corresponding
axes scaled according to the singular values δ1 , δ2 , · · · , δr .

7.4.2 Connection between SVD and PCA

Assume that the matrix D has been centered, and assume that it has been factorized
via SVD (7.38) as D = L∆RT . Consider the scatter matrix for D, given as DT D.
CHAPTER 7. DIMENSIONALITY REDUCTION 236

We have
T
DT D = L∆RT L∆RT
= R∆T LT L∆RT
= R(∆T ∆)RT
= R∆2d RT (7.40)

where ∆2d is the d × d diagonal matrix deﬁned as ∆2d (i, i) = δi2 , for i = 1, · · · , d.
Only r ≤ min(d, n) of these eigenvalues are positive, whereas the rest are all zeros.
Since the covariance matrix of centered D is given as Σ = n1 DT D, and since it
can be decomposed as Σ = UΛUT via PCA (7.37), we have

DT D = nΣ
= nUΛUT
= U(nΛ)UT (7.41)

Equating (7.40) and (7.41), we conclude that the right singular vectors R are the
same as the eigenvectors of Σ. Furthermore, the corresponding singular values of D
are related to the eigenvalues of Σ by the expression

nλi = δi2
δi2
or, λi = , for i = 1, · · · , d (7.42)
n
Let us now consider the matrix DDT . We have

DDT =(L∆RT )(L∆RT )T

=L∆RT R∆T LT
=L(∆∆T )LT
=L∆2n LT

where ∆2n is the n × n diagonal matrix given as ∆2n (i, i) = δi2 , for i = 1, · · · , n.
Only r of these singular values are positive, whereas the rest are all zeros. Thus,
the left singular L are the eigenvectors of the matrix n × n matrix DDT , and the
corresponding eigenvalues are given as δi2 .

Example 7.9: Let us consider the n × d centered Iris data matrix D from Exam-
ple 7.1, with n = 150 and d = 3. In Example 7.5 we computed the eigenvectors
CHAPTER 7. DIMENSIONALITY REDUCTION 237

and eigenvalues of the covariance matrix Σ as follows

λ1 = 3.662 λ2 = 0.239 λ3 = 0.059

     
−0.390 −0.639 −0.663
u1 =  0.089 u2 = −0.742 u3 =  0.664
−0.916 0.200 0.346

Computing the SVD of D yields the following non-zero singular values and the
corresponding right singular vectors

δ1 = 23.437 δ2 = 5.992 δ3 = 2.974

     
−0.390 0.639 −0.663
r1 =  0.089 r2 =  0.742 r3 =  0.664
−0.916 −0.200 0.346

We do not show the left singular vectors l1 , l2 , l3 since they lie in R150 . Using (7.42)
δi2
one can verify that λi = n. For example

δ12 23.4372 549.29

λ1 = = = = 3.662
n 150 150
Notice also that the right singular vectors are equivalent to the principal com-
ponents or eigenvectors of Σ, up to isomorphism. That is, they may potentially
be reversed in direction. For the Iris dataset, we have r1 = u1 , r2 = −u2 , and
r3 = u3 . Here the second right singular vector is reversed when compared to the
second principal component.

7.5 Further Reading

Principal component analysis was pioneered in (Pearson, 1901). For a comprehensive
description of PCA see (Jolliﬀe, 2002). Kernel PCA was ﬁrst introduced in (Schölkopf,
Smola, and Müller, 1998). For further exploration of non-linear dimensionality reduc-
tion methods see (Lee and Verleysen, 2007). The requisite linear algebra background
can be found in (Strang, 2006).

Jolliﬀe, I. (2002), Principal Component Analysis, 2nd Edition, Springer Series in

Statistics, New York, USA: Springer-Verlag, Inc.
Lee, J. A. and Verleysen, M. (2007), Nonlinear dimensionality reduction, Springer.
CHAPTER 7. DIMENSIONALITY REDUCTION 238

Pearson, K. (1901), “On lines and planes of closest ﬁt to systems of points in space”,
The London, Edinburgh, and Dublin Philosophical Magazine and Journal of Sci-
ence, 2 (11), pp. 559–572.
Schölkopf, B., Smola, A. J., and Müller, K.-R. (1998), “Nonlinear Component Analy-
sis as a Kernel Eigenvalue Problem”, Neural Computation, 10 (5), pp. 1299–1319.
Strang, G. (2006), Linear Algebra and Its Applications, 4th Edition, Thomson Brooks/-
Cole, Cengage learning.

7.6 Exercises
Q1. Consider the data matrix D given below:

X1 X2
8 -20
0 -1
10 -19
10 -20
2 0

(a) Compute the mean µ and covariance matrix Σ for D.

(b) Compute the eigenvalues of Σ.
(c) What is the “intrinsic” dimensionality of this dataset (discounting some
small amount of variance)?
(d) Compute the ﬁrst principal component.
(e) If the µ and Σ from above characterize the normal distribution from
which the points were generated, sketch the orientation/extent of the two-
dimensional normal density function.

5 4
Q2. Given the covariance matrix Σ = , answer the following
4 5

(a) Compute the eigenvalues of Σ by solving the equation det(Σ − λI) = 0.

(b) Find the corresponding eigenvectors by solving the equation Σui = λi ui .

Q3. Compute the singular values and the left and right singular vectors of the fol-
lowing matrix
1 1 0
A=
0 0 1

Q4. Consider the data in Table 7.1. Deﬁne the kernel function as follows: K(xi , xj ) =
kxi − xj k2 . Answer the following questions.
CHAPTER 7. DIMENSIONALITY REDUCTION 239

i xi
x1 (4,2.9)
x4 (2.5,1)
x7 (3.5,4)
x9 (2,2.1)

Table 7.1: Dataset for Q4

(a) Compute the kernel matrix K.

(b) Find the ﬁrst kernel principal component.

Q5. Given the two points x1 = (1, 2), and x2 = (2, 1), use the kernel function

K(xi , xj ) = (xTi xj )2

to ﬁnd the kernel principal component, by solving the equation Kc = η1 c.

Linear Algebra Project
No ratings yet
Linear Algebra Project
9 pages
Matrix Algebra For Engineers
No ratings yet
Matrix Algebra For Engineers
190 pages
Duc Tran - Basic Linear Algebra - An Introduction With An Intuitive Approach (2022)
No ratings yet
Duc Tran - Basic Linear Algebra - An Introduction With An Intuitive Approach (2022)
190 pages
GATE Engineering Mathematics Material
0% (1)
GATE Engineering Mathematics Material
17 pages
Maths YouTube Video List
No ratings yet
Maths YouTube Video List
39 pages
IB Further Math - Linear Algebra Summary
100% (1)
IB Further Math - Linear Algebra Summary
13 pages
University of Engineering and Technology Lahore: Department: Department of Mathematics Subject:MA-234 Linear Algebra
100% (2)
University of Engineering and Technology Lahore: Department: Department of Mathematics Subject:MA-234 Linear Algebra
3 pages
Chapter 5 Chapter Content
No ratings yet
Chapter 5 Chapter Content
23 pages
Syllabus of ESE KUET
No ratings yet
Syllabus of ESE KUET
26 pages
7de, Uk: X (X 1 X 2 - . - XP)
No ratings yet
7de, Uk: X (X 1 X 2 - . - XP)
24 pages
Assignment 2 Math 205 Linear Algebra
No ratings yet
Assignment 2 Math 205 Linear Algebra
6 pages
Santos Main
No ratings yet
Santos Main
38 pages
CH 04
No ratings yet
CH 04
13 pages
Isye 6416: Computational Statistics Spring 2023: Prof. Yao Xie
No ratings yet
Isye 6416: Computational Statistics Spring 2023: Prof. Yao Xie
44 pages
5 Matrix Form of Linear Transformations, Column and Row Space of A Matrix
No ratings yet
5 Matrix Form of Linear Transformations, Column and Row Space of A Matrix
19 pages
MATH 2111 - Matrix Algebra and Applications
No ratings yet
MATH 2111 - Matrix Algebra and Applications
37 pages
Math 107-S22 Syllabus
No ratings yet
Math 107-S22 Syllabus
4 pages
6.1. Introduction To Eigenvalues: Example 2
No ratings yet
6.1. Introduction To Eigenvalues: Example 2
1 page
Matrix Decomposition and Applications
No ratings yet
Matrix Decomposition and Applications
184 pages
L14 SVD
No ratings yet
L14 SVD
8 pages
PART I: Approximation of Static Systems
No ratings yet
PART I: Approximation of Static Systems
123 pages
mth501 Solutionfinaltermpaper 2008
No ratings yet
mth501 Solutionfinaltermpaper 2008
16 pages
Subramani 010207783
No ratings yet
Subramani 010207783
74 pages
Find The Rank and Nullity of
No ratings yet
Find The Rank and Nullity of
2 pages
Lecture 6
No ratings yet
Lecture 6
53 pages
Lecture 6
No ratings yet
Lecture 6
53 pages
Lecture 5
No ratings yet
Lecture 5
30 pages
Module 2 - DS I
No ratings yet
Module 2 - DS I
94 pages
Exercises For 8.5
No ratings yet
Exercises For 8.5
17 pages
11 SVD
No ratings yet
11 SVD
47 pages
Chapter 5 Dimensional Reduction Methods
No ratings yet
Chapter 5 Dimensional Reduction Methods
50 pages
The Singular Value Decomposition: Prof. Walter Gander ETH Zurich Decenber 12, 2008
No ratings yet
The Singular Value Decomposition: Prof. Walter Gander ETH Zurich Decenber 12, 2008
18 pages
Linear Models: Stability and Redundancy: 2.1 Singular Value Decomposition
No ratings yet
Linear Models: Stability and Redundancy: 2.1 Singular Value Decomposition
24 pages
Dimension Reduction (PCA
No ratings yet
Dimension Reduction (PCA
12 pages
SVD Slides
No ratings yet
SVD Slides
17 pages
MAT1341 Midterm Solution
No ratings yet
MAT1341 Midterm Solution
3 pages
Practical:14: Verification of Rank-Nullity Theorem For A Matrix
No ratings yet
Practical:14: Verification of Rank-Nullity Theorem For A Matrix
3 pages
UNIT II Vector Spaces
No ratings yet
UNIT II Vector Spaces
15 pages
The Singular Value Decomposition.
No ratings yet
The Singular Value Decomposition.
88 pages
Abdi SVD2007 Pretty PDF
No ratings yet
Abdi SVD2007 Pretty PDF
14 pages
1 Singular Value Decomposition: Lecture 8-10 Notes: SVD and Its Applications
No ratings yet
1 Singular Value Decomposition: Lecture 8-10 Notes: SVD and Its Applications
8 pages
Section 7.4 Notes (The SVD)
No ratings yet
Section 7.4 Notes (The SVD)
9 pages
Singular Value Decomposition
100% (1)
Singular Value Decomposition
24 pages
Coursework
No ratings yet
Coursework
14 pages
The Singular Value Decomposition
No ratings yet
The Singular Value Decomposition
8 pages
Dela 7-2
No ratings yet
Dela 7-2
10 pages
Intro SVD
No ratings yet
Intro SVD
16 pages
CS168: The Modern Algorithmic Toolbox Lecture #9: The Singular Value Decomposition (SVD) and Low-Rank Matrix Approximations
No ratings yet
CS168: The Modern Algorithmic Toolbox Lecture #9: The Singular Value Decomposition (SVD) and Low-Rank Matrix Approximations
10 pages
Singular Value Decomposition (SVD)
No ratings yet
Singular Value Decomposition (SVD)
94 pages
SVD PDF
No ratings yet
SVD PDF
10 pages
Cos323 s06 Lecture09 SVD
No ratings yet
Cos323 s06 Lecture09 SVD
24 pages
Lecture1 Slides
No ratings yet
Lecture1 Slides
26 pages
9.4 Singular Value Decomposition: 9.4.1 Definition of The SVD
No ratings yet
9.4 Singular Value Decomposition: 9.4.1 Definition of The SVD
4 pages
Svdnotes
No ratings yet
Svdnotes
10 pages
MATHS SYLLABUS Kijoframework
No ratings yet
MATHS SYLLABUS Kijoframework
59 pages
1.2.7 Singular Value Decomposition: Mathematical Background 39
No ratings yet
1.2.7 Singular Value Decomposition: Mathematical Background 39
7 pages
Singular Value Decomposition: Yan-Bin Jia Sep 6, 2012
No ratings yet
Singular Value Decomposition: Yan-Bin Jia Sep 6, 2012
9 pages
Singular Value Decomposition Geometry
No ratings yet
Singular Value Decomposition Geometry
9 pages
Math Assignment 1
No ratings yet
Math Assignment 1
2 pages
Math Primer
No ratings yet
Math Primer
13 pages
MTH 501
No ratings yet
MTH 501
3 pages
MFDS Lecture BITS WILP
No ratings yet
MFDS Lecture BITS WILP
29 pages
SVD Document
No ratings yet
SVD Document
8 pages
The Singular Value Decomposition (SVD)
No ratings yet
The Singular Value Decomposition (SVD)
9 pages
Financial Mathematics and Principles (Lecture Notes)
No ratings yet
Financial Mathematics and Principles (Lecture Notes)
26 pages
SVD and Data Science
No ratings yet
SVD and Data Science
52 pages
Tutorial Sheet-1
No ratings yet
Tutorial Sheet-1
4 pages
Lecture 10 SVD
No ratings yet
Lecture 10 SVD
36 pages
LINEAR-ALGEBRA-REPORT Group 20
No ratings yet
LINEAR-ALGEBRA-REPORT Group 20
19 pages
Dama50 2024 2025 Unit3n
No ratings yet
Dama50 2024 2025 Unit3n
56 pages
Lec16 ES205 Sp25 Upload
No ratings yet
Lec16 ES205 Sp25 Upload
21 pages
Ma1513 (2122sem1) Examplify Solution
No ratings yet
Ma1513 (2122sem1) Examplify Solution
17 pages
COMP4222-Lecture 4-Self Reading-Chapter 4 Eigenvalues Decomposition by Longin Jan Latecki
No ratings yet
COMP4222-Lecture 4-Self Reading-Chapter 4 Eigenvalues Decomposition by Longin Jan Latecki
30 pages
SFML DATE 19 Lecture3 Svdpca Notes
No ratings yet
SFML DATE 19 Lecture3 Svdpca Notes
6 pages
Data Preprocessing - V (Feature Extraction - SVD)
No ratings yet
Data Preprocessing - V (Feature Extraction - SVD)
34 pages
Singular Value Decomposition (SVD) With Two Fea-Tures: Column Means
No ratings yet
Singular Value Decomposition (SVD) With Two Fea-Tures: Column Means
3 pages
A.0. Introduction: The Free Enctyclopedia
No ratings yet
A.0. Introduction: The Free Enctyclopedia
10 pages
2 - Introduction To SVD
No ratings yet
2 - Introduction To SVD
5 pages
SYDE 312 Determinators Final Report
No ratings yet
SYDE 312 Determinators Final Report
13 pages
Assignment 3
No ratings yet
Assignment 3
3 pages
Part 1
No ratings yet
Part 1
1 page
Lec 14
No ratings yet
Lec 14
20 pages
2 SVD and Dimensionality Reduction - Singlular Value Decomposition
No ratings yet
2 SVD and Dimensionality Reduction - Singlular Value Decomposition
5 pages
Vietnam General Confederation of Labor: Ton Duc Thang University Faculty of Information Technology
No ratings yet
Vietnam General Confederation of Labor: Ton Duc Thang University Faculty of Information Technology
26 pages
PCA and SVD
No ratings yet
PCA and SVD
21 pages
Internal 4 Sem
No ratings yet
Internal 4 Sem
36 pages
Final 4 Sem
No ratings yet
Final 4 Sem
29 pages
A Comparative Analysis of Principal Component Analysis (PCA) and Singular Value Decomposition (SVD) As Dimensionality Reduction Techniques
No ratings yet
A Comparative Analysis of Principal Component Analysis (PCA) and Singular Value Decomposition (SVD) As Dimensionality Reduction Techniques
4 pages
Multiple Integrals, A Collection of Solved Problems
From Everand
Multiple Integrals, A Collection of Solved Problems
Steven Tan
No ratings yet
Limits and Continuity (Calculus) Engineering Entrance Exams Question Bank
From Everand
Limits and Continuity (Calculus) Engineering Entrance Exams Question Bank
Mohmmad Khaja Shareef
No ratings yet

Book4 SVD

Uploaded by

Book4 SVD

Uploaded by

CHAPTER 7.

DIMENSIONALITY REDUCTION 233

7.4 Singular Value Decomposition

where L is a orthogonal n × n matrix, R is an orthogonal d × d matrix, and ∆ is an

where i = 1, · · · , n and j = 1, · · · , d. The entries ∆(i, i) = δi along the main

where kAkF is called the Frobenius Norm of the n × d matrix A, deﬁned as

7.4.1 Geometry of SVD

From the above, we conclude that

Dri = δi li for all i = 1, · · · , r

7.4.2 Connection between SVD and PCA

DDT =(L∆RT )(L∆RT )T

and eigenvalues of the covariance matrix Σ as follows

λ1 = 3.662 λ2 = 0.239 λ3 = 0.059

δ1 = 23.437 δ2 = 5.992 δ3 = 2.974

δ12 23.4372 549.29

7.5 Further Reading

Jolliﬀe, I. (2002), Principal Component Analysis, 2nd Edition, Springer Series in

(a) Compute the mean µ and covariance matrix Σ for D.

(a) Compute the eigenvalues of Σ by solving the equation det(Σ − λI) = 0.

Table 7.1: Dataset for Q4

(a) Compute the kernel matrix K.

to ﬁnd the kernel principal component, by solving the equation Kc = η1 c.

You might also like