0% found this document useful (0 votes)

36 views31 pages

Singular Value Decomposition (SVD) / Principal Components Analysis (Pca)

The document discusses singular value decomposition (SVD) and principal component analysis (PCA). It provides an example to calculate the eigenvectors and eigenvalues of a sample data matrix X. The spectral theorem is used, which states that the matrices XX^T and X^TX share the same non-zero eigenvalues. This allows calculating the eigenvectors of XX^T from the eigenvectors of X^TX. For the example matrix X, it is shown that the eigenvalues are 58 and 2, and the corresponding eigenvectors are [1,2]^T and [-1,2]^T.

Uploaded by

Arindam Chakraborty

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

36 views31 pages

Singular Value Decomposition (SVD) / Principal Components Analysis (Pca)

Uploaded by

Arindam Chakraborty

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 31

SINGULAR VALUE DECOMPOSITION (SVD)/

PRINCIPAL COMPONENTS ANALYSIS (PCA)

!1
SVD - EXAMPLE

U, S, V T = numpy . linalg . svd(img)

!2
SVD - EXAMPLE
full rank 600 300

100

50 20

10 U[: , k]S[: k]V T[: k, :]

!3
PCA - INTRODUCTION

1 2 4
2 1 5
X=
3 4 10
4 3 11

!4
PCA - INTRODUCTION

!5
PCA - INTRODUCTION

!6
PCA - INTRODUCTION

!7
PRINCIPAL COMPONENT ANALYSIS

• A technique to find the directions along which the points

(set of tuples) in high-dimensional data line up best.

• Treat a set of tuples as a matrix M and find the eigenvectors

for MMT or MTM.

• The matrix of these eigenvectors can be thought of as a

rigid rotation in a high-dimensional space.

• When this transformation is applied to the original data - the

axis corresponding to the principal eigenvector is the one
along which the points are most “spread out”.

!8
PRINCIPAL COMPONENT ANALYSIS

• When this transformation is applied to the original data - the

axis corresponding to the principal eigenvector is the one
along which the points are most “spread out”.

• This axis is the one along which variance of the data is

maximized.

• Points can best be viewed as lying along this axis with small
deviations from this axis.

• Likewise, the axis corresponding the second eigenvector is

the axis along which the variance of distances from the first
axis is greatest, and so on.

!9
PRINCIPAL COMPONENT ANALYSIS
• Principal Component Analysis (PCA) is a dimensionality reduction
method.

• The goal is to embed data in high dimensional space, onto a small

number of dimensions.

• It most frequent use is in exploratory data analysis and

visualization.

• It can also be helpful in regression (linear or logistic) where we can

transform input variables into a smaller number of predictors for
modeling.

!10
PRINCIPAL COMPONENT ANALYSIS
• Mathematically, 
 
Given: Data set   {x1, x2, . . . , xn}
 
where, xi is the vector of p variable values for the i-th observation. 
 
 
 
 
Return: 
 
Matrix  [ϕ1, ϕ2, . . . , ϕp]
 
of linear transformations that retain maximal variance.

• You can think of the first vector ϕ1 as a linear transformation that

embeds observations into 1 dimension 
 
Z1 = ϕ11X1 + ϕ21X2 + … + ϕp1Xp

!11
PRINCIPAL COMPONENT ANALYSIS
• You can think of the first vector ϕ1 as a linear transformation that
embeds observations into 1 dimension 
 
 
  Z1 = ϕ11X1 + ϕ21X2 + … + ϕp1Xp
 
where ϕ1 is selected so that the resulting dataset {zi,  …, zn}
 
has maximum variance.

• In order for this to make sense, mathematically, data has to be

centered
• Each Xi has zero mean p
2
• Transformation vector ϕ1 has to be normalized, i.e.,   ∑ ϕj1 = 1
  j=1

!12
PRINCIPAL COMPONENT ANALYSIS
• In order for this to make sense, mathematically, data has to be
centered
• Each Xi has zero mean p
• Transformation vector ϕ1 has to be normalized, i.e.,   ϕ 2 = 1
∑ j1
j=1

• We can find ϕ1 by solving an optimization problem: 

 
  n p 2 p

ϕ11,ϕ21,…,ϕp1 n ∑ (∑ )
  1
ϕj12 = 1
∑
  max ϕj1xij s.t.
  i=1 j=1 j=1
 
Maximize variance but subject to normalization constraint.  
 
 
 

!13
PRINCIPAL COMPONENT ANALYSIS

• We can find ϕ1 by solving an optimization problem: 

 
  p 2 p
n

ϕ11,ϕ21,…,ϕp1 n ∑ (∑ )
  1
ϕj12 = 1
∑
max ϕj1xij s.t.
 
  i=1 j=1 j=1
 
Maximize variance but subject to normalization constraint.

• The second transformation, ϕ2 is obtained similarly with the added 

 
constraint that ϕ2 is orthogonal to ϕ1  
 

• Taken together [ϕ1, ϕ2] define a pair of linear transformations of the  

 
data into 2 dimensional space  
 
Zn×2 = Xn×p[ϕ1, ϕ2]p×2

!14
PRINCIPAL COMPONENT ANALYSIS

• Taken together [ϕ1, ϕ2] define a pair of linear transformations of the  

 
data into 2 dimensional space  
  Zn×2 = Xn×p[ϕ1, ϕ2]p×2

• Each of the columns of the Z matrix are called Principal

components.

• The units of the PCs are meaningless.

• In practice we may also scale Xj to have unit variance.

• In general if variables Xj are measured in different units(e.g., miles  

 
vs. liters vs. dollars), variables should be scaled to have unit  
 
variance.

!15
SPECTRAL THEOREM
Using Spectral theorem

(X T X)ϕ = λϕ
XX T (Xϕ) = λ(Xϕ)
Conclusion:

The matrices XX T and X T X share the same nonzero eigenvalues

To get an eigenvector of XX T from X T X multiply ϕ on the left by X

Very powerful, particularly if number of observations, m, and the

number of predictors, n, are drastically different in size.

For PCA:
Cov(X, X ) = XX T

!16
EXAMPLE - PCA
1 2
2 1
X=
3 4
4 3

Eigen Values and Eigen Vectors?

!17
EXAMPLE - PCA
1 2
2 1
X=
3 4
4 3

From spectral theorem:

(X T X)ϕ = λϕ
1 2

[2 1 4 3] 3 [28 30]
1 2 3 4 2 1 30 28
XT X = =
4
4 3

!18
EXAMPLE - PCA
1 2
2 1
X=
3 4
4 3

From spectral theorem:

(X T X)ϕ = λϕ ⟹ (X T X)ϕ − λIϕ = 0

((X T X) − λI)ϕ = 0

[ 28 30 − λ]
30 − λ 28
= 0 ⟹ λ = 58 and λ = 2

!19
EXAMPLE - PCA
1 2
2 1
X=
3 4
4 3

From spectral theorem:

(X T X)ϕ = λϕ
1

[28 30] [ϕ12] [ϕ12]

30 28 ϕ11 ϕ11 2
= 58 ⟹ ϕ1 = 1
2

!20
EXAMPLE - PCA
1 2
2 1
X=
3 4
4 3

From spectral theorem: (X T X)ϕ = λϕ

[28 30] [ϕ12] [ϕ12]

30 28 ϕ11 ϕ11 2
= 58 ⟹ ϕ1 = 1
2

−1

[28 30] [ϕ22] [ϕ22]

30 28 ϕ21 ϕ21 2
=2 ⟹ ϕ2 = 1
2

!21
EXAMPLE - PCA
1 2
2 1
X=
3 4
4 3

From spectral theorem: (X T X)ϕ = λϕ

1 −1
2 2
ϕ1 = λ1 = 58 ϕ2 = λ2 = 2
1 1
2 2

1 −1
2 2
ϕ= 1 1
2 2

!22
EXAMPLE - PCA
1 −1
1 2 2 2 λ2 = 2
2 1 ϕ1 = λ1 = 58 ϕ2 =
X= 1 1
3 4 2 2
4 3

3 1
2 2
1 2 1 −1 3 −1
2 1 2 2 2 2
Z = Xϕ = =
3 4 1 1 7 1
4 3 2 2 2 2
7 −1
2 2

!23
EXAMPLE - PCA
3 1
2 2
1 2 3 −1
2 1
X= 2 2
3 4 Z= 7 1
4 3
2 2
7 −1

(3,4) 2 2

(3.5,3.5)

(1,2)
(4,3)
( 2) ( 2)
3 1 7 1
, ,
(1.5,1.5) 2 2

(2,1)

( 2) ( 2)
3 −1 7 −1
, ,
2 2

!24
PCA STEPS -
STEP 1 MEAN SUBTRACTION

!25
PCA STEPS -
STEP 2 COVARIANCE MATRIX

top 5 rows of mean centered data

covariance matrix

!26
PCA STEPS -
STEP 3 EIGEN VALUES & EIGEN
VECTORS OF COVARIANCE MATRIX

!27
PCA STEPS -
STEP 4 - PRINCIPAL COMPONENTS

Multiply each eigen vector by its corresponding eigen value

(usually square root)

Plot them on top of the data

!28
PCA STEPS -
STEP 5 - PROJECT DATA ALONG
DOMINANT PC

newData = PC1 × oldData

!29
HOW MANY PRINCIPAL COMPONENTS ?

• How many PCs should we consider in post-hoc analysis?

• One result of PCA is a measure of the variance to each PC relative  

 
to the total variance of the dataset.

• We can calculate the percentage of variance explained for the m-th

PC: 
  n
2
  ∑ z im
PVEm = pi=1 n
∑ ∑ xij2
j=1 i=1

!30
HOW MANY PRINCIPAL COMPONENTS ?

• We can calculate the percentage of variance explained for the m-th

PC:  n
  ∑ zim2
 
PVEm = pi=1 n
∑ ∑ xij2
j=1 i=1

!31

Principal Component Analysis
No ratings yet
Principal Component Analysis
13 pages
DimensionalitY Reduction
No ratings yet
DimensionalitY Reduction
29 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
8 pages
Pca Kmeans GMM
No ratings yet
Pca Kmeans GMM
96 pages
Principal Component Analysis Concepts
No ratings yet
Principal Component Analysis Concepts
16 pages
FALLSEM2024-25 SWE1015 ETH VL2024250103260 2024-09-18 Reference-Material-I
No ratings yet
FALLSEM2024-25 SWE1015 ETH VL2024250103260 2024-09-18 Reference-Material-I
62 pages
Machine Learning (CSO851) - Lecture 03
No ratings yet
Machine Learning (CSO851) - Lecture 03
71 pages
09 Pca
No ratings yet
09 Pca
22 pages
Principal Component Analysis (PCA) - : San José State University Math 253: Mathematical Methods For Data Visualization
No ratings yet
Principal Component Analysis (PCA) - : San José State University Math 253: Mathematical Methods For Data Visualization
49 pages
کتاب نهم بارگزاری شده
No ratings yet
کتاب نهم بارگزاری شده
55 pages
MLSP Exp02
No ratings yet
MLSP Exp02
10 pages
RES805-RM-Module 2
No ratings yet
RES805-RM-Module 2
26 pages
Principal Component Analysis (PCA)
No ratings yet
Principal Component Analysis (PCA)
18 pages
MLSP Exp2
No ratings yet
MLSP Exp2
7 pages
Pattern Recognition PCA: Subrata Datta Dept. of AIML Nsec
No ratings yet
Pattern Recognition PCA: Subrata Datta Dept. of AIML Nsec
19 pages
IDS 4 (Week 14)
No ratings yet
IDS 4 (Week 14)
66 pages
4.5 Principal Component Analysis
No ratings yet
4.5 Principal Component Analysis
15 pages
7.3 Pca
No ratings yet
7.3 Pca
17 pages
Principal Component Analysis Concepts: T56Gzsrvah
No ratings yet
Principal Component Analysis Concepts: T56Gzsrvah
16 pages
Lecture FPCA
No ratings yet
Lecture FPCA
67 pages
Data Analysis: Dr. C Santhosh Kumar
No ratings yet
Data Analysis: Dr. C Santhosh Kumar
22 pages
MLPDF 2
No ratings yet
MLPDF 2
9 pages
ML Lec-20
No ratings yet
ML Lec-20
17 pages
Maths Pca
No ratings yet
Maths Pca
6 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
34 pages
Program: Course Code: Course Name:: M.C.A. MCAS9220 Data Science Fundamentals
No ratings yet
Program: Course Code: Course Name:: M.C.A. MCAS9220 Data Science Fundamentals
28 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
27 pages
Exp 15
No ratings yet
Exp 15
12 pages
Lecture 9 - Data Reduction
No ratings yet
Lecture 9 - Data Reduction
36 pages
DR Pca
No ratings yet
DR Pca
22 pages
AML Unit - 1 Material
No ratings yet
AML Unit - 1 Material
36 pages
20 Pca
No ratings yet
20 Pca
50 pages
Pca
No ratings yet
Pca
18 pages
Mathematical Approach To PCA
No ratings yet
Mathematical Approach To PCA
8 pages
Principal Component Analysis (PCA) Final
No ratings yet
Principal Component Analysis (PCA) Final
37 pages
Dimensionality Reduction (Principal Component Analysis)
No ratings yet
Dimensionality Reduction (Principal Component Analysis)
12 pages
Ferath Kherif PCA
No ratings yet
Ferath Kherif PCA
17 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
10 pages
Pca
No ratings yet
Pca
16 pages
Principal Component Analysis (PCA) : Gundimeda Venugopal
No ratings yet
Principal Component Analysis (PCA) : Gundimeda Venugopal
17 pages
Kinya Sharon - Ass2 - Machine Learning
No ratings yet
Kinya Sharon - Ass2 - Machine Learning
12 pages
Principal Computer Analysis (PCA)
No ratings yet
Principal Computer Analysis (PCA)
25 pages
Unit 3
No ratings yet
Unit 3
28 pages
Need of Principal Component Analysis
No ratings yet
Need of Principal Component Analysis
8 pages
Remote Sensing Assignment
No ratings yet
Remote Sensing Assignment
10 pages
The Math Behind PCA
No ratings yet
The Math Behind PCA
3 pages
PCA Explained Stepbystep
No ratings yet
PCA Explained Stepbystep
4 pages
6 Principal Component Analysis
No ratings yet
6 Principal Component Analysis
7 pages
PCA Steps - Numerical Problem
No ratings yet
PCA Steps - Numerical Problem
8 pages
Principal Component Analysis
100% (1)
Principal Component Analysis
10 pages
Principal Component Analysis: Term Paper For Data Mining & Data Warehousing
No ratings yet
Principal Component Analysis: Term Paper For Data Mining & Data Warehousing
11 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
12 pages
Feature Extraction: - Saheni Patra
No ratings yet
Feature Extraction: - Saheni Patra
17 pages
PCA Finds Representation Through Linear Transformation
No ratings yet
PCA Finds Representation Through Linear Transformation
28 pages
Dimensionality Reduction by Pca: Non - Feasible
No ratings yet
Dimensionality Reduction by Pca: Non - Feasible
26 pages
PCA Complete
No ratings yet
PCA Complete
8 pages
5 Pca
No ratings yet
5 Pca
14 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
5 pages
Indian Knowledge Systems (Index - PHP) : Book List
No ratings yet
Indian Knowledge Systems (Index - PHP) : Book List
28 pages
PCA
100% (1)
PCA
33 pages
CH 4 Determinants Multiple Choice Questions (With Answers)
100% (1)
CH 4 Determinants Multiple Choice Questions (With Answers)
4 pages
My Solutions To Blind 75 - LeetCode Discuss
No ratings yet
My Solutions To Blind 75 - LeetCode Discuss
1 page
Lecture Notes On Linear Algebra: Aklal S Pati
No ratings yet
Lecture Notes On Linear Algebra: Aklal S Pati
176 pages
MATH2101 Cheat Sheet
No ratings yet
MATH2101 Cheat Sheet
3 pages
Matrices Worksheet
100% (2)
Matrices Worksheet
7 pages
MTH101
No ratings yet
MTH101
13 pages
Linear Algebra Pure Applied 1st Edgar G Goodaire Download
No ratings yet
Linear Algebra Pure Applied 1st Edgar G Goodaire Download
88 pages
Thesis Duong Wang
No ratings yet
Thesis Duong Wang
83 pages
(20+) The Sinhgarh Fort, Also Known... - Sayed Qasim Bafakhy Thangal Mahiri - Facebook
100% (1)
(20+) The Sinhgarh Fort, Also Known... - Sayed Qasim Bafakhy Thangal Mahiri - Facebook
3 pages
Synthetic Data RL: Task Definition Is All You Need: Yiduo Guo Zhen Guo Chuanwei Huang Zi-Ang Wang
No ratings yet
Synthetic Data RL: Task Definition Is All You Need: Yiduo Guo Zhen Guo Chuanwei Huang Zi-Ang Wang
34 pages
Linear Algebra Week3 Notes
No ratings yet
Linear Algebra Week3 Notes
45 pages
Xinjiang Police Files - China's Crackdown On Uyghur Muslims
No ratings yet
Xinjiang Police Files - China's Crackdown On Uyghur Muslims
6 pages
THXX njr6
No ratings yet
THXX njr6
12 pages
Assigment
No ratings yet
Assigment
52 pages
PPT06 - Eigenvalues and Eigenvectors
No ratings yet
PPT06 - Eigenvalues and Eigenvectors
13 pages
Zi Appnote Mems Gyroscope
No ratings yet
Zi Appnote Mems Gyroscope
9 pages
CUET Mock Test - Maths
No ratings yet
CUET Mock Test - Maths
7 pages
How Infinite Series Reveal The Unity of Mathematics - Quanta Magazine
No ratings yet
How Infinite Series Reveal The Unity of Mathematics - Quanta Magazine
5 pages
Text-To-Lora: Instant Transformer Adaption: Rujikorn Charakorn Edoardo Cetin Yujin Tang Robert T. Lange
No ratings yet
Text-To-Lora: Instant Transformer Adaption: Rujikorn Charakorn Edoardo Cetin Yujin Tang Robert T. Lange
30 pages
How Simple Math Reveals Rational Points On Curves - Quanta Magazine
No ratings yet
How Simple Math Reveals Rational Points On Curves - Quanta Magazine
29 pages
Structured Programming Lab
No ratings yet
Structured Programming Lab
12 pages
Experiment 2a2q2020
No ratings yet
Experiment 2a2q2020
25 pages
4,000-Year-Old War Chariots Discovered in Royal Tombs of Northern India - Archaeology News Online Magazine
No ratings yet
4,000-Year-Old War Chariots Discovered in Royal Tombs of Northern India - Archaeology News Online Magazine
7 pages
Chapter 3 Determinants
No ratings yet
Chapter 3 Determinants
20 pages
Numbers of Children With Acute Malnutrition Down 12%, Reveals Poshan Tracker - India News - Times of India
No ratings yet
Numbers of Children With Acute Malnutrition Down 12%, Reveals Poshan Tracker - India News - Times of India
18 pages
Allama Iqbal Poetry کلام علامہ محمد اقبال - (Bang-e-Dra-131) Ghulam Qadir Ruhela
No ratings yet
Allama Iqbal Poetry کلام علامہ محمد اقبال - (Bang-e-Dra-131) Ghulam Qadir Ruhela
4 pages
You Can't Take It With You - Straight Talk About Epigenetics and Intergenerational Trauma
No ratings yet
You Can't Take It With You - Straight Talk About Epigenetics and Intergenerational Trauma
17 pages
Anti-Pandemic Measures - China Makes Surprise Cut To Key Lending Rate - The Economic Times
No ratings yet
Anti-Pandemic Measures - China Makes Surprise Cut To Key Lending Rate - The Economic Times
2 pages
Assignment 1 MA2101
No ratings yet
Assignment 1 MA2101
4 pages
5G Technology in The Agriculture Sector - Jio
No ratings yet
5G Technology in The Agriculture Sector - Jio
6 pages
3rd Feb Matrics Cayley-Hamilton Theorem and Diagonalizable
No ratings yet
3rd Feb Matrics Cayley-Hamilton Theorem and Diagonalizable
42 pages
"Adrishya Alok" - The Scintillating Legacy of Electromagnetic Waves
No ratings yet
"Adrishya Alok" - The Scintillating Legacy of Electromagnetic Waves
9 pages
The Rank Conditions Such That AB and BA Are Similar and Applications
No ratings yet
The Rank Conditions Such That AB and BA Are Similar and Applications
10 pages
Wpiea2023270 Print PDF
No ratings yet
Wpiea2023270 Print PDF
51 pages
40+ Quirky Facts We Didn't Know About Hawaii
No ratings yet
40+ Quirky Facts We Didn't Know About Hawaii
82 pages
NumPy Functions
No ratings yet
NumPy Functions
5 pages
Jupyter Notebook Viewer
No ratings yet
Jupyter Notebook Viewer
19 pages
My Mistakes Analysis - Matrices - Quizrr
No ratings yet
My Mistakes Analysis - Matrices - Quizrr
1 page
Skema LKM 6
No ratings yet
Skema LKM 6
2 pages
Lecture # 2 Types of Matrices:: Linear Algebra and Analytical Geometry
No ratings yet
Lecture # 2 Types of Matrices:: Linear Algebra and Analytical Geometry
3 pages
MA111 Exam 2018
No ratings yet
MA111 Exam 2018
6 pages
Matrix Appendix (Katsuhiko Ogata) System Dynamics (4th Edition)
No ratings yet
Matrix Appendix (Katsuhiko Ogata) System Dynamics (4th Edition)
15 pages
Some Important Matrix Factorizations: LU Decomposition
No ratings yet
Some Important Matrix Factorizations: LU Decomposition
39 pages
Linear Algebra and Its Applications: Tim Netzer, Andreas Thom
No ratings yet
Linear Algebra and Its Applications: Tim Netzer, Andreas Thom
17 pages
Write A Program To Store and Calculate The Sum of 5 Numbers Entered by The User Using Arrays
No ratings yet
Write A Program To Store and Calculate The Sum of 5 Numbers Entered by The User Using Arrays
5 pages
Matriz de Incidencia
No ratings yet
Matriz de Incidencia
15 pages
Vidyabharti Trust Polytechnic, Umrakh Assignment 15
No ratings yet
Vidyabharti Trust Polytechnic, Umrakh Assignment 15
6 pages
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet