0% found this document useful (0 votes)

10 views4 pages

AE - Tema 2 - Principal Component Analysis

Uploaded by

Ramón García

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views4 pages

AE - Tema 2 - Principal Component Analysis

Uploaded by

Ramón García

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Principal Components Analysis

This technique reduces the dimensionality of a data set containing a large number
of (interrelated) variables. It was initially developed by Pearson (1901) although
it was not until 1933 that it obtained its algebraic formulation. To this end, the
original variables are transformed into a new set of variables (called principal
components) that are orthogonal and uncorrelated and that can be ordered
according to the variance they explain of the original variables.

First Principal Component

Definition. Let X = [x1 , · · · , xp ]T a multi-dimensional, stochastic variable which
has the variance-covariance (or dispersion) matrix Σ. Without loss of generality
we can assume it has the mean value zero, µ = 0. We define the first principal axis,
α 1 with α 1 T α 1 = 1 (normed coefficients), as the linear combination of the original
variables α 1 T X which has the largest variance. The random variable Y1 = α 1 T X
is called first principal component.

α1 T X). We know that

Derivation. We want to find α 1 that maximizes Var(α

α1 T X) = α1 T D(X)α
Var(α α1 = α1 T Σα
α1 ,

it has no maximum as α 1 is not bounded, therefore we establish the standardization

constraint. So, the optimization problem becomes

max α 1 T Σα
α1 ,
α1

s.t. α 1 T α 1 = 1.
Using Lagrange multipliers

max α 1 T Σα α1 T α 1 − 1),
α1 − λ(α
α1

Deriving

(Σ + ΣT )α
α1 − λ(I + I T )α α1 − 2λα
α1 = 2Σα α1 = 0 → Σα
α1 = λα
α1 ,

Then, α 1 is an eigenvector of Σ and λ an eigenvalue. But, which eigenvalue?

max α 1 T Σα
α1 = max α 1 T λα
α1 = max λ.
α1 α1 α 1 T α 1 =1 α1

λ is the largest eigenvalue.

The m-th principal component

Definition. The m-th principal axis α m , with α m T α m = 1 (normed coefficients),
is defined as the linear combination such that the random variable Ym = α m X
has maximum variance and cov (Ym , Yk ) = 0, ∀k = 1, · · · , m − 1. The random
variable Ym is called the m-th principal component.

1
Derivation. Let’s start with the second principal component:

α2 T X),
max Var(α
α2

s.t. α 2 T α 2 = 1,
α1 T X, α 2 T X) = 0.
cov(α

which is equivalent to
max α 2 T Σα
α2 ,
α2

s.t. α 2 T α 2 = 1,
α 1 T α 2 = 0.
α2 T X, α 1 T X) = α 2 T Σα
In fact, cov (α α1 = α 2 T λα α2 T α 1 = 0 ⇔ α 2 T α 1 = 0.
α1 = λα
Using the Lagrange multipliers

max α 2 T Σα α2 T α 2 − 1) − ϕ(α
α2 − λ(α α1 T α 2 − 0).
α2

Now, deriving with respect to α2

α2 − 2λα
2Σα α2 − ϕα
α1 ,

multiplying by α 1 T

α −2λ α 1 T α 2 −ϕ α 1 T α 1 = 0 ⇒ ϕ = 0.
2 α 1 T Σα
| {z }2 | {z } | {z }
0 0 1

Then, Σαα2 = λαα2 , in consequence α 2 is an eigenvector of Σ and λ an eigenvalue.

Same as before, λ = maxα 2 α 2 T Σαα2 , and assuming different eigenvalues has to be
the second eigenvalue. If α 2 = α 1 , then α 2 T α 1 ̸= 0.

The m-th principal component

Now we can now generalize to the m-th principal component

max α m T Σα
αm ,
αm

s.t. α m T α m = 1,
α 1 T α m = 0, ∀i = 1, · · · , m − 1.
Using the Lagrange multipliers
m−1
X
T T
αm − λ(α
max α m Σα αm α m − 1) − αm T α i − 0).
ϕi (α
α2
i=1

Now, deriving with respect to α m

m−1
X
αm − 2λα
2Σα αm − ϕiα i = 0.
i=1

Multiplying by αj , j = 1, · · · , m − 1, then

2
m−1
X
αm −2λ αj T αm −
2 αj T Σα ϕiαj T αi = 0 ⇒ ϕj = 0.
| {z } | {z } i=1
0 0 | {z }
ϕj

αm = λα
As before Σα αm , λ is an eigenvalue associated to the eigenvector α m .
Some important observations:
Let λ1 , · · · , λp the eigenvalues of Σ, and α 1 , · · · , α p its eigenvectors, then the
principal components are:
T
Y1 =
.. α 1 X,
.
T
Ym =.. α m X,
.
Yp = α p T X.

and Var(Yi ) = Var(αi ⊤ X) = α⊤i Σαi = αi λi α = λi . Defining P as the matrix

α1 , · · · ,α
whose columns are the eigenvectors P = [α αp ], we can write

α1 T X · · · α p T X] = P T X.
Y = [Y1 · · · Yp ] = [α

Also, we saw that

Σααi = λiα i ∀ i = 1, · · · , p.
 
λ1 . . . 0
which by defining Λ =  ... . . . ... , we can write, ΣP = P Λ, which, by
 
0 . . . λr
−1
observing that P = P , is equivalent to P T ΣP = Λ. On the basis of the
T

above results we have that

D(Y) = D(P T X) = P T D(X)P = P T D(X)P = P T ΣP = Λ.

Theorem. The sum of the variances of the original variables is equal to the sum
of the variances of the principal components.
Proof.
p p p
X X X
T T
Var(X) = tr(Σ) = tr(P ΛP ) = tr(P P Λ) = tr(Λ) = λi = Var(Yi ).
i=1 | {z } i=1 i=1
trace is invariant by ciclic permutations

Consequence: The proportion of the variance explained by the i-th principal

component is given by
λi
,
λ1 + · · · + λp
and the proportion explained by the first k principal components is given by
λ1 + · · · + λk
, k ≤ p.
λ1 + · · · + λp

3
The above theorem establishes a very useful relationship between the variance
of the original variables and the variance of the principal components. However,
it is possible to establish a more general relationship that relates the dispersion
matrix of the original variables with the main axes.

Theorem. Spectral Decomposition:

Σ = λ1α 1α 1 T + · · · + λpα pα p T .

Proof. It is enough to note that Σ = P ΛP T .

Let’s determine the correlation between the original variables and the principal
components

cov(X, Y) = cov(X, P T X) = cov(X, X)P = ΣP = P ΛP T P = P Λ,

1/2
cov(Xi , Yi ) λj pij λj
corr(Xi , Yi ) = p p = √ p = pij .
Var(Xi ) Var(Yi ) σii λj σii

Calculate the eigenvectors where n ≪ p

How to calculate the principal components of a database where the number of
variables p is much greater than the number of observations n? Note that when p
is large, Σ is very large and its eigenvalues and eigenvectors cannot be calculated
due to computational problems.
To solve this, we first observe that the non-zero eigenvalues of A and AT are
the same. The characteristic polynomial of AT is given by

∥AT − λI∥ = ∥AT − λIT ∥ = ∥(A − λI)T ∥ = ∥A − λI∥.

Therefore, we can calculate the eigenvalues of ΣT which are smaller. To do

this, let G be a matrix containing our n observations in which the mean has been
subtracted, G = [gij ] = [xij − µ]. Let’s

1
Σl = GT G, (l = long),
n−1
1
Σs = GGT , (s = sort),
n−1
and let’s Λs and Λl the non-zero eigenvalues, Λs = Λl = Λ. We have that

1
Σs ϕs = ϕs Λ ⇒ GGT ϕs = ϕs Λ,
n−1
1
⇒ GT GGT ϕs = GT ϕs Λ,
n−1
⇒ Σl (GT ϕs ) = (GT ϕs )Λ,
⇒ GT ϕs are the non-zero n − p eigenvectors of Σl .

Silt Control in Irrigation Channels
100% (1)
Silt Control in Irrigation Channels
36 pages
Onani Master Kurosawa - After The Juvenile
No ratings yet
Onani Master Kurosawa - After The Juvenile
20 pages
Load Out
100% (2)
Load Out
239 pages
04 Nursing Process of MHN
100% (1)
04 Nursing Process of MHN
13 pages
Physics Ia (Electricity)
No ratings yet
Physics Ia (Electricity)
5 pages
Principal Component Analysis 4 Dummies
100% (1)
Principal Component Analysis 4 Dummies
8 pages
Summer Internship Report
100% (1)
Summer Internship Report
35 pages
Remote Sensing Assignment
No ratings yet
Remote Sensing Assignment
10 pages
Tcs
No ratings yet
Tcs
46 pages
A. Definitions of Traditional Literacies
100% (1)
A. Definitions of Traditional Literacies
6 pages
G.J.O. Jameson - Finding Carmichael Numbers
No ratings yet
G.J.O. Jameson - Finding Carmichael Numbers
12 pages
ISO-9001-quality-management System
No ratings yet
ISO-9001-quality-management System
16 pages
2019 Genes Ejercicio
No ratings yet
2019 Genes Ejercicio
543 pages
IoT Module 4 Associated IoT Technologies
No ratings yet
IoT Module 4 Associated IoT Technologies
56 pages
Factor Analysis
No ratings yet
Factor Analysis
57 pages
The Man Behind The Famous Bee (Jollibee)
No ratings yet
The Man Behind The Famous Bee (Jollibee)
2 pages
Principal Components Analysis (PCA)
No ratings yet
Principal Components Analysis (PCA)
53 pages
RDZ Search Options
No ratings yet
RDZ Search Options
74 pages
MV - Principal Components Using SAS
No ratings yet
MV - Principal Components Using SAS
69 pages
Lecture 6 - PCA - Lecturefin
No ratings yet
Lecture 6 - PCA - Lecturefin
71 pages
Engineering Interview Questions
No ratings yet
Engineering Interview Questions
66 pages
5 Pca
No ratings yet
5 Pca
14 pages
New Microsoft Word Document (3) BBBB
No ratings yet
New Microsoft Word Document (3) BBBB
85 pages
Bia b350f Unit 4
No ratings yet
Bia b350f Unit 4
38 pages
Ch8-Principal Components
No ratings yet
Ch8-Principal Components
77 pages
Life Fitness Cardiovascular - NEW ELEVATION SERIES Discover Se & Si Consoles
No ratings yet
Life Fitness Cardiovascular - NEW ELEVATION SERIES Discover Se & Si Consoles
35 pages
Chapter2 PCA
No ratings yet
Chapter2 PCA
65 pages
Singular Value Decomposition (SVD) / Principal Components Analysis (Pca)
No ratings yet
Singular Value Decomposition (SVD) / Principal Components Analysis (Pca)
31 pages
Unit 3
No ratings yet
Unit 3
28 pages
Ales Hrdlicka - Some Results of Recent Anthropological Exploration in Peru, 1911
No ratings yet
Ales Hrdlicka - Some Results of Recent Anthropological Exploration in Peru, 1911
40 pages
PCA Fin. Econ.
No ratings yet
PCA Fin. Econ.
56 pages
Aprendizaje Estadistico Final
No ratings yet
Aprendizaje Estadistico Final
71 pages
Pca Portfolio Selection
No ratings yet
Pca Portfolio Selection
18 pages
Questionpaper Paper1P June2017 PDF
No ratings yet
Questionpaper Paper1P June2017 PDF
36 pages
PC Regression
No ratings yet
PC Regression
25 pages
Principal Component Analysis (PCA) Final
No ratings yet
Principal Component Analysis (PCA) Final
37 pages
MFC2 L5
No ratings yet
MFC2 L5
63 pages
Ila6 6 1
No ratings yet
Ila6 6 1
16 pages
09 Pca
No ratings yet
09 Pca
22 pages
L08 PrincipalComponentAnalysis
No ratings yet
L08 PrincipalComponentAnalysis
36 pages
Lecture FPCA
No ratings yet
Lecture FPCA
67 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
14 pages
Multivariate Statistics PCA
No ratings yet
Multivariate Statistics PCA
19 pages
Dimensionality Reduction Using PCA (Principal Component Analysis)
No ratings yet
Dimensionality Reduction Using PCA (Principal Component Analysis)
13 pages
Principal Components Analysis (PCA) : 2.1 Outline of Technique
No ratings yet
Principal Components Analysis (PCA) : 2.1 Outline of Technique
21 pages
Week 2 Notes
No ratings yet
Week 2 Notes
23 pages
Lecture Note5
No ratings yet
Lecture Note5
53 pages
PCA A Simple Principal Component Analysis Example
No ratings yet
PCA A Simple Principal Component Analysis Example
9 pages
Agenda: Principal Component Analysis (PCA)
No ratings yet
Agenda: Principal Component Analysis (PCA)
14 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
12 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
12 pages
Pca
No ratings yet
Pca
16 pages
Week 04
No ratings yet
Week 04
86 pages
Tutorial On Principal Component Analysis: Javier R. Movellan
No ratings yet
Tutorial On Principal Component Analysis: Javier R. Movellan
9 pages
Mathematical Optimization of Solar Thermal Collectors Efficiency Function Using MATLAB
No ratings yet
Mathematical Optimization of Solar Thermal Collectors Efficiency Function Using MATLAB
5 pages
Pb2 Eng Set 2 AK
No ratings yet
Pb2 Eng Set 2 AK
6 pages
Harmony 895 Logitech Manuel Us
No ratings yet
Harmony 895 Logitech Manuel Us
17 pages
Industrial Drawing (W/ 2D CAD) : Laguna State Polytechnic University
No ratings yet
Industrial Drawing (W/ 2D CAD) : Laguna State Polytechnic University
10 pages
Ch. 10 Principal Components Analysis (PCA)
No ratings yet
Ch. 10 Principal Components Analysis (PCA)
17 pages
Principal Components Analysis (Chapter 2 MDRD) : Overview of PCA
No ratings yet
Principal Components Analysis (Chapter 2 MDRD) : Overview of PCA
13 pages
Mathematical Approach To PCA
No ratings yet
Mathematical Approach To PCA
8 pages
The Mathematics Behind Principal Component Analysis
No ratings yet
The Mathematics Behind Principal Component Analysis
9 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
12 pages
Linear 56
No ratings yet
Linear 56
28 pages
Principal Component Analysis: Random Vector
No ratings yet
Principal Component Analysis: Random Vector
20 pages
Lab 0 - Hypertext - Web Protocols Lab
No ratings yet
Lab 0 - Hypertext - Web Protocols Lab
24 pages
ML15 Pca
No ratings yet
ML15 Pca
12 pages
TransCAD - An Overview of A Transportation Planning and Analysis Software Significance Part 3
No ratings yet
TransCAD - An Overview of A Transportation Planning and Analysis Software Significance Part 3
10 pages
Pca
No ratings yet
Pca
10 pages
PCA Steps - Numerical Problem
No ratings yet
PCA Steps - Numerical Problem
8 pages
Steps For PCA
No ratings yet
Steps For PCA
5 pages
Principal Component Analysis: 2.1 Definition of Principal Components
No ratings yet
Principal Component Analysis: 2.1 Definition of Principal Components
8 pages
Princomps George Dallas
No ratings yet
Princomps George Dallas
9 pages
MathModel - Lecture 8 1
No ratings yet
MathModel - Lecture 8 1
8 pages
Lecture 9 PRINCIPAL COMPONENTS
No ratings yet
Lecture 9 PRINCIPAL COMPONENTS
7 pages
Semi Finals Examination: Multiple Choice
No ratings yet
Semi Finals Examination: Multiple Choice
6 pages
BUSS 1020 - Quantitative Business Analysis Individual ASSIGNMENT Semester 2, 2015
No ratings yet
BUSS 1020 - Quantitative Business Analysis Individual ASSIGNMENT Semester 2, 2015
3 pages
PCA Explained Stepbystep
No ratings yet
PCA Explained Stepbystep
4 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
5 pages
A Study On Drug Addiction Among Youngsters at Coimbatore District
No ratings yet
A Study On Drug Addiction Among Youngsters at Coimbatore District
5 pages
Factor Analysis
No ratings yet
Factor Analysis
8 pages
Final Project
No ratings yet
Final Project
6 pages
PCA Notes
No ratings yet
PCA Notes
3 pages
Component Analysis Is A Dimension-Reduction Tool That Can
No ratings yet
Component Analysis Is A Dimension-Reduction Tool That Can
2 pages
Principal Components
No ratings yet
Principal Components
5 pages
Principalaxisthm
No ratings yet
Principalaxisthm
6 pages
Narrative Report
No ratings yet
Narrative Report
2 pages
Homework Emilio
No ratings yet
Homework Emilio
2 pages
CFF Regular
No ratings yet
CFF Regular
2 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
3 pages
Quotation for Air cond - 240108 - eng version (giá gốc)
No ratings yet
Quotation for Air cond - 240108 - eng version (giá gốc)
3 pages
PM Clinic L11 2023
No ratings yet
PM Clinic L11 2023
2 pages
Mixture of Gaussians and The EM Algorithm
No ratings yet
Mixture of Gaussians and The EM Algorithm
1 page
Lectures on Integral Equations
From Everand
Lectures on Integral Equations
Harold Widom
4.5/5 (2)
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet

AE - Tema 2 - Principal Component Analysis

Uploaded by

AE - Tema 2 - Principal Component Analysis

Uploaded by

Principal Components Analysis

First Principal Component

α1 T X). We know that

it has no maximum as α 1 is not bounded, therefore we establish the standardization

Then, α 1 is an eigenvector of Σ and λ an eigenvalue. But, which eigenvalue?

λ is the largest eigenvalue.

The m-th principal component

Now, deriving with respect to α2

Then, Σαα2 = λαα2 , in consequence α 2 is an eigenvector of Σ and λ an eigenvalue.

The m-th principal component

Now, deriving with respect to α m

and Var(Yi ) = Var(αi ⊤ X) = α⊤i Σαi = αi λi α = λi . Defining P as the matrix

Also, we saw that

above results we have that

D(Y) = D(P T X) = P T D(X)P = P T D(X)P = P T ΣP = Λ.

Consequence: The proportion of the variance explained by the i-th principal

Theorem. Spectral Decomposition:

Proof. It is enough to note that Σ = P ΛP T .

cov(X, Y) = cov(X, P T X) = cov(X, X)P = ΣP = P ΛP T P = P Λ,

Calculate the eigenvectors where n ≪ p

∥AT − λI∥ = ∥AT − λIT ∥ = ∥(A − λI)T ∥ = ∥A − λI∥.

Therefore, we can calculate the eigenvalues of ΣT which are smaller. To do

You might also like