0% found this document useful (0 votes)

25 views27 pages

15PCA

This document discusses principal component analysis (PCA) applied to images. PCA seeks to represent observations in a form that enhances the mutual independence of contributory components. It transforms the data to a new coordinate system such that the greatest variance comes to lie on the first coordinate and subsequent lower variances on following coordinates. The document covers PCA derivation, geometric motivation, eigen-values, eigen-vectors, and least-squares approximation.

Uploaded by

Saad Juboory

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views27 pages

15PCA

Uploaded by

Saad Juboory

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 27

Principal Component Analysis (PCA) applied to images

Václav Hlaváč

Czech Technical University in Prague

Czech Institute of Informatics, Robotics and Cybernetics
160 00 Prague 6, Jugoslávských partyzánů 1580/3, Czech Republic
http://people.ciirc.cvut.cz/hlavac, [email protected]
also Center for Machine Perception, http://cmp.felk.cvut.cz
Courtesy: Václav Voráček jr.

Outline of the lecture:

Principal components, informal idea. PCA derivation, PCA for images.

Needed linear algebra. Drawbacks. Interesting behaviors live in manifolds.

Least-squares approximation. Subspace methods, LDA, CCA, . . .

PCA, the instance of the eigen-analysis
2/27

PCA seeks to represent observations (or signals, images, and general data) in a form that

enhances the mutual independence of contributory components.

One observation is assumed to be a point in a p-dimensional linear space.

This linear space has some ‘natural’ orthogonal basis vectors. It is of advantage to express

observation as a linear combination with regards to this ‘natural’ base (given by eigen-vectors
as we will see later).
PCA is mathematically defined as an orthogonal linear transformation that transforms the

data to a new coordinate system such that the greatest variance by some projection of the
data comes to lie on the first coordinate (called the first principal component), the second
greatest variance on the second coordinate, and so on.
Geometric rationale of PCA
3/27

PCA objective is to rotate rigidly the coordinate

axes of the p-dimensional linear space to new
‘natural’ positions (principal axes) such that:
Coordinate axes are ordered such that

principal axis 1 corresponds to the highest

variance in data, axis 2 has the next highest
variance, . . . , and axis p has the lowest
variance.
The covariance among each pair of principal

axes is zero, i.e. they are uncorrelated.

Geometric motivation, principal components (1)
4/27

Two-dimensional vector space of observations, (x1, x2).

Each observation corresponds to a single point in the

vector space.
The goal:

Find another basis of the vector space, which treats

variations of data better.

We will see later:

Data points (observations) are represented in a rotated

orthogonal coordinate system. The origin is the mean of the
data points and the axes are provided by the eigenvectors.
Geometric motivation, principal components (2)
5/27

Assume a single straight line approximating best the

observation in the (total) least-square sense, i.e. by

minimizing the sum of perpendicular distances between
data points and the line.
The first principal direction (component) is the direction of

this line. Let it be a new basis vector z1.

The second principal direction (component, basis vector)

z2 is a direction perpendicular to z1 and minimizing the

distances to data points to a corresponding straight line.
For higher dimensional observation spaces, this

construction is repeated.
Eigen-values, eigen-vectors of matrices
6/27

Assume a finite-dimensional vector space and a square n × n regular matrix A.

Eigen-vectors are solutions of the eigen-equation A v = λ v, where a (column) eigen-vector v

is one of matrix A eigen-vectors and λ is one of eigen-values (which may be complex). The
matrix A has n eigen-values λi and n eigen-vectors vi, i = 1, . . . , n.
Let us derive: A v = λ v ⇒ A v − λ v = 0 ⇒ (A − λ I) v = 0. Matrix I is the identity

matrix. The equation (A − λ I) v = 0 has the non-zero solution v if and only if

det(A − λ I) = 0.
The polynomial det(A − λ I) is called the characteristic polynomial of the matrix A. The

fundamental theorem of algebra implies that the characteristic polynomial can be factored,
i.e. det(A − λ I) = 0 = (λ1 − λ)(λ2 − λ) . . . (λn − λ).
Eigen-values λi are not necessarily distinct. Multiple eigen-values arise from multiple roots of

the characteristic polynomial.

Deterministic view first, statistical view later
7/27

We start reviewing eigen-analysis from a deterministic, linear algebra standpoint.

Later, we will develop a statistical view based on covariance matrices and principal component

analysis.
A system of linear equations, a reminder
8/27
A system of linear equations can be expressed in a matrix form as Ax = b, where A is the

matrix of the system.

Example:
    
x + 3y − 2z = 5  1 3 −2 5
3x + 5y + 6z = 7 =⇒ A =  3 5 6 , b =  7 .
2x + 4y + 3z = 8 2 4 3 8


The augmented matrix of the system is created by concatenating a column vector b to the

matrix A, i.e., [A|b]. 

1 3 −2 5


Example: [A|b] =  3 5 6 7  .
2 4 3 8

This system has a solution if and only if the rank of the matrix A is equal to the rank of the

extended matrix [A|b]. The solution is unique if the rank of matrix (A) equals to the number
of unknowns or equivalently null(A) = A.
Similarity transformations of a matrix
9/27

Let A be a regular matrix.

Matrices A and B with real or complex entries are called similar if there exists an invertible

square matrix P such that P −1A P = B.

Matrix P is called the change of basis matrix.

The similarity transformation refers to a matrix transformation that results in similar matrices.

Similar matrices have useful properties: they have the same rank, determinant, trace,

characteristic polynomial, minimal polynomial and eigen-values (but not necessarily the same
eigen-vectors).
Similarity transformations allow us to express regular matrices in several useful forms, e.g.,

Jordan canonical form, Frobenius normal form (called also rational canonical form).
Jordan canonical form of a matrix
10/27

Any complex square matrix is similar to a matrix in the Jordan canonical form

 
  λi 1 0
J1 0 ...
...  , where Ji are Jordan blocks  0 λi 0 

,

 0 0 ... 1 
0 Jp
0 0 λi

in which λi are the multiple eigen-values.

The multiplicity of the eigen-value gives the size of the Jordan block.

If the eigen-value is not multiple then the Jordan block degenerates to the eigen-value itself.

Least-square approximation
11/27
Assume that abundant data comes from many observations or measurements. This case is

very common in practice.

We intent to approximate the data by a linear model - a system of linear equations, e.g., a

straight line in particular.

Strictly speaking, the observations are likely to be in a contradiction with respect to the

system of linear equations.

In the deterministic world, the conclusion would be that the system of linear equations has no

solution.
There is an interest in finding the solution to the system, which is in some sense ‘closest’ to

the observations, perhaps compensating for noise in observations.

We will usually adopt a statistical approach by minimizing the least square error.

Principal component analysis, introduction
12/27
PCA is a powerful and widely used linear technique in statistics, signal processing, image

processing, and elsewhere.

Several names: the (discrete) Karhunen-Loève transform (KLT, after Kari Karhunen,

1915-1992 and Michael Loève, 1907-1979) or the Hotelling transform (after Harold Hotelling,
1895-1973). Invented by Pearson (1901) and H. Hotelling (1933).
In statistics, PCA is a method for simplifying a multidimensional dataset to lower dimensions

for analysis, visualization or data compression.

PCA represents the data in a new coordinate system in which basis vectors follow modes of

greatest variance in the data.

Thus, new basis vectors are calculated for the particular data set.

The price to be paid for PCA’s flexibility is in higher computational requirements as compared

to, e.g., the fast Fourier transform.

Derivation, M -dimensional case (1)
13/27

Suppose a data set comprising N observations, each of M variables (dimensions). Usually

N M.
The aim: to reduce the dimensionality of the data so that each observation can be usefully

represented with only L variables, 1 ≤ L < M .

Data are arranged as a set of N column data vectors, each representing a single observation

of M variables: the n-th observations is a column vector xn = (x1, . . . , xM )>,

n = 1, . . . , N .
We thus have an M × N data matrix X. Such matrices are often huge because N may be

very large: this is in fact good, since many observations imply better statistics.
Data normalization is needed first
14/27

This procedure is not applied to the raw data R but to normalized data X as follows.

The raw observed data is arranged in a matrix R and the empirical mean is calculated along

each row of R. The result is stored in a vector u the elements of which are scalars

N
1 X
u(m) = R(m, n) , where m = 1, . . . , M .
N n=1

The empirical mean is subtracted from each column of R: if e is a unitary vector of size N

(consisting of ones only), we will write

X = R − ue .
Derivation, M -dimensional case (2)
15/27
If we approximate higher dimensional space X (of dimension M ) by the lower dimensional matrix
Y (of dimension L) then the mean square error ε2 of this approximation is given by

N L N
!
1 X X 1 X
ε = 2 2
|xn| − b>
i xn x>
n bi ,
N n=1 i=1
N n=1

where bi, i = 1, . . . , L are basis vector of the linear space of dimension L.

If ε2 has to be minimal then the following term has to be maximal

L N
X 1 X
b>
i cov(x) bi , where cov(x) = xn x>
n ,
i=1
N n=1

is the covariance matrix.

Approximation error
16/27
The covariance matrix cov(x) has special properties: it is real, symmetric and positive

semi-definite.
So the covariance matrix can be guaranteed to have real eigen-values.

Matrix theory tells us that these eigen-values may be sorted (largest to smallest) and the

associated eigen-vectors taken as the basis vectors that provide the maximum we seek.
In the data approximation, dimensions corresponding to the smallest eigen-values are omitted.

The mean square error ε2 is given by

L
X M
X
ε2 = trace cov(x) −

λi = λi ,
i=1 i=L+1

where trace(A) is the trace—sum of the diagonal elements—of the matrix A. The trace
equals the sum of all eigenvalues.
Can we use PCA for images?
17/27
It took a while to realize (Turk, Pentland, 1991), but yes.

Let us consider a 321 × 261 image.

The image is considered as a very long 1D vector by concatenating image pixels column by

column (or alternatively row by row), i.e. 321 × 261 = 83781.

The huge number 83781 is the dimensionality of our vector space.

The intensity variation is assumed in each pixel of the image.

What if we have 32 instances of images?
18/27
Fewer observations than unknowns, and what?
19/27

We have only 32 observations and 83781 unknowns in our example!

The induced system of linear equations is not over-constrained but under-constrained.

PCA is still applicable.

The number of principle components is less than or equal to the number of observations

available (32 in our particular case). This is because the (square) covariance matrix has a size
corresponding to the number of observations.
The eigen-vectors we derive are called eigen-images, after rearranging back from the 1D

vector to a rectangular image.

Let us perform the dimensionality reduction from 32 to 4 in our example.

PCA, graphical illustration
20/27

data matrix PCA repesentation

N observed images L basis vectors
of N images

}
}
}
... ~
~ ...

...
one PCA
represented image

one image one basis

vector
Approximation by 4 principal components only
21/27

Reconstruction of the image from four basis vectors bi, i = 1, . . . , 4 which can be displayed

as images by rearranging the (long) vector back to the matrix form.

The linear combination was computed as q1b1 + q2b2 + q3b3 + q4b4 = 0.078 b1 +

0.062 b2 − 0.182 b3 + 0.179 b4.

The mean value of images subtracted when data were normalized earlier has to be added, cf.

slide 14.

= q1 + q2 + q3 + q4
Reconstruction fidelity, 4 components
22/27
Reconstruction fidelity, original
23/27
PCA drawbacks, the images case
24/27

By rearranging pixels column by column to a 1D vector, relations of a given pixel to pixels in

neighboring rows are not taken into account.

Another disadvantage is in the global nature of the representation; small change or error in

the input images influences the whole eigen-representation. However, this property is inherent
in all linear integral transforms.
Data (images) representations
25/27

Reconstructive (also generative) representation

Enables (partial) reconstruction of input images (hallucinations).

It is general. It is not tuned for a specific task.

Enables closing the feedback loop, i.e. bidirectional processing.

Discriminative representation
Does not allow partial reconstruction.

Less general. A particular task specific.

Stores only information needed for the decision task.

Dimensionality issues, low-dimensional manifolds
26/27
Images, as we saw, lead to enormous dimensionality.

The data of interest often live in a much lower-dimensional subspace called the manifold.

Example (courtesy Thomas Brox):

The 100 × 100 image of the number 3 shifted and rotated, i.e. there are only 3 degrees of
variations.

All data points live in a 3-dimensional manifold of the 10,000-dimensional observation space.
The difficulty of the task is to find out empirically from the data in which manifold the data

vary.
Subspace methods
27/27
Subspace methods explore the fact that data (images) can be represented in a subspace of the
original vector space in which data live.

Different methods examples:

Method (abbreviation) Key property
reconstructive, unsupervised, optimal reconstruction, mini-
Principal Component Analysis (PCA) mizes squared reconstruction error, maximizes variance of
projected input vectors
discriminative, supervised, optimal separation, maximizes
Linear Discriminative Analysis (LDA)
distance between projection vectors
supervised, optimal correlation, motivated by regression
Canonical Correlation Analysis (CCA)
task, e.g. robot localization
Independent Component Analysis (ICA) independent factors
Non-negative matrix factorization (NMF) non-negative factors
Kernel methods for nonlinear extension local straightening by kernel functions

Unit 3dimentionality Reduction
No ratings yet
Unit 3dimentionality Reduction
13 pages
Fluid Mechanics and Hydraulics 4th Edition
No ratings yet
Fluid Mechanics and Hydraulics 4th Edition
285 pages
Principal Component Analysis
100% (1)
Principal Component Analysis
17 pages
Analysis of Doubly Reinforced Beam (Designing)
No ratings yet
Analysis of Doubly Reinforced Beam (Designing)
6 pages
Principal Component Analysis
100% (1)
Principal Component Analysis
10 pages
Dimensionality Reduction (Pca)
No ratings yet
Dimensionality Reduction (Pca)
32 pages
Topic 2 Elimination of Arbitrary Constants: Differential Equations
No ratings yet
Topic 2 Elimination of Arbitrary Constants: Differential Equations
4 pages
2 Eof
No ratings yet
2 Eof
31 pages
General Physics I (Topic List and Schedule) - Topic List
No ratings yet
General Physics I (Topic List and Schedule) - Topic List
8 pages
1-Python Algebra Maths
No ratings yet
1-Python Algebra Maths
26 pages
Tutorial On Principal Component Analysis: Javier R. Movellan
No ratings yet
Tutorial On Principal Component Analysis: Javier R. Movellan
9 pages
Data Science Lecture
No ratings yet
Data Science Lecture
24 pages
Principal Components Analysis (PCA) : 2.1 Outline of Technique
No ratings yet
Principal Components Analysis (PCA) : 2.1 Outline of Technique
21 pages
Principal Component Analysis (PCA) Application To Images: Outline of The Lecture
No ratings yet
Principal Component Analysis (PCA) Application To Images: Outline of The Lecture
26 pages
APS1070 Lecture (6) Slides Annotated
No ratings yet
APS1070 Lecture (6) Slides Annotated
77 pages
Singular Value Decomposition (SVD) / Principal Components Analysis (Pca)
No ratings yet
Singular Value Decomposition (SVD) / Principal Components Analysis (Pca)
31 pages
Factor Analysis
No ratings yet
Factor Analysis
57 pages
L08 PrincipalComponentAnalysis
No ratings yet
L08 PrincipalComponentAnalysis
36 pages
Computer Vision: Spring 2006 15-385,-685 Instructor: S. Narasimhan Wean 5403 T-R 3:00pm - 4:20pm
No ratings yet
Computer Vision: Spring 2006 15-385,-685 Instructor: S. Narasimhan Wean 5403 T-R 3:00pm - 4:20pm
58 pages
Sst414-Lesson 4
No ratings yet
Sst414-Lesson 4
12 pages
Chapter2 PCA
No ratings yet
Chapter2 PCA
65 pages
Sanjay Singh Principal Component Analysis
No ratings yet
Sanjay Singh Principal Component Analysis
9 pages
Lecture PCA
No ratings yet
Lecture PCA
20 pages
Lec 16 PCA
No ratings yet
Lec 16 PCA
64 pages
cs229 Notes10 PDF
No ratings yet
cs229 Notes10 PDF
6 pages
Dimension Reduction
No ratings yet
Dimension Reduction
23 pages
4.5 Principal Component Analysis
No ratings yet
4.5 Principal Component Analysis
15 pages
Computer Vision: Spring 2006 15-385,-685
No ratings yet
Computer Vision: Spring 2006 15-385,-685
58 pages
Lecture 3 Introduction To Linear Algebra (Part 2)
No ratings yet
Lecture 3 Introduction To Linear Algebra (Part 2)
57 pages
PCA revis-BoW PDF
No ratings yet
PCA revis-BoW PDF
47 pages
FALLSEM2024-25 SWE1015 ETH VL2024250103260 2024-09-18 Reference-Material-I
No ratings yet
FALLSEM2024-25 SWE1015 ETH VL2024250103260 2024-09-18 Reference-Material-I
62 pages
Class 8 Model Paper
No ratings yet
Class 8 Model Paper
4 pages
PCA
100% (1)
PCA
33 pages
Lecture: Dimensionality Reduction With Principal Component Analysis
No ratings yet
Lecture: Dimensionality Reduction With Principal Component Analysis
42 pages
Mathematical Approach To PCA
No ratings yet
Mathematical Approach To PCA
8 pages
Pca Kmeans GMM
No ratings yet
Pca Kmeans GMM
96 pages
Lec 22
No ratings yet
Lec 22
16 pages
PrincipalComponentAnalysis LectureNotesPublic
No ratings yet
PrincipalComponentAnalysis LectureNotesPublic
24 pages
Dim Reduction & Pattern Recognition
No ratings yet
Dim Reduction & Pattern Recognition
63 pages
Pca
No ratings yet
Pca
73 pages
Unit 3
No ratings yet
Unit 3
28 pages
Presentation A I STD 2
No ratings yet
Presentation A I STD 2
63 pages
Lec 3
No ratings yet
Lec 3
60 pages
Lecture 7: Principal Component Analysis (PCA) (Draft: Version 0.9.1)
No ratings yet
Lecture 7: Principal Component Analysis (PCA) (Draft: Version 0.9.1)
11 pages
Lecture 9 - Data Reduction
No ratings yet
Lecture 9 - Data Reduction
36 pages
Pac
No ratings yet
Pac
70 pages
Principal Component Analysis PCA 17
No ratings yet
Principal Component Analysis PCA 17
58 pages
Data Pre-Processing-IV (Feature Extraction-PCA)
No ratings yet
Data Pre-Processing-IV (Feature Extraction-PCA)
23 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
3 pages
3rd Quarter Test in Science 6
No ratings yet
3rd Quarter Test in Science 6
7 pages
SKF SNL Plummer Block Housings
100% (1)
SKF SNL Plummer Block Housings
84 pages
Application of Eigen Value & Eigen Vectror
No ratings yet
Application of Eigen Value & Eigen Vectror
14 pages
Dimensionality Reduction by Pca: Non - Feasible
No ratings yet
Dimensionality Reduction by Pca: Non - Feasible
26 pages
Module3 Notes
No ratings yet
Module3 Notes
13 pages
Catalogue MP en Lowres
No ratings yet
Catalogue MP en Lowres
252 pages
Unit Iii Dimentionality Reduction
No ratings yet
Unit Iii Dimentionality Reduction
12 pages
5-Dimension Reduction
No ratings yet
5-Dimension Reduction
48 pages
IDS 4 (Week 14)
No ratings yet
IDS 4 (Week 14)
66 pages
6 Dimension Reduction Theory
No ratings yet
6 Dimension Reduction Theory
18 pages
Lecture 14: Principal Component Analysis: Computing The Principal Components
No ratings yet
Lecture 14: Principal Component Analysis: Computing The Principal Components
6 pages
PCA Complete
No ratings yet
PCA Complete
8 pages
Determination of Quality of Concrete by Upv Machine
No ratings yet
Determination of Quality of Concrete by Upv Machine
5 pages
Comprehensive Technical Note On Symmetric Matrices, Eigenvectors, Eigenvalues, and Principal Component Analysis (PCA)
No ratings yet
Comprehensive Technical Note On Symmetric Matrices, Eigenvectors, Eigenvalues, and Principal Component Analysis (PCA)
6 pages
KL
No ratings yet
KL
5 pages
Principal Component Analysis: Term Paper For Data Mining & Data Warehousing
No ratings yet
Principal Component Analysis: Term Paper For Data Mining & Data Warehousing
11 pages
Graphical Approaches For Evaluating Overdamped Second-Order-Plus-Dead-Time (SOPDT) Model Parameters
No ratings yet
Graphical Approaches For Evaluating Overdamped Second-Order-Plus-Dead-Time (SOPDT) Model Parameters
14 pages
Entrance Exam Docu
No ratings yet
Entrance Exam Docu
11 pages
Principal Component Analysis: Atent Ariables
No ratings yet
Principal Component Analysis: Atent Ariables
13 pages
Eastern Quezon College, Inc Table of Specifications: Grade 7 Mathematics - First Quarter
No ratings yet
Eastern Quezon College, Inc Table of Specifications: Grade 7 Mathematics - First Quarter
1 page
Lecture 2-4 - Balancing - Chap 21
No ratings yet
Lecture 2-4 - Balancing - Chap 21
46 pages
Zeljko Tukovic OFW09 P 0103
No ratings yet
Zeljko Tukovic OFW09 P 0103
21 pages
Iadc PDC Bit Classification
No ratings yet
Iadc PDC Bit Classification
5 pages
Baroghel-Bouny - 2007 - Water Vapour Sorption Experiments On Hardened Ceme
No ratings yet
Baroghel-Bouny - 2007 - Water Vapour Sorption Experiments On Hardened Ceme
17 pages
Group 14 Thesis
No ratings yet
Group 14 Thesis
70 pages
Sensory System
No ratings yet
Sensory System
51 pages
01 Semiconductor Theory
No ratings yet
01 Semiconductor Theory
32 pages
MAA 324 Topic 1
No ratings yet
MAA 324 Topic 1
4 pages
Thermodynamic Measurement Techniques Mohammad Shamsuddin Download
No ratings yet
Thermodynamic Measurement Techniques Mohammad Shamsuddin Download
77 pages
Epicurean Prolepsis
No ratings yet
Epicurean Prolepsis
13 pages
Advanced Surveying Assignment Questions - (PASHA - BHAI)
No ratings yet
Advanced Surveying Assignment Questions - (PASHA - BHAI)
3 pages
Unit I Assignment
No ratings yet
Unit I Assignment
2 pages
Calibrating NTH method for ϕ′ in clayey soils using centrifuge CPTu (Ouyang and Mayne, 2018)
No ratings yet
Calibrating NTH method for ϕ′ in clayey soils using centrifuge CPTu (Ouyang and Mayne, 2018)
7 pages
Dual Nature Notes
No ratings yet
Dual Nature Notes
4 pages
Gen Purpose Strain SG
No ratings yet
Gen Purpose Strain SG
5 pages
1158 MATH F231 20240519112300 Mid Semester Question Paper
No ratings yet
1158 MATH F231 20240519112300 Mid Semester Question Paper
1 page
A Door Open Study in A Stability Chamber Is Conducted To Assess The Impact of Door Openings On The Temperature and Relative Humidity
No ratings yet
A Door Open Study in A Stability Chamber Is Conducted To Assess The Impact of Door Openings On The Temperature and Relative Humidity
2 pages
Determinants and Matrices
From Everand
Determinants and Matrices
A. C. Aitken
3/5 (1)
Exercises of Vectors and Vectorial Spaces
From Everand
Exercises of Vectors and Vectorial Spaces
Simone Malacrida
No ratings yet
Direct Linear Transformation: Practical Applications and Techniques in Computer Vision
From Everand
Direct Linear Transformation: Practical Applications and Techniques in Computer Vision
Fouad Sabry
No ratings yet

15PCA

Uploaded by

15PCA

Uploaded by

Principal Component Analysis (PCA) applied to images

Czech Technical University in Prague

Outline of the lecture:

enhances the mutual independence of contributory components.

PCA objective is to rotate rigidly the coordinate

principal axis 1 corresponds to the highest

axes is zero, i.e. they are uncorrelated.

Two-dimensional vector space of observations, (x1, x2).

Each observation corresponds to a single point in the

Find another basis of the vector space, which treats

We will see later:

Data points (observations) are represented in a rotated

Assume a single straight line approximating best the

observation in the (total) least-square sense, i.e. by

this line. Let it be a new basis vector z1.

z2 is a direction perpendicular to z1 and minimizing the

Assume a finite-dimensional vector space and a square n × n regular matrix A.

Eigen-vectors are solutions of the eigen-equation A v = λ v, where a (column) eigen-vector v

matrix. The equation (A − λ I) v = 0 has the non-zero solution v if and only if

the characteristic polynomial.

We start reviewing eigen-analysis from a deterministic, linear algebra standpoint.

matrix of the system.

matrix A, i.e., [A|b]. 

Let A be a regular matrix.

square matrix P such that P −1A P = B.

in which λi are the multiple eigen-values.

very common in practice.

straight line in particular.

system of linear equations.

the observations, perhaps compensating for noise in observations.

processing, and elsewhere.

for analysis, visualization or data compression.

greatest variance in the data.

to, e.g., the fast Fourier transform.

Suppose a data set comprising N observations, each of M variables (dimensions). Usually

represented with only L variables, 1 ≤ L < M .

of M variables: the n-th observations is a column vector xn = (x1, . . . , xM )>,

(consisting of ones only), we will write

where bi, i = 1, . . . , L are basis vector of the linear space of dimension L.

If ε2 has to be minimal then the following term has to be maximal

is the covariance matrix.

The mean square error ε2 is given by

Let us consider a 321 × 261 image.

column (or alternatively row by row), i.e. 321 × 261 = 83781.

The intensity variation is assumed in each pixel of the image.

We have only 32 observations and 83781 unknowns in our example!

The induced system of linear equations is not over-constrained but under-constrained.

PCA is still applicable.

vector to a rectangular image.

data matrix PCA repesentation

one image one basis

as images by rearranging the (long) vector back to the matrix form.

0.062 b2 − 0.182 b3 + 0.179 b4.

By rearranging pixels column by column to a 1D vector, relations of a given pixel to pixels in

neighboring rows are not taken into account.

Reconstructive (also generative) representation

It is general. It is not tuned for a specific task.

Enables closing the feedback loop, i.e. bidirectional processing.

Less general. A particular task specific.

Stores only information needed for the decision task.

Example (courtesy Thomas Brox):

Different methods examples:

You might also like