0% found this document useful (0 votes)

10 views31 pages

Eigenvectors 2

The document discusses dimensionality reduction techniques in advanced data mining, focusing on linear methods like Principal Component Analysis (PCA) and Linear Discriminant Analysis (LDA), as well as nonlinear methods such as Locally Linear Embedding (LLE). It explains the concepts of parametric and nonparametric learning, comparing their effectiveness based on data distribution and sample size. Additionally, it introduces the concept of manifolds and their role in modeling complex data distributions.

Uploaded by

Hare

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views31 pages

Eigenvectors 2

Uploaded by

Hare

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 31

CIS 530—Advanced Data Mining

9- Dimensionality
Reduction
Computer and Information Science
University of Massachusetts Dartmouth
Attribute Dimensions and Orders

• Dimensions
• 1D: scalar
• 2D: two-dimensional vector
• 3D: three-dimensional
vector
• >3D: multi-dimensional
vector
• Orders 1-st order 2-nd order
• scalars
• vectors
• matrix
• tensors (high-order) vector matrix
Bivariate Data Representations

Courtesy of Prof. Hanspeter Pfister, Harvard University.

Original figures were from the slides of Stasko
Trivariate Data Representations

Courtesy of Prof. Hanspeter Pfister, Harvard University.

Original figures were from the slides of Stasko
Multi-Dimensional Data

Courtesy of Prof. Hanspeter Pfister, Harvard University.

Original figures were from the slides of Stasko
What if the dimension of the data is 4, 5, 6, and
even more?
Dimensionality Reduction
• Linear Methods
• Principal Component Analysis (PCA), M.A. Turk &
A.P. Pentland
• Linear Discriminant Analysis (LDA), R. Fisher
• Nonlinear Methods
• Locally Linear Embedding (LLE), S.T. Roweis & L.K.
Saul
Parametric vs. Nonparametric Learning

• Parametric Model
• Use a parameterized family of probability distributions to describe the
nature of a set of data (Moghaddam & Pentland, 1997).
• The data distribution is empirically assumed or estimated.
• Learning is conducted by measuring a set of fixed parameters, such as
mean and variance.
• Effective for the large sample but degrade for complicated data
distribution.

• Nonparametric Model
• Distribution free.
• Learning is conducted by measuring the pair-wise data relationship in
both global and local manners.
• Effective and robust due to the reliance on fewer assumptions and
parameters.
• Work for cases with small-sample, high-dimensionality, and complicated
data distribution.
Linear Models
• Two representative models
• Principal Component Analysis (PCA)
• Linear Discriminant Analysis (LDA)
• PCA is trying to captures the “principal” variations in the
data
• It is computed by finding the Eigenvectors of the
covariance matrix of the data
• Geometrically, PCA finds the largest variations
directions of the underlying data
• LDA on the other hand considers the label information
which maximizes the distance between classes, and
minimizes the distance within a class
Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.
Principal Component Analysis
• Two views: Assuming zero
• Variance mean, line is
represented
• Reconstruction as , where
w is the basis,
s.t. =1.
• Maximize the data
variance in the lower-
dimensional space

• Find the projections that

minimize the
reconstruction error
Face reconstruction using different
number of eigenvectors
PCA: View 1
• Given a dataset , we first centralize the
dataset by

• We want to find a low-dimensional

space , such that the variance of in
this new space is maximized
• Let’s say is the new representation in
this space, then we should maximize
the following:
PCA: View 1

• Maximum of will be affected by the magnitude of vector

.
• To mitigate this effect, we introduce another constraint.

• By using Lagrangian multiplier method, it turns to be:

Eigenvectors
• For a square matrix , if exists

• is an eigenvector, is the eigenvalue associated with

this eigenvector.
• For eigenvector , transform is just a scaling function.
• Example

Matrix Compute eigenvalues Eigenvectors

Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.

http://en.wikipedia.org/wiki/Eigenvalue,_eigenvector_and_eigenspace
PCA: View 2
• Preliminary:
• A subspace is represented by the orthogonal basis of
this space:
• Projection – from high to low dimensional space:
• Reconstruction – projecting back:
• Example,

• Objective:
• Find a subspace spanned by to
minimize the reconstruction error
• We add an additional constraint to
make this problem trackable:
Face reconstruction using different
number of eigenvectors
Principal Component Analysis

Eigenvector Eigenfaced
Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.
Linear Discriminant Analysis

Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.

Linear Discriminant Analysis

• Instead of PCA, it finds the discriminant subspace by including class label info in
subspace modeling (Supervised learning).
– Compute within class scatter
– Compute between class scatter
– Maximize between scatters and minimize within scatters

Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.

LDA Problem Definition
• In LDA, we want to find a projection to achieve
two goals in one shot!
• Make samples from the same class compact
• Make samples from different classes dispart
• Assume the center of class ; is the projection
to be optimized
• We hope in 1-D space:

•
LDA
𝑥𝑗

𝜇𝑖 𝑢𝑗
Within-Class Scatter Matrix
• For all samples from Class , we add them
together

Within-class scatter
matrix of class-i
• If we have more than one classes (most likely!),
the within-class scatter matrix is:
Between-Class Scatter Matrix
• Only Two Classes
𝑇
( ( 𝜇1 − 𝜇2 ) ( 𝜇1 − 𝜇 2) ) 𝑤=𝑤 𝑆𝑏 𝑤
2
( 𝑤 ( 𝜇1 −𝜇 2) ) =𝑤
𝑇 𝑇 𝑇

: Between-class
• More than Two Classes scatter matrix
Learning Objective
• To achieve two goals, we will

• Which is equal to the following problem:

• Again, using Lagrangian multiplier method, we

have:

• Which is a typical eigen-decomposition problem

Different Subspace Base Vectors
• Different subspace base vectors show different
projective directions
• Subspace base vector forms a Fisherface
PCA vs. LDA

PCA LDA
Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.
PCA vs. LDA
• PCA performs
worse under
this condition

• LDA (FLD-Fisher
Linear Discriminant)
provides
better low
dimensional
representation.

Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.

When LDA Fails
• LDA fails in the right figure( is the projected
direction). Think about why…

Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.

Manifold
• “Manifold, is an abstract mathematical space in which
every point has a neighborhood which resembles
Euclidean space, but in which the global structure may
be more complicated.” ---from Wikipedia
• “A manifold is a topological space that is locally
Euclidean.” ---from Mathworld
• e.g., 2D map of the 3D earth is a manifold.
• Manifold could be obtained by a projection from original
data to a low-dimensional representation via subspace
learning.
• Manifold criterion can provide more effective ways to
model the data distribution than conventional learning
methods based on the Gaussian distribution.
Manifold

http://en.wikipedia.org/wiki/Manifold
Manifold Learning
Swiss Roll

Dimensionality
Reduction

Courtesy of Sam T. Roweis and Lawrence K. Saul, Sience 2002

Locally Linear Embedding

http://www.cs.toronto.edu/~roweis/lle/
LEA for Pose Manifold

Yun Fu, et. al. “Locally Adaptive Subspace and Similarity Metric Learning for Visual Clustering and Retrieval”, CVIU, Vol. 110, No. 3, pp: 390-402, 2008.
LEA for Expression Manifold

Yun Fu, et. al. “Locally Adaptive Subspace and Similarity Metric Learning for Visual Clustering and Retrieval”, CVIU, Vol. 110, No. 3, pp: 390-402, 2008.

Unit 3
No ratings yet
Unit 3
21 pages
Applications of Linear Algebra in Facial Recognition
100% (1)
Applications of Linear Algebra in Facial Recognition
3 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
60 pages
EE 566 - Pattern Recognition Project
No ratings yet
EE 566 - Pattern Recognition Project
19 pages
Linear Discriminant Analysis
No ratings yet
Linear Discriminant Analysis
16 pages
UNIT-4 Machine Learning
No ratings yet
UNIT-4 Machine Learning
20 pages
Lecture 16 - 25.09.2024 - PCA, Unsupervised Learning-Clustring & Metrics
No ratings yet
Lecture 16 - 25.09.2024 - PCA, Unsupervised Learning-Clustring & Metrics
51 pages
Principle Component Analysis (Pca) - Eigenfaces
100% (1)
Principle Component Analysis (Pca) - Eigenfaces
54 pages
Face Recognition Using PCA (Eigenfaces) and LDA (Fisherfaces)
No ratings yet
Face Recognition Using PCA (Eigenfaces) and LDA (Fisherfaces)
20 pages
Ruiz Modified I2ml3e Chap6
No ratings yet
Ruiz Modified I2ml3e Chap6
38 pages
08 Biometrics Lecture 8 Part3 2009-11-09
No ratings yet
08 Biometrics Lecture 8 Part3 2009-11-09
24 pages
EE769-11 Dimension Reduction
No ratings yet
EE769-11 Dimension Reduction
16 pages
Visualization 9 Dim Reduction
No ratings yet
Visualization 9 Dim Reduction
73 pages
ML Unit 4
No ratings yet
ML Unit 4
10 pages
کتاب نهم بارگزاری شده
No ratings yet
کتاب نهم بارگزاری شده
55 pages
Dim Reduction & Pattern Recognition
No ratings yet
Dim Reduction & Pattern Recognition
63 pages
Eigenfaces Vs Fisher Faces Presentation
No ratings yet
Eigenfaces Vs Fisher Faces Presentation
28 pages
I2ml3e Chap6
No ratings yet
I2ml3e Chap6
37 pages
Machine Learning (CSO851) - Lecture 03
No ratings yet
Machine Learning (CSO851) - Lecture 03
71 pages
Lecture 9 - PCA
No ratings yet
Lecture 9 - PCA
44 pages
Face Recognition PAC
No ratings yet
Face Recognition PAC
24 pages
CS464 Ch6 FeatureExtraction
No ratings yet
CS464 Ch6 FeatureExtraction
46 pages
Class8-9 DataPreprocessing DataReduction 30Sept-05Oct2020
No ratings yet
Class8-9 DataPreprocessing DataReduction 30Sept-05Oct2020
22 pages
5-Dimension Reduction
No ratings yet
5-Dimension Reduction
48 pages
Linear (PCA, LDA) and Manifolds
No ratings yet
Linear (PCA, LDA) and Manifolds
15 pages
08 HighDimensional PDF
No ratings yet
08 HighDimensional PDF
88 pages
Linear Discriminant Analysis
No ratings yet
Linear Discriminant Analysis
32 pages
ML Unit 3
No ratings yet
ML Unit 3
29 pages
315 F19 27 Pca1
No ratings yet
315 F19 27 Pca1
28 pages
Dimensions Reduction
No ratings yet
Dimensions Reduction
27 pages
# Loop Over Classes: 6.2 Principal Components Analysis (Pca)
No ratings yet
# Loop Over Classes: 6.2 Principal Components Analysis (Pca)
10 pages
PCA, Eigenfaces, and Face Detection: CSC320: Introduction To Visual Computing Michael Guerzhoy
No ratings yet
PCA, Eigenfaces, and Face Detection: CSC320: Introduction To Visual Computing Michael Guerzhoy
30 pages
MLSP-6 Dimensionality Reduction
No ratings yet
MLSP-6 Dimensionality Reduction
39 pages
03 Face Detection
No ratings yet
03 Face Detection
7 pages
Unit - 4
No ratings yet
Unit - 4
76 pages
ML 6
No ratings yet
ML 6
7 pages
Lecture W12ab
No ratings yet
Lecture W12ab
60 pages
ML 4
No ratings yet
ML 4
14 pages
FR Pca Lda
No ratings yet
FR Pca Lda
52 pages
Two-Dimensional Linear Discriminant Analysis: Jieping Ye Ravi Janardan Qi Li
No ratings yet
Two-Dimensional Linear Discriminant Analysis: Jieping Ye Ravi Janardan Qi Li
8 pages
Unit V Foml
No ratings yet
Unit V Foml
18 pages
Presentation
No ratings yet
Presentation
31 pages
Lec 15
No ratings yet
Lec 15
28 pages
Linear Project-1 (1) - 241223 - 132502
No ratings yet
Linear Project-1 (1) - 241223 - 132502
7 pages
A New Face Recognition Method Using PCA, LDA and Neural Network
No ratings yet
A New Face Recognition Method Using PCA, LDA and Neural Network
6 pages
08 HighDimensional PDF
No ratings yet
08 HighDimensional PDF
88 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
2 pages
ML Mod 4 Part 2
No ratings yet
ML Mod 4 Part 2
32 pages
Pca Lda Lobo
No ratings yet
Pca Lda Lobo
20 pages
Unit 4
No ratings yet
Unit 4
79 pages
Principal Component Analysis (PCA) and Linear Discriminant Analysis For Image Recognition
No ratings yet
Principal Component Analysis (PCA) and Linear Discriminant Analysis For Image Recognition
17 pages
PCALDAICA
No ratings yet
PCALDAICA
28 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
19 pages
Performance Analysis of Linear Appearance Based Algorithms For Face Recognition
No ratings yet
Performance Analysis of Linear Appearance Based Algorithms For Face Recognition
10 pages
Computer Vision and Image Processing - Fundamentals and Applications
No ratings yet
Computer Vision and Image Processing - Fundamentals and Applications
34 pages
Face Recognition Using PCA (Eigenfaces) and LDA (Fisherfaces)
No ratings yet
Face Recognition Using PCA (Eigenfaces) and LDA (Fisherfaces)
20 pages
Lecture 7: Unsupervised Learning: C19 Machine Learning Hilary 2013 A. Zisserman
No ratings yet
Lecture 7: Unsupervised Learning: C19 Machine Learning Hilary 2013 A. Zisserman
20 pages
Face Recognition by Using Linear Discriminant Analysis
No ratings yet
Face Recognition by Using Linear Discriminant Analysis
4 pages
Attendance System Using Face Recognition and Class Monitoring System
No ratings yet
Attendance System Using Face Recognition and Class Monitoring System
4 pages
Multivariate Statistics With R
No ratings yet
Multivariate Statistics With R
190 pages
IIT Kharagpur Data Science PDF
No ratings yet
IIT Kharagpur Data Science PDF
22 pages
Data Science Cheatsheet
100% (1)
Data Science Cheatsheet
5 pages
Assignment 3 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran
No ratings yet
Assignment 3 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran
8 pages
Topic Modeling in The Voynich Manuscript
No ratings yet
Topic Modeling in The Voynich Manuscript
18 pages
CM1415 Parts List
No ratings yet
CM1415 Parts List
28 pages
Nitesh Kumar: A Proficient Machine Learning/Database Developer With 2 Years of Experience
No ratings yet
Nitesh Kumar: A Proficient Machine Learning/Database Developer With 2 Years of Experience
2 pages
Session Commands
No ratings yet
Session Commands
1,046 pages
TQM Model of Elements-Deployment Table Developed From Quality Award and Its Application
No ratings yet
TQM Model of Elements-Deployment Table Developed From Quality Award and Its Application
291 pages
ML - LAB - BE CSE (DS) Final
No ratings yet
ML - LAB - BE CSE (DS) Final
110 pages
Tourist Experiences and Attractions
No ratings yet
Tourist Experiences and Attractions
19 pages
Social Media Advertisements and Their Influence On
No ratings yet
Social Media Advertisements and Their Influence On
18 pages
Mapping Hydrothermal Minerals Using Remotely Sensed Reflectance Spectroscopy Data From Landsat
No ratings yet
Mapping Hydrothermal Minerals Using Remotely Sensed Reflectance Spectroscopy Data From Landsat
11 pages
Learning Augmented Joint-Space Task-Oriented Dynamical Systems: A Linear Parameter Varying and Synergetic Control Approach
No ratings yet
Learning Augmented Joint-Space Task-Oriented Dynamical Systems: A Linear Parameter Varying and Synergetic Control Approach
8 pages
Airborne Hyperspectral Imaging of Cover Crops Through Radiative Transfer Process-Guided Machine Learning
No ratings yet
Airborne Hyperspectral Imaging of Cover Crops Through Radiative Transfer Process-Guided Machine Learning
49 pages
Jof 09 00991
No ratings yet
Jof 09 00991
12 pages
2004 JQT Woodall Et Al
No ratings yet
2004 JQT Woodall Et Al
12 pages
Authentication of Whey Protein - 1750-3841.13006 - 11740 - 20201016035552130
No ratings yet
Authentication of Whey Protein - 1750-3841.13006 - 11740 - 20201016035552130
6 pages
Tech Report03
No ratings yet
Tech Report03
39 pages
GGE Biplot Analysis Wheat
No ratings yet
GGE Biplot Analysis Wheat
6 pages
Gower 1966
No ratings yet
Gower 1966
15 pages
Advanced Statistics Project
No ratings yet
Advanced Statistics Project
25 pages
CAB-IoT - Continuous Authentication Architecture Based On Blockchain For Internet of Things
No ratings yet
CAB-IoT - Continuous Authentication Architecture Based On Blockchain For Internet of Things
18 pages
Ecography - 2020 - Testolin - Global Distribution and Bioclimatic Characterization of Alpine Biomes
No ratings yet
Ecography - 2020 - Testolin - Global Distribution and Bioclimatic Characterization of Alpine Biomes
10 pages
Awad K 2018 Report PDF
No ratings yet
Awad K 2018 Report PDF
27 pages
Recent Developments and Applications of Hyperspectral Imaging For Rapid Detection of Mycotoxins and Mycotoxigenic Fungi in Food Products
No ratings yet
Recent Developments and Applications of Hyperspectral Imaging For Rapid Detection of Mycotoxins and Mycotoxigenic Fungi in Food Products
9 pages
Lda PDF
No ratings yet
Lda PDF
47 pages
Employee Development Thru Competency Mapping
No ratings yet
Employee Development Thru Competency Mapping
12 pages
A System For Automated Detection of Ampoule Injection Impurities
No ratings yet
A System For Automated Detection of Ampoule Injection Impurities
10 pages
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet

Eigenvectors 2

Uploaded by

Eigenvectors 2

Uploaded by

CIS 530—Advanced Data Mining

Courtesy of Prof. Hanspeter Pfister, Harvard University.

Courtesy of Prof. Hanspeter Pfister, Harvard University.

Courtesy of Prof. Hanspeter Pfister, Harvard University.

• Find the projections that

• We want to find a low-dimensional

• Maximum of will be affected by the magnitude of vector

• By using Lagrangian multiplier method, it turns to be:

• is an eigenvector, is the eigenvalue associated with

Matrix Compute eigenvalues Eigenvectors

Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.

Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.

Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.

• Which is equal to the following problem:

• Again, using Lagrangian multiplier method, we

• Which is a typical eigen-decomposition problem

Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.

Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.

Courtesy of Sam T. Roweis and Lawrence K. Saul, Sience 2002

You might also like