0% found this document useful (0 votes)

59 views32 pages

Dimensionality Reduction Using Principal Component Analysis

Principal Component Analysis (PCA) is a technique used to reduce dimensionality in data. It works by transforming the data to a new coordinate system where the greatest variance by any projection of the data comes to lie on the first coordinate, called the first principal component. PCA identifies the directions of maximum variance in high-dimensional data and projects it onto a lower dimensional space. It does this by computing the eigenvalues and eigenvectors of the covariance matrix of the data. The principal components corresponding to the largest eigenvalues represent the directions with the most variance.

Uploaded by

sai varun

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

59 views32 pages

Dimensionality Reduction Using Principal Component Analysis

Uploaded by

sai varun

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 32

Dimensionality Reduction using

Principal Component Analysis

Topics to be covered….
• Introduction to PCA
• Basics of statistics
• PCA algorithm
• Application of PCA in Face Recognition
• Limitations of PCA
Data Reduction Using PCA

Reduce space dimensionality with minimum loss

of description information.
Data Reduction:
Example of an Ideal Case

reduces the number of dimensions,

without much loss of information.
Principal Component Analysis
• Find an orthogonal coordinate system such that the greatest
variance by any projection of data comes to lie on the first
coordinate(principal Component)
Basics………………..
• The Standard Deviation (SD) :of a data set is a
measure of how spread out the data is.

• Variance is another measure of the spread of

data in a data set.

Says How much the dimensions vary from the mean with
respect to each other.
Basics………………..

Covariance is always measured between

2 dimensions

if Covariance is +ve -------------Both Dimensions increase together

-ve--------------- When One dimension increase the
other decrease
0--------------- 2 dimensions are independent of
each other
Basics………………..
• The covariance Matrix If we have a data set
with more than 2 dimensions, there is more
than one covariance measurement that can
be calculated.
Eigen Values and Eigen Vectors
• If A is a linear transformation, a non-null vector x is
an eigenvector of A if there is a scalar λ such that
Ax= λx
The scalar λ is said to be an eigenvalue of A
corresponding to the eigenvector x.
Physical Significance……..
• Transformation matrix acts on certain vectors by
changing only their magnitude and leaving the
direction unchanged.These vectors are eigen vectors
• A matrix acts on an eigen vector by multiplying its
magnitude by a factor(+/-).This values is eigen value
associated with that eigen vector
Eigen Values and Eigen Vectors
picture was deformed in such a
way that its central vertical axis
(red vector) has not changed
direction, but the diagonal
vector (blue) has changed
direction. Hence the red vector
is an eigenvector of the
transformation and the blue
vector is not.

Each eigenvalue represents the the total variance in its dimension.

Throwing away the least significant eigenvectors means---throwing
away the least significant variance information(eigen values) !
Principal Component Analysis
− PCA projects the data
along the directions
where the data varies
the most.
− These directions are
determined by the
eigenvectors of the
covariance matrix
corresponding to the
largest eigenvalues.
− The magnitude of the
eigenvalues
corresponds to the
variance of the data
along the eigenvector
directions.
PCA-Steps
• Calculate the mean sample
• Subtract it from the samples(variance)
• Calculate the Covariance Matrix
• Find the set of eigen vectors for the
Covariance Matrix
• Consider the eigen vectors corresponding to
"largest" eigen values---also called "principal
components“.
Dimensionality reduction

The goal of PCA is to reduce the

dimensionality of the data while retaining as
much as possible of the variation present in
the original dataset
Principal Component Analysis (PCA)
• Lower dimensionality basis
− Approximate vectors by finding a basis in an appropriate lower
dimensional space.

(1) Higher-dimensional space representation:

(2) Lower-dimensional space representation:

15
Principal Component Analysis (PCA)
• Information loss
− Dimensionality reduction implies •information loss !!
− Want to preserve as much information as possible, that is:

• How to determine the best lower dimensional sub-space?

The “best” low dimensional space can be determined by

the best eigen vectors of the covariance matrix of x
(ie eigen vectors corresponding to "largest" eigen values---
also called "principal components"
16
Principal Component Analysis (PCA)
Steps:- − Suppose x , x , ..., x are N x 1
vectors
1 2 M
Principal Component Analysis (PCA)
Principal Component Analysis (PCA)
• What is the error due to dimensionality reduction?
− original vector x can be reconstructed using its principal components:

− It can be shown that the low-dimensional basis based on principal

components minimizes the reconstruction error:

− It can be shown that the error is equal to:

19
Face Detection
Samples at different
orientations,illuminations, different
expressions
Face Recognition using PCA
• Acquire an initial set of face images(training
set)
• Calculate the eigenfaces from the training
set,keeping only M images corresponding to
highest eigen values.These M images define
the face space.
Recognition

• Calculate a set of weights based on the input

image and the M eigenfaces by projecting the
input image onto each of the eigenfaces
• Determine if the image is a face at all by
checking to see if the image is close to’face
space’
• If it is a face, classify the weight pattern as
known person
Principal Component Analysis (PCA)

• The linear transformation RN  RK that

performs the dimensionality reduction is:

• How to choose the

principal
components?
Principal Component Analysis (PCA)
• Representing faces onto this basis

27
Limitations of PCA
• PCA is a linear method. It fails as the largest
variance is not along a single vector, but along a
non-linear path.
• Static Traditional PCA assumes that the
monitored process is static. Many industrial
processes do not display a stationary behavior
because the operational conditions change
• When PCA is used for clustering, its main
limitation is that it does not account for class
separability since it makes no use of the class
label of the feature vector.
Thank You

Big Data Analytics by Seema Acharya PDF
86% (43)
Big Data Analytics by Seema Acharya PDF
370 pages
Let Us Python by Yashavant Kanetkar
88% (26)
Let Us Python by Yashavant Kanetkar
429 pages
Data Structure and Algorithmic Thinking With Python Data Structure and Algorithmic Puzzles PDF
95% (21)
Data Structure and Algorithmic Thinking With Python Data Structure and Algorithmic Puzzles PDF
471 pages
Data Visualization Complete Notes
100% (9)
Data Visualization Complete Notes
28 pages
Applied Numerical Analysis 7ed - Curtis F. Gerald, Patrick O. Wheatley - Solutions Manual
56% (18)
Applied Numerical Analysis 7ed - Curtis F. Gerald, Patrick O. Wheatley - Solutions Manual
124 pages
Python Handwritten Notes (Original Images)
90% (10)
Python Handwritten Notes (Original Images)
186 pages
Python Programming. A Step-by-Step Guide For Absolute Beginners
93% (43)
Python Programming. A Step-by-Step Guide For Absolute Beginners
181 pages
The Data Visualization Workshop
75% (4)
The Data Visualization Workshop
535 pages
DATA ANALYTICS - A Comprehensive Beginner's Guide To Learn About The Realms of Data Analytics From A-Z
88% (17)
DATA ANALYTICS - A Comprehensive Beginner's Guide To Learn About The Realms of Data Analytics From A-Z
102 pages
Internet of Things Using Single Board Computers 2022
100% (1)
Internet of Things Using Single Board Computers 2022
301 pages
ER Diagram and Relational Model Exercise Solution
83% (41)
ER Diagram and Relational Model Exercise Solution
9 pages
Algorithms For Data Science 1st Brian Steele (WWW - Ebook DL - Com)
94% (16)
Algorithms For Data Science 1st Brian Steele (WWW - Ebook DL - Com)
438 pages
Python Data Science
92% (12)
Python Data Science
65 pages
Advanced Python Material PDF
57% (7)
Advanced Python Material PDF
209 pages
Aerospace Standard
100% (1)
Aerospace Standard
35 pages
20IT503 - Big Data Analytics - Unit4
No ratings yet
20IT503 - Big Data Analytics - Unit4
73 pages
Principal Component Analysis: Term Paper For Data Mining & Data Warehousing
No ratings yet
Principal Component Analysis: Term Paper For Data Mining & Data Warehousing
11 pages
Full Course of Machine Learning
100% (16)
Full Course of Machine Learning
660 pages
Object Oriented Python Tutorial
100% (20)
Object Oriented Python Tutorial
111 pages
Machine Learning Projects in Python
100% (16)
Machine Learning Projects in Python
135 pages
Learn Excel Data Analysis
100% (15)
Learn Excel Data Analysis
721 pages
IDS 4 (Week 14)
No ratings yet
IDS 4 (Week 14)
66 pages
Face Recognition PAC
No ratings yet
Face Recognition PAC
24 pages
P-3.1.4 - Pca
No ratings yet
P-3.1.4 - Pca
44 pages
Principal Component Analysis
100% (1)
Principal Component Analysis
10 pages
PCA
100% (1)
PCA
33 pages
Presentation A I STD 2
No ratings yet
Presentation A I STD 2
63 pages
FALLSEM2024-25 SWE1015 ETH VL2024250103260 2024-09-18 Reference-Material-I
No ratings yet
FALLSEM2024-25 SWE1015 ETH VL2024250103260 2024-09-18 Reference-Material-I
62 pages
Principal Component Analysis (PCA) : Gundimeda Venugopal
No ratings yet
Principal Component Analysis (PCA) : Gundimeda Venugopal
17 pages
Dim Reduction & Pattern Recognition
No ratings yet
Dim Reduction & Pattern Recognition
63 pages
Pac
No ratings yet
Pac
70 pages
Lecture 9 - Data Reduction
No ratings yet
Lecture 9 - Data Reduction
36 pages
Module 3
No ratings yet
Module 3
41 pages
DS Ca2 PPT 3010 3017
No ratings yet
DS Ca2 PPT 3010 3017
10 pages
Pattern Recognition (CSE4213) : Principal Components Analysis (PCA)
No ratings yet
Pattern Recognition (CSE4213) : Principal Components Analysis (PCA)
38 pages
Presentation
No ratings yet
Presentation
31 pages
Mlfa Autumn 2023 Pca
No ratings yet
Mlfa Autumn 2023 Pca
32 pages
3.2 Pca
No ratings yet
3.2 Pca
27 pages
Need of Principal Component Analysis
No ratings yet
Need of Principal Component Analysis
8 pages
1501589578da Mod15 Q1 e Text
No ratings yet
1501589578da Mod15 Q1 e Text
9 pages
PCA Finds Representation Through Linear Transformation
No ratings yet
PCA Finds Representation Through Linear Transformation
28 pages
Lecture 9 - Data Prep - Reduction - PCA-M
No ratings yet
Lecture 9 - Data Prep - Reduction - PCA-M
44 pages
Unit 3
No ratings yet
Unit 3
28 pages
Dimensionality Reduction Using PCA: Unsupervised Machine Learning
No ratings yet
Dimensionality Reduction Using PCA: Unsupervised Machine Learning
32 pages
Principal Components Analysis (PCA)
No ratings yet
Principal Components Analysis (PCA)
27 pages
U4 - PCA - 5th Sem - DS
No ratings yet
U4 - PCA - 5th Sem - DS
14 pages
CS464 Ch6 FeatureExtraction
No ratings yet
CS464 Ch6 FeatureExtraction
46 pages
Program: Course Code: Course Name:: M.C.A. MCAS9220 Data Science Fundamentals
No ratings yet
Program: Course Code: Course Name:: M.C.A. MCAS9220 Data Science Fundamentals
28 pages
Principal Components Analysis (PCA) Final
No ratings yet
Principal Components Analysis (PCA) Final
23 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
19 pages
Principal Computer Analysis (PCA)
No ratings yet
Principal Computer Analysis (PCA)
25 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
33 pages
Principal Component Analysis
100% (1)
Principal Component Analysis
9 pages
Dimensionality Reduction by Pca: Non - Feasible
No ratings yet
Dimensionality Reduction by Pca: Non - Feasible
26 pages
20 Pca
No ratings yet
20 Pca
50 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
16 pages
Unit 3dimentionality Reduction
No ratings yet
Unit 3dimentionality Reduction
13 pages
Machine Learning (CSO851) - Lecture 03
No ratings yet
Machine Learning (CSO851) - Lecture 03
71 pages
DR Pca
No ratings yet
DR Pca
22 pages
Module3 Notes
No ratings yet
Module3 Notes
13 pages
Lecture 14: Principal Component Analysis: Computing The Principal Components
No ratings yet
Lecture 14: Principal Component Analysis: Computing The Principal Components
6 pages
Remote Sensing Assignment
No ratings yet
Remote Sensing Assignment
10 pages
10-601 Machine Learning (Fall 2010) Principal Component Analysis
No ratings yet
10-601 Machine Learning (Fall 2010) Principal Component Analysis
8 pages
Qrm2024 Topic5 Pca Fa
No ratings yet
Qrm2024 Topic5 Pca Fa
67 pages
PCA Complete
No ratings yet
PCA Complete
8 pages
Pca 1
No ratings yet
Pca 1
3 pages
W4.2 DataPreProcessing-PCA
No ratings yet
W4.2 DataPreProcessing-PCA
22 pages
Unit V Foml
No ratings yet
Unit V Foml
18 pages
10 ASAP Advanced Statistics Dimension Reduction
No ratings yet
10 ASAP Advanced Statistics Dimension Reduction
8 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
27 pages
Data Analysis: Dr. C Santhosh Kumar
No ratings yet
Data Analysis: Dr. C Santhosh Kumar
22 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
9 pages
03 Principal Components Analysis
No ratings yet
03 Principal Components Analysis
3 pages
Data Pre-Processing-IV (Feature Extraction-PCA)
No ratings yet
Data Pre-Processing-IV (Feature Extraction-PCA)
23 pages
Computer Vision and Image Processing - Fundamentals and Applications
No ratings yet
Computer Vision and Image Processing - Fundamentals and Applications
34 pages
Pca Tutorial
No ratings yet
Pca Tutorial
11 pages
Feature Extraction: - Saheni Patra
No ratings yet
Feature Extraction: - Saheni Patra
17 pages
PCA Dev
No ratings yet
PCA Dev
16 pages
Introduction to Vectors, Matrices and Tensors
From Everand
Introduction to Vectors, Matrices and Tensors
Simone Malacrida
No ratings yet
Introduction to Vectorial and Matricial Calculus
From Everand
Introduction to Vectorial and Matricial Calculus
Simone Malacrida
No ratings yet
20IT503 - Big Data Analytics - Unit2
No ratings yet
20IT503 - Big Data Analytics - Unit2
62 pages
Notes Hadoop
No ratings yet
Notes Hadoop
19 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
13 pages
Practical Machine Learning R
90% (10)
Practical Machine Learning R
149 pages
Data Analytics
100% (2)
Data Analytics
225 pages
Introduction To Data ScienceA Python Approach To Concepts, Techniques and Applications PDF
100% (10)
Introduction To Data ScienceA Python Approach To Concepts, Techniques and Applications PDF
227 pages
Data Analytics Concepts Techniques and A PDF
100% (12)
Data Analytics Concepts Techniques and A PDF
451 pages
Machine Learning - The Mastery Bible - The Definitive Guide To Machine Learning Data Science PDF
100% (5)
Machine Learning - The Mastery Bible - The Definitive Guide To Machine Learning Data Science PDF
331 pages
2019 Book DataScienceAndBigDataAnalytics
100% (15)
2019 Book DataScienceAndBigDataAnalytics
418 pages
Machine Learning
100% (5)
Machine Learning
35 pages
CS3691 EMBEDDED SYSTEMS AND IOTl 1 To 4 Unit
100% (1)
CS3691 EMBEDDED SYSTEMS AND IOTl 1 To 4 Unit
265 pages
Data Structures and Algorithms (Python)
60% (5)
Data Structures and Algorithms (Python)
13 pages
11G6 Topic - Thermal Properties and Temperature
No ratings yet
11G6 Topic - Thermal Properties and Temperature
6 pages
Aisi 1030 Steel
No ratings yet
Aisi 1030 Steel
2 pages
02 - MEM Construction
No ratings yet
02 - MEM Construction
22 pages
E 307-15
No ratings yet
E 307-15
2 pages
Danon 2011
No ratings yet
Danon 2011
12 pages
Properties:: Product Name
No ratings yet
Properties:: Product Name
1 page
Projectile Motion 2
No ratings yet
Projectile Motion 2
15 pages
PressureDrop IJFMR
No ratings yet
PressureDrop IJFMR
18 pages
Make - Vol 90 Fall 2024 Freemagazines Top 5
No ratings yet
Make - Vol 90 Fall 2024 Freemagazines Top 5
1 page
Test 03 Atomic Structure and Quantum Mechanics, Atomic Spectros
No ratings yet
Test 03 Atomic Structure and Quantum Mechanics, Atomic Spectros
8 pages
Research Aptitude UGC NET Paper 1 Notes Part 1
No ratings yet
Research Aptitude UGC NET Paper 1 Notes Part 1
16 pages
Astm E2192-13 (2018)
No ratings yet
Astm E2192-13 (2018)
23 pages
Ref47 Radl TheEngineeringHydrogenPeroxide
No ratings yet
Ref47 Radl TheEngineeringHydrogenPeroxide
28 pages
Matematika Ekonomi 1 Lecture 8 2023
No ratings yet
Matematika Ekonomi 1 Lecture 8 2023
38 pages
Performance Optimization of Tri-Gate Junctionless FinFET Using Channel Stack Engineering For Digital and Analog - RF Design
No ratings yet
Performance Optimization of Tri-Gate Junctionless FinFET Using Channel Stack Engineering For Digital and Analog - RF Design
13 pages
Microscopy As A Tool For Autopsie Membranes
No ratings yet
Microscopy As A Tool For Autopsie Membranes
11 pages
Evans 2009
No ratings yet
Evans 2009
11 pages
N Ee1 Assignment 3 Fakher Ijaz 70137228
No ratings yet
N Ee1 Assignment 3 Fakher Ijaz 70137228
4 pages
Fu Et Al 2023 Constitutive Modeling and Simulation of Polyethylene Foam Under Quasi Static and Impact Loading
No ratings yet
Fu Et Al 2023 Constitutive Modeling and Simulation of Polyethylene Foam Under Quasi Static and Impact Loading
20 pages
Alg Nov 2020 QP
No ratings yet
Alg Nov 2020 QP
5 pages
Philosophy 101
No ratings yet
Philosophy 101
10 pages
Seminar
No ratings yet
Seminar
22 pages
Vectors - II (Up) Question Practice 7777 Series
No ratings yet
Vectors - II (Up) Question Practice 7777 Series
19 pages
Applications of Wire Arc Additive Manufacturing (WAAM) For Aerospace Component Manufacturing
No ratings yet
Applications of Wire Arc Additive Manufacturing (WAAM) For Aerospace Component Manufacturing
17 pages
Astm Volume 15 01
No ratings yet
Astm Volume 15 01
8 pages
Electrostatic Deflector Simulation Studies DBarna
No ratings yet
Electrostatic Deflector Simulation Studies DBarna
27 pages
Atomic Weight and Atomic Mass Unit: 12 Amu (Exactly) 1 Amu 1.66×10 G
No ratings yet
Atomic Weight and Atomic Mass Unit: 12 Amu (Exactly) 1 Amu 1.66×10 G
3 pages
ST Teresa Chemistry Unit Test I-1
No ratings yet
ST Teresa Chemistry Unit Test I-1
12 pages

Dimensionality Reduction Using Principal Component Analysis

Uploaded by

Dimensionality Reduction Using Principal Component Analysis

Uploaded by

Dimensionality Reduction using

Principal Component Analysis

Reduce space dimensionality with minimum loss

reduces the number of dimensions,

• Variance is another measure of the spread of

Covariance is always measured between

if Covariance is +ve -------------Both Dimensions increase together

Each eigenvalue represents the the total variance in its dimension.

The goal of PCA is to reduce the

(1) Higher-dimensional space representation:

(2) Lower-dimensional space representation:

• How to determine the best lower dimensional sub-space?

The “best” low dimensional space can be determined by

− It can be shown that the low-dimensional basis based on principal

− It can be shown that the error is equal to:

• Calculate a set of weights based on the input

• The linear transformation RN  RK that

• How to choose the

You might also like