How Do You Do A Principal Component Analysis?

Uploaded by

Makhon Chandra Hajra

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views13 pages

How Do You Do A Principal Component Analysis?

Uploaded by

Makhon Chandra Hajra

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 13

HOW DO YOU DO A PRINCIPAL COMPONENT ANALYSIS?

1.Standardize the range of continuous initial variables

2.Compute the covariance matrix to identify correlations
3.Compute the eigenvectors and eigenvalues of the covariance matrix to
identify the principal components
4.Create a feature vector to decide which principal components to keep
5.Recast the data along the principal components axes
 PCA condenses information from a large set of variables into fewer variables by applying some sort
of transformation onto them. The transformation is applied in such a way that linearly correlated
variables get transformed into uncorrelated variables.
 Correlation tells us that there is a redundancy of information and if this redundancy can be reduced,
then information can be compressed. For example, if there are two variables in the variable set
which are highly correlated, then, we are not gaining any extra information by retaining both the
variables because one can be nearly expressed as the linear combination of the other.
 In such cases, PCA transfers the variance of the second variable onto the first variable by translation
and rotation of original axes and projecting data onto new axes. The direction of projection is
determined using eigenvalues and eigenvectors. So, the first few transformed features (termed as
Principal Components) are rich in information, whereas the last features contain mostly noise with
negligible information in them.
 This transferability allows us to retain the first few principal components, thus reducing the number
of variables significantly with minimal loss of information.
1. Assemble a data matrix: The first step is to assemble all the data points into a matrix where each column is
one data point. A data matrix,D, of n 3D points would like something like this

2. Calculate Mean: The next step is to calculate the mean (average) of all data points. Note, if the data is 3D,
the mean is also a 3D point with x, y and z coordinates. Similarly, if the data is m dimensional, the mean will
also be m dimensional. The mean is calculculated as
3.Subtract Mean from data matrix: We next create another matrix M by subtracting the mean from every data
point of D

4.Calculate the Covariance matrix: Remember we want to find the direction of maximum variance. The
covariance matrix captures the information about the spread of the data. The diagonal elements of a
covariance matrix are the variances along the X, Y and Z axes. The off-diagonal elements represent the
covariance between two dimensions ( X and Y, Y and Z, Z and X ).The covariance matrix, C} is calculated using
the following product.

where, T represents the transpose operation. The matrix Cis of size m x m times where m is the
number of dimensions ( which is 3 in our example ).Figure shows how the covariance matrix
changes depending on the spread of data in different directions.
Figure: Left : When the data is evenly spread in all directions, the covariance matrix has equal
diagonal elements and zero off-diagonal elements. Center: When the data spread is
elongated along one of the axes, the diagonal elements are unequal, but the off diagonal
elements are zero. Right : In general the covariance matrix has both diagonal and off -
diagonal elements.
Variance- can only be used to explain the spread of the data in
the directions parallel to the axes of the feature space.

Covariance
Variance

For this data, we could calculate the variance in the x-direction and the variance in the y-direction.
However, the horizontal spread and the vertical spread of the data does not explain the clear diagonal correlation. Figure
clearly shows that on average, if the x-value of a data point increases, then also the y-value increases, resulting in a
positive correlation. This correlation can be captured by extending the notion of variance to what is called the ‘covariance’
of the data:
Covariance

For 2D data, we thus obtain , , and

These four values can be summarized in a matrix, called the covariance matrix:

the covariance matrix is always a symmetric matrix with the variances on its diagonal and the covariances off-
diagonal.
So, the covariance matrix defines both the spread (variance), and the orientation (covariance) of our data. So, if
we would like to represent the covariance matrix with a vector and its magnitude, we should simply try to find
the vector that points into the direction of the largest spread of the data, and whose magnitude equals the
spread (variance) in this direction
5. Calculate the Eigen vectors and Eigen values of the covariance matrix: The
principal components are the Eigen vectors of the covariance matrix. The first principal
component is the Eigen vector corresponding to the largest Eigen value, the second
principal component is the Eigen vector corresponding to the second largest Eigen
value and so on and so forth.

Feature Vector = (eig1, eig2)

 Forming Principal Components:

This is the final step where we actually form the principal components using all the
math we did till here. For the same, we take the transpose of the feature vector and
left-multiply it with the transpose of scaled version of original dataset.

NewData = FeatureVectorT x
ScaledDataT
NewData- is the Matrix consisting of the principal components,
FeatureVector- is the matrix we formed using the eigenvectors we chose to keep
Scaled Data- is the scaled version of original dataset

Chapter 1 - Class 9-10 ICT Guide
100% (1)
Chapter 1 - Class 9-10 ICT Guide
44 pages
Chapter 2 - Class 9-10 ICT Guide
0% (1)
Chapter 2 - Class 9-10 ICT Guide
44 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
13 pages
Lecture FPCA
No ratings yet
Lecture FPCA
67 pages
SSC Higher Math - Chapter 2
No ratings yet
SSC Higher Math - Chapter 2
58 pages
ML Chapter 4 Part3
No ratings yet
ML Chapter 4 Part3
82 pages
Principal Component Analysis: by Eesha Tur Razia Babar
No ratings yet
Principal Component Analysis: by Eesha Tur Razia Babar
38 pages
Pca Kmeans GMM
No ratings yet
Pca Kmeans GMM
96 pages
Feature Extraction
No ratings yet
Feature Extraction
90 pages
Program: Course Code: Course Name:: M.C.A. MCAS9220 Data Science Fundamentals
No ratings yet
Program: Course Code: Course Name:: M.C.A. MCAS9220 Data Science Fundamentals
28 pages
4 1 Pca
No ratings yet
4 1 Pca
21 pages
RES805-RM-Module 2
No ratings yet
RES805-RM-Module 2
26 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
82 pages
Presentation A I STD 2
No ratings yet
Presentation A I STD 2
63 pages
L 10 Principal Component Analysis 09052024 072206pm
No ratings yet
L 10 Principal Component Analysis 09052024 072206pm
37 pages
1-Python Algebra Maths
No ratings yet
1-Python Algebra Maths
26 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
8 pages
Multivariate Statistical Analysis
No ratings yet
Multivariate Statistical Analysis
12 pages
Vision Dummy PDF
100% (1)
Vision Dummy PDF
51 pages
Principal Components Analysis (PCA)
No ratings yet
Principal Components Analysis (PCA)
27 pages
AML Unit - 1 Material
No ratings yet
AML Unit - 1 Material
36 pages
09 Pca
No ratings yet
09 Pca
19 pages
Dimensonality Reduction
No ratings yet
Dimensonality Reduction
25 pages
Data Analysis: Dr. C Santhosh Kumar
No ratings yet
Data Analysis: Dr. C Santhosh Kumar
22 pages
Mlfa Autumn 2023 Pca
No ratings yet
Mlfa Autumn 2023 Pca
32 pages
Basic Theory
No ratings yet
Basic Theory
4 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
27 pages
Lecture 6 - PCA - Lecturefin
No ratings yet
Lecture 6 - PCA - Lecturefin
71 pages
Principal Computer Analysis (PCA)
No ratings yet
Principal Computer Analysis (PCA)
25 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
5 pages
Principal Component Analysis Concepts: T56Gzsrvah
No ratings yet
Principal Component Analysis Concepts: T56Gzsrvah
16 pages
ML Unit - 3 DimensionalitY Reduction
No ratings yet
ML Unit - 3 DimensionalitY Reduction
39 pages
Lecture 9 - Data Reduction
No ratings yet
Lecture 9 - Data Reduction
36 pages
Principal Component Analysis
100% (1)
Principal Component Analysis
17 pages
P-3.1.4 - Pca
No ratings yet
P-3.1.4 - Pca
44 pages
6 Principal Component Analysis
No ratings yet
6 Principal Component Analysis
7 pages
Principal Component Analysis (PCA)
No ratings yet
Principal Component Analysis (PCA)
18 pages
PCA Complete
No ratings yet
PCA Complete
8 pages
Unit V Foml
No ratings yet
Unit V Foml
18 pages
Steps For PCA
No ratings yet
Steps For PCA
5 pages
Multivariate Statistics Principal Component Analysis (PCA)
No ratings yet
Multivariate Statistics Principal Component Analysis (PCA)
41 pages
PCA Notes
No ratings yet
PCA Notes
3 pages
Dimensionality Reduction Using PCA (Principal Component Analysis)
No ratings yet
Dimensionality Reduction Using PCA (Principal Component Analysis)
13 pages
1501589578da Mod15 Q1 e Text
No ratings yet
1501589578da Mod15 Q1 e Text
9 pages
Unit 3
No ratings yet
Unit 3
28 pages
Principal Component Analysis
100% (1)
Principal Component Analysis
9 pages
Astrophysics, Gravitation and Quantum Physics PDF
100% (2)
Astrophysics, Gravitation and Quantum Physics PDF
300 pages
Eigenfaces With Pca
No ratings yet
Eigenfaces With Pca
12 pages
U4 - PCA - 5th Sem - DS
No ratings yet
U4 - PCA - 5th Sem - DS
14 pages
Projecting Data To A Lower Dimension With PCA
No ratings yet
Projecting Data To A Lower Dimension With PCA
6 pages
Linear Regression and Correlation Analysis PPT at BEC DOMS
50% (2)
Linear Regression and Correlation Analysis PPT at BEC DOMS
67 pages
Principal Component Analysis: Courtesy:University of Louisville, CVIP Lab
No ratings yet
Principal Component Analysis: Courtesy:University of Louisville, CVIP Lab
48 pages
Feature Extraction: - Saheni Patra
No ratings yet
Feature Extraction: - Saheni Patra
17 pages
Remote Sensing Assignment
No ratings yet
Remote Sensing Assignment
10 pages
Need of Principal Component Analysis
No ratings yet
Need of Principal Component Analysis
8 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
10 pages
Pac
No ratings yet
Pac
70 pages
Dimensionality Reduction by Pca: Non - Feasible
No ratings yet
Dimensionality Reduction by Pca: Non - Feasible
26 pages
ULA Resource Pack (Urdu Version)
No ratings yet
ULA Resource Pack (Urdu Version)
70 pages
7 5-04-01-01.2 Analysis of Speed Power Trial Data PDF
No ratings yet
7 5-04-01-01.2 Analysis of Speed Power Trial Data PDF
25 pages
Principal Component Analysis
100% (1)
Principal Component Analysis
10 pages
PCA Explained Stepbystep
No ratings yet
PCA Explained Stepbystep
4 pages
Module For Stem 12 Gen Physics
No ratings yet
Module For Stem 12 Gen Physics
23 pages
SSC Higher Math - Chapter 4
No ratings yet
SSC Higher Math - Chapter 4
26 pages
Principal Components Analysis (PCA) Final
No ratings yet
Principal Components Analysis (PCA) Final
23 pages
Lec13 Image-Compression Lec
100% (1)
Lec13 Image-Compression Lec
104 pages
PCA
100% (1)
PCA
33 pages
7 - Worksheet 7 - Trigonometry & Right-Angled Triangles
No ratings yet
7 - Worksheet 7 - Trigonometry & Right-Angled Triangles
8 pages
Question Paper - CT-12-PCM-11th-JEE - (Batch-1) - 21.11.2021.pmd
No ratings yet
Question Paper - CT-12-PCM-11th-JEE - (Batch-1) - 21.11.2021.pmd
12 pages
Lec2.3Circle Drawing
No ratings yet
Lec2.3Circle Drawing
16 pages
Info Pca
No ratings yet
Info Pca
3 pages
Lec15 Wavelets
No ratings yet
Lec15 Wavelets
97 pages
Lec 15 Multiscale Re
No ratings yet
Lec 15 Multiscale Re
73 pages
Chapter 3 FM I
No ratings yet
Chapter 3 FM I
16 pages
978 3 662 03750 8
No ratings yet
978 3 662 03750 8
541 pages
Lec 10 Us Ability Testing
No ratings yet
Lec 10 Us Ability Testing
6 pages
Filter Sizing - Pool & Spa News
No ratings yet
Filter Sizing - Pool & Spa News
3 pages
Exponents Worksheets PDF
0% (3)
Exponents Worksheets PDF
2 pages
Lec 11 Universal Design
No ratings yet
Lec 11 Universal Design
9 pages
Machine Learning Experiment
No ratings yet
Machine Learning Experiment
69 pages
Roark's Formulas For Excel - Superposition Wizard: Universal Technical Systems Inc
No ratings yet
Roark's Formulas For Excel - Superposition Wizard: Universal Technical Systems Inc
6 pages
Boeing 747 - Aerodynamic Analysis
100% (1)
Boeing 747 - Aerodynamic Analysis
59 pages
AnalysisandDesignofaSmallTwo BarCreepTestSpecimen
No ratings yet
AnalysisandDesignofaSmallTwo BarCreepTestSpecimen
14 pages
Lesson 4
No ratings yet
Lesson 4
83 pages
Data Compression Report
No ratings yet
Data Compression Report
10 pages
Recent Advances in Mathematics For Engineering (Mathematical Engineering, Manufacturing, and Management Sciences) 1st Edition Mangey Ram (Editor)
100% (3)
Recent Advances in Mathematics For Engineering (Mathematical Engineering, Manufacturing, and Management Sciences) 1st Edition Mangey Ram (Editor)
54 pages
Chapter 2
No ratings yet
Chapter 2
29 pages
Manual Tambahan Geogebra
No ratings yet
Manual Tambahan Geogebra
21 pages
Math III Q2 Wk1 Wk2
0% (1)
Math III Q2 Wk1 Wk2
4 pages
Management Books
No ratings yet
Management Books
20 pages
Math First Quarter Module
No ratings yet
Math First Quarter Module
4 pages
Game Theory Lecture Notes - Levent Kockesen
No ratings yet
Game Theory Lecture Notes - Levent Kockesen
120 pages
Volumen Finito
No ratings yet
Volumen Finito
11 pages
Jan 2006 Paper 2
No ratings yet
Jan 2006 Paper 2
16 pages
THIRD-QUARTER-EXAM-IN-MATH-6-SY-2024-2025 - Edited
No ratings yet
THIRD-QUARTER-EXAM-IN-MATH-6-SY-2024-2025 - Edited
6 pages
Optics: Majorship Let Reviewer in Physical Science
No ratings yet
Optics: Majorship Let Reviewer in Physical Science
4 pages
Simulink
No ratings yet
Simulink
6 pages
Lab Exam - M1 - Data - Set
No ratings yet
Lab Exam - M1 - Data - Set
10 pages
Master the Fundamentals of Electromagnetism and EM-Induction
From Everand
Master the Fundamentals of Electromagnetism and EM-Induction
Space Learn
No ratings yet
Direct Linear Transformation: Practical Applications and Techniques in Computer Vision
From Everand
Direct Linear Transformation: Practical Applications and Techniques in Computer Vision
Fouad Sabry
No ratings yet

How Do You Do A Principal Component Analysis?

Uploaded by

How Do You Do A Principal Component Analysis?

Uploaded by

HOW DO YOU DO A PRINCIPAL COMPONENT ANALYSIS?

1.Standardize the range of continuous initial variables

For 2D data, we thus obtain , , and

Feature Vector = (eig1, eig2)

You might also like