0% found this document useful (0 votes)

4 views5 pages

Dimensionality Reduction

The document outlines the steps involved in Principal Component Analysis (PCA), starting with standardization of variables to ensure equal contribution to the analysis. It details the computation of the covariance matrix to identify relationships between variables, followed by the calculation of eigenvectors and eigenvalues to determine principal components. Finally, it discusses the creation of a feature vector for dimensionality reduction by selecting significant components based on their eigenvalues.

Uploaded by

22d152

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views5 pages

Dimensionality Reduction

Uploaded by

22d152

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Principal Component Analysis

Formula for var(x) and cov(x,y)

Steps to Find the Covariance Matrix

Step-by-Step Explanation of PCA

Step 1: Standardization

This step is to standardize the range of the continuous initial variables so that each one of them
contributes equally to the analysis.

if there are large differences between the ranges of initial variables, those variables with larger ranges will
dominate over those with small ranges (for example, a variable that ranges between 0 and 100 will
dominate over a variable that ranges between 0 and 1), which will lead to biased results. So, transforming
the data to comparable scales can prevent this problem.

Mathematically, this can be done by subtracting the mean and dividing by the standard deviation for each
value of each variable.
Step 2: Covariance Matrix Computation

This step is to understand how the variables of the input data set are varying from the mean with respect
to each other, or in other words, to see if there is any relationship between them. Because sometimes,
variables are highly correlated in such a way that they contain redundant information. So, in order to
identify these correlations, we compute the covariance matrix.

The covariance matrix is a p × p symmetric matrix (where p is the number of dimensions)

For example, for a 3-dimensional data set with 3 variables x, y, and z, the covariance matrix is a 3×3 data
matrix of this from:

Since the covariance of a variable with itself is its variance (Cov(a,a)=Var(a)), in the main diagonal (Top left
to bottom right) we actually have the variances of each initial variable. And since the covariance is
commutative (Cov(a,b)=Cov(b,a)), the entries of the covariance matrix are symmetric with respect to the
main diagonal, which means that the upper and the lower triangular portions are equal.

What do the covariances that we have as entries of the matrix tell us about the correlations between
the variables?

It’s actually the sign of the covariance that matters:

If positive then: the two variables increase or decrease together (correlated)

If negative then: one increases when the other decreases (Inversely correlated)

Step 3: Compute the eigenvectors and eigenvalues of the covariance matrix to identify the principal
components

Eigenvectors and eigenvalues are the linear algebra concepts that we need to compute from the
covariance matrix in order to determine the principal components of the data.

What you first need to know about eigenvectors and eigenvalues is that they always come in pairs, so that
every eigenvector has an eigenvalue. Also, their number is equal to the number of dimensions of the data.
For example, for a 3-dimensional data set, there are 3 variables, therefore there are 3 eigenvectors with
3 corresponding eigenvalues.

Eigenvectors of the Covariance matrix are actually the directions of the axes where there is the most
variance (most information) and that we call Principal Components.

And eigenvalues are simply the coefficients attached to eigenvectors, which give the amount of variance
carried in each Principal Component.
Principal Component Analysis Example:

Let’s suppose that our data set is 2-dimensional with 2 variables x,y and that the eigenvectors and
eigenvalues of the covariance matrix are as follows:

If we rank the eigenvalues in descending order, we get λ1>λ2, which means that the eigenvector that
corresponds to the first principal component (PC1) is v1 and the one that corresponds to the second
principal component (PC2) is v2.

After having the principal components, to compute the percentage of variance (information) accounted
for by each component, we divide the eigenvalue of each component by the sum of eigenvalues. If we
apply this on the example above, we find that PC1 and PC2 carry respectively 96 percent and 4 percent of
the variance of the data.

Step 4: Create a Feature Vector

In this step, what we do is, to choose whether to keep all these components or discard those of lesser
significance (of low eigenvalues), and form with the remaining ones a matrix of vectors that we call
Feature vector.

So, the feature vector is simply a matrix that has as columns the eigenvectors of the components that we
decide to keep. This makes it the first step towards dimensionality reduction, because if we choose to
keep only p eigenvectors (components) out of n, the final data set will have only p dimensions.

Principal Component Analysis Example:

Continuing with the example from the previous step, we can either form a feature vector with both of the
eigenvectors v1 and v2:

Or discard the eigenvector v2, which is the one of lesser significance, and form a feature vector with v1
only:
Discarding the eigenvector v2 will reduce dimensionality by 1, and will consequently cause a loss of
information in the final data set. But given that v2 was carrying only 4 percent of the information, the loss
will be therefore not important and we will still have 96 percent of the information that is carried by v1.

Step 5: Recast the Data Along the Principal Components Axes

Reference link:
Dimensionality Reduction:
https://medium.com/nerd-for-tech/dimensionality-reduction-techniques-pca-lca-and-svd-
f2a56b097f7c

Dimensionality Reduction Using PCA (Principal Component Analysis)
No ratings yet
Dimensionality Reduction Using PCA (Principal Component Analysis)
13 pages
A Step by Step Explanation of Principal Component Analysis
No ratings yet
A Step by Step Explanation of Principal Component Analysis
7 pages
Steps For PCA
No ratings yet
Steps For PCA
5 pages
PCA Notes
No ratings yet
PCA Notes
3 pages
Remote Sensing Assignment
No ratings yet
Remote Sensing Assignment
10 pages
PCA Explained Stepbystep
No ratings yet
PCA Explained Stepbystep
4 pages
Unit 3
No ratings yet
Unit 3
28 pages
Principal Component Analysis
100% (1)
Principal Component Analysis
10 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
13 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
10 pages
ML Unit - 3 DimensionalitY Reduction
No ratings yet
ML Unit - 3 DimensionalitY Reduction
39 pages
DR Pca
No ratings yet
DR Pca
22 pages
DimensionalitY Reduction
No ratings yet
DimensionalitY Reduction
29 pages
Principal Component Analysis (PCA)
No ratings yet
Principal Component Analysis (PCA)
18 pages
Principal Computer Analysis (PCA)
No ratings yet
Principal Computer Analysis (PCA)
25 pages
4 1 Pca
No ratings yet
4 1 Pca
21 pages
A Step-By-Step Explanation of Principal Component Analysis (PCA) - Built in
No ratings yet
A Step-By-Step Explanation of Principal Component Analysis (PCA) - Built in
8 pages
How Do You Do A Principal Component Analysis?
No ratings yet
How Do You Do A Principal Component Analysis?
13 pages
U4 - PCA - 5th Sem - DS
No ratings yet
U4 - PCA - 5th Sem - DS
14 pages
Principal Component Analysis
100% (1)
Principal Component Analysis
9 pages
PCA Complete
No ratings yet
PCA Complete
8 pages
Need of Principal Component Analysis
No ratings yet
Need of Principal Component Analysis
8 pages
6 Principal Component Analysis
No ratings yet
6 Principal Component Analysis
7 pages
Projecting Data To A Lower Dimension With PCA
No ratings yet
Projecting Data To A Lower Dimension With PCA
6 pages
P-3.1.4 - Pca
No ratings yet
P-3.1.4 - Pca
44 pages
1501589578da Mod15 Q1 e Text
No ratings yet
1501589578da Mod15 Q1 e Text
9 pages
Principal Component Analysis (PCA) Final
No ratings yet
Principal Component Analysis (PCA) Final
37 pages
Pac
No ratings yet
Pac
70 pages
Principal Component Analysis: by Eesha Tur Razia Babar
No ratings yet
Principal Component Analysis: by Eesha Tur Razia Babar
38 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
15 pages
Unit V Foml
No ratings yet
Unit V Foml
18 pages
Lecture FPCA
No ratings yet
Lecture FPCA
67 pages
Principal Component Analysis (PCA) Explained - Built in
No ratings yet
Principal Component Analysis (PCA) Explained - Built in
11 pages
Pca
No ratings yet
Pca
18 pages
Aim: Theory: Experiment 3
No ratings yet
Aim: Theory: Experiment 3
3 pages
Presentation A I STD 2
No ratings yet
Presentation A I STD 2
63 pages
Principal Component Analysis Concepts: T56Gzsrvah
No ratings yet
Principal Component Analysis Concepts: T56Gzsrvah
16 pages
IDS 4 (Week 14)
No ratings yet
IDS 4 (Week 14)
66 pages
09 Pca
No ratings yet
09 Pca
22 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
34 pages
Principal Component Analysis (PCA) : Gundimeda Venugopal
No ratings yet
Principal Component Analysis (PCA) : Gundimeda Venugopal
17 pages
RES805-RM-Module 2
No ratings yet
RES805-RM-Module 2
26 pages
Program: Course Code: Course Name:: M.C.A. MCAS9220 Data Science Fundamentals
No ratings yet
Program: Course Code: Course Name:: M.C.A. MCAS9220 Data Science Fundamentals
28 pages
ML Chapter 4 Part3
No ratings yet
ML Chapter 4 Part3
82 pages
The Mathematics Behind Principal Component Analysis
No ratings yet
The Mathematics Behind Principal Component Analysis
9 pages
PCA
100% (1)
PCA
33 pages
Principal Component Analysis (PCA)
No ratings yet
Principal Component Analysis (PCA)
12 pages
PCA Finds Representation Through Linear Transformation
No ratings yet
PCA Finds Representation Through Linear Transformation
28 pages
ACPusing R
No ratings yet
ACPusing R
25 pages
L08 PrincipalComponentAnalysis
No ratings yet
L08 PrincipalComponentAnalysis
36 pages
Kinya Sharon - Ass2 - Machine Learning
No ratings yet
Kinya Sharon - Ass2 - Machine Learning
12 pages
Feature Extraction: - Saheni Patra
No ratings yet
Feature Extraction: - Saheni Patra
17 pages
PCA GL
No ratings yet
PCA GL
8 pages
ML15 Pca
No ratings yet
ML15 Pca
12 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
8 pages
Lecture 6 - PCA - Lecturefin
No ratings yet
Lecture 6 - PCA - Lecturefin
71 pages
Principal Component Analysis 4 Dummies
100% (1)
Principal Component Analysis 4 Dummies
8 pages
Princomps George Dallas
No ratings yet
Princomps George Dallas
9 pages
Dimensionality Reduction Using Principal Component Analysis
No ratings yet
Dimensionality Reduction Using Principal Component Analysis
32 pages
Direct Linear Transformation: Practical Applications and Techniques in Computer Vision
From Everand
Direct Linear Transformation: Practical Applications and Techniques in Computer Vision
Fouad Sabry
No ratings yet
Assignment1.1 PPL
No ratings yet
Assignment1.1 PPL
5 pages
Conversation History With ChatGPT
No ratings yet
Conversation History With ChatGPT
6 pages
4.4 Raspberry Pi
No ratings yet
4.4 Raspberry Pi
27 pages
4.5 - Raspberry Pi Interfaces
No ratings yet
4.5 - Raspberry Pi Interfaces
25 pages
Maximizing The Sharpe Ratio
No ratings yet
Maximizing The Sharpe Ratio
3 pages
Numerical Methods
No ratings yet
Numerical Methods
130 pages
Matrices and Determinants Part 2
No ratings yet
Matrices and Determinants Part 2
4 pages
Kernel Methods For Machine Learning With Math and R 100 Exercises For Building Logic Joe Suzuki PDF Download
No ratings yet
Kernel Methods For Machine Learning With Math and R 100 Exercises For Building Logic Joe Suzuki PDF Download
84 pages
Matrices Worksheet
100% (2)
Matrices Worksheet
7 pages
2018 Maths-II Nda Na
No ratings yet
2018 Maths-II Nda Na
13 pages
MAT133 Practice Test
No ratings yet
MAT133 Practice Test
12 pages
Notes On "Integration Theory: A Second Course" by Martin Vath
No ratings yet
Notes On "Integration Theory: A Second Course" by Martin Vath
4 pages
Math Term 2 Sample Paper 1
No ratings yet
Math Term 2 Sample Paper 1
2 pages
Detailed Time Table of B.Tech S1, S2 (S, FE) Examinations, December2024 (2019 Scheme)
No ratings yet
Detailed Time Table of B.Tech S1, S2 (S, FE) Examinations, December2024 (2019 Scheme)
2 pages
Mathcad Matrix
No ratings yet
Mathcad Matrix
18 pages
4.1 Matrices
No ratings yet
4.1 Matrices
5 pages
Understanding Linear Algebra
No ratings yet
Understanding Linear Algebra
517 pages
Discrete Time Periodic Signals: N N X N X
No ratings yet
Discrete Time Periodic Signals: N N X N X
60 pages
x + βy) = αf (x) + βf (y) : converse
No ratings yet
x + βy) = αf (x) + βf (y) : converse
50 pages
Significance of Mathematics in VLSI Circuit Design
No ratings yet
Significance of Mathematics in VLSI Circuit Design
9 pages
Quantum Mechanics Math Review
No ratings yet
Quantum Mechanics Math Review
5 pages
Maxima - PPT 1 New
No ratings yet
Maxima - PPT 1 New
9 pages
MTPPT1 Introduction To Vectors
No ratings yet
MTPPT1 Introduction To Vectors
15 pages
HP 15c Collector's Edition Advanced Functions Handbook (Draft) (2023)
No ratings yet
HP 15c Collector's Edition Advanced Functions Handbook (Draft) (2023)
224 pages
Random Process and Linear Algebra - MA3355 2021 Regulation - Semester Question Paper 2024 April May
No ratings yet
Random Process and Linear Algebra - MA3355 2021 Regulation - Semester Question Paper 2024 April May
6 pages
Introductory Mathematics Final
No ratings yet
Introductory Mathematics Final
240 pages
C2 Applied Linear Algebra
No ratings yet
C2 Applied Linear Algebra
28 pages
Summary of MATLAB Onramp
No ratings yet
Summary of MATLAB Onramp
2 pages
Tutorial Week 2 Solutions
No ratings yet
Tutorial Week 2 Solutions
2 pages
Fulton's Conjecture For : Paul L. Larsen
No ratings yet
Fulton's Conjecture For : Paul L. Larsen
21 pages
Handout Lecture1 D2
No ratings yet
Handout Lecture1 D2
11 pages
Chapter 2 (Solutions)
No ratings yet
Chapter 2 (Solutions)
50 pages
2010 Wace Mas 3cd Solutions
No ratings yet
2010 Wace Mas 3cd Solutions
6 pages
Basic Matrix and Vector Functions Written With VBA/Excel: Finaquant Analytics LTD
No ratings yet
Basic Matrix and Vector Functions Written With VBA/Excel: Finaquant Analytics LTD
21 pages

Dimensionality Reduction

Uploaded by

Dimensionality Reduction

Uploaded by

Principal Component Analysis

Formula for var(x) and cov(x,y)

Steps to Find the Covariance Matrix

The covariance matrix is a p × p symmetric matrix (where p is the number of dimensions)

It’s actually the sign of the covariance that matters:

If positive then: the two variables increase or decrease together (correlated)

Step 4: Create a Feature Vector

Principal Component Analysis Example:

Step 5: Recast the Data Along the Principal Components Axes

You might also like