Dimensonality Reduction

sdfgh

Uploaded by

jugal.chhatriwala.spam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views25 pages

Dimensonality Reduction

sdfgh

Uploaded by

jugal.chhatriwala.spam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 25

Dimensionality Reduction

• Most of the real-world datasets are having

thousands, or millions of dimensions.
• Problems of having high dimensional data
• The error increases with he increase in the
Curse of number of features
• The computational cost of data
Dimensionality mining/machine learning techniques increases
exponentially.
• The data becomes very sparse in high
dimensional dataset, making the machine
learning/data mining algorithms in effective.
• Overfitting problem in the predictive models.
Dimensionality
Reduction
• Usually, the data can be
described with fewer
dimensions, without losing
much of the meaning of the
data.
• The data reside in a
space of lower
dimensionality
• Visualization: Projection of high dimensional data
onto 2D or 3D.
Why to • Data Compression: Efficient storage and retrieval.
• Noise Removal: Positive effect on accuracy of the
Reduce built model.
• Remove Redundant Features: Positive effect on the
Dimension? performance of the model.
• Hidden Correlations: May find hidden correlations
among features.
• Variance: measure of the deviation from the mean
for points in one dimension e.g., heights
• Covariance: measure of how much each of the
dimensions vary from the mean with respect to
each other.
Covariance • Covariance is measure between two dimension to
see if there is a relationship between the 2
dimensions e.g., number of hours studied, and
marks obtained.
• The covariance between one dimension and itself
is the variance.
Covariance Matrix

• Diagonal is the variances of x, y and z.

• cov (x,y) = cov (y,x) hence matrix is symmetrical about the
diagonal.
• N-dimensional data will result in N x N covariance matrix.
Covariance
Examples
• A positive value of covariance indicates both
dimensions increase or decrease together, e.g. as
the number of hour studied increases, the marks in
that subject increase.
Covariance • A negative value indicates while one increases the
other decreases, or vice versa.
• If covariance is zero: the two dimensions are
independent of each other e.g., height of students
vs marks obtained in a subject.
• PCA is a technique to reduce the dimension of a
dataset without affecting the information.

Principal
• It is a linear transformation that chooses a new
coordinate system for the dataset such that:
• The greatest variance by any projection of
Component the data set comes to lie on first axis (called
the first principal component)
Analysis • The second greatest variance on the second
axis and so on.
• PCA can be used for reducing dimensionality by
eliminating the later principal components.
Geometrical Interpretation
View each point in 3D space.

In this example, all the points happen to belong to a line: a 1D

subspace of the original 3D space.
Geometrical Interpretation
Consider a new coordinate system where one of the axes is along the
direction of the line.

Here every point has only one non-zero coordinate.

PCA-Concept
• Given a set of points, how do we know if they can be compressed like in the previous
example?
• We have to look into the correlation between the points
• By finding the eigenvalues and eigenvectors of the covariance matrix, we find that
the eigenvectors with the largest eigenvalues correspond to the dimensions that the
strongest correlation in the dataset.
• This is the principal component.
PCA-Theorem
PCA-Theorem
PCA-Theorem

Generally:
1. Q is square
2. Q is symmetric
3. Q is the covariance matrix
PCA-Theorem
Each can be written as:

Where, are the n eigenvectors of Q with non-zero eigenvalues.

Note:

1. The eigenvectors span an eigenspace.

2. These are N x 1 orthogonal vectors (directions in N-Dimensional space.
3. The scalars are the coordinates of in the space.
Using PCA to Compress Data
• Expressing x in terms of has not changed the size of the data.
• If the points are highly correlated many of the coordinates of x will be
zero or close to zero.
• Sort the eigenvectors according to their eigenvalue.
PCA Example-Step 1
PCA Example-Step 2
Calculate the covariance matrix.

Since the non-diagonal elements in this covariance

matrix are positive, we should expect that both x
and y variable increase together.
PCA Example-Step 3
Calculate the eigenvectors and eigenvalues of the covariance matrix.
PCA Example-Step 3

• Eigenvectors are plotted as diagonal

dotted lines on the plot.
• They are perpendicular to each
other.
• One of the eigenvectors goes
through the middle of the points,
like drawing a line of best fit.
PCA Example-Step 4
PCA Example-Step 5
PCA Example-Step 5
Thank You

Sneak Peek BCTCI - First 7 Chapters - What's Broken About Coding Interviews, What Recruiters Won't Tell You, How To Get in The Door, and More
100% (1)
Sneak Peek BCTCI - First 7 Chapters - What's Broken About Coding Interviews, What Recruiters Won't Tell You, How To Get in The Door, and More
70 pages
Fall 2023 - CS302P - 1
No ratings yet
Fall 2023 - CS302P - 1
2 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
35 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
45 pages
Internet Service Provider Business Plan
No ratings yet
Internet Service Provider Business Plan
44 pages
Unit 3dimentionality Reduction
No ratings yet
Unit 3dimentionality Reduction
13 pages
Cheat Sheet
No ratings yet
Cheat Sheet
2 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
27 pages
Digitalization and The Future of Work in The Financial Services
No ratings yet
Digitalization and The Future of Work in The Financial Services
53 pages
Rahul Lohar Resume 2025
No ratings yet
Rahul Lohar Resume 2025
1 page
Assessment User Experience Responsive Web Applications Case Study
No ratings yet
Assessment User Experience Responsive Web Applications Case Study
8 pages
Man0029199 Accuseqv3.2 Ug
No ratings yet
Man0029199 Accuseqv3.2 Ug
234 pages
Dimensionality Reduction Using Principal Component Analysis
No ratings yet
Dimensionality Reduction Using Principal Component Analysis
32 pages
Unit 3
No ratings yet
Unit 3
102 pages
5543978
No ratings yet
5543978
2 pages
How Do You Do A Principal Component Analysis?
No ratings yet
How Do You Do A Principal Component Analysis?
13 pages
PCA GL
No ratings yet
PCA GL
8 pages
Multivariate Statistical Analysis
No ratings yet
Multivariate Statistical Analysis
12 pages
Is 1892
No ratings yet
Is 1892
1 page
NIS Microproject
No ratings yet
NIS Microproject
21 pages
ML RUSA Module 5 Dim Red
No ratings yet
ML RUSA Module 5 Dim Red
85 pages
Typical Slab and Beams and Columns Bbs 1st 9th Floor
No ratings yet
Typical Slab and Beams and Columns Bbs 1st 9th Floor
19 pages
Plot Plan Wellpad E - SUPERIMPOSE RIG (E31P, E56P) (WI)
No ratings yet
Plot Plan Wellpad E - SUPERIMPOSE RIG (E31P, E56P) (WI)
1 page
Project Time Management PDF
No ratings yet
Project Time Management PDF
95 pages
Dimensionality Reduction Using PCA: Unsupervised Machine Learning
No ratings yet
Dimensionality Reduction Using PCA: Unsupervised Machine Learning
32 pages
Pca Kmeans GMM
No ratings yet
Pca Kmeans GMM
96 pages
IDS 4 (Week 14)
No ratings yet
IDS 4 (Week 14)
66 pages
IoT Based Street Light Controlling and M
No ratings yet
IoT Based Street Light Controlling and M
8 pages
Presentation A I STD 2
No ratings yet
Presentation A I STD 2
63 pages
5-Dimension Reduction
No ratings yet
5-Dimension Reduction
48 pages
Unit 4 Dimenstionality Reduction
No ratings yet
Unit 4 Dimenstionality Reduction
104 pages
CS464 Ch6 FeatureExtraction
No ratings yet
CS464 Ch6 FeatureExtraction
46 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
33 pages
Principal Component Analysis and Cluster Analysis
No ratings yet
Principal Component Analysis and Cluster Analysis
14 pages
Lecture 9 - Data Prep - Reduction - PCA-M
No ratings yet
Lecture 9 - Data Prep - Reduction - PCA-M
44 pages
W4.2 DataPreProcessing-PCA
No ratings yet
W4.2 DataPreProcessing-PCA
22 pages
Module 5 - BECE309L - AIML - Part2
No ratings yet
Module 5 - BECE309L - AIML - Part2
34 pages
E24 E36 Dot Matrix User Manual (Interface)
No ratings yet
E24 E36 Dot Matrix User Manual (Interface)
10 pages
Institute of Space Technology: Submitted by
No ratings yet
Institute of Space Technology: Submitted by
12 pages
PCA
100% (1)
PCA
45 pages
Michael Todd Beauty Kicks Off Black Friday Sale
No ratings yet
Michael Todd Beauty Kicks Off Black Friday Sale
3 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
6 pages
Mlfa Autumn 2023 Pca
No ratings yet
Mlfa Autumn 2023 Pca
32 pages
PCA - Feb 8
No ratings yet
PCA - Feb 8
28 pages
Gixirobodata
No ratings yet
Gixirobodata
2 pages
22AIP3101A Session 7
No ratings yet
22AIP3101A Session 7
28 pages
Module 3
No ratings yet
Module 3
41 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
27 pages
41 Assigment 4 Chapter 6-9
No ratings yet
41 Assigment 4 Chapter 6-9
1 page
PCA Notes
No ratings yet
PCA Notes
3 pages
AL Tamil Medium Answer
No ratings yet
AL Tamil Medium Answer
93 pages
Principal Component Analysis
100% (1)
Principal Component Analysis
9 pages
Unit Iii Dimentionality Reduction
No ratings yet
Unit Iii Dimentionality Reduction
12 pages
7.3 Pca
No ratings yet
7.3 Pca
17 pages
P-3.1.4 - Pca
No ratings yet
P-3.1.4 - Pca
44 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
19 pages
Principal Component Analysis (PCA)
No ratings yet
Principal Component Analysis (PCA)
18 pages
Application For Admission in " KV NO.2 NAUSENABAUGH "
No ratings yet
Application For Admission in " KV NO.2 NAUSENABAUGH "
7 pages
Drawing 19851
No ratings yet
Drawing 19851
1 page
3.2 Pca
No ratings yet
3.2 Pca
27 pages
Lecture 9 - Data Reduction
No ratings yet
Lecture 9 - Data Reduction
36 pages
PCA Complete
No ratings yet
PCA Complete
8 pages
Module3 Notes
No ratings yet
Module3 Notes
13 pages
Principal Computer Analysis (PCA)
No ratings yet
Principal Computer Analysis (PCA)
25 pages
1.variable Reduction 2.principal Component Analysis: Topic UNIT-4
No ratings yet
1.variable Reduction 2.principal Component Analysis: Topic UNIT-4
19 pages
Dimensionality Reduction Using PCA (Principal Component Analysis)
No ratings yet
Dimensionality Reduction Using PCA (Principal Component Analysis)
13 pages
U4 - PCA - 5th Sem - DS
No ratings yet
U4 - PCA - 5th Sem - DS
14 pages
1501589578da Mod15 Q1 e Text
No ratings yet
1501589578da Mod15 Q1 e Text
9 pages
Need of Principal Component Analysis
No ratings yet
Need of Principal Component Analysis
8 pages
DP-200 Dump
No ratings yet
DP-200 Dump
164 pages
Linear Regression: Dimensionality Reduction
No ratings yet
Linear Regression: Dimensionality Reduction
7 pages
Projecting Data To A Lower Dimension With PCA
No ratings yet
Projecting Data To A Lower Dimension With PCA
6 pages
Axial Piston Variable Pump A4VG Series 32: Europe
No ratings yet
Axial Piston Variable Pump A4VG Series 32: Europe
94 pages
Electromagnetic Clutches
100% (1)
Electromagnetic Clutches
12 pages
PCA Explained Stepbystep
No ratings yet
PCA Explained Stepbystep
4 pages
ME990-IH-Section 2a - LongBoltFlangeDesignProblems
No ratings yet
ME990-IH-Section 2a - LongBoltFlangeDesignProblems
15 pages
Unit 3
No ratings yet
Unit 3
28 pages
Principal Component Analysis: Courtesy:University of Louisville, CVIP Lab
No ratings yet
Principal Component Analysis: Courtesy:University of Louisville, CVIP Lab
48 pages
Pac
No ratings yet
Pac
70 pages
Principal Component Analysis
100% (1)
Principal Component Analysis
10 pages
Dimensionality Reduction by Pca: Non - Feasible
No ratings yet
Dimensionality Reduction by Pca: Non - Feasible
26 pages
Planos ZX130-5
No ratings yet
Planos ZX130-5
18 pages
PCA
100% (1)
PCA
33 pages
10-601 Machine Learning (Fall 2010) Principal Component Analysis
No ratings yet
10-601 Machine Learning (Fall 2010) Principal Component Analysis
8 pages
Electrical Performance Testing of AC Motors
No ratings yet
Electrical Performance Testing of AC Motors
3 pages
HSV5 TB
No ratings yet
HSV5 TB
15 pages
The Evaluation of Operating System
No ratings yet
The Evaluation of Operating System
6 pages
Mathematics for Data Science: Linear Algebra with Matlab
From Everand
Mathematics for Data Science: Linear Algebra with Matlab
César Pérez López
No ratings yet
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet

Dimensonality Reduction

Uploaded by

Dimensonality Reduction

Uploaded by

Dimensionality Reduction

• Most of the real-world datasets are having

• Diagonal is the variances of x, y and z.

In this example, all the points happen to belong to a line: a 1D

Here every point has only one non-zero coordinate.

Where, are the n eigenvectors of Q with non-zero eigenvalues.

1. The eigenvectors span an eigenspace.

Since the non-diagonal elements in this covariance

• Eigenvectors are plotted as diagonal

You might also like