MLSP-6 Dimensionality Reduction

The document discusses dimensionality reduction techniques in machine learning, focusing on the curse of dimensionality and methods to overcome it, such as incorporating prior knowledge and reducing dimensionality. It explains Linear Discriminant Analysis (LDA) and Principal Component Analysis (PCA) as key techniques for transforming high-dimensional data into lower-dimensional representations while preserving essential information. The document also outlines the steps involved in the PCA algorithm and provides examples of its application.

Uploaded by

pranayjn14

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views39 pages

MLSP-6 Dimensionality Reduction

Uploaded by

pranayjn14

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 39

Machine learning for Signal

Processing (MLSP)
Dimensionality Reduction
Course Instructor
Prof. Jyotsna Singh
curse of dimensionality
• In practice, the curse of dimensionality means that, for a
given sample size, there is a maximum number of features
above which the performance of our classifier will degrade
rather than improve
In most cases, the additional information that is lost by
discarding some features is compensated by a more accurate
mapping in the lower dimensional space
• How do we beat the curse of dimensionality?
• By incorporating prior knowledge
• By providing increasing smoothness of the
target function
• By reducing the dimensionality
Dimensionality Reduction
• In pattern recognition, Dimension Reduction is
defined as-
• It is a process of converting a data set having
vast dimensions into a data set with lesser
dimensions.
• It ensures that the converted data set conveys
similar information concisely.
Dimensionality Reduction
Dimensionality Reduction
Signal representation versus classification
Linear Discriminant Analysis
• LDA is a dimensionality reduction technique used
as a pre-processing step for Pattern recognition or
Machine Learning Application.
• LDA is similar to PCA, But LDA in addition finds the
axis that maximises the separation between
multiple classes.
• It projects the N dimensional feature space onto a
smaller subspace M (M<=N), While maintaining
the class discriminatory Information.
Linear Discriminant Analysis
Linear Discriminant Analysis, two-classes
Fisher linear discriminant
Linear Discriminant Analysis, two-classes
Linear Discriminant Analysis, two-classes
Fisher LDA for C classes
Fisher LDA for C classes
Fisher LDA for C classes
PCA Algorithm

Prof. Jyotsna Singh

How does PCA achieve dimension
reduction?
• 1.Principal Component analysis reduces high dimensions
into low dimension subspace by creating a new set of
components that carry most of the essential information
of all the features.
• 2.These new components are a linear combination of all
the features and the components thus formed are nothing
but the eigenvectors which are now called the Principal
components.
• 3.The eigenvalue corresponding to each of these
eigenvectors will tell us about how much variation in the
data has been captured by that particular Principal
component.
• 4. Principal components are orthogonal to each other
and are uncorrelated that increases maximum variance.
As they are uncorrelated it solves the problem
of multicollinearity.
The steps involved in PCA Algorithm are as follows-

• Step-01: Get data.

• Step-02: Compute the mean vector (µ).
• Step-03: Subtract mean from the given data.
• Step-04: Calculate the covariance matrix.
• Step-05: Calculate the eigen vectors and eigen values
of the covariance matrix.
• Step-06: Choosing components and forming a feature
vector.
• Step-07: Deriving the new data set.
• Problem-01:

• Consider the two dimensional patterns (2, 1),

(3, 5), (4, 3), (5, 6), (6, 7), (7, 8).
• Compute the principal component using
PCA Algorithm.
• Solution-
•
• We use the above discussed PCA Algorithm-

• Step-01:

• Get data. The given feature vectors are-

• x1 = (2, 1)
• x2 = (3, 5)
• x3 = (4, 3)
• x4 = (5, 6)
• x5 = (6, 7)
• x6 = (7, 8)
• Step-02:

• Calculate the mean vector (µ).

• Mean vector (µ)
= ((2 + 3 + 4 + 5 + 6 + 7) / 6, (1 + 5 + 3 + 6 +
7 + 8) / 6)
= (4.5, 5)
• Step-03:
•
• Subtract mean vector (µ) from the given feature
vectors.
• x1 – µ = (2 – 4.5, 1 – 5) = (-2.5, -4)
• x2 – µ = (3 – 4.5, 5 – 5) = (-1.5, 0)
• x3 – µ = (4 – 4.5, 3 – 5) = (-0.5, -2)
• x4 – µ = (5 – 4.5, 6 – 5) = (0.5, 1)
• x5 – µ = (6 – 4.5, 7 – 5) = (1.5, 2)
• x6 – µ = (7 – 4.5, 8 – 5) = (2.5, 3)
• Covariance matrix is given by-

• Now, Covariance matrix

= (m1 + m2 + m3 + m4 + m5 + m6) / 6
• The covariance matrix will be

• Step-05:
• Calculate the eigen values and eigen vectors of
the covariance matrix.
• λ is an eigen value for a matrix M if it is a
solution of the characteristic equation |M – λI| =
0.
• So, we have-
• From here,
• (2.92 – λ)(5.67 – λ) – (3.67 x 3.67) = 0
• 16.56 – 2.92λ – 5.67λ + λ2 – 13.47 = 0
• λ2 – 8.59λ + 3.09 = 0
• Solving this quadratic equation, we get λ = 8.22, 0.38
• Thus, two eigen values are λ1 = 8.22 and λ2 = 0.38.
•
• Clearly, the second eigen value is very small compared to
the first eigen value.
• So, the second eigen vector can be left out.
•
• Eigen vector corresponding to the greatest eigen value is
the principal component for the given data set.
• So. we find the eigen vector corresponding to eigen value
λ1.
• We use the following equation to find the eigen
vector-
• MX = λX
• where-
• M = Covariance Matrix
• X = Eigen vector
• λ = Eigen value
•
• Substituting the values in the above equation, we
get-
• Solving these, we get-
• 2.92X1 + 3.67X2 = 8.22X1
• 3.67X1 + 5.67X2 = 8.22X2
• On simplification, we get-
• 5.3X1 = 3.67X2 ………(1)
• 3.67X1 = 2.55X2 ………(2)

• From (1) and (2), X1 = 0.69X2

• From (2), the eigen vector is-
• Lastly, we project the data points onto the new
subspace as-
• Problem-02:
• Use PCA Algorithm to transform the pattern (2, 1) onto the
eigen vector in the previous question.
• Solution-
• The given feature vector is (2, 1).
• The feature vector gets transformed to
= Transpose of Eigen vector x (Feature Vector – Mean Vector)

DB58 Engine Manual (En)
88% (8)
DB58 Engine Manual (En)
210 pages
Project Pulse-Jet Group 4: Jeffrey Dennen Justin Marriott Brian Melo Matthew Skillin
No ratings yet
Project Pulse-Jet Group 4: Jeffrey Dennen Justin Marriott Brian Melo Matthew Skillin
37 pages
Angels, Demons and thANGELS, DEMONS AND THE DEVIL IN THE BOOK OF JOBe Devil in The Book of Job
No ratings yet
Angels, Demons and thANGELS, DEMONS AND THE DEVIL IN THE BOOK OF JOBe Devil in The Book of Job
19 pages
ML Lec-20
No ratings yet
ML Lec-20
17 pages
Unit 4 Dimenstionality Reduction
No ratings yet
Unit 4 Dimenstionality Reduction
104 pages
AML Unit - 1 Material
No ratings yet
AML Unit - 1 Material
36 pages
Unit - IV - DIMENSIONALITY REDUCTION AND GRAPHICAL MODELS
No ratings yet
Unit - IV - DIMENSIONALITY REDUCTION AND GRAPHICAL MODELS
59 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
12 pages
22AIP3101A Session 7
No ratings yet
22AIP3101A Session 7
28 pages
Data Analysis: Dr. C Santhosh Kumar
No ratings yet
Data Analysis: Dr. C Santhosh Kumar
22 pages
PCA Steps - Numerical Problem
No ratings yet
PCA Steps - Numerical Problem
8 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
27 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
60 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
8 pages
Lecture W12ab
No ratings yet
Lecture W12ab
60 pages
Unit 3
No ratings yet
Unit 3
28 pages
Dimension Reduction
No ratings yet
Dimension Reduction
15 pages
Dim Reduction & Pattern Recognition
No ratings yet
Dim Reduction & Pattern Recognition
63 pages
Machine Learning (CSO851) - Lecture 03
No ratings yet
Machine Learning (CSO851) - Lecture 03
71 pages
It ML Unit 4 Notes Final
No ratings yet
It ML Unit 4 Notes Final
21 pages
PCA Complete
No ratings yet
PCA Complete
8 pages
Dimension Reduction and Hidden Structure: 1.1 Principal Component Analysis (PCA)
No ratings yet
Dimension Reduction and Hidden Structure: 1.1 Principal Component Analysis (PCA)
40 pages
Unit 3
No ratings yet
Unit 3
102 pages
Pca
No ratings yet
Pca
16 pages
# Loop Over Classes: 6.2 Principal Components Analysis (Pca)
No ratings yet
# Loop Over Classes: 6.2 Principal Components Analysis (Pca)
10 pages
16 dm2 Dimred 2022 23
No ratings yet
16 dm2 Dimred 2022 23
49 pages
Lecture 9 - Data Reduction
No ratings yet
Lecture 9 - Data Reduction
36 pages
Maths Pca
No ratings yet
Maths Pca
6 pages
Unit 3
No ratings yet
Unit 3
21 pages
کتاب نهم بارگزاری شده
No ratings yet
کتاب نهم بارگزاری شده
55 pages
Principal Component Analysis: Atent Ariables
No ratings yet
Principal Component Analysis: Atent Ariables
13 pages
5 Pca
No ratings yet
5 Pca
14 pages
Computer Vision and Image Processing - Fundamentals and Applications
No ratings yet
Computer Vision and Image Processing - Fundamentals and Applications
34 pages
Unit 3dimentionality Reduction
No ratings yet
Unit 3dimentionality Reduction
13 pages
I2ml3e Chap6
No ratings yet
I2ml3e Chap6
37 pages
ML Mod 4 Part 2
No ratings yet
ML Mod 4 Part 2
32 pages
Unit Iii Dimentionality Reduction
No ratings yet
Unit Iii Dimentionality Reduction
12 pages
5-Dimension Reduction
No ratings yet
5-Dimension Reduction
48 pages
FALLSEM2024-25 SWE1015 ETH VL2024250103260 2024-09-18 Reference-Material-I
No ratings yet
FALLSEM2024-25 SWE1015 ETH VL2024250103260 2024-09-18 Reference-Material-I
62 pages
Presentation
No ratings yet
Presentation
31 pages
Ruiz Modified I2ml3e Chap6
No ratings yet
Ruiz Modified I2ml3e Chap6
38 pages
PCA
100% (1)
PCA
33 pages
1.variable Reduction 2.principal Component Analysis: Topic UNIT-4
No ratings yet
1.variable Reduction 2.principal Component Analysis: Topic UNIT-4
19 pages
Dimension Reduction
No ratings yet
Dimension Reduction
23 pages
ML Unit 4
No ratings yet
ML Unit 4
10 pages
Lecture 14: Principal Component Analysis: Computing The Principal Components
No ratings yet
Lecture 14: Principal Component Analysis: Computing The Principal Components
6 pages
Dimensionality Reduction by Pca: Non - Feasible
No ratings yet
Dimensionality Reduction by Pca: Non - Feasible
26 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
19 pages
Exp 15
No ratings yet
Exp 15
12 pages
Lecture 3
No ratings yet
Lecture 3
14 pages
ML RUSA Module 5 Dim Red
No ratings yet
ML RUSA Module 5 Dim Red
85 pages
IDS 4 (Week 14)
No ratings yet
IDS 4 (Week 14)
66 pages
Module3 Notes
No ratings yet
Module3 Notes
13 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
3 pages
Principal Component Analysis
100% (1)
Principal Component Analysis
9 pages
Principal Components Analysis (PCA) Final
No ratings yet
Principal Components Analysis (PCA) Final
23 pages
Lecture 7: Unsupervised Learning: C19 Machine Learning Hilary 2013 A. Zisserman
No ratings yet
Lecture 7: Unsupervised Learning: C19 Machine Learning Hilary 2013 A. Zisserman
20 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
82 pages
1501589578da Mod15 Q1 e Text
No ratings yet
1501589578da Mod15 Q1 e Text
9 pages
Pac
No ratings yet
Pac
70 pages
MLPDF 2
No ratings yet
MLPDF 2
9 pages
Mathematics for Data Science: Linear Algebra with Matlab
From Everand
Mathematics for Data Science: Linear Algebra with Matlab
César Pérez López
No ratings yet
Matrices with MATLAB (Taken from "MATLAB for Beginners: A Gentle Approach")
From Everand
Matrices with MATLAB (Taken from "MATLAB for Beginners: A Gentle Approach")
Peter Kattan
3/5 (4)
Ececc02 Eeecc02 Icecc02 Eiecc02 2024
No ratings yet
Ececc02 Eeecc02 Icecc02 Eiecc02 2024
2 pages
Assignment EDC
No ratings yet
Assignment EDC
1 page
Functions
No ratings yet
Functions
9 pages
Experiment
No ratings yet
Experiment
9 pages
Subject Geology: Paper No and Title Remote Sensing and GIS Module No and Title Module Tag
No ratings yet
Subject Geology: Paper No and Title Remote Sensing and GIS Module No and Title Module Tag
19 pages
Civil Engineering Project Ideas - Best and Exclusive FE Collection
No ratings yet
Civil Engineering Project Ideas - Best and Exclusive FE Collection
4 pages
G-11 Ict Unit 5 Questions
No ratings yet
G-11 Ict Unit 5 Questions
14 pages
Module 3 - Spring 2019 (Compatibility Mode) PDF
No ratings yet
Module 3 - Spring 2019 (Compatibility Mode) PDF
60 pages
DNA Coloring
No ratings yet
DNA Coloring
4 pages
Manual Módem Huawei
No ratings yet
Manual Módem Huawei
3 pages
List of Horse Breeds
No ratings yet
List of Horse Breeds
4 pages
The Work of The Traditional Healer: Traditional Healers and Modern Doctors
No ratings yet
The Work of The Traditional Healer: Traditional Healers and Modern Doctors
2 pages
(Naruto Shippuden) - Man of The Worls
No ratings yet
(Naruto Shippuden) - Man of The Worls
4 pages
DS 2df8236i Ael
No ratings yet
DS 2df8236i Ael
4 pages
Vestibular Rehab by Susan B OSullivan
No ratings yet
Vestibular Rehab by Susan B OSullivan
34 pages
ASNT Reference Manual Eddy Current
No ratings yet
ASNT Reference Manual Eddy Current
80 pages
4 X 4 Fishing Guide
No ratings yet
4 X 4 Fishing Guide
12 pages
Found Sounds Scavenger Hunt
No ratings yet
Found Sounds Scavenger Hunt
1 page
Full Thesis
No ratings yet
Full Thesis
27 pages
Williams - 2017 Risk and Project Management
No ratings yet
Williams - 2017 Risk and Project Management
13 pages
Side Story 1
No ratings yet
Side Story 1
8 pages
Sipass Integrated Afi5100: Installation Manual
No ratings yet
Sipass Integrated Afi5100: Installation Manual
14 pages
Diesel Generator Warranty
No ratings yet
Diesel Generator Warranty
1 page
Untitled
No ratings yet
Untitled
146 pages
IX Science Ch-12 Solutions (Improvement in Food Resources)
No ratings yet
IX Science Ch-12 Solutions (Improvement in Food Resources)
6 pages
Solenn Heusaff House Tour - Interior Material & Brands
No ratings yet
Solenn Heusaff House Tour - Interior Material & Brands
3 pages
4399 Aq DC 33210000000010
No ratings yet
4399 Aq DC 33210000000010
1 page
Ryobi-825r Parts List
No ratings yet
Ryobi-825r Parts List
3 pages
Science General Chemistry 1: Whole Brain Learning System Outcome-Based Education
No ratings yet
Science General Chemistry 1: Whole Brain Learning System Outcome-Based Education
20 pages
Numerical Methods
No ratings yet
Numerical Methods
25 pages
28X50 MSDS Un1263
No ratings yet
28X50 MSDS Un1263
5 pages