0% found this document useful (0 votes)

11 views38 pages

03 Dimensionality Reduction

Uploaded by

hrhee1atl

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views38 pages

03 Dimensionality Reduction

Uploaded by

hrhee1atl

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 38

Dimensionality

Reduction
Course: Artificial Intelligence
Fundamentals

Instructor: Marco Bonzanini

Machine Learning Tasks

Supervised Unsupervised
Discrete Data

Classification Clustering
(predict a label) (group similar items)

Continuous Data
Dimensionality
Regression Reduction
(predict a quantity) (reduce n. of variables)
Machine Learning Tasks

Supervised Unsupervised
Discrete Data

Classification Clustering
(predict a label) (group similar items)

Continuous Data
Dimensionality
Regression Reduction
(predict a quantity) (reduce n. of variables)
Section Agenda

• Introduction to Dimensionality Reduction

• Dimensionality Reduction with PCA

Introduction to
Dimensionality
Reduction
Motivation

• Too many dimensions! Do we need all of them?

• Some ML algorithms are not able to handle many

variables too well:
- they are too slow
- they are inaccurate

• How do you visualise N dimensions on a 2D screen?

Dimensionality Reduction

• Statistical techniques to reduce the number of

dimensions have been developed since the early
1900s

• Still extremely valuable in ML nowadays

• Different strategies:
- Feature Selection
- Feature Projection
Feature Selection
• A subset of the original variables is used

• Filter methods: least interesting variables are

suppressed (regardless of the model), e.g. using
information gain / correlation

• Wrapper methods: use a subset of variables to train a

model, features are added/removed from here
iteratively (watch out for overfitting)

• Embedded methods: similar to wrapper, but an intrinsic

metric is used when building the model
Feature Projection

• Transform the data in high-dimensional space into

a space with fewer dimensions

• The extracted set of dimensions is intended to be

informative and non-redundant

• Example: Principal Component Analysis (PCA)

Example

From:

To:
Dimensionality
Reduction
Algorithms
Principal Component Analysis

• Converts a set of observations (with a number of

variables) into a set of values of linearly
uncorrelated variables called principal components

• With N samples and P variables:

at most min(N-1, P) principal components

• But how to find these variables / dimensions?

PCA by Example
Suppose we have a data set:
N samples with 2 variables

VAR 1 VAR 2
12 32
54 56
34 34
… …

at most min(N-1, 2) principal components

PCA by Example
var 2

var 1
PCA by Example
var 2

var 1
PCA by Example
var 2
Take the average for
the first dimension

X
var 1
PCA by Example
var 2
and for the other dimension

X
var 1
PCA by Example
var 2
centre of the data set

X X

X
var 1
PCA by Example
shift the data set
var 2 so that the centre
corresponds to the origin

var 1
PCA by Example
var 2

X
var 1
PCA by Example
var 2
Relative positions:
still the same

X
var 1
PCA by Example
var 2
Fit a line that goes
through the origin

X
var 1
PCA by Example
var 2
Start random

var 1
PCA by Example
var 2
Rotate the line

var 1
PCA by Example
var 2
until you find the best fit

var 1
PCA by Example

How to decide what’s a good fit?

PCA by Example
var 2
Find the projections

X
X
X

var 1
X
X
X
PCA by Example
var 2 - minimise:
d(point, line)
- maximise:
X d(project, origin)
X
X

var 1
X
X
X
PCA by Example
• Minimising the distance from the point to the line, or
maximising the distance from the projection to the
origin are equivalent (Why?)

• Intuitively, it makes sense to minimise the distance

point-to-line, but in practice it could be easier to
calculate the distance projection-to-origin

• Use sum of squared distances (could have negative

distance)
PCA by Example
var 2
This line maximises the
sum of squared distances
projection-to-origin

var 1
PCA by Example
var 2
This line represent the
first principal component
(or PC1)

var 1
PCA by Example

• A Principal Component is a linear combination of the

original variables

• We are essentially maximising the spread of the

projection

• Note: principal components are orthogonal

PCA by Example
var 2 In 2D, it’s easy to find PC2
PC2 (do we really need it?)

PC1

var 1
PCA by Example
var 2 Finding the values:
PC2 rotate the PCs so that
PC1 is horizontal

PC1

var 1
PCA by Example
PC2 Finding the values:
rotate the PCs so that
PC1 is horizontal

PC1
PCA by Example
PC2 Finding the values:
rotate the PCs so that
PC1 is horizontal

PC1
PCA Discussion
• The components are ordered by variance
i.e. the first component is the one with the highest
variance

• How many components? (see notebook)

• What do the new dimensions mean?

• Objects that are similar in high-dimensional space

should also be similar in the transformed space
Questions?

21el3203-Advanced Machine Learning-Lab Workbook Final
No ratings yet
21el3203-Advanced Machine Learning-Lab Workbook Final
150 pages
ML Unit-1
No ratings yet
ML Unit-1
15 pages
Asimov, Isaac - The Gods Themselves
No ratings yet
Asimov, Isaac - The Gods Themselves
7 pages
05 Deep Learning and Neural Nets
No ratings yet
05 Deep Learning and Neural Nets
184 pages
Chapter 10. Dimensionality Reduction With PCA
No ratings yet
Chapter 10. Dimensionality Reduction With PCA
23 pages
Deep Learning PPT Full Notes
100% (3)
Deep Learning PPT Full Notes
105 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
20 pages
Lecture 6 (B) PCA-II
No ratings yet
Lecture 6 (B) PCA-II
90 pages
01 Course Intro and Intro To AI
No ratings yet
01 Course Intro and Intro To AI
63 pages
Unit 3dimentionality Reduction
No ratings yet
Unit 3dimentionality Reduction
13 pages
Mouli Full Project
No ratings yet
Mouli Full Project
53 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
27 pages
03 Regression
No ratings yet
03 Regression
39 pages
03 Machine Learning Overview
No ratings yet
03 Machine Learning Overview
24 pages
Kannada Handwritten Digit Recognition. Version-1.0
0% (1)
Kannada Handwritten Digit Recognition. Version-1.0
9 pages
AI Project Cycle
No ratings yet
AI Project Cycle
10 pages
Computer Vision and Image Processing - Fundamentals and Applications
No ratings yet
Computer Vision and Image Processing - Fundamentals and Applications
34 pages
WANTED3
No ratings yet
WANTED3
31 pages
Dimensionality Reduction (Pca)
No ratings yet
Dimensionality Reduction (Pca)
32 pages
Hadsell Chopra Lecun 06 PDF
No ratings yet
Hadsell Chopra Lecun 06 PDF
8 pages
Devansh Srivastava - Batch 2025 - B.tech - CSE
No ratings yet
Devansh Srivastava - Batch 2025 - B.tech - CSE
2 pages
DumbLoc Dumb Indoor Localization Framework Using Wi-Fi Fingerprinting
No ratings yet
DumbLoc Dumb Indoor Localization Framework Using Wi-Fi Fingerprinting
8 pages
D3S2 - Unsupervised - Dimensionality Reduction
No ratings yet
D3S2 - Unsupervised - Dimensionality Reduction
81 pages
Unit 3
No ratings yet
Unit 3
102 pages
10 Autoencoders
No ratings yet
10 Autoencoders
42 pages
AI and Robotics Complete Practice Set
No ratings yet
AI and Robotics Complete Practice Set
48 pages
Introduction To Machine Learning and Hands On Sessions
No ratings yet
Introduction To Machine Learning and Hands On Sessions
50 pages
Lecture 9 - PCA
No ratings yet
Lecture 9 - PCA
44 pages
Dimension Reduction
No ratings yet
Dimension Reduction
38 pages
Loan Prediction
No ratings yet
Loan Prediction
3 pages
Week12 PCA BayesianInference Before Lecture
No ratings yet
Week12 PCA BayesianInference Before Lecture
82 pages
14: Dimensionality Reduction (PCA) : Motivation 1: Data Compression
No ratings yet
14: Dimensionality Reduction (PCA) : Motivation 1: Data Compression
7 pages
9 ML
No ratings yet
9 ML
39 pages
Pattern Recognition Techniques
No ratings yet
Pattern Recognition Techniques
13 pages
Ann Unit V
No ratings yet
Ann Unit V
30 pages
ML Mod 4 Part 2
No ratings yet
ML Mod 4 Part 2
32 pages
Dimensionality Reduction: Principal Component Analysis (PCA)
No ratings yet
Dimensionality Reduction: Principal Component Analysis (PCA)
11 pages
Arunabha Gupta: Profile
No ratings yet
Arunabha Gupta: Profile
2 pages
Machine Learning (CSO851) - Lecture 03
No ratings yet
Machine Learning (CSO851) - Lecture 03
71 pages
IDS 4 (Week 14)
No ratings yet
IDS 4 (Week 14)
66 pages
Unsupervised Machine Learning in Python
100% (1)
Unsupervised Machine Learning in Python
89 pages
Deep Learning For Data Analytics 2023 Answer
No ratings yet
Deep Learning For Data Analytics 2023 Answer
6 pages
Machine Learning With Cae
100% (2)
Machine Learning With Cae
6 pages
ML (Unit 5)
No ratings yet
ML (Unit 5)
34 pages
Ai & ML Week-9
No ratings yet
Ai & ML Week-9
30 pages
Principal Component Analysis: Jianxin Wu
No ratings yet
Principal Component Analysis: Jianxin Wu
24 pages
Machine Learning: Unsupervised Learning Dimensionality Reduction K-Means Clustering
No ratings yet
Machine Learning: Unsupervised Learning Dimensionality Reduction K-Means Clustering
28 pages
MasterThesis V0
No ratings yet
MasterThesis V0
33 pages
Dimensionality Reduction 22-01-22
No ratings yet
Dimensionality Reduction 22-01-22
47 pages
Unit 4 Dimenstionality Reduction
No ratings yet
Unit 4 Dimenstionality Reduction
104 pages
CS464 Ch6 FeatureExtraction
No ratings yet
CS464 Ch6 FeatureExtraction
46 pages
Principal Component Analysis and Cluster Analysis
No ratings yet
Principal Component Analysis and Cluster Analysis
14 pages
315 F19 27 Pca1
No ratings yet
315 F19 27 Pca1
28 pages
Lecture 9 - Data Prep - Reduction - PCA-M
No ratings yet
Lecture 9 - Data Prep - Reduction - PCA-M
44 pages
Clustering and Dimensionality Reduction Techniques PCA T SNE K Means
No ratings yet
Clustering and Dimensionality Reduction Techniques PCA T SNE K Means
15 pages
Sma Exp 4
No ratings yet
Sma Exp 4
3 pages
W4.2 DataPreProcessing-PCA
No ratings yet
W4.2 DataPreProcessing-PCA
22 pages
Dimensionality Reduction: Motivation I: Data Compression
No ratings yet
Dimensionality Reduction: Motivation I: Data Compression
35 pages
Dimensionality Reduction 22-01-22
No ratings yet
Dimensionality Reduction 22-01-22
47 pages
Artificial Intelligence and Machine Learning As A Double-Edge Sword in Cyber World
No ratings yet
Artificial Intelligence and Machine Learning As A Double-Edge Sword in Cyber World
5 pages
U5@-Data Reduction
No ratings yet
U5@-Data Reduction
22 pages
DWM - Notes Unit 1 To Unit 5
No ratings yet
DWM - Notes Unit 1 To Unit 5
23 pages
Class8-9 DataPreprocessing DataReduction 30Sept-05Oct2020
No ratings yet
Class8-9 DataPreprocessing DataReduction 30Sept-05Oct2020
22 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
9 pages
PCA - Feb 8
No ratings yet
PCA - Feb 8
28 pages
CHBE413CDS Lecture 12 Unsupervised DimRed
No ratings yet
CHBE413CDS Lecture 12 Unsupervised DimRed
30 pages
Dim Reduction & Pattern Recognition
No ratings yet
Dim Reduction & Pattern Recognition
63 pages
Structural Damage Identification Based On Autoencoder Neural Networks and Deep Learning
No ratings yet
Structural Damage Identification Based On Autoencoder Neural Networks and Deep Learning
16 pages
22AIP3101A Session 7
No ratings yet
22AIP3101A Session 7
28 pages
Module 3 ML
No ratings yet
Module 3 ML
19 pages
Module 3
No ratings yet
Module 3
41 pages
Ai Engineer Roadmap-Kdtech
No ratings yet
Ai Engineer Roadmap-Kdtech
18 pages
Chapter Five Principal Comonent Analysis (PCA)
No ratings yet
Chapter Five Principal Comonent Analysis (PCA)
33 pages
Mml-Book (ch1)
No ratings yet
Mml-Book (ch1)
6 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
30 pages
Unit Iii Dimentionality Reduction
No ratings yet
Unit Iii Dimentionality Reduction
12 pages
P-3.1.4 - Pca
No ratings yet
P-3.1.4 - Pca
44 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
19 pages
3.2 Pca
No ratings yet
3.2 Pca
27 pages
Module3 Notes
No ratings yet
Module3 Notes
13 pages
Dimensionality Reduction Report-Yomna Eid Rizk
No ratings yet
Dimensionality Reduction Report-Yomna Eid Rizk
6 pages
Unit 2 Data Preprocessing
No ratings yet
Unit 2 Data Preprocessing
40 pages
1.variable Reduction 2.principal Component Analysis: Topic UNIT-4
No ratings yet
1.variable Reduction 2.principal Component Analysis: Topic UNIT-4
19 pages
Love Report 1
No ratings yet
Love Report 1
10 pages
Linear Regression: Dimensionality Reduction
No ratings yet
Linear Regression: Dimensionality Reduction
7 pages
Artificial Intelligence & Data Science Course Outline
No ratings yet
Artificial Intelligence & Data Science Course Outline
5 pages
CONN FMRI Functional Connectivity Toolbox Manual v15
No ratings yet
CONN FMRI Functional Connectivity Toolbox Manual v15
29 pages
Lecture 7: Unsupervised Learning: C19 Machine Learning Hilary 2013 A. Zisserman
No ratings yet
Lecture 7: Unsupervised Learning: C19 Machine Learning Hilary 2013 A. Zisserman
20 pages
Unit 4 Basics of Feature Engineering
No ratings yet
Unit 4 Basics of Feature Engineering
33 pages
PCA
100% (1)
PCA
33 pages
Unit - IV - DIMENSIONALITY REDUCTION AND GRAPHICAL MODELS
No ratings yet
Unit - IV - DIMENSIONALITY REDUCTION AND GRAPHICAL MODELS
59 pages
Unit-V Deep Learning Techniques
100% (1)
Unit-V Deep Learning Techniques
31 pages
Dear The Weight
From Everand
Dear The Weight
Masud Rana
No ratings yet
Radial Basis Networks: Fundamentals and Applications for The Activation Functions of Artificial Neural Networks
From Everand
Radial Basis Networks: Fundamentals and Applications for The Activation Functions of Artificial Neural Networks
Fouad Sabry
No ratings yet