0% found this document useful (0 votes)

62 views17 pages

7.3 Pca

Principal Component Analysis (PCA) is a statistical technique used to reduce the dimensionality of large data sets by transforming correlated variables into a smaller number of uncorrelated variables called principal components. It works by calculating the eigenvalues and eigenvectors of the covariance matrix and choosing principal components with the highest eigenvalues to account for as much of the variability in the data as possible. PCA is commonly used for dimensionality reduction in applications like data analysis, neuroscience, and image processing.

Uploaded by

Matrix Bot

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

62 views17 pages

7.3 Pca

Uploaded by

Matrix Bot

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

Principal Component Analysis

(PCA)
Introduction
Principal component analysis (PCA) is a standard tool in modern
data analysis - in diverse fields from neuroscience to computer
graphics.

It is very useful method for extracting relevant information from

confusing data sets.

PCA is “an orthogonal linear transformation that transfers

the data to a new coordinate system such that the greatest
variance by any projection of the data comes to lie on the
first coordinate (first principal component), the second
greatest variance lies on the second coordinate (second
principal component), and so on.”
Definition
Principal component analysis (PCA) is a statistical procedure that
uses an orthogonal transformation to convert a set of observations
of possibly correlated variables into a set of values of linearly
uncorrelated variables called principal components.

The number of principal components is less than or equal to the

number of original variables.

PCA used to reduce dimensions of data without much loss of

information.
Goals

• The main goal of a PCA analysis is to identify patterns

in data
• PCA aims to detect the correlation between variables.
• It attempts to reduce the dimensionality.
• If covariance is positive, both dimensions increase
together. If negative, as one increases, the other
decreases. Zero: independent of each other.
Dimensionality Reduction

It reduces the dimensions of a d-dimensional dataset by

projecting it onto a (k)-dimensional subspace
(where k<d) in order to increase the computational
efficiency while retaining most of the information.
Transformation

This transformation is defined in such a way that the first

principal component has the largest possible variance and each
succeeding component in turn has the next highest possible
variance.
x y
2.5 2.4
0.5 0.7
2.2 2.9
1.9 2.2
3.1 3.0 C= cov(x,x) cov(x,y) = 0.6165 0.6154
cov(y,x) cov(y,y) 0.6154 0.7165
2.3 2.7
2 1.6 (C – λI) = 0 (C – λI) XVector = 0
1 1.1 I= 1 0
1.5 1.6 0 1
1.1 0.9 Quadratic equation of the Determinant is
x̄= 1.81 ȳ= 1.91 λ2-1.333λ+0.0630 = 0 C V = λ V VT = [X1 Y1]
Eigen Values: λ1 = 0.04908 λ2 = 1.2840
0.6165 X1 + 0.6154 Y1 = 0.0490 X1 0.6154X1+0.7165Y1 = 0.0490 Y1
Eigen Vectors: -0.735 0.677
-0.678 -0.73
The process of obtaining principle
components from a raw dataset
can be simplified in six parts
1. Take the whole dataset consisting of d+1 dimensions and
ignore the labels such that our new dataset becomes d
dimensional.
2. Compute the mean for every dimension of the whole
dataset.
3. Compute the covariance matrix of the whole dataset.
4. Compute eigenvalues and the corresponding eigenvectors.
5. Sort the eigenvectors by decreasing eigenvalues and
choose k eigenvectors with the largest eigenvalues to form
a d × k dimensional matrix W.
6. Use this d × k eigenvector matrix to transform the samples
onto the new subspace.
1. Given original data set S = {x1, ..., xk}, produce new
set by subtracting the mean of attribute Ai from each
xi.
2. <xi,yi> from DataAdjust . <v1> = zi
3. <xi,yi> from DataAdjust . <v2> = zi
Reconstructing the original data

We did:
TransformedData = RowFeatureVector × RowDataAdjust

so we can do
RowDataAdjust = RowFeatureVector -1 × TransformedData

and
RowDataOriginal = RowDataAdjust + OriginalMean
PCA Approach

• Standardize the data.

• Perform Singular Vector Decomposition to get the
Eigenvectors and Eigenvalues.
• Sort eigenvalues in descending order and choose
the k- eigenvectors
• Construct the projection matrix from the
selected k- eigenvectors.
• Transform the original dataset via projection matrix to obtain
a k-dimensional feature subspace.
Limitation of PCA

The results of PCA depend on the scaling of the variables.

A scale-invariant form of PCA has been developed.

Applications of PCA :

• Interest Rate Derivatives Portfolios

• Neuroscience
• Image Processing
Thank You

Kazadi Joel 9213934 DLMDSPWP01
No ratings yet
Kazadi Joel 9213934 DLMDSPWP01
18 pages
Principal Component Analysis: Term Paper For Data Mining & Data Warehousing
No ratings yet
Principal Component Analysis: Term Paper For Data Mining & Data Warehousing
11 pages
U5@-Data Reduction
No ratings yet
U5@-Data Reduction
22 pages
Need of Principal Component Analysis
No ratings yet
Need of Principal Component Analysis
8 pages
1501589578da Mod15 Q1 e Text
No ratings yet
1501589578da Mod15 Q1 e Text
9 pages
10 ASAP Advanced Statistics Dimension Reduction
No ratings yet
10 ASAP Advanced Statistics Dimension Reduction
8 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
27 pages
Principal Component Analysis
100% (1)
Principal Component Analysis
9 pages
Lecture 9 - Data Reduction
No ratings yet
Lecture 9 - Data Reduction
36 pages
3.2 Pca
No ratings yet
3.2 Pca
27 pages
6 Principal Component Analysis
No ratings yet
6 Principal Component Analysis
7 pages
PCA Complete
No ratings yet
PCA Complete
8 pages
D3S2 - Unsupervised - Dimensionality Reduction
No ratings yet
D3S2 - Unsupervised - Dimensionality Reduction
81 pages
PCA Finds Representation Through Linear Transformation
No ratings yet
PCA Finds Representation Through Linear Transformation
28 pages
W4.2 DataPreProcessing-PCA
No ratings yet
W4.2 DataPreProcessing-PCA
22 pages
STAT502
No ratings yet
STAT502
13 pages
Multivariate Statistical Analysis
No ratings yet
Multivariate Statistical Analysis
12 pages
Unit 3
No ratings yet
Unit 3
28 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
8 pages
PCA
100% (1)
PCA
33 pages
Module 3
No ratings yet
Module 3
41 pages
IDS 4 (Week 14)
No ratings yet
IDS 4 (Week 14)
66 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
8 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
19 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
27 pages
Unit 3dimentionality Reduction
No ratings yet
Unit 3dimentionality Reduction
13 pages
Unit 3
No ratings yet
Unit 3
102 pages
DR Pca
No ratings yet
DR Pca
22 pages
Machine Learning (CSO851) - Lecture 03
No ratings yet
Machine Learning (CSO851) - Lecture 03
71 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
6 pages
03 Principal Components Analysis
No ratings yet
03 Principal Components Analysis
3 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
33 pages
ML Mod32019
No ratings yet
ML Mod32019
6 pages
Kinya Sharon - Ass2 - Machine Learning
No ratings yet
Kinya Sharon - Ass2 - Machine Learning
12 pages
Principal Component Analysis and Cluster Analysis
No ratings yet
Principal Component Analysis and Cluster Analysis
14 pages
PCA - Feb 8
No ratings yet
PCA - Feb 8
28 pages
Ferath Kherif PCA
No ratings yet
Ferath Kherif PCA
17 pages
What Is PCA?: Image Source
No ratings yet
What Is PCA?: Image Source
17 pages
MLSP Exp2
No ratings yet
MLSP Exp2
7 pages
1.variable Reduction 2.principal Component Analysis: Topic UNIT-4
No ratings yet
1.variable Reduction 2.principal Component Analysis: Topic UNIT-4
19 pages
Lecture 14: Principal Component Analysis: Computing The Principal Components
No ratings yet
Lecture 14: Principal Component Analysis: Computing The Principal Components
6 pages
Class8-9 DataPreprocessing DataReduction 30Sept-05Oct2020
No ratings yet
Class8-9 DataPreprocessing DataReduction 30Sept-05Oct2020
22 pages
PCA Tutorial: Instructor: Forbes Burkowski
No ratings yet
PCA Tutorial: Instructor: Forbes Burkowski
12 pages
Unit Iii Dimentionality Reduction
No ratings yet
Unit Iii Dimentionality Reduction
12 pages
Principal Component Analysis (PCA) : Anisha M. Lal
No ratings yet
Principal Component Analysis (PCA) : Anisha M. Lal
20 pages
Pca Lda Lobo
No ratings yet
Pca Lda Lobo
20 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
34 pages
Pca 1
No ratings yet
Pca 1
3 pages
MLSP Exp02
No ratings yet
MLSP Exp02
10 pages
Pca Tutorial
No ratings yet
Pca Tutorial
11 pages
Data Analysis: Dr. C Santhosh Kumar
No ratings yet
Data Analysis: Dr. C Santhosh Kumar
22 pages
U4 - PCA - 5th Sem - DS
No ratings yet
U4 - PCA - 5th Sem - DS
14 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
2 pages
Dimensionality Reduction by Pca: Non - Feasible
No ratings yet
Dimensionality Reduction by Pca: Non - Feasible
26 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
15 pages
Principal Component Analysis
100% (1)
Principal Component Analysis
10 pages
Prs l6
No ratings yet
Prs l6
10 pages
10-601 Machine Learning (Fall 2010) Principal Component Analysis
No ratings yet
10-601 Machine Learning (Fall 2010) Principal Component Analysis
8 pages
Learn Statistics Fast: A Simplified Detailed Version for Students
From Everand
Learn Statistics Fast: A Simplified Detailed Version for Students
Hesbon R.M
No ratings yet
Core Concepts in Real Analysis
From Everand
Core Concepts in Real Analysis
Roshan Trivedi
No ratings yet
Numerical Analysis II Essentials
From Everand
Numerical Analysis II Essentials
The Editors of REA
No ratings yet
Cloud and Emerging Technologies
No ratings yet
Cloud and Emerging Technologies
5 pages
What Are The Signs of An Impending Geologic Hazard
100% (2)
What Are The Signs of An Impending Geologic Hazard
2 pages
Asm Note
No ratings yet
Asm Note
1 page
Mendels Law of Segregation
No ratings yet
Mendels Law of Segregation
10 pages
Unit 6
No ratings yet
Unit 6
15 pages
Prerequis R
No ratings yet
Prerequis R
38 pages
Biblio Tatla Aspects of Universality in Modern and Postmodern Architecture
No ratings yet
Biblio Tatla Aspects of Universality in Modern and Postmodern Architecture
3 pages
11.2 The Process of Cell Division
No ratings yet
11.2 The Process of Cell Division
36 pages
Math Investigation
No ratings yet
Math Investigation
20 pages
Chapter-1 Group7MMM
No ratings yet
Chapter-1 Group7MMM
4 pages
740 (B) Calculation of Smoke Spilled System
No ratings yet
740 (B) Calculation of Smoke Spilled System
8 pages
Southern Mindanao College 229
No ratings yet
Southern Mindanao College 229
3 pages
Steps Involved in Production and Utilization of A TV Programme
No ratings yet
Steps Involved in Production and Utilization of A TV Programme
5 pages
CPSE Contacts
No ratings yet
CPSE Contacts
1,264 pages
Verbal Autopsy Standards 2022 Who Verbal Autopsy Instrument v1 Final
No ratings yet
Verbal Autopsy Standards 2022 Who Verbal Autopsy Instrument v1 Final
40 pages
Simplex Algorithm - Wikipedia
No ratings yet
Simplex Algorithm - Wikipedia
20 pages
2023+02+09+TD Z-Trak2+LP2C+4K+Series+Datasheet
No ratings yet
2023+02+09+TD Z-Trak2+LP2C+4K+Series+Datasheet
4 pages
Moba Compaction Assistance
No ratings yet
Moba Compaction Assistance
12 pages
Module 3 User's Guide - Planning and Assessing Health Worker Activities
No ratings yet
Module 3 User's Guide - Planning and Assessing Health Worker Activities
149 pages
Hyd Pressure Spek
No ratings yet
Hyd Pressure Spek
3 pages
Thriller English
No ratings yet
Thriller English
69 pages
Virtuous A. Adroit
No ratings yet
Virtuous A. Adroit
10 pages
Unit 4
100% (1)
Unit 4
7 pages
Technical Spec For Gas Detectors
No ratings yet
Technical Spec For Gas Detectors
19 pages
Comparison Fagll03 Fbl3n Fbl5n
No ratings yet
Comparison Fagll03 Fbl3n Fbl5n
2 pages
Study Material 2 PDF
No ratings yet
Study Material 2 PDF
8 pages
Exp 1a Determine The Resultant of Two Non-Linear Force Vectors
No ratings yet
Exp 1a Determine The Resultant of Two Non-Linear Force Vectors
7 pages
Lesson 4 - Contructivist Theory in Teaching Science
No ratings yet
Lesson 4 - Contructivist Theory in Teaching Science
2 pages
Adventure Tourism in Bilaspur: A Framework For Assessment and Strategic Development
100% (1)
Adventure Tourism in Bilaspur: A Framework For Assessment and Strategic Development
14 pages

7.3 Pca

Uploaded by

7.3 Pca

Uploaded by

Principal Component Analysis

It is very useful method for extracting relevant information from

PCA is “an orthogonal linear transformation that transfers

The number of principal components is less than or equal to the

PCA used to reduce dimensions of data without much loss of

• The main goal of a PCA analysis is to identify patterns

It reduces the dimensions of a d-dimensional dataset by

This transformation is defined in such a way that the first

• Standardize the data.

The results of PCA depend on the scaling of the variables.

A scale-invariant form of PCA has been developed.

• Interest Rate Derivatives Portfolios

You might also like