0% found this document useful (0 votes)

68 views9 pages

MLPDF 2

The document provides steps for performing principal component analysis (PCA) and applying it to reduce the dimensions of a dataset. It describes 1) representing the dataset as a matrix, 2) standardizing the data, 3) calculating the covariance matrix and eigenvectors/eigenvalues, 4) using the eigenvalues to select principal components and reduce dimensions, and 5) an example of applying PCA to reduce a 2D dataset to 1D. Singular value decomposition (SVD) is also discussed as an alternative technique for dimensionality reduction.

Uploaded by

ssalman85072

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

68 views9 pages

MLPDF 2

Uploaded by

ssalman85072

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 9

Steps for PCA

⚫ 1. Getting the dataset - get the input dataset and divide it into two subparts X and Y, where X is the
training set, and Y is the validation set.

2. Representing data into a structure

Represent our dataset into a structure. i.e. represent the

two-dimensional matrix of independent variable X. • Here each row corresponds to the data items, and
the

column corresponds to the Features.

The number of columns is the dimensions of the dataset

Steps for PCA...

3. Standardizing the data

from X, In a particular column, the features with high variance are more important, compared to the
features with lower variance. If the importance of features is independent of the variance of the feature,
then divide each data item in a column with the standard

deviation of the column.

• This matrix is Z.
4. Calculating the Covariance of Z find Z transpose, and multiply it by Z.

i.e. Z'Z is the Covariance matrix of Z.

.Steps for PCA...

⚫5, Calculating the Eigen Values and Eigen Vectors for

covariance matrix Z Eigenvectors are the directions of the axes with high information. And the
coefficients of these eigenvectors are defined as the eigenvalues.

6. Sorting the Eigen Vectors all the eigenvalues will sort in decreasing order, And sort the eigenvectors
accordingly in matrix P of

eigenvalues. The resultant matrix will be named as P*

Steps for PCA...

⚫7. Calculating the new features Or Principal Components multiply the P* matrix to the Z, and the
resultant matrix Z*, Each column of the Z* matrix is independent of each other. ⚫ 8. Remove less or
unimportant features from the new

dataset, Z*

Example

• Given the following data, use PCA to reduce the dimension from 2 to 1

Step 1: Given Data Set

Feature

Example 1 Example 2 Example 3 Example 4 4

No. of features, n: 2

No. of samples, N: 4

Step 2: Computation of mean of variables

-X (4+8+13+7)/4= 8

y=(11+4+5+14) / 48.5

Step 3: computation of covariance matrix

Example

• Write ordered pairs of (x,y) are

Feature

(x,x), (x,y), (y.x), (y.y)

i) find covariance of all ordered pairs Cov(x,x)= (1/(N-1)) E-1(xik-x)(xjk-x)

Cov(x,x) = (1/(N-1)) E-1(xk-x)2 (both covariance are same

"x")

(1/4-1)(4-8)+(8-8)+(13-8)2-(7-8)²=14

• Cov(x,y)=(1/4-1)((4-8) (11-8.5)+(8-8) (4-8.5)+(13-8) (5-8.5)+(7-8)(14-8.5))

=-11

• Cov(y,x)=cov(x,y)=-11

• Cov(y,y)=(1/4-1)((11-8.5)²+(4-8.5)+(5-8.5)²+(14-8.5)2)=23

ii) Covariance matrix nxn (2x2)

S=
S=

[cov(x, x) cov(x,y) cov(y,x) cov(y,y)

[14 - 11] -11 23

Step 4: Eigen Value, Eigen vector and normalized eigen vector.

⚫i) Eigen Value

| and 2*1= [] Determinent [S-A11-0 1-ar

• Where S is Covariance mateix, I is Identity matrix and 2 is eigen value. Det(S-AI)=det [111 [14-2 11]

23-λ

(14-A)(23-A)-(-11-11)=22-37A+201=0 A 30.3849, 6.6151

- 21> A2,

-A1-30.3849 (first principal component) . and A2=6.6151

Step 6: Coordinate system for principal component

• Select mean of x,y (8,8.5) Then select e1 & e2

14 12

10 6

0.5574

• el=

-0.8303

0.8303] e2= 0.5574]

Then draw the line

-e1 and e2

2 4 6 8 10 12 14 16 18

Then place the table values on e1

PCA...

• If every values lies on e1 i.e. on single dimension, then it is easy for computation, when compared to
two dimension.

Applications of PCA

• PCA is mainly used as the dimensionality reduction

technique in various Al applications such as computer vision, image compression, etc. ⚫ It can also be
used for finding hidden patterns if data has

high dimensions. Some fields where PCA is used are Finance,

data mining, Psychology, etc

Singular value decomposition (SVD)

- Singular value decomposition (SVD) is a matrix factorization technique

commonly used in linear algebra. SVD of a matrix A (mx n) is a factorization of the form:

A = UMV

where, U and V are orthonormal matrices,

⚫ U is an m x m unitary matrix,
-Vis an nxn unitary matrix and

Σ is an m x n rectangular diagonal matrix.

The diagonal entries of Σ are known as singular values of matrix A.

The columns of U and Vare called the left-singular and right-singular

vectors of matrix A, respectively.

Singular Value Decomposition (SVD)...

• SVD of a data matrix - the properties:

• 1. Patterns in the attributes are captured by the right-singular vectors, i.e. the columns of V.

• 2. Patterns among the instances are captured by the left-singular, i.e. the columns

of U. 3. Larger a singular value, larger is the part of the matrix A that it accounts for and its associated
vectors.

. 4. New data matrix with 'K' attributes is obtained using the equation •D=DX [V,,V,,V₂]

• Thus, the dimensionality gets reduced to k

• SVD is often used in the context of text data.

Principal Component Analysis
No ratings yet
Principal Component Analysis
13 pages
Google My Business 101
100% (1)
Google My Business 101
28 pages
Oracle: Questions and Answers (PDF) For More Information - Visit
100% (1)
Oracle: Questions and Answers (PDF) For More Information - Visit
70 pages
Topographic Survey of Comprehensive Secondary School Nawfia, Anambra State
100% (1)
Topographic Survey of Comprehensive Secondary School Nawfia, Anambra State
8 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
8 pages
DimensionalitY Reduction
No ratings yet
DimensionalitY Reduction
29 pages
User-Agents Line-App Application Android
No ratings yet
User-Agents Line-App Application Android
37 pages
Syllabus Cse Ruet
No ratings yet
Syllabus Cse Ruet
25 pages
Unit 3
No ratings yet
Unit 3
102 pages
tms320f28377d (데이터시트)
No ratings yet
tms320f28377d (데이터시트)
253 pages
Principal Component Analysis
100% (1)
Principal Component Analysis
9 pages
Day 4 - MSP Bootcamp Training 201
No ratings yet
Day 4 - MSP Bootcamp Training 201
54 pages
PCA With An Example
No ratings yet
PCA With An Example
7 pages
IDS 4 (Week 14)
No ratings yet
IDS 4 (Week 14)
66 pages
Machine Learning (CSO851) - Lecture 03
No ratings yet
Machine Learning (CSO851) - Lecture 03
71 pages
Pca Kmeans GMM
No ratings yet
Pca Kmeans GMM
96 pages
The Girl With The Broken Heart Lurlene Mcdaniel Download
No ratings yet
The Girl With The Broken Heart Lurlene Mcdaniel Download
27 pages
Principal Component Analysis
100% (1)
Principal Component Analysis
10 pages
Information Retrieval Techniques
No ratings yet
Information Retrieval Techniques
59 pages
Lecture 9 - Data Prep - Reduction - PCA-M
No ratings yet
Lecture 9 - Data Prep - Reduction - PCA-M
44 pages
Presentation A I STD 2
No ratings yet
Presentation A I STD 2
63 pages
AML Unit - 1 Material
No ratings yet
AML Unit - 1 Material
36 pages
Happy Link Software Instruction Manual
No ratings yet
Happy Link Software Instruction Manual
101 pages
Thesis Formate Be
No ratings yet
Thesis Formate Be
80 pages
Principal Computer Analysis (PCA)
No ratings yet
Principal Computer Analysis (PCA)
25 pages
Principal Component Analysis: Ujjwal Maulik Computer Sc. & Engg. Department Jadavpur University
No ratings yet
Principal Component Analysis: Ujjwal Maulik Computer Sc. & Engg. Department Jadavpur University
34 pages
PCA
100% (1)
PCA
33 pages
Unit - IV - DIMENSIONALITY REDUCTION AND GRAPHICAL MODELS
No ratings yet
Unit - IV - DIMENSIONALITY REDUCTION AND GRAPHICAL MODELS
59 pages
HCIA-Intelligent Computing V1.0 Lab Guide
No ratings yet
HCIA-Intelligent Computing V1.0 Lab Guide
213 pages
UploadFile 9116
No ratings yet
UploadFile 9116
21 pages
Dim Reduction & Pattern Recognition
No ratings yet
Dim Reduction & Pattern Recognition
63 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
34 pages
Data Pre-Processing-IV (Feature Extraction-PCA)
No ratings yet
Data Pre-Processing-IV (Feature Extraction-PCA)
23 pages
PCA Steps - Numerical Problem
No ratings yet
PCA Steps - Numerical Problem
8 pages
PCA Complete
No ratings yet
PCA Complete
8 pages
Lecture 9 - Data Reduction
No ratings yet
Lecture 9 - Data Reduction
36 pages
Data Analysis: Dr. C Santhosh Kumar
No ratings yet
Data Analysis: Dr. C Santhosh Kumar
22 pages
1929 Rakul Mathavan
No ratings yet
1929 Rakul Mathavan
11 pages
Pca
No ratings yet
Pca
16 pages
U5@-Data Reduction
No ratings yet
U5@-Data Reduction
22 pages
PCA Dev
No ratings yet
PCA Dev
16 pages
ML Lec-20
No ratings yet
ML Lec-20
17 pages
Analog Testing 02
0% (1)
Analog Testing 02
39 pages
Unit 3
No ratings yet
Unit 3
28 pages
Pattern Recognition PCA: Subrata Datta Dept. of AIML Nsec
No ratings yet
Pattern Recognition PCA: Subrata Datta Dept. of AIML Nsec
19 pages
Module 2 Lab 2
No ratings yet
Module 2 Lab 2
5 pages
1501589578da Mod15 Q1 e Text
No ratings yet
1501589578da Mod15 Q1 e Text
9 pages
CMMT ST C8 1C MP S0 - Operating Instr - 2024 04a - 8214117g1
No ratings yet
CMMT ST C8 1C MP S0 - Operating Instr - 2024 04a - 8214117g1
12 pages
Lecture6 PCA
No ratings yet
Lecture6 PCA
30 pages
6 Principal Component Analysis
No ratings yet
6 Principal Component Analysis
7 pages
P-3.1.4 - Pca
No ratings yet
P-3.1.4 - Pca
44 pages
Mlfa Autumn 2023 Pca
No ratings yet
Mlfa Autumn 2023 Pca
32 pages
KVT 715 DVD
No ratings yet
KVT 715 DVD
76 pages
ControlAcceso Resumen
No ratings yet
ControlAcceso Resumen
27 pages
Udyam Registration Certificate - The Lord's Family Spa
No ratings yet
Udyam Registration Certificate - The Lord's Family Spa
2 pages
Pca 1
No ratings yet
Pca 1
3 pages
3.2 Pca
No ratings yet
3.2 Pca
27 pages
Samsung Ah68 02293b Users Manual 280484
No ratings yet
Samsung Ah68 02293b Users Manual 280484
39 pages
Principal Component Analysis - A Tutorial
No ratings yet
Principal Component Analysis - A Tutorial
37 pages
Outline: - Mathematical Background - PCA - SVD - Some PCA and SVD Applications - Case Study: LSI
No ratings yet
Outline: - Mathematical Background - PCA - SVD - Some PCA and SVD Applications - Case Study: LSI
42 pages
U4 - PCA - 5th Sem - DS
No ratings yet
U4 - PCA - 5th Sem - DS
14 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
10 pages
Principal Component Analysis (PCA)
No ratings yet
Principal Component Analysis (PCA)
18 pages
Need of Principal Component Analysis
No ratings yet
Need of Principal Component Analysis
8 pages
The Math Behind PCA
No ratings yet
The Math Behind PCA
3 pages
Singular Value Decomposition (SVD) / Principal Components Analysis (Pca)
No ratings yet
Singular Value Decomposition (SVD) / Principal Components Analysis (Pca)
31 pages
Pca
No ratings yet
Pca
18 pages
Simple Presentation On Artificial Intelligence
No ratings yet
Simple Presentation On Artificial Intelligence
7 pages
Mathematical Approach To PCA
No ratings yet
Mathematical Approach To PCA
8 pages
Dimensionality Reduction Using PCA (Principal Component Analysis)
No ratings yet
Dimensionality Reduction Using PCA (Principal Component Analysis)
13 pages
5 Pca
No ratings yet
5 Pca
14 pages
Pac
No ratings yet
Pac
70 pages
Program Level Energy and Power Analysis
No ratings yet
Program Level Energy and Power Analysis
4 pages
25.3 - VFX PDF MultiPassReferences
No ratings yet
25.3 - VFX PDF MultiPassReferences
3 pages
Dimensionality Reduction (Principal Component Analysis)
No ratings yet
Dimensionality Reduction (Principal Component Analysis)
12 pages
Feature Extraction: - Saheni Patra
No ratings yet
Feature Extraction: - Saheni Patra
17 pages
2024 Calendar Client
No ratings yet
2024 Calendar Client
1 page
Networking Solution
No ratings yet
Networking Solution
2 pages
Ecomdash Setup Checklist
No ratings yet
Ecomdash Setup Checklist
2 pages
Dimensionality Reduction by Pca: Non - Feasible
No ratings yet
Dimensionality Reduction by Pca: Non - Feasible
26 pages
Secure Print Mode Overview and Guide For Windows 10 Users
No ratings yet
Secure Print Mode Overview and Guide For Windows 10 Users
6 pages
Presentation
No ratings yet
Presentation
31 pages
Principal Components Analysis (PCA) Final
No ratings yet
Principal Components Analysis (PCA) Final
23 pages
PCI Express Gen 4 and Gen 5 Card Edge Connectors
No ratings yet
PCI Express Gen 4 and Gen 5 Card Edge Connectors
4 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
16 pages
Css q2 Week6 g12
No ratings yet
Css q2 Week6 g12
4 pages
.An Approach To Physical Design of 28nm Technology Based Processor Chip Using IC Compiler
No ratings yet
.An Approach To Physical Design of 28nm Technology Based Processor Chip Using IC Compiler
4 pages
Speech To Text - No Need To Write - 03
No ratings yet
Speech To Text - No Need To Write - 03
1 page
10-601 Machine Learning (Fall 2010) Principal Component Analysis
No ratings yet
10-601 Machine Learning (Fall 2010) Principal Component Analysis
8 pages

MLPDF 2

Uploaded by

MLPDF 2

Uploaded by

Steps for PCA

2. Representing data into a structure

Represent our dataset into a structure. i.e. represent the

column corresponds to the Features.

The number of columns is the dimensions of the dataset

Steps for PCA...

3. Standardizing the data

deviation of the column.

i.e. Z'Z is the Covariance matrix of Z.

.Steps for PCA...

⚫5, Calculating the Eigen Values and Eigen Vectors for

eigenvalues. The resultant matrix will be named as P*

Steps for PCA...

Step 1: Given Data Set

Example 1 Example 2 Example 3 Example 4 4

Step 2: Computation of mean of variables

Step 3: computation of covariance matrix

• Write ordered pairs of (x,y) are

(x,x), (x,y), (y.x), (y.y)

i) find covariance of all ordered pairs Cov(x,x)= (1/(N-1)) E-1(xik-x)(xjk-x)

Cov(x,x) = (1/(N-1)) E-1(xk-x)2 (both covariance are same

• Cov(x,y)=(1/4-1)((4-8) (11-8.5)+(8-8) (4-8.5)+(13-8) (5-8.5)+(7-8)(14-8.5))

ii) Covariance matrix nxn (2x2)

[cov(x, x) cov(x,y) cov(y,x) cov(y,y)

[14 - 11] -11 23

Step 4: Eigen Value, Eigen vector and normalized eigen vector.

⚫i) Eigen Value

| and 2*1= [] Determinent [S-A11-0 1-ar

(14-A)(23-A)-(-11-11)=22-37A+201=0 A 30.3849, 6.6151

-A1-30.3849 (first principal component) . and A2=6.6151

• Select mean of x,y (8,8.5) Then select e1 & e2

0.8303] e2= 0.5574]

Then draw the line

Then place the table values on e1

• PCA is mainly used as the dimensionality reduction

high dimensions. Some fields where PCA is used are Finance,

data mining, Psychology, etc

Singular value decomposition (SVD)

- Singular value decomposition (SVD) is a matrix factorization technique

where, U and V are orthonormal matrices,

Σ is an m x n rectangular diagonal matrix.

The diagonal entries of Σ are known as singular values of matrix A.

The columns of U and Vare called the left-singular and right-singular

vectors of matrix A, respectively.

Singular Value Decomposition (SVD)...

• SVD of a data matrix - the properties:

• Thus, the dimensionality gets reduced to k

• SVD is often used in the context of text data.

You might also like