0% found this document useful (0 votes)

124 views4 pages

Principle Component Analysis

PCA is a technique used to reduce the dimensionality of large datasets by transforming variables into a smaller number of uncorrelated principal components. It works by calculating the covariance matrix of the data and then determining the eigenvalues and eigenvectors of this matrix. The principal components are linear combinations of the original variables that capture the maximum variance in the data. PCA is commonly used in market research and other fields to analyze datasets with many interrelated variables.

Uploaded by

ersimohit

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

124 views4 pages

Principle Component Analysis

Uploaded by

ersimohit

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Principal components analysis

Principal Component Analysis (PCA) is a multivariate statistical technique used to form a

smaller number of uncorrelated variables from a large set of data. The goal of PCA is to
explain the maximum amount of variance with the fewest number of principal components.
PCA is commonly used in the social sciences, market research, and other industries that use
large data sets.

PCA is commonly used as one step in a series of analyses. One can use principal components
analysis to reduce the number of variables and avoid multicollinearity, or when you have too
many predictors relative to the number of observations.

Working of PCA:

1. Calculate the covariance matrix X of data points.

2. Calculate eigen vectors and corresponding eigen values.
3. Sort the eigen vectors according to their eigen values in decreasing order.
4. Choose first k eigen vectors and that will be the new k dimensions.
5. Transform the original n dimensional data points into k dimensions.

For example, a consumer products company wants to analyze customer responses to several
characteristics of a new shampoo: color, smell, texture, cleanliness, shine, volume, amount
needed to lather, and price. They conduct a principal components analysis to see if they can
form a smaller number of uncorrelated variables that are easier to interpret and analyze. The
results suggest the following patterns:

Color, smell, and texture form a "Shampoo quality" component.

Cleanliness, shine, and volume form an "Effect on hair" component.

Amount needed to lather and price form a "Value" component.

Principal components method

In PCA, one first finds the set of orthogonal eigenvectors of the correlation or covariance
matrix of the variables. The matrix of principal components is the product of the eigenvector
matrix with the matrix of independent variables. The first principal component accounts for
the largest percent of the total data variation. The second principal component accounts the
second largest percent of the total data variation, and so on. The goal of principal components
is to explain the maximum amount of variance with the fewest number of components.

Terminologies associated with PCA:

Eigenvectors

Eigenvectors, which are comprised of coefficients corresponding to each variable, are the
weights for each variable used to calculate the principal components scores.

Scores

The linear combinations of the original variables that account for the variance in the data.

Eigenvalue

The eigenvalues are the variances of the principal components

Principal Component Model

The method involves decomposing a data matrix X into a structure part and
a noise part. The PC model is the matrix product TPT (the Structure):

X = TPT + E = Structure + Noise

We assume that X can be split into the sum of the matrix product TPT and the residual matrix
E.

Scores: T
The Scores, structure part of the PCA. Summary of the original variables
in X that describe how the different rows in X (observations) relate to each
other. In the T-matrix column 1 (t1) is the scores of the first PC. The second
column contains the scores of the second PC, and so on.

Loadings: P
The Loadings, structure part of the PCA. The weights (in°uence) of the
variables in X on the scores T. Of the loadings we can see which variables
that are responsible for patterns found in scores, T, using the Loadings plot.
This plot is simply the loadings of a PC plotted against the loadings of
another PC. We can see how the scores and loadings relate, and that is
very important about this plot. The loadings plot could be called a map of
variables.

Residuals: E
The Residuals (E-matrix), is the noise part of the PCA, a n x p large Matrix.
E is not part of the model. It will be the part of X which is not explained by
the model TPT .

NIPALS algorithm

The NIPALS ("Nonlinear Iterative Partial Least Squares") algorithm is employed for
estimating the parameters of the PCA model. The steps of the algorithm is listed below:

X is a mean centered data matrix

E(0) = X The E-matrix for the zero-th PC (PC0) is mean centered X
t vector is set to a column in X
t will be the scores for PCi
p will be the loadings for PCi
threshold = 0:00001 Just a low value, to do the convergence check
Iterations (i=1 to number-of-PCs):

1. Project X onto t to ¯nd the corresponding loading p

p = (ET(i-1)t)=(tT t)

2. Normalise loading vector p to length 1

p = p * (pT p)-0.5
3. Project X onto p to ¯nd corresponding score vector t
t = (E(i-1)p) / (pT p)

4. Check for convergence. If difference between eigenvalues Tnew = (tT t)

and Told (from last iteration) is larger than threshold Tnew, return to
step 1.

5. Remove the estimated PC component from E(i¡1)

E(i) = E(i-1) - (tpT )

PCA Theory
No ratings yet
PCA Theory
13 pages
Principal Component Analysis1
No ratings yet
Principal Component Analysis1
26 pages
Pca
No ratings yet
Pca
15 pages
Principal Computer Analysis(PCA)
No ratings yet
Principal Computer Analysis(PCA)
25 pages
pca1
No ratings yet
pca1
3 pages
PCA S3
No ratings yet
PCA S3
26 pages
Module 4-2 Principal Components Analysis
No ratings yet
Module 4-2 Principal Components Analysis
18 pages
L3
No ratings yet
L3
38 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
9 pages
Kumar 2017
No ratings yet
Kumar 2017
13 pages
Pca
No ratings yet
Pca
18 pages
Factor analysis is a statistical method used to explore the underlying structure of relationships among observed variables in a dataset. It aims to identify latent or unobservable factors that exp (2)
No ratings yet
Factor analysis is a statistical method used to explore the underlying structure of relationships among observed variables in a dataset. It aims to identify latent or unobservable factors that exp (2)
12 pages
Principal+Component+Analysis
No ratings yet
Principal+Component+Analysis
6 pages
Unit 3
No ratings yet
Unit 3
31 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
27 pages
Question Based On Fiber Optic Sensor
No ratings yet
Question Based On Fiber Optic Sensor
1 page
Principal Component Analysis
No ratings yet
Principal Component Analysis
17 pages
Principal component analysis
No ratings yet
Principal component analysis
15 pages
Module3 Notes
No ratings yet
Module3 Notes
13 pages
Qrm2024 Topic5 Pca Fa
No ratings yet
Qrm2024 Topic5 Pca Fa
67 pages
Pca 1692550768
No ratings yet
Pca 1692550768
13 pages
Principal Component Analysis - Wikipedia
No ratings yet
Principal Component Analysis - Wikipedia
28 pages
Devoir PCA
No ratings yet
Devoir PCA
13 pages
Ferath Kherif PCA
No ratings yet
Ferath Kherif PCA
17 pages
IDS 4 (Week 14)
No ratings yet
IDS 4 (Week 14)
66 pages
6 Principal Component Analysis
No ratings yet
6 Principal Component Analysis
7 pages
03 Principal Components Analysis
No ratings yet
03 Principal Components Analysis
3 pages
Summary PCA by Atta Mohammad 26040
No ratings yet
Summary PCA by Atta Mohammad 26040
2 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
11 pages
Pca
No ratings yet
Pca
17 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
6 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
8 pages
PCA_dev
No ratings yet
PCA_dev
16 pages
Lecture 6 - PCA - Lecturefin
No ratings yet
Lecture 6 - PCA - Lecturefin
71 pages
3
No ratings yet
3
12 pages
Unit V Foml
No ratings yet
Unit V Foml
18 pages
Unit 3dimentionality Reduction
No ratings yet
Unit 3dimentionality Reduction
13 pages
U5@-Data Reduction
No ratings yet
U5@-Data Reduction
22 pages
Need of Principal Component Analysis
No ratings yet
Need of Principal Component Analysis
8 pages
DR Pca
No ratings yet
DR Pca
22 pages
10 ASAP Advanced Statistics Dimension Reduction
No ratings yet
10 ASAP Advanced Statistics Dimension Reduction
8 pages
STAT502
No ratings yet
STAT502
13 pages
program-3
No ratings yet
program-3
7 pages
3.2 Pca
No ratings yet
3.2 Pca
27 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
10 pages
PCA Finds Representation Through Linear Transformation
No ratings yet
PCA Finds Representation Through Linear Transformation
28 pages
Love Report 1
No ratings yet
Love Report 1
10 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
23 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
8 pages
U4 - PCA - 5th Sem - DS
No ratings yet
U4 - PCA - 5th Sem - DS
14 pages
s11220-025-00558-w
No ratings yet
s11220-025-00558-w
21 pages
Unit-3
No ratings yet
Unit-3
28 pages
ML Mod32019
No ratings yet
ML Mod32019
6 pages
1501589578da-mod15-Q1-e-text
No ratings yet
1501589578da-mod15-Q1-e-text
9 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
33 pages
Principal Component Analysis: Term Paper For Data Mining & Data Warehousing
No ratings yet
Principal Component Analysis: Term Paper For Data Mining & Data Warehousing
11 pages
Principal Component Analysis
100% (1)
Principal Component Analysis
10 pages
PCA - Principal Component Analysis: Step by Step Computation of PCA
No ratings yet
PCA - Principal Component Analysis: Step by Step Computation of PCA
2 pages
Pca Tutorial
No ratings yet
Pca Tutorial
11 pages
Remote Sensing Assignment
No ratings yet
Remote Sensing Assignment
10 pages
Date Abstract Data Type
No ratings yet
Date Abstract Data Type
5 pages
2. Courtse Waiver form-MBA Regular
No ratings yet
2. Courtse Waiver form-MBA Regular
2 pages
FINM7008 Lecture 4
No ratings yet
FINM7008 Lecture 4
33 pages
AP Precal 2024-Prep set1 Questions only
No ratings yet
AP Precal 2024-Prep set1 Questions only
20 pages
Assignment - 6 Solution
No ratings yet
Assignment - 6 Solution
6 pages
Exercises of Numerical Analysis
From Everand
Exercises of Numerical Analysis
Simone Malacrida
No ratings yet
Test Item Format
No ratings yet
Test Item Format
42 pages
Bounded Linear Operators On A Hilbert Space
100% (2)
Bounded Linear Operators On A Hilbert Space
28 pages
AIML ISE mpq2
No ratings yet
AIML ISE mpq2
4 pages
Microprocessor Basic Programming
100% (1)
Microprocessor Basic Programming
132 pages
April 19, 2015: 10:58am C 2015 Avinash Kak, Purdue University
No ratings yet
April 19, 2015: 10:58am C 2015 Avinash Kak, Purdue University
49 pages
Lecture 3 CNN - Backpropagation
No ratings yet
Lecture 3 CNN - Backpropagation
18 pages
DC 5 Receiver New
No ratings yet
DC 5 Receiver New
228 pages
Integer Addition & Subtraction GUIDED LYRICS
100% (1)
Integer Addition & Subtraction GUIDED LYRICS
1 page
What Is Experimental Design
No ratings yet
What Is Experimental Design
3 pages
Q Switching
No ratings yet
Q Switching
8 pages
Aops Community 2021-Imo Problems: Proposed by Dominik Burek, Poland and Tomasz Ciesla, Poland
No ratings yet
Aops Community 2021-Imo Problems: Proposed by Dominik Burek, Poland and Tomasz Ciesla, Poland
2 pages
GATE Electronics and Communication 2013 Set C
No ratings yet
GATE Electronics and Communication 2013 Set C
24 pages
Nodal Methods For Three-Dimensional Simulators: (Received July
No ratings yet
Nodal Methods For Three-Dimensional Simulators: (Received July
23 pages
Electro Optic Modulator
No ratings yet
Electro Optic Modulator
21 pages
Is It Time For A Raise?
No ratings yet
Is It Time For A Raise?
2 pages
Analysis of Cpw-Fed Uwb Antenna For Wimax and Wlan Band Rejection
No ratings yet
Analysis of Cpw-Fed Uwb Antenna For Wimax and Wlan Band Rejection
10 pages
2.4 The Completeness Property of R: 2.4.1 Bounded Sets
No ratings yet
2.4 The Completeness Property of R: 2.4.1 Bounded Sets
12 pages
What Is The Fastest Algorithm
No ratings yet
What Is The Fastest Algorithm
4 pages
Numerical Methods - Module 1
No ratings yet
Numerical Methods - Module 1
8 pages
Project - Finite Element Analysis (MECH 0016) - Spring 19
No ratings yet
Project - Finite Element Analysis (MECH 0016) - Spring 19
3 pages
Factor Analysis
No ratings yet
Factor Analysis
4 pages
04 Arches PDF
No ratings yet
04 Arches PDF
11 pages
Modeling and Data Analysis in The Credit Card Industry: Bankruptcy, Fraud, and Collections
No ratings yet
Modeling and Data Analysis in The Credit Card Industry: Bankruptcy, Fraud, and Collections
6 pages
Cellphone Based Optometry Using Hybrid Images: Rationale
No ratings yet
Cellphone Based Optometry Using Hybrid Images: Rationale
5 pages
Random Sample Consensus: Robust Estimation in Computer Vision
From Everand
Random Sample Consensus: Robust Estimation in Computer Vision
Fouad Sabry
No ratings yet
1st Year Maths Notes
76% (34)
1st Year Maths Notes
48 pages
Dll Matatag _mathematics 7 q1 w1
No ratings yet
Dll Matatag _mathematics 7 q1 w1
12 pages
Basics of
No ratings yet
Basics of
2 pages
Milled Rice Grading Chart
No ratings yet
Milled Rice Grading Chart
1 page
Unigraphics General Interview Questions
No ratings yet
Unigraphics General Interview Questions
5 pages
Architecture and Programming of 8051 MCU
No ratings yet
Architecture and Programming of 8051 MCU
111 pages
Questionbank 110202035041 Phpapp01
No ratings yet
Questionbank 110202035041 Phpapp01
11 pages
SETS Revision
No ratings yet
SETS Revision
19 pages
Basic Engineering Circuit Analysis Solutions Manual 10th
No ratings yet
Basic Engineering Circuit Analysis Solutions Manual 10th
3 pages
Mathematics: Quarter 1 - Module 4: Solving Problems Involving Sequences
No ratings yet
Mathematics: Quarter 1 - Module 4: Solving Problems Involving Sequences
17 pages

Principle Component Analysis

Uploaded by

Principle Component Analysis

Uploaded by

Principal components analysis

Principal Component Analysis (PCA) is a multivariate statistical technique used to form a

1. Calculate the covariance matrix X of data points.

Color, smell, and texture form a "Shampoo quality" component.

Cleanliness, shine, and volume form an "Effect on hair" component.

Amount needed to lather and price form a "Value" component.

Terminologies associated with PCA:

The eigenvalues are the variances of the principal components

Principal Component Model

X = TPT + E = Structure + Noise

X is a mean centered data matrix

1. Project X onto t to ¯nd the corresponding loading p

2. Normalise loading vector p to length 1

4. Check for convergence. If difference between eigenvalues Tnew = (tT t)

5. Remove the estimated PC component from E(i¡1)

You might also like