6 Principal Component Analysis

Principal Component Analysis (PCA) is an unsupervised learning algorithm used for dimensionality reduction by transforming correlated features into uncorrelated principal components. It is widely applied in exploratory data analysis, predictive modeling, and various real-world applications such as image processing and recommendation systems. The PCA process involves steps like standardizing data, calculating covariance, and extracting principal components while retaining important features.

Uploaded by

ishan123456789000000

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views7 pages

6 Principal Component Analysis

Uploaded by

ishan123456789000000

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 7

Principal Component Analysis

Principal Component Analysis is an

unsupervised learning algorithm that
is used for the dimensionality
reduction in machine learning. It is a
statistical process that converts the
observations of correlated features into a
set of linearly uncorrelated features with
the help of orthogonal transformation.
These new transformed features are
called the Principal Components. It is
one of the popular tools that is used for
exploratory data analysis and predictive
modeling. It is a technique to draw strong
patterns from the given dataset by
reducing the variances.
PCA generally tries to find the lower-
dimensional surface to project the
high-dimensional data.
Some real-world applications of PCA
are image processing, movie
recommendation system, optimizing
the power allocation in various
communication channels. It is a
feature extraction technique, so it
contains the important variables and
drops the least important variable.
The PCA algorithm is based on some
mathematical concepts such as:
o Variance and Covariance
o Eigenvalues and Eigen factors
Some common terms used in PCA
algorithm:
o Dimensionality: It is the number of
features or variables present in the
given dataset. More easily, it is the
number of columns present in the
dataset.
o Correlation: It signifies that how
strongly two variables are related to
each other. Such as if one changes,
the other variable also gets changed.
The correlation value ranges from -1
to +1. Here, -1 occurs if variables
are inversely proportional to each
other, and +1 indicates that
variables are directly proportional
to each other.
o Orthogonal: It defines that variables
are not correlated to each other, and
hence the correlation between the pair
of variables is zero.
o Eigenvectors: If there is a square
matrix M, and a non-zero vector v is
given. Then v will be eigenvector if Av
is the scalar multiple of v.
o Covariance Matrix: A matrix
containing the covariance between the
pair of variables is called the
Covariance Matrix.
Principal Components in PCA
As described above, the transformed
new features or the output of PCA
are the Principal Components. The
number of these PCs are either equal
to or less than the original features
present in the dataset. Some
properties of these principal components
are given below:
o The principal component must be the
linear combination of the original
features.
o These components are orthogonal,
i.e., the correlation between a pair of
variables is zero.
o The importance of each component
decreases when going to 1 to n, it
means the 1 PC has the most
importance, and n PC will have the
least importance.
Steps for PCA algorithm
1. Getting the dataset
Firstly, we need to take the input
dataset and divide it into two subparts
X and Y, where X is the training set,
and Y is the validation set.
2. Representing data into a
structure
Now we will represent our dataset into
a structure. Such as we will represent
the two-dimensional matrix of
independent variable X. Here each row
corresponds to the data items, and the
column corresponds to the Features.
The number of columns is the
dimensions of the dataset.
3. Standardizing the data
In this step, we will standardize our
dataset. Such as in a particular
column, the features with high
variance are more important
compared to the features with lower
variance.
If the importance of features is
independent of the variance of the
feature, then we will divide each data
item in a column with the standard
deviation of the column. Here we will
name the matrix as Z.
4. Calculating the Covariance of Z
To calculate the covariance of Z, we
will take the matrix Z, and will
transpose it. After transpose, we will
multiply it by Z. The output matrix will
be the Covariance matrix of Z.
5. Calculating the Eigen Values
and Eigen Vectors
Now we need to calculate the
eigenvalues and eigenvectors for the
resultant covariance matrix Z.
Eigenvectors or the covariance matrix
are the directions of the axes with
high information. And the coefficients
of these eigenvectors are defined as
the eigenvalues.
6. Sorting the Eigen Vectors
In this step, we will take all the
eigenvalues and will sort them in
decreasing order, which means from
largest to smallest. And
simultaneously sort the eigenvectors
accordingly in matrix P of eigenvalues.
The resultant matrix will be named as
P*.
7. Calculating the new features Or
Principal Components
Here we will calculate the new
features. To do this, we will multiply
the P* matrix to the Z. In the resultant
matrix Z*, each observation is the
linear combination of original features.
Each column of the Z* matrix is
independent of each other.
8. Remove less or unimportant
features from the new dataset.
The new feature set has occurred, so
we will decide here what to keep and
what to remove. It means, we will only
keep the relevant or important
features in the new dataset, and
unimportant features will be removed
out.
Applications of Principal Component Analysis
o PCA is mainly used as the
dimensionality reduction technique in
various AI applications such as
computer vision, image
compression, etc.
o It can also be used for finding hidden
patterns if data has high dimensions.
Some fields where PCA is used are
Finance, data mining, Psychology, etc.

Docu87490 - Data Domain DD3300 Field Replacement and Upgrade Guide PDF
0% (2)
Docu87490 - Data Domain DD3300 Field Replacement and Upgrade Guide PDF
204 pages
DR Pca
No ratings yet
DR Pca
22 pages
U5@-Data Reduction
No ratings yet
U5@-Data Reduction
22 pages
ML Mod32019
No ratings yet
ML Mod32019
6 pages
Kinya Sharon - Ass2 - Machine Learning
No ratings yet
Kinya Sharon - Ass2 - Machine Learning
12 pages
PCA - Ensemble Classifiers
No ratings yet
PCA - Ensemble Classifiers
9 pages
Pca
No ratings yet
Pca
18 pages
Remote Sensing Assignment
No ratings yet
Remote Sensing Assignment
10 pages
Pca 1
No ratings yet
Pca 1
3 pages
Dimensionality Reduction (Principal Component Analysis)
No ratings yet
Dimensionality Reduction (Principal Component Analysis)
12 pages
Unit 3
No ratings yet
Unit 3
28 pages
Need of Principal Component Analysis
No ratings yet
Need of Principal Component Analysis
8 pages
U4 - PCA - 5th Sem - DS
No ratings yet
U4 - PCA - 5th Sem - DS
14 pages
PCA Complete
No ratings yet
PCA Complete
8 pages
PCA Dev
No ratings yet
PCA Dev
16 pages
PCA Finds Representation Through Linear Transformation
No ratings yet
PCA Finds Representation Through Linear Transformation
28 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
34 pages
10 ASAP Advanced Statistics Dimension Reduction
No ratings yet
10 ASAP Advanced Statistics Dimension Reduction
8 pages
Principal Component Analysis
100% (1)
Principal Component Analysis
9 pages
Feature Extraction: - Saheni Patra
No ratings yet
Feature Extraction: - Saheni Patra
17 pages
P-3.1.4 - Pca
No ratings yet
P-3.1.4 - Pca
44 pages
Principal Component Analysis
100% (1)
Principal Component Analysis
10 pages
3.2 Pca
No ratings yet
3.2 Pca
27 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
8 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
8 pages
Unit 4 (PCA)
No ratings yet
Unit 4 (PCA)
12 pages
IDS 4 (Week 14)
No ratings yet
IDS 4 (Week 14)
66 pages
1501589578da Mod15 Q1 e Text
No ratings yet
1501589578da Mod15 Q1 e Text
9 pages
Pca 1692550768
No ratings yet
Pca 1692550768
13 pages
Pattern Recognition PCA: Subrata Datta Dept. of AIML Nsec
No ratings yet
Pattern Recognition PCA: Subrata Datta Dept. of AIML Nsec
19 pages
Module 2 Lab 2
No ratings yet
Module 2 Lab 2
5 pages
Program 3
No ratings yet
Program 3
7 pages
03 Principal Components Analysis
No ratings yet
03 Principal Components Analysis
3 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
13 pages
PCA Notes
No ratings yet
PCA Notes
3 pages
7.3 Pca
No ratings yet
7.3 Pca
17 pages
Unit V Foml
No ratings yet
Unit V Foml
18 pages
PCA Explained Stepbystep
No ratings yet
PCA Explained Stepbystep
4 pages
It ML Unit 4 Notes Final
No ratings yet
It ML Unit 4 Notes Final
21 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
27 pages
PCA Theory
No ratings yet
PCA Theory
13 pages
STAT502
No ratings yet
STAT502
13 pages
Love Report 1
No ratings yet
Love Report 1
10 pages
10-601 Machine Learning (Fall 2010) Principal Component Analysis
No ratings yet
10-601 Machine Learning (Fall 2010) Principal Component Analysis
8 pages
Unit 3dimentionality Reduction
No ratings yet
Unit 3dimentionality Reduction
13 pages
PCA
100% (1)
PCA
33 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
27 pages
A COMPLETE GUIDE TO PRINCIPAL COMPONENT ANALYSIS in ML 1598272724
No ratings yet
A COMPLETE GUIDE TO PRINCIPAL COMPONENT ANALYSIS in ML 1598272724
16 pages
The Math Behind PCA
No ratings yet
The Math Behind PCA
3 pages
DimensionalitY Reduction
No ratings yet
DimensionalitY Reduction
29 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
5 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
6 pages
Principal Component Analysis1
No ratings yet
Principal Component Analysis1
26 pages
Principal Computer Analysis (PCA)
No ratings yet
Principal Computer Analysis (PCA)
25 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
6 pages
ML - Unit 3
No ratings yet
ML - Unit 3
4 pages
Multivariate Statistical Analysis
No ratings yet
Multivariate Statistical Analysis
12 pages
What Is PCA?: Image Source
No ratings yet
What Is PCA?: Image Source
17 pages
Principal Component Analysis Concepts: T56Gzsrvah
No ratings yet
Principal Component Analysis Concepts: T56Gzsrvah
16 pages
Unit - IV - DIMENSIONALITY REDUCTION AND GRAPHICAL MODELS
No ratings yet
Unit - IV - DIMENSIONALITY REDUCTION AND GRAPHICAL MODELS
59 pages
Ordered Weighted Averaging Aggregation Operator: Fundamentals and Applications
From Everand
Ordered Weighted Averaging Aggregation Operator: Fundamentals and Applications
Fouad Sabry
No ratings yet
Unit I
No ratings yet
Unit I
12 pages
Unit Iv
No ratings yet
Unit Iv
9 pages
Text-Based Stress Detection and Classification Using Machine Learning
No ratings yet
Text-Based Stress Detection and Classification Using Machine Learning
5 pages
Text Based Stress Detection Using Machine Learning
No ratings yet
Text Based Stress Detection Using Machine Learning
5 pages
User Stress Detection Using Social Media Text Machine Learning Approach
No ratings yet
User Stress Detection Using Social Media Text Machine Learning Approach
15 pages
Mini Project Report 2024-25-1
No ratings yet
Mini Project Report 2024-25-1
29 pages
Mini Project Report 2024-25-0
No ratings yet
Mini Project Report 2024-25-0
28 pages
Stress Detection Using Machine Learning and Image Processing
No ratings yet
Stress Detection Using Machine Learning and Image Processing
9 pages
Gilsang+18671 17 Korea AAP Rev
No ratings yet
Gilsang+18671 17 Korea AAP Rev
8 pages
Ferrum Phosphoricum
No ratings yet
Ferrum Phosphoricum
4 pages
Kontakt Manual
No ratings yet
Kontakt Manual
252 pages
12mm Geronomo CSB & RB Cables Certificate
No ratings yet
12mm Geronomo CSB & RB Cables Certificate
1 page
362a3322p005 1
No ratings yet
362a3322p005 1
1 page
Data Sheet - AST r01
No ratings yet
Data Sheet - AST r01
3 pages
Lesson 200.6 Creating Reports and Dashboards
No ratings yet
Lesson 200.6 Creating Reports and Dashboards
63 pages
SABA Sports Book
No ratings yet
SABA Sports Book
11 pages
In Pursuit of Excellence For A Better Tomorrow
No ratings yet
In Pursuit of Excellence For A Better Tomorrow
26 pages
Holistic Progress Card
No ratings yet
Holistic Progress Card
4 pages
Chapter 3 - Sources of Financing
No ratings yet
Chapter 3 - Sources of Financing
5 pages
Can Charisma Be Taught
No ratings yet
Can Charisma Be Taught
24 pages
Studentinfo Homework
No ratings yet
Studentinfo Homework
11 pages
Salut D'amour: E.Elgar
No ratings yet
Salut D'amour: E.Elgar
9 pages
Css NCii Checklist For Trainies
No ratings yet
Css NCii Checklist For Trainies
22 pages
CN Module 1 Prelim
No ratings yet
CN Module 1 Prelim
42 pages
Amalgamation & Sale of Partnership Firm
No ratings yet
Amalgamation & Sale of Partnership Firm
24 pages
American Choral Directors Association The Choral Journal
No ratings yet
American Choral Directors Association The Choral Journal
3 pages
Cubase SX SL 2 Ignite 1st Edition Chris Hawkins Download
100% (1)
Cubase SX SL 2 Ignite 1st Edition Chris Hawkins Download
85 pages
Set Alpha - Model Paper PSPM SP025 - KMM 23-24 - Answer
No ratings yet
Set Alpha - Model Paper PSPM SP025 - KMM 23-24 - Answer
9 pages
Checklist For Post Registration - Plots
No ratings yet
Checklist For Post Registration - Plots
23 pages
Elevate Your Self Worth
No ratings yet
Elevate Your Self Worth
46 pages
Introduction Presentation
No ratings yet
Introduction Presentation
17 pages
TARUN 230914500082 11092023 NoMemo H
No ratings yet
TARUN 230914500082 11092023 NoMemo H
6 pages
Micro CC 20 Plus Communication Protocol
No ratings yet
Micro CC 20 Plus Communication Protocol
9 pages
Opensap: Big Data With Sap Hana Vora: Course Week 03 - Exercises
No ratings yet
Opensap: Big Data With Sap Hana Vora: Course Week 03 - Exercises
18 pages
List of Teaching Staff AY 2016-2017
No ratings yet
List of Teaching Staff AY 2016-2017
2 pages
Purchase Receipt
No ratings yet
Purchase Receipt
3 pages
Review of Economic Development Assistance Tools in Nova Scotia
No ratings yet
Review of Economic Development Assistance Tools in Nova Scotia
163 pages
MMG 301 Final March18
No ratings yet
MMG 301 Final March18
143 pages

6 Principal Component Analysis

Uploaded by

6 Principal Component Analysis

Uploaded by

Principal Component Analysis

Principal Component Analysis is an

You might also like