0% found this document useful (0 votes)

6 views6 pages

Principal Component Analysis

Uploaded by

mrinmoyee.bhattacharya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views6 pages

Principal Component Analysis

Uploaded by

mrinmoyee.bhattacharya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

Principal Component Analysis (PCA) - Beginner's Note

Principal Component Analysis (PCA) is a technique used to simplify large sets

of data. Imagine you have a lot of information, like many different measurements from a bunch of
objects, and it’s hard to make sense of it all. PCA helps you by taking this complex data and turning it
into a simpler form, keeping the most important information.

Key Ideas:

1. Dimensionality Reduction:

o Think of PCA like organizing a messy room. You have too many things (data points),
and it’s hard to find what’s important.

o PCA helps by "cleaning up" and showing you the few key items (important patterns)
that matter the most.

2. Principal Components:

o These are the new, simpler pieces of information PCA gives you.

o The first principal component shows the most important pattern or trend in your
data.

o The second one shows the next most important pattern, and so on.

3. Why Use PCA?

o Simplifying Data: If you have too many features or measurements, PCA helps by
summarizing them into just a few that still capture the essence of your data.

o Making Patterns Clearer: It helps reveal patterns in the data that might not be
obvious at first glance.

4. How It Works:

o Standardize Your Data: First, you make sure all your measurements are on the same
scale.

o Find Patterns: PCA finds the main directions (patterns) where your data varies the
most.

o Create New Variables: These main directions become new variables, called principal
components, that you can use instead of your original data.

5. When to Use PCA?

o Too Much Data: When you have too many features and want to focus on the most
important ones.

o Visualization: When you want to see your data in a 2D or 3D plot to better

understand it.

Simple Example:
Imagine you’re looking at a lot of different types of fruit. Each fruit has different
measurements: size, weight, color, etc. PCA would help you figure out which measurements
are the most important to identify the type of fruit, like "big and heavy" might be a key
pattern. It then lets you focus on just those key patterns, making it easier to categorize or
visualize your fruit.
PCA is like taking a big, complicated puzzle and finding the main pieces that give you a clear
picture of what’s going on.

How PCA Constructs the Principal Components

As there are as many principal components as there are variables in the data, principal
components are constructed in such a manner that the first principal component accounts
for the largest possible variance in the data set. For example, let’s assume that the scatter
plot of our data set is as shown below, can we guess the first principal component ? Yes, it’s
approximately the line that matches the purple marks because it goes through the origin
and it’s the line in which the projection of the points (red dots) is the most spread out. Or
mathematically speaking, it’s the line that maximizes the variance (the average of the
squared distances from the projected points (red dots) to the origin).

Step-by-Step Explanation of PCA

Step 1: Standardization
The aim of this step is to standardize the range of the continuous initial variables so that each one of
them contributes equally to the analysis.

More specifically, the reason why it is critical to perform standardization prior to PCA, is that the
latter is quite sensitive regarding the variances of the initial variables. That is, if there are large
differences between the ranges of initial variables, those variables with larger ranges will dominate
over those with small ranges (for example, a variable that ranges between 0 and 100 will dominate
over a variable that ranges between 0 and 1), which will lead to biased results. So, transforming the
data to comparable scales can prevent this problem.

Mathematically, this can be done by subtracting the mean and dividing by the standard deviation for

each value of each variable.

Step2: Covariance Matrix Computation

Covariance measures the strength of joint variability between two or more variables, indicating how
much they change in relation to each other. To find the covariance we can use the formula:

The value of covariance can be positive, negative, or zeros.

 Positive: As the x1 increases x2 also increases.

 Negative: As the x1 increases x2 also decreases.

 Zeros: No direct relation

Step 3: Compute the eigenvectors and eigenvalues of the

covariance matrix to identify the principal components
Eigenvectors and eigenvalues are the linear algebra concepts that we need to
compute from the covariance matrix in order to determine the principal
components of the data.
What you first need to know about eigenvectors and eigenvalues is that they
always come in pairs, so that every eigenvector has an eigenvalue. Also, their
number is equal to the number of dimensions of the data. For example, for a 3-
dimensional data set, there are 3 variables, therefore there are 3 eigenvectors
with 3 corresponding eigenvalues.
It is eigenvectors and eigenvalues who are behind all the magic of principal
components because the eigenvectors of the Covariance matrix are
actually the directions of the axes where there is the most variance (most
information) and that we call Principal Components. And eigenvalues are
simply the coefficients attached to eigenvectors, which give the amount of
variance carried in each Principal Component.
By ranking your eigenvectors in order of their eigenvalues, highest to lowest,
you get the principal components in order of significance.
Principal Component Analysis Example:

Let’s suppose that our data set is 2-dimensional with 2 variables x,y and that
the eigenvectors and eigenvalues of the covariance matrix are as follows:

If we rank the eigenvalues in descending order, we get λ1>λ2, which means

that the eigenvector that corresponds to the first principal component (PC1)
is v1 and the one that corresponds to the second principal component (PC2)
is v2.
After having the principal components, to compute the percentage of variance
(information) accounted for by each component, we divide the eigenvalue of
each component by the sum of eigenvalues. If we apply this on the example
above, we find that PC1 and PC2 carry respectively 96 percent and 4 percent of
the variance of the data.

Importance of Principal Component Analysis (PCA) in Computer Vision

Principal Component Analysis (PCA) plays a crucial role in computer vision, a
field that involves processing and interpreting visual data from the world.
Here’s why PCA is so important in this domain:
1. Dimensionality Reduction
 High-Dimensional Data: Images are high-dimensional data, with each
pixel representing a feature. For example, a 100x100 grayscale image has
10,000 features (pixels).
 PCA Reduces Dimensions: PCA reduces the number of features by
finding the most important patterns in the data. This makes it easier to
process and analyze images without losing significant information.
 Faster Computation: By reducing the dimensionality, PCA speeds up the
processing time for machine learning models and algorithms, making
real-time applications like facial recognition and object detection more
feasible.
2. Noise Reduction
 Images Contain Noise: Real-world images often contain noise due to
factors like poor lighting or sensor imperfections.
 PCA Removes Noise: By keeping only the most important components
and discarding the less significant ones (which often correspond to
noise), PCA can help clean up images, improving the performance of
computer vision algorithms.
3. Feature Extraction
 Extracts Key Features: PCA helps in extracting the most important
features from an image. These features can then be used for tasks like
image classification, object detection, or face recognition.
 Eigenspace Representation: In face recognition, for example, PCA is used
to represent faces in a lower-dimensional eigenspace, known as
"eigenfaces." This simplifies the problem of matching faces by reducing
the data to the most essential features.
4. Data Compression
 Storage and Transmission: Storing and transmitting high-resolution
images can be resource-intensive. PCA allows for compressing the image
data by reducing the number of features (pixels) while retaining most of
the visual information.
 Reconstruction: The original image can be approximately reconstructed
from the principal components, which is valuable in applications where
data storage or bandwidth is limited.
5. Visualization
 Visualizing High-Dimensional Data: Images and other visual data are
often too complex to interpret directly. PCA can reduce this complexity,
making it possible to visualize high-dimensional data in 2D or 3D plots.
 Cluster Analysis: In tasks like object recognition or image segmentation,
PCA helps in visualizing and understanding clusters of similar images,
aiding in the design of better algorithms.
6. Preprocessing for Machine Learning
 Improves Model Performance: Before feeding images into machine
learning models, PCA can be used as a preprocessing step to reduce
noise and complexity, which often leads to better model performance.
 Prevents Overfitting: By reducing the number of features, PCA helps in
preventing overfitting, where the model performs well on training data
but poorly on unseen data.
7. Pattern Recognition
 Identifying Patterns: PCA excels in identifying and emphasizing the
underlying patterns in visual data, which is critical in pattern recognition
tasks like handwriting analysis or medical imaging.
 Character Recognition: For example, in optical character recognition
(OCR), PCA can help in recognizing letters or numbers by focusing on the
most distinctive features.
Summary
PCA is a powerful tool in computer vision because it simplifies and improves
the processing of high-dimensional visual data. By reducing dimensions,
removing noise, extracting features, and aiding in data compression and
visualization, PCA enhances the efficiency and effectiveness of computer vision
applications, making it an indispensable technique in the field.

Principal Component Analysis
No ratings yet
Principal Component Analysis
13 pages
GW Basic Programs
100% (1)
GW Basic Programs
6 pages
Reasoning CHSL 2024 Tier I
No ratings yet
Reasoning CHSL 2024 Tier I
139 pages
Schedule For Early Number Assessment (SENA 3) Recording Sheet
100% (2)
Schedule For Early Number Assessment (SENA 3) Recording Sheet
7 pages
Seminar PPT On Pca
No ratings yet
Seminar PPT On Pca
17 pages
Glencoe Mcgraw Hill Pre Algebra Homework Practice Workbook Answer Key
50% (2)
Glencoe Mcgraw Hill Pre Algebra Homework Practice Workbook Answer Key
5 pages
A Step-By-Step Explanation of Principal Component Analysis (PCA) - Built in
No ratings yet
A Step-By-Step Explanation of Principal Component Analysis (PCA) - Built in
8 pages
Cheat Sheet
No ratings yet
Cheat Sheet
2 pages
Rutishauser Eigen 29 Matrix Order
No ratings yet
Rutishauser Eigen 29 Matrix Order
17 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
8 pages
Lines, Angles, and Triangles SLM
No ratings yet
Lines, Angles, and Triangles SLM
11 pages
Pca - Principal Component Analysis 1233
No ratings yet
Pca - Principal Component Analysis 1233
30 pages
L2 - UCLxDeepMind DL2020
No ratings yet
L2 - UCLxDeepMind DL2020
104 pages
Primer On Augmented Data Preparation
No ratings yet
Primer On Augmented Data Preparation
12 pages
Magi Astrology Lesson Four: Everyone Has Two Birth Charts
100% (1)
Magi Astrology Lesson Four: Everyone Has Two Birth Charts
6 pages
MT 226:partial Differential Equations
No ratings yet
MT 226:partial Differential Equations
75 pages
Math 18 Matlab HW5
No ratings yet
Math 18 Matlab HW5
7 pages
Howard - Spectacular Chess Problems - 200 Spectacular Chess Puzzles That Shock Your Mind
No ratings yet
Howard - Spectacular Chess Problems - 200 Spectacular Chess Puzzles That Shock Your Mind
34 pages
CS8082U4L04 - Case Based Reasoning
No ratings yet
CS8082U4L04 - Case Based Reasoning
9 pages
Introduction To The Propagating Wave On A Single Conductor: Background & History
No ratings yet
Introduction To The Propagating Wave On A Single Conductor: Background & History
30 pages
Some Direct and Inverse Problems in Land Subsidence Theory: Vesselina Dimova
No ratings yet
Some Direct and Inverse Problems in Land Subsidence Theory: Vesselina Dimova
7 pages
SysFile V3x E
No ratings yet
SysFile V3x E
11 pages
The Intuition Behind PCA: Machine Learning Assignment
No ratings yet
The Intuition Behind PCA: Machine Learning Assignment
11 pages
LSTM AryanGomes
No ratings yet
LSTM AryanGomes
13 pages
PCA - Principal Component Analysis: Step by Step Computation of PCA
No ratings yet
PCA - Principal Component Analysis: Step by Step Computation of PCA
2 pages
Principal Component Analysis1
No ratings yet
Principal Component Analysis1
26 pages
Percent Practice Chapter Test: Principles of Mathematics 8
No ratings yet
Percent Practice Chapter Test: Principles of Mathematics 8
6 pages
Costs and Their Curves
No ratings yet
Costs and Their Curves
8 pages
Problems
No ratings yet
Problems
8 pages
2WB05 Simulation Lecture 5: Random-Number Generators: Marko Boon
No ratings yet
2WB05 Simulation Lecture 5: Random-Number Generators: Marko Boon
32 pages
Car Seats R Code
No ratings yet
Car Seats R Code
5 pages
Online Application Form For Recruitment
No ratings yet
Online Application Form For Recruitment
2 pages
Remote Sensing Assignment
No ratings yet
Remote Sensing Assignment
10 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
15 pages
Business Intelligence System: Definition, Application & Data Organization
No ratings yet
Business Intelligence System: Definition, Application & Data Organization
9 pages
Ch-9 Surface Area & Volume Ques. (Cube, Cuboid & Cylinder) .
No ratings yet
Ch-9 Surface Area & Volume Ques. (Cube, Cuboid & Cylinder) .
1 page
Pca 1692550768
No ratings yet
Pca 1692550768
13 pages
PCA Theory
No ratings yet
PCA Theory
13 pages
Clustering and Dimensionality Reduction Techniques PCA T SNE K Means
No ratings yet
Clustering and Dimensionality Reduction Techniques PCA T SNE K Means
15 pages
American International University-Bangladesh (AIUB) : Summer 2019-2020 SEMESTER I. Course Core and Title
No ratings yet
American International University-Bangladesh (AIUB) : Summer 2019-2020 SEMESTER I. Course Core and Title
11 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
16 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
11 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
20 pages
Principal Component Analysis
100% (1)
Principal Component Analysis
10 pages
IDS 4 (Week 14)
No ratings yet
IDS 4 (Week 14)
66 pages
Principal Component Analysis (PCA) Explained - Built in
No ratings yet
Principal Component Analysis (PCA) Explained - Built in
11 pages
Love Report
No ratings yet
Love Report
7 pages
Ai (PCA)
No ratings yet
Ai (PCA)
3 pages
DS Ca2 PPT 3010 3017
No ratings yet
DS Ca2 PPT 3010 3017
10 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
34 pages
R PCA (Principal Component Analysis) - DataCamp
No ratings yet
R PCA (Principal Component Analysis) - DataCamp
54 pages
Module 3
No ratings yet
Module 3
41 pages
Pca
No ratings yet
Pca
18 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
8 pages
The Binomial Theorem: Examples: X X N C C C C C C C
No ratings yet
The Binomial Theorem: Examples: X X N C C C C C C C
3 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
9 pages
Principal Component Analysis and Cluster Analysis
No ratings yet
Principal Component Analysis and Cluster Analysis
14 pages
Module 2 Lab 2
No ratings yet
Module 2 Lab 2
5 pages
P-3.1.4 - Pca
No ratings yet
P-3.1.4 - Pca
44 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
1 page
3.2 Pca
No ratings yet
3.2 Pca
27 pages
Data Pre-Processing-IV (Feature Extraction-PCA)
No ratings yet
Data Pre-Processing-IV (Feature Extraction-PCA)
23 pages
6 Principal Component Analysis
No ratings yet
6 Principal Component Analysis
7 pages
Program 3
No ratings yet
Program 3
7 pages
Geometry: (Common Core)
No ratings yet
Geometry: (Common Core)
24 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
6 pages
A Step by Step Explanation of Principal Component Analysis
No ratings yet
A Step by Step Explanation of Principal Component Analysis
7 pages
PCA Notes
No ratings yet
PCA Notes
3 pages
DR Pca
No ratings yet
DR Pca
22 pages
STAT502
No ratings yet
STAT502
13 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
3 pages
Principal Computer Analysis (PCA)
No ratings yet
Principal Computer Analysis (PCA)
25 pages
Autoencoder
No ratings yet
Autoencoder
4 pages
PCA Dev
No ratings yet
PCA Dev
16 pages
Principal Component Analysis (PCA)
No ratings yet
Principal Component Analysis (PCA)
3 pages
Love Report 1
No ratings yet
Love Report 1
10 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
10 pages
U4 - PCA - 5th Sem - DS
No ratings yet
U4 - PCA - 5th Sem - DS
14 pages
Learning Competency Directory
No ratings yet
Learning Competency Directory
3 pages
Kinya Sharon - Ass2 - Machine Learning
No ratings yet
Kinya Sharon - Ass2 - Machine Learning
12 pages
Unit 3
No ratings yet
Unit 3
28 pages
PCA How To.1
No ratings yet
PCA How To.1
13 pages
What Is An RNN
No ratings yet
What Is An RNN
6 pages
03 Principal Components Analysis
No ratings yet
03 Principal Components Analysis
3 pages
Need of Principal Component Analysis
No ratings yet
Need of Principal Component Analysis
8 pages
PCA Finds Representation Through Linear Transformation
No ratings yet
PCA Finds Representation Through Linear Transformation
28 pages
PCA Explained Stepbystep
No ratings yet
PCA Explained Stepbystep
4 pages
Pca 1
No ratings yet
Pca 1
3 pages
NATIONAL-LEARNING-CAMP-ACCOMPLISHMENT-REPORT Ok
No ratings yet
NATIONAL-LEARNING-CAMP-ACCOMPLISHMENT-REPORT Ok
4 pages
Shell Programming
No ratings yet
Shell Programming
50 pages
Scientific Computations With Python: E & ICT Academy, National Institute of Technology, Warangal
No ratings yet
Scientific Computations With Python: E & ICT Academy, National Institute of Technology, Warangal
2 pages
Mathematics: I. Course Description and Aims
No ratings yet
Mathematics: I. Course Description and Aims
4 pages
10-601 Machine Learning (Fall 2010) Principal Component Analysis
No ratings yet
10-601 Machine Learning (Fall 2010) Principal Component Analysis
8 pages
Introduction to Deep Learning
From Everand
Introduction to Deep Learning
Eugene Charniak
No ratings yet
Mathematics for Data Science: Linear Algebra with Matlab
From Everand
Mathematics for Data Science: Linear Algebra with Matlab
César Pérez López
No ratings yet

Principal Component Analysis

Uploaded by

Principal Component Analysis

Uploaded by

Principal Component Analysis (PCA) - Beginner's Note

Principal Component Analysis (PCA) is a technique used to simplify large sets

3. Why Use PCA?

5. When to Use PCA?

o Visualization: When you want to see your data in a 2D or 3D plot to better

How PCA Constructs the Principal Components

Step-by-Step Explanation of PCA

each value of each variable.

Step2: Covariance Matrix Computation

The value of covariance can be positive, negative, or zeros.

 Positive: As the x1 increases x2 also increases.

 Negative: As the x1 increases x2 also decreases.

 Zeros: No direct relation

Step 3: Compute the eigenvectors and eigenvalues of the

If we rank the eigenvalues in descending order, we get λ1>λ2, which means

Importance of Principal Component Analysis (PCA) in Computer Vision

You might also like