0% found this document useful (0 votes)

18 views

Principal Component Analysis

PCA in Machine Larning

Uploaded by

Sravan Kumar Thota

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views

Principal Component Analysis

PCA in Machine Larning

Uploaded by

Sravan Kumar Thota

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 11

Principal Component Analysis (PCA)

Principal Component Analysis (PCA) is a widely used unsupervised technique for

dimensionality reduction, feature extraction, and data visualization. PCA transforms the original
features of a dataset into new, uncorrelated variables called principal components. These
components capture the maximum variance in the data with the fewest dimensions, allowing you
to reduce dimensionality while retaining the essential structure of the data.

1. Why Use PCA?

 Reduce Dimensionality: When dealing with datasets that have many features, reducing
the number of dimensions simplifies the dataset without losing significant information.
 Remove Multicollinearity: PCA can remove multicollinearity (high correlation between
features) by transforming correlated variables into independent components.
 Data Visualization: PCA helps visualize high-dimensional data in 2D or 3D space,
making it easier to interpret patterns and relationships.
 Speed Up Algorithms: By reducing the number of features, PCA can improve the speed
and performance of machine learning algorithms.

2. How PCA Works

PCA finds new axes (principal components) in the data space such that:

1. The first principal component accounts for the maximum variance in the data.
2. The second principal component is orthogonal (uncorrelated) to the first and accounts for
the maximum remaining variance, and so on.

Steps in PCA:

1. Standardization of Data: PCA is sensitive to the scale of the data. Therefore, it is

important to standardize the data such that each feature has zero mean and unit variance.

2. Covariance Matrix Calculation: The covariance matrix shows how the features vary
with respect to each other. This is calculated as:
3. Eigenvectors and Eigenvalues Calculation: The eigenvectors represent the directions
(principal components) in which the data varies the most. The eigenvalues tell you how
much variance is explained by each principal component.
4. Sorting Eigenvectors by Eigenvalues: The eigenvectors are sorted by their
corresponding eigenvalues in descending order. The top eigenvectors form the new axes.
5. Projecting Data onto Principal Components: The original dataset is transformed by
projecting it onto the selected principal components to obtain the reduced-dimensional
dataset.

3. Mathematical Concepts Behind PCA

 Variance: The spread of the data along a particular dimension.

 Covariance: Measures how much two variables change together. If variables are highly
correlated, their covariance is large.
 Eigenvectors and Eigenvalues: Eigenvectors define the directions of the new feature
space (principal components), and eigenvalues represent the magnitude of variance along
these directions.

Mathematically:

 Let X be the standardized data matrix.

 The covariance matrix of X is

 The eigenvectors and their corresponding eigenvalues

are obtained by solving

4. Explained Variance and Choosing the Number of Components

Each principal component has an associated explained variance. This represents the amount of
the total dataset's variance that is captured by each component. The explained variance ratio is
used to decide how many components to retain.

 Explained Variance Ratio: The proportion of variance explained by each component.

 Cumulative Explained Variance: The sum of the explained variances up to the nnn-th
principal component. It helps decide how many components are necessary to retain a
desired amount of variance.

5. Practical Implementation of PCA with Python

Let's implement PCA using Python on a standard dataset, such as the Iris dataset.

Step-by-Step PCA Implementation

1. Import Necessary Libraries

import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
from sklearn.datasets import load_iris
from sklearn.preprocessing import StandardScaler
from sklearn.decomposition import PCA

2. Load the Dataset We’ll use the famous Iris dataset, which contains 150 samples with 4
features: sepal length, sepal width, petal length, and petal width.

# Load the Iris dataset

iris = load_iris()
X = iris.data # Features
y = iris.target # Target (Class labels)

3. Standardize the Data Since PCA is sensitive to the scales of the features, we standardize
the dataset to have a mean of 0 and a variance of 1.

# Standardizing the features

scaler = StandardScaler()
X_scaled = scaler.fit_transform(X)

4. Apply PCA We will apply PCA to reduce the dataset to 2 principal components for
visualization.

# Applying PCA
pca = PCA(n_components=2)
X_pca = pca.fit_transform(X_scaled)

# Explained variance ratio (how much variance is explained by each component)

print(pca.explained_variance_ratio_)
5. Visualize the Results We can now plot the data projected onto the first two principal
components to visualize how well PCA separates the classes.

# Plotting the PCA result

plt.figure(figsize=(8, 6))
plt.scatter(X_pca[:, 0], X_pca[:, 1], c=y, cmap='viridis', edgecolor='k',
s=100)
plt.title('PCA of Iris Dataset')
plt.xlabel('Principal Component 1')
plt.ylabel('Principal Component 2')
plt.colorbar()
plt.show()

6. Determine the Explained Variance It’s important to check how much variance is
captured by the principal components.

# Cumulative explained variance

cumulative_variance = np.cumsum(pca.explained_variance_ratio_)
print("Cumulative explained variance:", cumulative_variance)

# Plot cumulative explained variance

plt.figure(figsize=(6, 4))
plt.plot(range(1, len(cumulative_variance) + 1), cumulative_variance,
marker='o', linestyle='--')
plt.title('Explained Variance by PCA Components')
plt.xlabel('Number of Principal Components')
plt.ylabel('Cumulative Explained Variance')
plt.grid(True)
plt.show()

6. Interpreting the Results

 Explained Variance Ratio: This shows how much variance each of the principal
components captures. For example, if the first two components explain 95% of the
variance, they are sufficient to represent the data.
 PCA Plot: The scatter plot shows the data points projected onto the first two principal
components. If PCA successfully separates the classes, different class clusters will be
visible in the plot.

7. Real-World Applications of PCA

1. Image Compression: In high-dimensional image datasets, PCA is used to reduce the

number of pixels (features) without significant loss of image quality.
2. Noise Reduction: PCA is used to denoise datasets by removing components with low
variance, which are likely to represent noise rather than meaningful data.
3. Gene Expression Analysis: In bioinformatics, PCA is applied to gene expression data to
reduce thousands of genes into a smaller number of principal components that capture the
most important variations.
4. Finance: PCA is used in finance to reduce the dimensionality of datasets containing
many correlated financial instruments, such as stock prices.

8. Advantages and Limitations of PCA

Advantages:

 Dimensionality Reduction: Reduces the number of features while retaining most of the
information.
 Improved Performance: Reduces computation time and improves algorithm
performance.
 Reduces Multicollinearity: By creating new uncorrelated principal components, PCA
removes multicollinearity issues.

Limitations:

 Loss of Interpretability: Principal components are linear combinations of the original

features, which may not have direct physical or intuitive interpretations.
 Linear Assumption: PCA only captures linear relationships between features. Nonlinear
relationships are not handled by PCA.
 Sensitive to Scaling: PCA is affected by the scale of the features, so standardization is
crucial.

Revision

PCA is a powerful tool for dimensionality reduction, especially when dealing with high-
dimensional data. By transforming the data into principal components, PCA helps simplify
models, reduce noise, and improve computation times, all while preserving the most important
features of the dataset. However, it is important to carefully consider its limitations, especially
regarding interpretability and the assumption of linearity.

Mathematical Workout with example

Principal Component Analysis (PCA) is a popular dimensionality reduction technique used to
simplify datasets by reducing the number of features while preserving as much variability
(information) as possible. Mathematically, PCA transforms the data into a new coordinate
system where the axes (called principal components) correspond to the directions of maximum
variance in the data.
Key Mathematical Concepts Behind PCA:

1. Variance: PCA seeks to maximize the variance in the data.

2. Covariance Matrix: This measures the pairwise variance between features.
3. Eigenvalues and Eigenvectors: These are computed from the covariance matrix and
represent the magnitude and direction of the principal components, respectively.
4. Projection: Data is projected onto the new axes (principal components).

Let's go step by step through a simple numerical example to illustrate the mathematical concepts.

Step-by-Step Example:

Step 1: Data Preparation

Let's consider a simple 2D dataset with two features, :

 Here, each row is a data point, and each column is a feature.

Step 2: Standardize the Data

PCA works best when the data is standardized, meaning each feature is centered around zero
(i.e., the mean is subtracted from each feature). For each feature, calculate the mean:

mean of and mean of

Now, subtract the mean from each feature:

Step 3: Compute the Covariance Matrix

Next, compute the covariance matrix, which captures the relationships between the features.

Calculating this, the covariance matrix is:

Step 4: Compute Eigenvalues and Eigenvectors

To find the principal components, we calculate the eigenvalues and eigenvectors of the
covariance matrix. Eigenvectors give us the directions (principal components), and eigenvalues
give us the magnitude (importance) of these components.

The covariance matrix is:

The characteristic equation simplifies to:

Solving this quadratic equation gives the eigenvalues:

The larger eigenvalue (λ1=2.8) corresponds to the first principal component, which captures the
most variance. The smaller eigenvalue (λ2=0.025) corresponds to the second principal
component.
Next, we compute the eigenvectors. These vectors indicate the direction of the principal
components. Solving for the eigenvectors gives us:

Step 5: Project Data onto Principal Components

Now, project the original data onto the principal components. To do this, multiply the centered
data matrix Xcentered by the eigenvectors:

Projection onto First Principal Component (Eigenvector 1):

We take the dot product of each row of the centered data matrix with the first eigenvector.

Thus, the projections onto the first principal component are:

Projection onto Second Principal Component (Eigenvector 2):

Now, let's project the data onto the second principal component:

Thus, the projections onto the second principal component are:

This will give us the new representation of the data in the principal component space.
Step 6: Final Results (Projected Data)

The final projected data in the space defined by the two principal components is:

Step 6: Reduce Dimensionality

We can now reduce the dimensionality by keeping only the principal component(s) with the
largest eigenvalue(s). In this case, we might choose to keep just the first principal component
(corresponding to λ1) to reduce the 2D dataset to 1D while still capturing most of the variance.

Revisitng the results

In this example, PCA reduced the original 2D dataset to a 1D dataset by projecting the data onto
the first principal component. The key steps involved:

1. Centering the data.

2. Computing the covariance matrix.
3. Finding the eigenvalues and eigenvectors.
4. Projecting the data onto the principal components.

By keeping only the most important principal components, we can reduce the dimensionality of
the dataset while preserving most of its variance.

the projection onto two principal components still results in two dimensions. The idea of
dimensionality reduction with PCA is that you can choose how many dimensions (principal
components) to keep based on the amount of variance each component explains.

In this example, we kept both principal components (PC1 and PC2), which keeps the data in 2D
space. If we want to reduce the dimensionality from 2D to 1D, we can choose to keep only the
first principal component (PC1), which explains the majority of the variance in the data.

Variance Explained by Each Principal Component:

From the previous steps, we calculated the eigenvalues:

 Eigenvalue 1 (λ1) = 2.8
 Eigenvalue 2 (λ2) = 0.025

These eigenvalues represent the amount of variance explained by each principal component. The
larger the eigenvalue, the more variance that principal component captures.

 The first principal component (PC1) explains much more variance (2.8) than the
second one (0.025), meaning it captures most of the important information in the data.

Reducing to 1D:

To reduce the data from 2D to 1D, we only keep PC1 (the first principal component) and ignore
PC2.

The projection onto the first principal component was:

This is a 1D representation of the original 2D data, and it preserves most of the variance. By
using only the first principal component, we have successfully reduced the data from 2D to 1D.

Key Takeaway:

 In PCA, you reduce the dimensionality by choosing how many principal components to
keep. In this case, keeping just PC1 reduces the 2D data to 1D while still retaining most
of the variance in the dataset.
 If you keep both principal components, you stay in 2D. If you discard the second
principal component, you reduce the data to 1D.

This 1D projection captures most of the information, and you have effectively reduced the
dimensionality of the dataset.

Principal Component Analysis
100% (1)
Principal Component Analysis
34 pages
PCA_dev
No ratings yet
PCA_dev
16 pages
Love Report 1
No ratings yet
Love Report 1
10 pages
The Intuition Behind PCA: Machine Learning Assignment
No ratings yet
The Intuition Behind PCA: Machine Learning Assignment
11 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
34 pages
Principal Component Analysis (PCA)
No ratings yet
Principal Component Analysis (PCA)
3 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
1 page
The Math Behind PCA
No ratings yet
The Math Behind PCA
3 pages
PCA_Explained -
No ratings yet
PCA_Explained -
9 pages
Ai ( PCA)
No ratings yet
Ai ( PCA)
3 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
6 pages
Pca
No ratings yet
Pca
18 pages
DS Ca2 PPT 3010 3017
No ratings yet
DS Ca2 PPT 3010 3017
10 pages
Love Report
No ratings yet
Love Report
7 pages
program-3
No ratings yet
program-3
7 pages
PCA- PRINCIPAL COMPONENT ANALYSIS 1233
No ratings yet
PCA- PRINCIPAL COMPONENT ANALYSIS 1233
30 pages
Assignment
No ratings yet
Assignment
24 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
9 pages
10-601 Machine Learning (Fall 2010) Principal Component Analysis
No ratings yet
10-601 Machine Learning (Fall 2010) Principal Component Analysis
8 pages
Pca 1692550768
No ratings yet
Pca 1692550768
13 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
3 pages
Principal+Component+Analysis
No ratings yet
Principal+Component+Analysis
6 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
8 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
2 pages
3.2 Pca
No ratings yet
3.2 Pca
27 pages
Principal Component Analysis: #Datascience
No ratings yet
Principal Component Analysis: #Datascience
13 pages
PCA Finds Representation Through Linear Transformation
No ratings yet
PCA Finds Representation Through Linear Transformation
28 pages
Principal Components Analysis (PCA) Final
No ratings yet
Principal Components Analysis (PCA) Final
23 pages
Data Pre-Processing-IV (Feature Extraction-PCA)_7c5a4c5da931f4f69a14c94e7e8b9062
No ratings yet
Data Pre-Processing-IV (Feature Extraction-PCA)_7c5a4c5da931f4f69a14c94e7e8b9062
23 pages
U4 - PCA - 5th Sem - DS
No ratings yet
U4 - PCA - 5th Sem - DS
14 pages
Principal Component Analysis and Cluster Analysis
No ratings yet
Principal Component Analysis and Cluster Analysis
14 pages
Kinya Sharon - Ass2 - Machine Learning
No ratings yet
Kinya Sharon - Ass2 - Machine Learning
12 pages
Dimensionality Reduction (Principal Component Analysis)
No ratings yet
Dimensionality Reduction (Principal Component Analysis)
12 pages
Need of Principal Component Analysis
No ratings yet
Need of Principal Component Analysis
8 pages
Chapter Five Principal Comonent Analysis (PCA)
No ratings yet
Chapter Five Principal Comonent Analysis (PCA)
33 pages
Principal Component Analysis
100% (1)
Principal Component Analysis
10 pages
Principal Component Analysis Limitations and How To Overcome Them Let's Talk A
No ratings yet
Principal Component Analysis Limitations and How To Overcome Them Let's Talk A
5 pages
Qrm2024 Topic5 Pca Fa
No ratings yet
Qrm2024 Topic5 Pca Fa
67 pages
Pca Ica
No ratings yet
Pca Ica
34 pages
U5@-Data Reduction
No ratings yet
U5@-Data Reduction
22 pages
Linear Algebra
No ratings yet
Linear Algebra
5 pages
A COMPLETE GUIDE TO PRINCIPAL COMPONENT ANALYSIS in ML 1598272724
No ratings yet
A COMPLETE GUIDE TO PRINCIPAL COMPONENT ANALYSIS in ML 1598272724
16 pages
cheat sheet
No ratings yet
cheat sheet
2 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
13 pages
STAT502
No ratings yet
STAT502
13 pages
Module 3
No ratings yet
Module 3
41 pages
2. PCA
No ratings yet
2. PCA
22 pages
Reduce Data Dimensionality Using PCA
No ratings yet
Reduce Data Dimensionality Using PCA
6 pages
DR Pca
No ratings yet
DR Pca
22 pages
Principal Component Analysis - Wikipedia
No ratings yet
Principal Component Analysis - Wikipedia
28 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
13 pages
Mlfa Autumn 2023 Pca
No ratings yet
Mlfa Autumn 2023 Pca
32 pages
Clustering_and_dimensionality_reduction_techniques__PCA__t_SNE__K_means_ (1)
No ratings yet
Clustering_and_dimensionality_reduction_techniques__PCA__t_SNE__K_means_ (1)
15 pages
PCA Explained
No ratings yet
PCA Explained
5 pages
Mloa Exp2 C121
No ratings yet
Mloa Exp2 C121
20 pages
Principal Component Analysis Concepts
No ratings yet
Principal Component Analysis Concepts
16 pages
Implementing PCA in Python With Scikit
No ratings yet
Implementing PCA in Python With Scikit
6 pages
Principal Component Analysis Notes : Info
No ratings yet
Principal Component Analysis Notes : Info
22 pages
PCA Using Python
No ratings yet
PCA Using Python
18 pages
Mastering Data Structures and Algorithms in Python & Java
From Everand
Mastering Data Structures and Algorithms in Python & Java
Sachin Naha
No ratings yet
City of Dasmariñas, Cavite: Correlation and Regression Analysis
No ratings yet
City of Dasmariñas, Cavite: Correlation and Regression Analysis
8 pages
ECE431 - Signal Detection and Estimation Theory Assignment 1
No ratings yet
ECE431 - Signal Detection and Estimation Theory Assignment 1
2 pages
DFMFullCoverageKS5 HypothesisTesting
No ratings yet
DFMFullCoverageKS5 HypothesisTesting
7 pages
Multiple Correlation
No ratings yet
Multiple Correlation
5 pages
Ijsedu 20150306 12
No ratings yet
Ijsedu 20150306 12
6 pages
Rebound Hammer Summary
No ratings yet
Rebound Hammer Summary
39 pages
BA 502 (1) Introduction To Statistics and Statistical Inference
No ratings yet
BA 502 (1) Introduction To Statistics and Statistical Inference
34 pages
Normal Probability Distribution
No ratings yet
Normal Probability Distribution
17 pages
The Confidence Interval Mini-Project
No ratings yet
The Confidence Interval Mini-Project
8 pages
Chapter11 Econometrics SpecificationerrorAnalysis
No ratings yet
Chapter11 Econometrics SpecificationerrorAnalysis
7 pages
Probability and Statistics - Asynch B.1
No ratings yet
Probability and Statistics - Asynch B.1
5 pages
Chart Title: Tablet Computer Sales Week Units Sold
No ratings yet
Chart Title: Tablet Computer Sales Week Units Sold
4 pages
(Monographs On Statistics and Applied Probability (Series) 161) Li, Bing - Sufficient Dimension Reduction - Methods and Applications With R-CRC Press (2018)
No ratings yet
(Monographs On Statistics and Applied Probability (Series) 161) Li, Bing - Sufficient Dimension Reduction - Methods and Applications With R-CRC Press (2018)
307 pages
Econ3150 - 4150 2018v Utsat Sensorveiledning
No ratings yet
Econ3150 - 4150 2018v Utsat Sensorveiledning
10 pages
Statistics Course Outline
No ratings yet
Statistics Course Outline
2 pages
Mislevy 1986
No ratings yet
Mislevy 1986
30 pages
STATISTICS AND PROBABILITY REVIEWER
No ratings yet
STATISTICS AND PROBABILITY REVIEWER
10 pages
Module Questions
No ratings yet
Module Questions
2 pages
Ensemble Methods.pptx
No ratings yet
Ensemble Methods.pptx
32 pages
202477-9-6-4-PB
No ratings yet
202477-9-6-4-PB
11 pages
Var, Svar and Svec Models
No ratings yet
Var, Svar and Svec Models
32 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
19 pages
02 Measures of Central Tendency, Visual Presentation, Vital Statistics
No ratings yet
02 Measures of Central Tendency, Visual Presentation, Vital Statistics
114 pages
T Test Conclusion
No ratings yet
T Test Conclusion
2 pages
16 ACTL2131 Exercises
No ratings yet
16 ACTL2131 Exercises
94 pages
Dsba Career Transition Handbook
No ratings yet
Dsba Career Transition Handbook
17 pages
Basic Statistics (3685) PPT - Lecture On 20-01-2019
100% (1)
Basic Statistics (3685) PPT - Lecture On 20-01-2019
64 pages
Answers For Past Papers
No ratings yet
Answers For Past Papers
17 pages
424-433, Ni Putu Hanisa Noptiana Putri, I Ketut Sunarwijaya, Ni Putu Lisa Ernawatiningsih
No ratings yet
424-433, Ni Putu Hanisa Noptiana Putri, I Ketut Sunarwijaya, Ni Putu Lisa Ernawatiningsih
10 pages
Histogram
No ratings yet
Histogram
11 pages