0% found this document useful (0 votes)
3 views

Lecture # 9 Principle Component Analysis

Principal Component Analysis (PCA) is a statistical method for reducing dimensionality while maintaining variance by transforming correlated variables into uncorrelated principal components. It is useful for simplifying models, visualizing complex data, reducing noise, extracting features, and analyzing correlations. PCA is particularly beneficial when dealing with high-dimensional datasets.

Uploaded by

Ezza Mehmood
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
3 views

Lecture # 9 Principle Component Analysis

Principal Component Analysis (PCA) is a statistical method for reducing dimensionality while maintaining variance by transforming correlated variables into uncorrelated principal components. It is useful for simplifying models, visualizing complex data, reducing noise, extracting features, and analyzing correlations. PCA is particularly beneficial when dealing with high-dimensional datasets.

Uploaded by

Ezza Mehmood
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 7

PRINCIPLE COMPONENT

ANALYSIS
Introduction to Data Science
WHAT IS PRINCIPLE COMPONENT
ANALYSIS
Principal Component Analysis (PCA) is a
statistical technique used for dimensionality
reduction while preserving as much variance
as possible. It transforms a set of possibly
correlated variables into a smaller number of
uncorrelated variables called principal
components. The first principal component
accounts for the largest possible variance in
the data set, and each succeeding component,
in turn, has the highest variance possible
under the constraint that it is orthogonal to the
preceding components.
WHEN TO USE PCA?
PCA is typically used in situations where:
 Reducing Dimensionality: When the number of input

variables for a dataset is too high, PCA can reduce the


number of features while retaining the essential parts,
thus simplifying the model without significant loss of
information.
 Visualization: It helps in visualizing complex data by

reducing dimensions to 2D or 3D.


 Noise Reduction: It can help smooth data or remove

noise by focusing on the principal components with


higher variance.
 Feature Extraction and Engineering: PCA can also be

used to discover important relationships in the data or to


develop new features for machine learning models.
 Correlation Analysis: It helps in understanding the

interrelationships among different features of the data.


EXAMPLES

You might also like