0% found this document useful (0 votes)

10 views26 pages

RES805-RM-Module 2

This document provides an overview of Principal Component and Factor Analysis, detailing its purpose, assumptions, and types, including Exploratory and Confirmatory Factor Analysis. It explains the process of dimensionality reduction using Principal Component Analysis (PCA), including steps such as data standardization, covariance matrix calculation, and eigenvalue/eigenvector determination. The document also includes examples and mathematical formulations to illustrate the concepts discussed.

Uploaded by

Apurva S

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views26 pages

RES805-RM-Module 2

Uploaded by

Apurva S

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 26

PRESIDENCY UNIVERSITY

SCHOOL OF ENGINEERING

MODULE - 2
Principal Component and Factor Analysis
PRESENTED BY,

ANURAJ N. V
20233MAT0011

1
Overview
• Introduction
• Latent variable
• Assumption of Factor Analysis
• Purpose of Factor Analysis
• Types of Factor Analysis
• Principal Component Analysis (PCA)

2
Introduction
• Factor Analysis is a technique used to reduce a large number of variables
into fewer numbers of factors. It is a way to condense the data in many
variables into just few variables
• Factor Analysis is also called Dimension Reduction
• It is a example of latent variable model
FACTOR

Variable 1 Variable 2 Variable 3 Variable 4 Variable 5

3
Example:

FOOD
SERVICE
QUALITY

Waiting Cleanliness Staff Food Food

Taste
time Behavior Freshness temperature
of food

4
Latent Variables
• Latent Variables are variables that are not directly observed but are inferred
from other variables.
• Mathematical models that aim to explain observed variables in terms of
latent variables are called latent variable models.
• Example:

Quality of life Business Happiness

confidence

5
Assumptions of Factor Analysis
• There are no outliers in the data
• Sample size is supposed to be greater than the factor
• Variables must be interrelated
• Metric variable are expected
• Multivariate normality not required

6
Purpose of Factor Analysis
• Data reduction
• Lament variable discovery
• Simplification of items into subsets of concepts
• Assess dimensionality

7
Types of Factor Analysis
• Exploratory Factor Analysis (EFA): Used to discover underlying structure
• Principal Component Analysis
• Common factor analysis
• Image Factoring
• Maximum likelihood analysis
• Alpha Factoring and Weight Square
• Confirmatory Factor Analysis (CFA): Used to test if the data fit a priori
exaptation for data stature. Uses structural equations modeling

8
Principal Component Analysis
• PCA is a Dimensionality reduction technique or data reduction technique
• PCA is used in exploratory data analysis and for making predictive models
How to reduce the dimensions?
• It is done by projecting each data points onto only the first few principal
components to obtain lower-dimentional data while preserving as much of
data’s variation as possible.
• Principal components are eigenvalues of the data’s covariance matrix

9
Steps for dimensionality reduction using PCA
• Step1: Standardize the data
• Step2: Compute the covariance matrix
• Step 3: Calculate the and Eigenvectors and Eigenvalues
• Step 4: Sort Eigenvalues in descending order and compute the principal
component
• Step 5: Reduce the dimensions of the data

10
1. Data Standardization
• The Data standardization is the process of converting data to a common
format to process and analyze data better.
• Example: Height Weight
(m) (pounds
• Let us consider the 1000 samples of peoples height and weight. )
• Weight feature is dominate over height feature 0.5 2

• Weight values are bigger as compared to height values … …

2.5 200
• To prevent this kind of problem transforming the features to comparable
scale using standardization is solution.

11
How to Standardize the Data ?
• The Data standardization is calculating a z-score or standard score.
It is given by 𝑍 =
𝑋−𝑋
𝜎

• Example: Ten samples of two features and ,

Chart Title
3.5

2.5

1.5

0.5

0
0 0.5 1 1.5 2 2.5 3 3.5

12
How to Standardize the Data ?
• The Data standardization is calculating a z-score or standard score.
It is given by 𝑍 =
𝑋−𝑋
𝜎

• Example: Ten samples of two features and ,

Standard Data
1.5

0.5

0
-2 -1.5 -1 -0.5 0 0.5 1 1.5 2
-0.5

-1

-1.5

-2

13
2. Covariance Matrix
• Covariance is always measured between two variables or features.
𝑛

∑ ( 𝑋 1 − 𝑋 1 ) ( 𝑋 2 − 𝑋 2)
𝑖=1
𝑐𝑜𝑣 ( 𝑋 1 , 𝑋 2 ) =
𝑛 −1

• It measure only the directional relationship between two variables not the
strength of the relationship between them.

14
Example
• Given 10 samples of two features and as:

Scatter plot
3.5

2.5

1.5

0.5

0
0 0.5 1 1.5 2 2.5 3 3.5

15
Example
• Given 10 samples of two features and as:

2.5 2.4 0.69 0.49 0.3381 𝑛

∑ ( 𝑋 1 − 𝑋 1)( 𝑋 2 − 𝑋 2 )
0.5 0.7 -1.31 -1.21 1.5851 𝐶𝑜𝑣 ( 𝑋 1 , 𝑋 2 ) = 𝑖 =1
𝑛 −1

2.2 2.9 0.39 0.99 0.3861 5.539

𝐶𝑜𝑣 ( 𝑋 1 , 𝑋 2 ) = =0.615444444
10 − 1
1.9 2.2 0.09 0.29 0.0261
𝐶𝑜𝑣 ( 𝑋 1 , 𝑋 2 ) =𝐶𝑜𝑣 ( 𝑋 2 , 𝑋 1) =0.615444
3.1 3 1.29 1.09 1.4061
2.3 2.7 0.49 0.79 0.3871
2 1.6 0.19 -0.31 -0.0589
1 1.1 -0.81 -0.81 0.6561
1.5 1.6 -0.31 -0.31 0.0961
1.1 0.9 -0.71 -1.01 0.7171

16
Example
• For 2-features the covariance matrix is as follows:

• For 3-features the covariance matrix is as follows:

17
Example
• Given 10 samples of two features and as:

2.5 2.4 0.69 0.49 0.3381 𝑛

∑ ( 𝑋 1 − 𝑋 1 )2
0.5 0.7 -1.31 -1.21 1.5851
𝑖 =1
𝐶𝑜𝑣 ( 𝑋 1 , 𝑋 1 )= = 0.616556
𝑛 −1

2.2 2.9 0.39 0.99 0.3861 𝑛

∑ ( 𝑋 2 − 𝑋 2 )2
𝐶𝑜𝑣 ( 𝑋 2 , 𝑋 2 ) = 𝑖 =1
1.9 2.2 0.09 0.29 0.0261 𝑛− 1
=0. 7 16 5 556

3.1 3 1.29 1.09 1.4061

2.3 2.7 0.49 0.79 0.3871 𝐶=
[ 0.616556
0.615444
0.615444
0.71655 6 ]
2 1.6 0.19 -0.31 -0.0589
1 1.1 -0.81 -0.81 0.6561
1.5 1.6 -0.31 -0.31 0.0961
1.1 0.9 -0.71 -1.01 0.7171

18
3. Eigenvalues and Eigenvectors
• Eigenvalues and Eigenvectors are the linear algebra concepts that are
required in the determination of the principal components from the
covariance matrix.
• Eigenvectors help in finding the new transformation where there is
maximum variance

• Eigenvectors, are those vectors which keep the direction same when
multiplied by a matrix
• Eigenvalues, are the scalers of the respective eigenvectors.

19
Calculate the Eigenvalues
• From the covariance matrix first find eigenvalues:

20
Calculate the Eigenvectors
• Put in the following and solve for and .

• So, eigenvectors are:

• Put in the following and solve for and .

• So, eigenvectors are:

21
Obtain Principal Components
• After sorting the eigenvalues in descending order
• Feature vector, has been formulated.

• Eigenvectors corresponding to highest eigenvalues will be the first principal

component in the direction of maximum variance.

22
Principal Components
• Feature vector,

Variable

% of total
variance

• The first principal component (PC1) is the first column corresponding to the
highest eigenvalues i.e.

23
Transforming and reducing the dimensions of the original data set
• Original data Standard data Transformed Data

0.879 0.58 1.021249

-1.67 -1.4 -2.18176
0.497 1.17 1.196353
0.115 0.34 * 0.329514
1.643 1.29 2.060298
0.624 0.93 1.109042
0.242 -0.4 -0.10511
-1.03 -1 -1.40272
-0.39 -0.4 -0.53684
-0.9 -1.2 -1.49003

24
Reconstructing the original data set
Transformed Data Reconstructed, Reconstructed X

1.021249 0.692 0.751

-2.18176 -1.48 -1.6
1.196353 ∗ [ 0.6779 0.7352 ] =¿ 0.811 0.879 ∗ 𝜎 + 𝑋 =¿
0.329514 0.223 0.242
2.060298 1.397 1.514
1.109042 0.752 0.815
-0.10511 -0.07 -0.08
-1.40272 -0.95 -1.03
-0.53684 -0.36 -0.39
-1.49003 -1.01 -1.1

25
26

Introduction To Quantum Algorithms
100% (1)
Introduction To Quantum Algorithms
390 pages
Example For Fast-Decoupled Load-Flow Technique: Y Y Y Y
No ratings yet
Example For Fast-Decoupled Load-Flow Technique: Y Y Y Y
5 pages
Remote Sensing Assignment
No ratings yet
Remote Sensing Assignment
10 pages
Principal Component Analysis (PCA)
No ratings yet
Principal Component Analysis (PCA)
18 pages
ML RUSA Module 5 Dim Red
No ratings yet
ML RUSA Module 5 Dim Red
85 pages
PCA Notes
No ratings yet
PCA Notes
3 pages
Lecture 9 - Data Reduction
No ratings yet
Lecture 9 - Data Reduction
36 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
10 pages
PCA Explained Stepbystep
No ratings yet
PCA Explained Stepbystep
4 pages
Steps For PCA
No ratings yet
Steps For PCA
5 pages
Need of Principal Component Analysis
No ratings yet
Need of Principal Component Analysis
8 pages
Principal Component Analysis
100% (1)
Principal Component Analysis
10 pages
Unit 3
No ratings yet
Unit 3
28 pages
Principal Computer Analysis (PCA)
No ratings yet
Principal Computer Analysis (PCA)
25 pages
4 1 Pca
No ratings yet
4 1 Pca
21 pages
ACPusing R
No ratings yet
ACPusing R
25 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
13 pages
ML - Unit 3
No ratings yet
ML - Unit 3
4 pages
DR Pca
No ratings yet
DR Pca
22 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
5 pages
Chapter 4: Normalized Principal Components Analysis: Dr. Lassad El Moubarki Tunis Business School
No ratings yet
Chapter 4: Normalized Principal Components Analysis: Dr. Lassad El Moubarki Tunis Business School
23 pages
Data Analysis: Dr. C Santhosh Kumar
No ratings yet
Data Analysis: Dr. C Santhosh Kumar
22 pages
P-3.1.4 - Pca
No ratings yet
P-3.1.4 - Pca
44 pages
Multivariate Statistical Analysis
No ratings yet
Multivariate Statistical Analysis
12 pages
Feature Extraction: - Saheni Patra
No ratings yet
Feature Extraction: - Saheni Patra
17 pages
MiM Predictive Analytics Sessions 1 2 (PCA)
No ratings yet
MiM Predictive Analytics Sessions 1 2 (PCA)
26 pages
1501589578da Mod15 Q1 e Text
No ratings yet
1501589578da Mod15 Q1 e Text
9 pages
Factor Analysis
No ratings yet
Factor Analysis
26 pages
Principal Components Analysis
100% (1)
Principal Components Analysis
24 pages
ML Unit - 3 DimensionalitY Reduction
No ratings yet
ML Unit - 3 DimensionalitY Reduction
39 pages
Program: Course Code: Course Name:: M.C.A. MCAS9220 Data Science Fundamentals
No ratings yet
Program: Course Code: Course Name:: M.C.A. MCAS9220 Data Science Fundamentals
28 pages
Pca Tutorial
No ratings yet
Pca Tutorial
11 pages
PCA Finds Representation Through Linear Transformation
No ratings yet
PCA Finds Representation Through Linear Transformation
28 pages
Unit V - Research Application in Business - Factor Analysis
No ratings yet
Unit V - Research Application in Business - Factor Analysis
8 pages
Projecting Data To A Lower Dimension With PCA
No ratings yet
Projecting Data To A Lower Dimension With PCA
6 pages
U4 - PCA - 5th Sem - DS
No ratings yet
U4 - PCA - 5th Sem - DS
14 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
8 pages
Week 9 Lecture - Revision Test-Dual-Translated
No ratings yet
Week 9 Lecture - Revision Test-Dual-Translated
92 pages
Multivariate Statistics Principal Component Analysis (PCA)
No ratings yet
Multivariate Statistics Principal Component Analysis (PCA)
41 pages
Unit-4 ML
No ratings yet
Unit-4 ML
17 pages
Data Analytics
No ratings yet
Data Analytics
28 pages
Singular Value Decomposition (SVD) / Principal Components Analysis (Pca)
No ratings yet
Singular Value Decomposition (SVD) / Principal Components Analysis (Pca)
31 pages
IDS 4 (Week 14)
No ratings yet
IDS 4 (Week 14)
66 pages
Unit-4 ML
No ratings yet
Unit-4 ML
19 pages
Dr. Chinmoy Jana Iiswbm: Management House, Kolkata
No ratings yet
Dr. Chinmoy Jana Iiswbm: Management House, Kolkata
22 pages
Principal Component Analysis
100% (1)
Principal Component Analysis
9 pages
Pca Kmeans GMM
No ratings yet
Pca Kmeans GMM
96 pages
STAT502
No ratings yet
STAT502
13 pages
Principal Component Analysis Slides
No ratings yet
Principal Component Analysis Slides
26 pages
DimensionalitY Reduction
No ratings yet
DimensionalitY Reduction
29 pages
Principal Components Analysis (PCA)
No ratings yet
Principal Components Analysis (PCA)
27 pages
Component Analysis Is A Dimension-Reduction Tool That Can
No ratings yet
Component Analysis Is A Dimension-Reduction Tool That Can
2 pages
Unit 4
No ratings yet
Unit 4
17 pages
PCA Complete
No ratings yet
PCA Complete
8 pages
Principal Component Analysis - Intro - Towards Data Science
No ratings yet
Principal Component Analysis - Intro - Towards Data Science
4 pages
PCA
100% (1)
PCA
33 pages
Unit V Foml
No ratings yet
Unit V Foml
18 pages
6 Principal Component Analysis
No ratings yet
6 Principal Component Analysis
7 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
82 pages
A Step-By-Step Explanation of Principal Component Analysis (PCA) - Built in
No ratings yet
A Step-By-Step Explanation of Principal Component Analysis (PCA) - Built in
8 pages
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
César Pérez López
No ratings yet
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
Becoming AI Engineer Learning Path
No ratings yet
Becoming AI Engineer Learning Path
4 pages
1 Matrix PDF
No ratings yet
1 Matrix PDF
39 pages
Luh 1979
No ratings yet
Luh 1979
8 pages
2 Intro2Matlab
No ratings yet
2 Intro2Matlab
31 pages
Codes Over An Infinite Family of Rings With A Gray Map: Yasemin Cengellenmis S. T. Dougherty
No ratings yet
Codes Over An Infinite Family of Rings With A Gray Map: Yasemin Cengellenmis S. T. Dougherty
22 pages
Python Numpy Tutorial (CS231n-Stanford)
No ratings yet
Python Numpy Tutorial (CS231n-Stanford)
27 pages
21BCA2T323 - Menaka B (QB) 2
No ratings yet
21BCA2T323 - Menaka B (QB) 2
3 pages
CS502 Quiz Solved For Final Term Preparation
No ratings yet
CS502 Quiz Solved For Final Term Preparation
20 pages
Lecture Plan 1: S. No. Topic:-Introduction To Two Port Network, Z & Y Parameters - Time Allotted
No ratings yet
Lecture Plan 1: S. No. Topic:-Introduction To Two Port Network, Z & Y Parameters - Time Allotted
0 pages
Spatial AutoRegression SAR Model Parameter Estimation Techniques
0% (1)
Spatial AutoRegression SAR Model Parameter Estimation Techniques
81 pages
Estimation of Neurons and Forward Propagation in Neural Net
No ratings yet
Estimation of Neurons and Forward Propagation in Neural Net
11 pages
B.Tech CSE (AI) - R23
No ratings yet
B.Tech CSE (AI) - R23
99 pages
Advanced Excel Formulas
100% (1)
Advanced Excel Formulas
319 pages
An Empirical Model of Noise Sources in Subsonic Jets, Formulated in A Linear Resolvent Framework
No ratings yet
An Empirical Model of Noise Sources in Subsonic Jets, Formulated in A Linear Resolvent Framework
10 pages
Robot Escritor de Hora
No ratings yet
Robot Escritor de Hora
18 pages
Lesson 16 Anintroductiontobraillemathematicserrata
No ratings yet
Lesson 16 Anintroductiontobraillemathematicserrata
31 pages
Array Processor
No ratings yet
Array Processor
19 pages
2024 Ensc180 Lab 1
No ratings yet
2024 Ensc180 Lab 1
2 pages
KHUSH
No ratings yet
KHUSH
21 pages
Internship Report: Navya Srivastava 1900520310040 Semester: 5th
No ratings yet
Internship Report: Navya Srivastava 1900520310040 Semester: 5th
24 pages
Tensor Analysis: Heinz Schade, Klaus Neemann
No ratings yet
Tensor Analysis: Heinz Schade, Klaus Neemann
344 pages
Design of Break Water Structure
No ratings yet
Design of Break Water Structure
58 pages
2019 Alpha Ciphering 0
No ratings yet
2019 Alpha Ciphering 0
14 pages
QP Maths Sample Papers
No ratings yet
QP Maths Sample Papers
101 pages
A Practical Approach To Data Structures and Algorithms-1
No ratings yet
A Practical Approach To Data Structures and Algorithms-1
573 pages
2007 - Modal Acoustic Transfer Vector Approach in A FEM BEM Vibroacoustic Analysis
No ratings yet
2007 - Modal Acoustic Transfer Vector Approach in A FEM BEM Vibroacoustic Analysis
11 pages
Assigment KCA205 (DS)
No ratings yet
Assigment KCA205 (DS)
2 pages
Matrix PYTHON Prgms
No ratings yet
Matrix PYTHON Prgms
46 pages

RES805-RM-Module 2

Uploaded by

RES805-RM-Module 2

Uploaded by

PRESIDENCY UNIVERSITY

Variable 1 Variable 2 Variable 3 Variable 4 Variable 5

Waiting Cleanliness Staff Food Food

Quality of life Business Happiness

• Weight values are bigger as compared to height values … …

• Example: Ten samples of two features and ,

• Example: Ten samples of two features and ,

2.5 2.4 0.69 0.49 0.3381 𝑛

2.2 2.9 0.39 0.99 0.3861 5.539

• For 3-features the covariance matrix is as follows:

2.5 2.4 0.69 0.49 0.3381 𝑛

2.2 2.9 0.39 0.99 0.3861 𝑛

3.1 3 1.29 1.09 1.4061

• So, eigenvectors are:

• So, eigenvectors are:

• Eigenvectors corresponding to highest eigenvalues will be the first principal

0.879 0.58 1.021249

1.021249 0.692 0.751

You might also like