0% found this document useful (0 votes)

140 views

Multivariate Analysis Notes

This document outlines topics in multi-variate analysis including principal components analysis. It discusses how principal components are constructed to maximize variability while remaining uncorrelated. The first principal component accounts for the most variability in the data, defined by the highest eigenvalue of the covariance matrix. Subsequent components each account for the next highest amount of remaining variability and are orthogonal to previous components. The document provides methods for determining how many principal components to use based on cumulative variability explained.

Uploaded by

sherlockholmes108

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

140 views

Multivariate Analysis Notes

Uploaded by

sherlockholmes108

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Multi-Variate Analysis

Prof. Ayanendra Basu

2022

Contents
1 Introduction 2
1.1 Topics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
1.2 Books to follow . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2

2 Principal Components 3
2.1 Variability . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
2.2 Construction of Y’s . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
2.2.1 Lemma . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
2.2.2 Proof . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
2.3 Components . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
2.4 Note . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
2.5 A special case . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
2.6 How to choose a ’k’ ? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5

Contents by Lecture

Monday Tuesday Wednesday Thursday Friday

Lec 1 (1/8) Lec 2 (4/8)

1
Lecture 1, Aug, 1

1 Introduction
1.1 Topics
• Traditional Topics

Multivariate Analysis

Multivariate Normal

Wishart Distribution Hotelling’s T 2

MANOVA

Union Intersection Test

• Non-traditional Topics

Applied Multivariate Techniques

Dimension Reduction Classification

Principal Components Cluster Analysis

Factor analysis Discriminant Analysis

Canonical Correlation Analysis

1.2 Books to follow

1. T.W. Anderson : Multivariate Analysis
2. C.R. Rao : Linear Statistical Inference and its Applications
3. R. Johnson, D.W. Wichern : Applied Multivariate Statistical Analysis
4. Goldstein, Dillon
5. G.A.F. Seber : Multivariate Observations
6. Mardia, Kent, Bibby

2
7. S.F. Arnold : Linear Statistical Inference and Multivariate Analysis

2 Principal Components
Let, we have X , Σp×p (real, symmetric, p.d.).
∼ p×1
We would want to base the future analysis on k(<< p) variables.

2.1 Variability
Total variability of a dataset is defined as-
p
X
Total variability := σii , where σii is i-th diagonal of Σ
i=1
p
X
= σi2
i=1

We would like to reduce the dimension while retaining as much variability as possible.

X := (X1 , X2 , ..., Xp )T
∼

We would like to replace this by Y1 , Y2 , ..., Yk , k << p, without losing out much on variability.

2.2 Construction of Y’s

Now we construct the Yi s sequentially.
Yi := l1T X , l1 is such that max. variability is attained.
∼
Now, V ar(Yi ) = l1T Σl1
So we are trying to maximize

l1T Σl1
max Tl
l1 ̸=0 l1 1

2.2.1 Lemma
This is maximized when l1 is the eigenvector with the highest eigenvalue.

2.2.2 Proof
Let, λ1 ≥ λ2 ≥ ... ≥ λp = 0 are the eigenvalues of Σ and the corresponding eigenvectors would
be e1 , e2 , ..., ep .

3
l1T Σl1 l1T P ΛP T l1
=
l1T l1 l1T l1
(P T l1 )T Λ(P T l1 )
= [∵ P is orthogonal]
(P T l1 )T (P T l1 )
Y T ΛY
= T
Y Y
λi y 2
P
= P 2i
yi
≤λ1 (≥ λp )

l1T Σl1 eT1 Σe1 λ1 eT1 e1

∴ = = = λ1
l1T l1 eT1 e1 eT1 e1
So, the maximum is achieved.

2.3 Components
Y1 = eT1 X is the 1st Principal Component
∼
V ar(Y1 ) = λ1
Principal components are defined to be uncorrelated.
Y2 = l2T X
∼
So we would need to find:-

l2T Σl2
max Tl
, subject to Cov(Y1 , Y2 ) = 0
l1 ̸=0 l2 2

Now, Cov(Y1 , Y2 ) = Cov(eT1 X , l2T X ) = l2T Σe1

∼ ∼
So, for this to be 0, l2 ∈ vector space of {e2 , e3 , ..., ep }
So, the solution would be Y2 = eT2 X
∼
V ar(Y2 ) = λ2
Similarly,

Yj = e T
j X, Var(Yj ) = λj , 1 ≤ j ≤ p
∼

2.4 Note

T otal
p
X
V ar(Xi ) = σii σii
i=1
Xp
V ar(Yi ) = λi λi
i=1

If the variables are uncorrelated, i.e. Σ is diagonal then Y1 , Y2 , ..., Yp will just be a permutation
of X1 , X2 , ..., Xp in decreasing order of Variance.

4
2.5 A special case
Let, we have a bi-variate data (X1 , X2 ). We will tr to get its Principal components. We know
∼ ∼
the components will be uncorrelated. So once we get the 1st principal component, the other one
would be the perpendicular of that.
Let the data be like:-
Y1
X2

Then Y1 will be the 1st principal component as most variation will be along this axes.
So basically the components will be some rotation of the rectangular axes.
This idea can be extended for higher dimension for multivariate normal.

2.6 How to choose a ’k’ ?

λi
Proportion of variability explained by i-th Principal Component := Pp
j=1 λj
Pk
λi
Proportion of variability explained by k Principal Components := Ppi=1
j=1 λj

• One of way of choosing is pre-determining a threshold for variability explained.

Pk
λ
We get k choosing the smallest k for which Ppi=1 i ≥ a where a is a pre-specified threshold.
j=1 λj

• Other way would be by using Scree Plot

We plot Ppλi against i.
j=1 λj

5
We stop and choose when you observe a major slope change.

Lecture 2, Aug, 4

An Introduction To Multivariate Statistical Analysis (Anderson T.W) (Z-Library)
No ratings yet
An Introduction To Multivariate Statistical Analysis (Anderson T.W) (Z-Library)
747 pages
Lecture Notes On Multivariate Analysis
100% (1)
Lecture Notes On Multivariate Analysis
75 pages
(K. V. Mardia, J. T. Kent, J. M. Bibby) Multivaria PDF
No ratings yet
(K. V. Mardia, J. T. Kent, J. M. Bibby) Multivaria PDF
267 pages
Johnson, R. A., & Wichern, D. W. (2007) .Applied Multivariate Statistical Analysis, Prentice Hall PDF
No ratings yet
Johnson, R. A., & Wichern, D. W. (2007) .Applied Multivariate Statistical Analysis, Prentice Hall PDF
794 pages
Mardia K.V., Kent John T., Bibby John M. - Multivariate Analysis
No ratings yet
Mardia K.V., Kent John T., Bibby John M. - Multivariate Analysis
532 pages
Multivariate Statistical Method
No ratings yet
Multivariate Statistical Method
85 pages
Factor Analysis
No ratings yet
Factor Analysis
57 pages
Lecture FPCA
No ratings yet
Lecture FPCA
67 pages
Principal Components Analysis (PCA)
No ratings yet
Principal Components Analysis (PCA)
53 pages
Ch8-Principal Components
No ratings yet
Ch8-Principal Components
77 pages
STAT3006 Lecture Notes 2021 Aug8 2021
No ratings yet
STAT3006 Lecture Notes 2021 Aug8 2021
110 pages
Lecture 9 PRINCIPAL COMPONENTS
No ratings yet
Lecture 9 PRINCIPAL COMPONENTS
7 pages
Bia b350f Unit 4
No ratings yet
Bia b350f Unit 4
38 pages
STAT501 Multivariate Analysis
No ratings yet
STAT501 Multivariate Analysis
196 pages
189541407
No ratings yet
189541407
8 pages
Pca
No ratings yet
Pca
10 pages
Multivariate Material
No ratings yet
Multivariate Material
58 pages
Lecture_note5
No ratings yet
Lecture_note5
53 pages
Week 4
No ratings yet
Week 4
13 pages
Notes For Multivariate Statistics With R
No ratings yet
Notes For Multivariate Statistics With R
189 pages
(Ebook) Methods of Multivariate Analysis by Alvin C. Rencher ISBN 9780471418894, 0471418897 - The ebook in PDF format is ready for immediate access
100% (1)
(Ebook) Methods of Multivariate Analysis by Alvin C. Rencher ISBN 9780471418894, 0471418897 - The ebook in PDF format is ready for immediate access
54 pages
An Introduction To Multivariate Statisti
No ratings yet
An Introduction To Multivariate Statisti
739 pages
Unit5 1
No ratings yet
Unit5 1
98 pages
Multivariate Data Analysis in R PDF
No ratings yet
Multivariate Data Analysis in R PDF
400 pages
Multivariate Statistical Functions in R
100% (3)
Multivariate Statistical Functions in R
382 pages
2.3 supuestos
No ratings yet
2.3 supuestos
7 pages
4-Lecture 04
No ratings yet
4-Lecture 04
34 pages
Immediate download Methods of Multivariate Analysis (Wiley Series in Probability and Statistics Book 709) 3rd Edition – Ebook PDF Version ebooks 2024
100% (3)
Immediate download Methods of Multivariate Analysis (Wiley Series in Probability and Statistics Book 709) 3rd Edition – Ebook PDF Version ebooks 2024
41 pages
Multivariate Statistics Principal Component Analysis (PCA)
No ratings yet
Multivariate Statistics Principal Component Analysis (PCA)
41 pages
Multivariate Statistics - An Introduction 8th Edition
100% (1)
Multivariate Statistics - An Introduction 8th Edition
202 pages
Stat331-Multiple Linear Regression
No ratings yet
Stat331-Multiple Linear Regression
13 pages
Applied multivariate statistical analysis 5th Edition Richard Arnold Johnson download
100% (1)
Applied multivariate statistical analysis 5th Edition Richard Arnold Johnson download
59 pages
An Introduction to Applied Multivariate Analysis 1st Edition Tenko Raykov - The ebook is available for quick download, easy access to content
100% (3)
An Introduction to Applied Multivariate Analysis 1st Edition Tenko Raykov - The ebook is available for quick download, easy access to content
54 pages
MA MID EXAMS_merged
No ratings yet
MA MID EXAMS_merged
12 pages
(Ebook) Applied multivariate statistical analysis, 5th Edition by Richard Arnold Johnson, Dean W. Wichern ISBN 9780130925534, 0130925535 pdf download
No ratings yet
(Ebook) Applied multivariate statistical analysis, 5th Edition by Richard Arnold Johnson, Dean W. Wichern ISBN 9780130925534, 0130925535 pdf download
56 pages
STAT456 Study Guide
No ratings yet
STAT456 Study Guide
31 pages
Chapter2 PCA
No ratings yet
Chapter2 PCA
65 pages
A First Course in Multivariate Statistics: Bernard Flury
No ratings yet
A First Course in Multivariate Statistics: Bernard Flury
4 pages
Applied multivariate statistical analysis 5th Edition Richard Arnold Johnson instant download
100% (1)
Applied multivariate statistical analysis 5th Edition Richard Arnold Johnson instant download
37 pages
MVA Section1 2012
No ratings yet
MVA Section1 2012
14 pages
Principal_Components
No ratings yet
Principal_Components
5 pages
STA3005 Exploratory Data Analysis Notes
No ratings yet
STA3005 Exploratory Data Analysis Notes
16 pages
WST 311 - Part 1 2024
No ratings yet
WST 311 - Part 1 2024
59 pages
Multivariate Normal Distribution
No ratings yet
Multivariate Normal Distribution
19 pages
Chapter1 MV
No ratings yet
Chapter1 MV
72 pages
MV - Principal Components Using SAS
No ratings yet
MV - Principal Components Using SAS
69 pages
Pertemuan 1 SNN
No ratings yet
Pertemuan 1 SNN
37 pages
WST 311 - Part 1 2023
No ratings yet
WST 311 - Part 1 2023
59 pages
Principal Component Analysis: 2.1 Definition of Principal Components
No ratings yet
Principal Component Analysis: 2.1 Definition of Principal Components
8 pages
AE - Tema 2 - Principal Component Analysis
No ratings yet
AE - Tema 2 - Principal Component Analysis
4 pages
2 Eof
No ratings yet
2 Eof
31 pages
Data Reduction or Structural Simplification
No ratings yet
Data Reduction or Structural Simplification
44 pages
textbook ML_removed (1)
No ratings yet
textbook ML_removed (1)
22 pages
Multivariate Data Analysis
No ratings yet
Multivariate Data Analysis
24 pages
Finance
No ratings yet
Finance
43 pages
Download Methods of Multivariate Analysis (Wiley Series in Probability and Statistics Book 709) 3rd Edition – Ebook PDF Version ebook All Chapters PDF
100% (1)
Download Methods of Multivariate Analysis (Wiley Series in Probability and Statistics Book 709) 3rd Edition – Ebook PDF Version ebook All Chapters PDF
51 pages
Multivariate Statistics PCA
No ratings yet
Multivariate Statistics PCA
19 pages
HASTS215_HSTS215 NOTES Chapter1_2
No ratings yet
HASTS215_HSTS215 NOTES Chapter1_2
24 pages
A Discourse Analysis of 1 Peter
From Everand
A Discourse Analysis of 1 Peter
Ervin Ray Starwalt
No ratings yet
The Just Intonation Primer
From Everand
The Just Intonation Primer
David B Doty
No ratings yet
Probability Theory III (B.Stat. 2017-2020)
No ratings yet
Probability Theory III (B.Stat. 2017-2020)
173 pages
ISI Placement Brochure 2022-23
No ratings yet
ISI Placement Brochure 2022-23
16 pages
Howrah [email protected]
No ratings yet
Howrah [email protected]
2 pages
Frank Wolfe
No ratings yet
Frank Wolfe
25 pages
Final Teaching Allocation First Semester 2022-2023
No ratings yet
Final Teaching Allocation First Semester 2022-2023
5 pages
1909 10140
No ratings yet
1909 10140
39 pages
Categorical Data Analysis Assignment: Due DT.: 10/12/2022 Name: Soham Mallick Roll No.: MB-2202
No ratings yet
Categorical Data Analysis Assignment: Due DT.: 10/12/2022 Name: Soham Mallick Roll No.: MB-2202
6 pages
CBCS - BSC - HONS - Sem-3 - STATISTICS - CC-5 - LINEAR ALGEBRA-10096
No ratings yet
CBCS - BSC - HONS - Sem-3 - STATISTICS - CC-5 - LINEAR ALGEBRA-10096
2 pages
(Springer Series in Statistics) Jun Shao, Dongsheng Tu (Auth.) - The Jackknife and Bootstrap-Springer-Verlag New York (1995)
100% (1)
(Springer Series in Statistics) Jun Shao, Dongsheng Tu (Auth.) - The Jackknife and Bootstrap-Springer-Verlag New York (1995)
532 pages
Math Geek
No ratings yet
Math Geek
3 pages
CC5 2020
No ratings yet
CC5 2020
3 pages
Neural Networks
No ratings yet
Neural Networks
17 pages
VMW NSX Network Virtualization Design Guide PDF
No ratings yet
VMW NSX Network Virtualization Design Guide PDF
167 pages
PIC6F506
No ratings yet
PIC6F506
114 pages
Blanching of Foods: Washington State University, Pullman, Washington, U.S.A
No ratings yet
Blanching of Foods: Washington State University, Pullman, Washington, U.S.A
5 pages
A Course in Differential Equations with Boundary Value Problems, Second Edition Swift Randall J. download
100% (3)
A Course in Differential Equations with Boundary Value Problems, Second Edition Swift Randall J. download
63 pages
Ccsa Air Handling Units: With Packaged Solutions by Trane Tracer Controls
No ratings yet
Ccsa Air Handling Units: With Packaged Solutions by Trane Tracer Controls
16 pages
Hepatic Clearance
No ratings yet
Hepatic Clearance
14 pages
Variables Scales of Measurement
No ratings yet
Variables Scales of Measurement
18 pages
23 - 24 Cat 2023 10cat t2 Practical Test
No ratings yet
23 - 24 Cat 2023 10cat t2 Practical Test
8 pages
Zig Bee
No ratings yet
Zig Bee
50 pages
No. 20
No ratings yet
No. 20
37 pages
Understanding Flammability Diagrams: Flammability Diagrams Show The Control of Flammability in Mixtures of Fuel
No ratings yet
Understanding Flammability Diagrams: Flammability Diagrams Show The Control of Flammability in Mixtures of Fuel
1 page
Dokumen - Pub - Ocean Waves and Oscillating Systems Volume 8 Linear Interactions Including Wave Energy Extraction 2nbsped 1108481663 9781108481663
No ratings yet
Dokumen - Pub - Ocean Waves and Oscillating Systems Volume 8 Linear Interactions Including Wave Energy Extraction 2nbsped 1108481663 9781108481663
319 pages
Sentinel Loops
No ratings yet
Sentinel Loops
22 pages
Shear and Moment in Beams
No ratings yet
Shear and Moment in Beams
60 pages
Bhuvan Resume
No ratings yet
Bhuvan Resume
2 pages
Properties of A Sinusoidal Function
No ratings yet
Properties of A Sinusoidal Function
21 pages
Project Report
No ratings yet
Project Report
32 pages
Robotics and Mechatronics Proceedings of the 4th IFToMM International Symposium on Robotics and Mechatronics 1st Edition by Said Zeghloul, Med Amine Laribi, Jean Pierre Gazeau ISBN 3319223674 9783319223674 - Download the ebook now to never miss important content
100% (4)
Robotics and Mechatronics Proceedings of the 4th IFToMM International Symposium on Robotics and Mechatronics 1st Edition by Said Zeghloul, Med Amine Laribi, Jean Pierre Gazeau ISBN 3319223674 9783319223674 - Download the ebook now to never miss important content
79 pages
Unit 5 Angular
No ratings yet
Unit 5 Angular
12 pages
Distributed Systems Introduction
No ratings yet
Distributed Systems Introduction
40 pages
16-04 Locomotion & Movement
No ratings yet
16-04 Locomotion & Movement
7 pages
Blank Slate Technology
No ratings yet
Blank Slate Technology
2 pages
SMK Kepong Baru Maths M P1 - Answer
No ratings yet
SMK Kepong Baru Maths M P1 - Answer
4 pages
STEC Digital Level DLS-15,07 ID-En.240328
No ratings yet
STEC Digital Level DLS-15,07 ID-En.240328
2 pages
TRANSFORMATION MARATHON.docx
No ratings yet
TRANSFORMATION MARATHON.docx
7 pages
Updated_Orthogonal_Transformation_Numericals_Detailed
No ratings yet
Updated_Orthogonal_Transformation_Numericals_Detailed
8 pages
Rashid, Fatema
No ratings yet
Rashid, Fatema
164 pages
Calibration of Radon Monitors and its Associated Uncertainties in NIS Egypt Radon Calibration Chamber -
No ratings yet
Calibration of Radon Monitors and its Associated Uncertainties in NIS Egypt Radon Calibration Chamber -
7 pages
Investigatory Project 2
No ratings yet
Investigatory Project 2
4 pages

Multivariate Analysis Notes

Uploaded by

Multivariate Analysis Notes

Uploaded by

Multi-Variate Analysis

Prof. Ayanendra Basu

Monday Tuesday Wednesday Thursday Friday

Wishart Distribution Hotelling’s T 2

Union Intersection Test

Applied Multivariate Techniques

Dimension Reduction Classification

Principal Components Cluster Analysis

Factor analysis Discriminant Analysis

Canonical Correlation Analysis

1.2 Books to follow

2.2 Construction of Y’s

l1T Σl1 eT1 Σe1 λ1 eT1 e1

Now, Cov(Y1 , Y2 ) = Cov(eT1 X , l2T X ) = l2T Σe1

2.6 How to choose a ’k’ ?

• One of way of choosing is pre-determining a threshold for variability explained.

• Other way would be by using Scree Plot

You might also like