0% found this document useful (0 votes)

52 views77 pages

Principal Components Analysis (Part I) : Data Science

Principal Components Analysis (PCA) is introduced as a technique to obtain a low-dimensional summary of NBA team statistics data. PCA works by transforming the data to a new coordinate system such that the greatest variance by any projection of the data lies on the first coordinate (called the first principal component), the second greatest variance on the second coordinate, and so on. This has the effect of maximizing the variance that can be explained by each successive component. An example visualization using the first two principal components projects the NBA teams as points in a two-dimensional space that accounts for most of the variance in the original high-dimensional data.

Uploaded by

JORGE LUIS AGUAS MEZA

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

52 views77 pages

Principal Components Analysis (Part I) : Data Science

Uploaded by

JORGE LUIS AGUAS MEZA

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 77

Principal Components Analysis (part I)

Data Science
UTB

CC BY-SA 4.0
Introduction

2 / 77
NBA Team Stats

I NBA Team Stats: regular season (2016-17)

I Github file: data/nba-teams-2017.csv
I Source: stats.nba.com
I http://stats.nba.com/teams/traditional/#!
?sort=GP&dir=-1

3 / 77
# variables
dat <- read.csv('data/nba-teams-2017.csv')

dim(dat)

[1] 30 27

names(dat)

[1] "team" "games_played" "wins"

[4] "losses" "win_prop" "minutes"
[7] "points" "field_goals" "field_goals_attempted"
[10] "field_goals_prop" "points3" "points3_attempted"
[13] "points3_prop" "free_throws" "free_throws_att"
[16] "free_throws_prop" "off_rebounds" "def_rebounds"
[19] "rebounds" "assists" "turnovers"
[22] "steals" "blocks" "block_fga"
[25] "personal_fouls" "personal_fouls_drawn" "plus_minus"

5 / 77
Exploratory Data Analysis

For illustration purposes, let’s focus on the following variables:

I wins

I losses

I points

I field goals

I assists

I turnovers

I steals

I blocks

6 / 77
EDA: Objects and Variables Perspectives
1 j p

i xij

Rows Variables
(objects) n (columns)

X
data matrix (centered)

XXT (1/n) XTX

inner products of rows covariance matrix

7 / 77
EDA: Objects and Variables Perspectives

Data Perspectives
We are interested in analyzing a data set from both
perspectives: objects and variables
At its simplest we are interested in 2 fundamental purposes:
I Study resemblance among individuals
(resemblance among NBA teams)
I Study relationship among variables
(relationship among team statistics)

8 / 77
EDA

Exploration
Likewise, we can explore variables at different stages:
I Univariate: one variable at a time
I Bivariate: two variables simultaneously
I Multivariate: multiple variables
Let’s see a shiny-app demo (see apps/ folder in github repo)

9 / 77
points
field_goals losses

assists wins

turnovers blocks
steals

Spurs Celtics Raptors Clippers

Warriors Rockets Jazz Cavaliers

Thunder Hawks Bucks Blazers

Wizards Grizzlies Pacers Bulls

Nuggets Hornets Mavericks Timberwolves

Heat Pistons Pelicans Kings

Magic Lakers Nets

Knicks 76ers Suns

10 / 77
Correlation heatmap

blocks

steals Pearson
Correlation
turnovers 1.0

assists 0.5
Var1

field_goals 0.0

points −0.5

losses −1.0

wins
s
es

_g s
as ls
rn ts

st s
bl ls
ks
in

ld int

er
oa

ea
tu sis

oc
ss
w

ov
fie po
lo

Var2 11 / 77
20 40 60 36 38 40 42 12 14 16 4.0 5.0 6.0

60
wins

40
20
60
40

losses
20

110
points

100
42

field_goals
39
36

26
assists

20
12 14 16

turnovers

9.5
steals

8.0
6.5
5.5

blocks
4.0

20 40 60 100 110 20 24 28 6.5 7.5 8.5 9.5

12 / 77
What if we could get a better
low-dimensional summary of the data?

13 / 77
4

Nets ● 76ers ●
●
Suns
Lakers ●
2
Dim 2 (20.22%)

Timberwolves ● Hawks
● Nuggets ● Warriors
Kings ● Knicks
●
●

Bucks ● ● Thunder● Rockets●

Magic ● Pelicans ● Pacers

●
●
Wizards
0

Bulls ● ●
● Blazers
Heat
Grizzlies Clippers Cavaliers
Mavericks
● ●
●
● ● ●● ●

Pistons
Hornets Raptors Celtics Spurs ●
−2

●
Jazz
−4

−4 −2 0 2 4 6 8

Dim 1 (46.01%)

14 / 77
1.0
turnovers

losses
0.5

steals

field_goals points
Dim 2 (20.22%)

blocks assists
0.0

●
−0.5

wins
−1.0

−1.5 −1.0 −0.5 0.0 0.5 1.0 1.5

Dim 1 (46.01%)

15 / 77
About PCA

16 / 77
Data Structure

Principal Components Analysis (PCA) is a multivariate

method that allows us to study and explore a set of
quantitative variables measured on some objects.

17 / 77
Landmarks

I PCA was first introduced by Karl Pearson (1904)

On lines and planes of closest fit to systems of points in
space
I Further developed by Harold Hotelling (1933)
Analysis of a complex of statistical variables into principal
components
I Singular Value Decomposition (SVD) theorem by
Eckart-Young (1936)
The approximation of a matrix by another of a lower rank
I Computationally implemented in the 1960s

18 / 77
Core Idea

With PCA we seek to reduce the

dimensionality (condense information in
variables) of a data set while retaining as
much as possible of the variation present in
the data

19 / 77
PCA: Overall Goals

I Summarize a data set with the help of a small number of

synthetic variables (i.e. the Principal Components).
I Visualize the position (resemblance) of individuals.
I Visualize how variables are correlated.
I Interpret the synthetic variables.

20 / 77
Applications

PCA can be used for

1. Dimension Reduction
2. Visualization
3. Feature Extraction
4. Data Compression
5. Smoothing of Data
6. Detection of Outliers
7. Preliminary process for further analyses

21 / 77
About PCA

Approaches:
PCA can be presented using various—different but
equivalent—approaches. Each approach corresponds to a
unique perspective and a way of thinking about data.
I Data dispersion from the individuals standpoint
I Data variability from the variables standpoint
I Data that follows a decomposition model
I will present PCA by mixing and connecting all of these
approaches.

22 / 77
Geometric Approach

23 / 77
Geometric mindset

PCA for Data Visualization

One way to present PCA is based on a data visualization
approach.

To help you understand the main idea of PCA from a

geometric standpoint, I’d like to begin showing you my
mug-data example.

24 / 77
Imagine a data set in a ”high-dimensional space”

25 / 77
We are looking for Candidate Subspaces

26 / 77
with the best low-dimensional representation

27 / 77
Best low-dimensional projection

28 / 77
Geometric Idea

Looking at the cloud of points

Under a purely geometric approach, PCA aims to represent
the cloud of points in a space with reduced dimensionality in
an “optimal” way.

We look for the “best” graphical representation that allows us

to visualize the cloud of individuals in a low dimensional space
(usually 2-dimensions).

29 / 77
Objects in a high-dimensional space

30 / 77
We look for a subspace such that

31 / 77
the projection of points on it

32 / 77
is the best low-dimensional representation

How do you find the associated axes?

33 / 77
Focus on Distances

Distances between individuals

Looking for the best low-dimensional projection means that we
want to find a subspace in which the projected distances
among points are as much similar as possible to the original
distances.

34 / 77
Focus on distances between objects

d2(i, h) h object
object i

g
centroid

H
subspace

35 / 77
We want projected dists to preserve original dists

d2(i, h) h object
object i

projection h
g 2(i,
dH h)
centroid projection i

H
subspace

d2(i, h) as close as possible to dH2(i, h)

36 / 77
Focus on projected distances

The idea is to project the cloud of points on a plane (or a

low-dim space) of Rp , chosen in such a manner as to minimize
distorting the distances between individuals as little as possible.

37 / 77
Distances and Dispersion

Dispersion of Data
Focusing on distances among all pairs of objects implicitly
entails taking into account the dispersion or spread (i.e.
variation) of the data.

Data Configuration
The reason to pay attention to distances and dispersion is to
summarize in a quantitative way the original configuration of
the data points.

38 / 77
How to measure dispersion?
The concept of Inertia

39 / 77
Sum of Square Distances

Pair-wise Square distances

One way to consider the dispersion of data (in a mathematical
form) is by adding the square distances among all pairs of
points.

Square distances from centroid

Another way to measure the dispersion of data is by
considering the square distances of all points around the
center of gravity (i.e. centroid)

40 / 77
Imagine 3 points and its centroid
Xp

GSW

UTA
LAL
X2

Centroid g is the “average” team.

41 / 77
Dispersion: Sum of all squared dists
Xp

GSW

UTA
LAL
X2

SSD = 2d2 (LAL, GSW) + 2d2 (LAL, UTA) + 2d2 (GSW, UTA)

42 / 77
2n × (sum of squared dists w.r.t. centroid)
Xp

GSW

UTA
LAL
X2

SSD = (2 × 3) × {d2 (LAL, g) + d2 (GSW, g) + d2 (UTA, g)}

43 / 77
Inertia
One way to take into account the dispersion of the data is
with the concept of Inertia.
I Inertia is a term borrowed from the moment of inertia in
mechanics (physics).
I This involves thinking about data as a rigid body (i.e.
particles).
I We use the term Inertia to convey the idea of dispersion
in the data.
I In multivariate methods, the term Inertia generalizes
the notion of variance.
I Think of Inertia as a “multidimensional variance”

44 / 77
Cloud of teams in p-dimensional space
Xp

GSW

LAL
X2

45 / 77
Centroid (i.e. the average team)
Xp

GSW

LAL
X2

46 / 77
Formula of Total Inertia

The Total Inertia, I, is a weighted sum of squared distances

among all pairs of objects:
n n
1 XX 2
I= d (i, h)
2n2 i=1 h=1

47 / 77
Overall variation/spread (around centroid)
Xp

GSW

LAL
X2

48 / 77
Formula of Total Inertia

Equivalently, the Total Inertia can be calculated in terms of

the centoid g:
n
1X 2
I= d (xi , g)
n i=1
The Inertia is an average sum of squared distances around the
centroid g

49 / 77
Centered data: centroid is the origin
Xp

GSW

LAL
X2

50 / 77
Computing Inertia

n
X
Inertia = mi d2 (xi , g)
i=1
n
X 1
= (xi − g)T (xi − g)
i=1
n
1
= tr(XT X)
n
1
= tr(XXT )
n
where mi is the mass (i.e. weight) of individual i, usually 1/n

51 / 77
Finding Principal Components

52 / 77
Inertia Concept

Inertia and PCA

In PCA we look for a low-dimensional subspace having
Projected Inertia as close as possible to the Original Inertia.

Criterion
The criterion used for dimensionality reduction implies that the
inertia of a cloud of points in the optimal subspace is
maximum (but less than the inertia in the original space).

53 / 77
Criterion

Maximize Projected Inertia

We want to maximize the Projected Inertia on subspace H:
X
max projected d2H (xi , g)
i

Axis of Inertia
To find the subspace H we can look for each of its axes
∆1 , ∆2 , . . . , ∆k and its corresponding vectors v1 , v2 , . . . , vk
(k < p).

54 / 77
Looking for an axis 1

GSW

LAL
X2

NBA teams in a p-dimensional space

55 / 77
1st axis

Xp
axis 1

We want a 1st axis that retains most of the projected inertia

56 / 77
First Axis and Principal Component

Projection of object i on axis ∆1 generated by vector v1

p
X
xTi v1 = xij v1j
j=1

The 1st component z1 is the projection of all points on v1

Xv1 = z1

we don’t really manipulate the axis ∆1 , but its associated vector v1

57 / 77
First Axis and Principal Component

I The axis ∆1 passes through the centroid g

(with centered data, g is the origin)
I The axis ∆1 is created by the unit-norm vector v1 ,
eigenvector of n1 XT X, associated to the largest
eigenvalue λ1
I The explained inertia by the axis ∆1 is equal to λ1
I With standardized data, the proportion of explained
inertia by ∆1 is λ1 /p

58 / 77
2nd axis
Xp
axis 2
axis 1

We want a 2nd axis, orthogonal to ∆1 , that retains most of

the remaining projected inertia

59 / 77
Second Axis and Principal Component

Projection of object i on axis ∆2 generated by vector v2

p
X
xTi v2 = xij v2j
j=1

The 2nd component z2 is the projection of all points on v2

Xv2 = z2

we don’t really manipulate the axis ∆2 , but its associated vector v2

60 / 77
Second Axis and Principal Component

I The axis ∆2 passes through the centroid g

and it is perpendicular to ∆1
I The axis ∆2 is created by the unit-norm vector v2 ,
eigenvector of n1 XT X, associated to the second largest
eigenvalue λ2
I The explained inertia by the axis ∆2 is equal to λ2
I With standardized data, the proportion of explained
inertia by ∆2 is λ2 /p

61 / 77
Computational note

In practice, most software routines for PCA don’t really work

with the population covariance matrix n1 XT X.

Instead, most programs work with the sample covariance

1
matrix: n−1 XT X
1
Notice that with standardized data, n−1
XT X = R, is the
(sample) correlation matrix.

62 / 77
Looking at Variables

63 / 77
Looking at the cloud of standardized variables

i
Rn

Xl
θjl
2

64 / 77
Looking at the cloud of standardized variables

I With standardized data, the p variables are located within

a hypersphere of radius 1 in an n-dimensional space..
I We represent them graphically as vectors.
I The scalar product between two variables Xj and Xl is:
n
X
hXj , Xl i = xij xil = kxj k kxl k cos(θjl )
i=1

I Notice that:
xTj xl
cos(θjl ) = = cor(Xj , Xl )
kxj k kxl k

65 / 77
Projecting the cloud of standardized variables

I The property cos(θjl ) = cor(Xj , Xl ) is essential in PCA

I A representation of the cloud of variables can be used to
visualize the correlations (through the angles between
variables)
I The cloud of variables is also projected onto a low
dimensional space.
I In this case, the distance between two variables is
computed with inner products.

66 / 77
Projection of best subspace

A
HA
D

HA HD HB
HB
HD
HC HC

B
C

Projection of the scatterplot of variables on the main plane of

variability

67 / 77
Projecting the cloud of standardized variables

The projection of variable j onto an axis k, is equal to the

cosine of the angle θjk .

The criterion maximizes:

p p
X X
2
cos (θjk ) = cor2 (xj , zk )
j=i j=1

where zk is the new variable which is the most correlated with

all of the original variables.

68 / 77
Finding subspace for variables

Projection of variable j on axis H1 generated by vector u1

n
X
xTj u1 = xij ui1
i=1

The synthetic variable u1 can be used to obtain a factor q1

XT u1 = q1

we don’t really manipulate the axis H1 , but its associated vector u1

69 / 77
Finding subspace for variables

Solution: u1 is the first eigenvector of n1 XXT , the matrix of

inner products between individuals
1
XXT u1 = λ1 u1
n
The subsequent dimensions are the other eigenvectors
u2 , u3 , . . .

And the corresponding variable factors are given by:

Q = XT U

70 / 77
Finding subspace for variables

It can be shown that the PCs can also be obtained as:

1
Z = √ UΛ1/2
n

where:
I Λ is the diagonal matrix of eigenvectors of 1 XXT
n
I U is the matrix of eigenvectors of 1 XXT
n
But keep in mind that PCs can be rescaled.

Note that most PCA programs work with n − 1 instead of n.

71 / 77
Relationship between the
representations
of Individuals and Variables

72 / 77
Link between representations

1
SVD of: √ X = UDVT
n−1

1 1
Z = XV = √ XQD−1 ⇒ V=√ QD−1
n−1 n−1

1 1
Q = XT U = √ XT ZD−1 ⇒ U= √ ZD−1
n−1 n−1

73 / 77
Link between representations

Principal Components or Scores

p
1 1 X
zik = √ ×√ × xij qjk
n−1 λk j=1

Factors for Variables

n
1 1 X
qjk =√ ×√ × xij zik
n−1 λk i=1

74 / 77
Principal Components?

Meaning of Principal
The term Principal, as used in PCA, has to do with the
notion of principal axis from geometry and linear algebra

Principal Axis
A principal axis is a certain line in a Euclidean space
associated to an ellipsoid or hyperboloid, generalizing the
major and minor axes of an ellipse

75 / 77
References

I Exploratory Multivariate Analysis by Example Using R by

Husson, Le and Pages (2010). Chapter 1: Principal Component
Analysis (PCA). CRC Press.
I An R and S-Plus Companion to Multivariate Analysis by Brian
Everitt (2004). Chapter 3: Principal Components Analysis.
Springer.
I Principal Component Analysis by Ian jolliffe (2002). Springer.
I Data Mining and Statistics for Decision Making by Stephane
Tuffery (2011). Chapter 7: Factor Analysis. Editions Technip, Paris.

76 / 77
References (French Literature)

I Statistique Exploratoire Multidimensionnelle by Lebart et al

(2004). Chapter 3, section 3: Analyse factorielle discriminante.
Dunod, Paris.
I Probabilites, analyse des donnees et statistique by Gilbert
Saporta (2011). Chapter 6: Analyse en Composantes Principaux.
Editions Technip, Paris.
I Statistique: Methodes pour decrire, expliquer et prevoir by
Michel Tenenhaus (2008). Chapter 10: L’analyse discriminante.
Dunod, Paris.
I Analyses factorielles simples et multiples by Brigitte Escofier et
Jerome Pages (2016, 5th edition). Chapter 2: L’analyse
discriminante. Dunod, Paris.

77 / 77

Linear Algebra Solution Manual by Peter Olver
80% (10)
Linear Algebra Solution Manual by Peter Olver
350 pages
Ang A. H-S, Probability Concepts in Engineering Planning and Design, 1984
86% (14)
Ang A. H-S, Probability Concepts in Engineering Planning and Design, 1984
572 pages
Unit 3dimentionality Reduction
No ratings yet
Unit 3dimentionality Reduction
13 pages
Unit - IV - DIMENSIONALITY REDUCTION AND GRAPHICAL MODELS
No ratings yet
Unit - IV - DIMENSIONALITY REDUCTION AND GRAPHICAL MODELS
59 pages
Pair of Linear Equations in Two Variables
100% (2)
Pair of Linear Equations in Two Variables
6 pages
Multivariate
100% (1)
Multivariate
78 pages
Principal Component Analysis
100% (1)
Principal Component Analysis
17 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
45 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
60 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
35 pages
PCA
100% (1)
PCA
45 pages
FALLSEM2023-24 - ITE2011 - ETH - VL2023240102356 - 2023-09-01 - Reference-Material-I (3 Files Merged)
No ratings yet
FALLSEM2023-24 - ITE2011 - ETH - VL2023240102356 - 2023-09-01 - Reference-Material-I (3 Files Merged)
191 pages
Principal Component Analysis
100% (1)
Principal Component Analysis
9 pages
A Tutorial On Principal Component Analysis
No ratings yet
A Tutorial On Principal Component Analysis
12 pages
ML RUSA Module 5 Dim Red
No ratings yet
ML RUSA Module 5 Dim Red
85 pages
Lec 3
No ratings yet
Lec 3
60 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
27 pages
IDS 4 (Week 14)
No ratings yet
IDS 4 (Week 14)
66 pages
Presentation A I STD 2
No ratings yet
Presentation A I STD 2
63 pages
PCA
100% (1)
PCA
33 pages
Arithmetic Processor: 10-2 Addition and Subtraction
No ratings yet
Arithmetic Processor: 10-2 Addition and Subtraction
9 pages
5-Dimension Reduction
No ratings yet
5-Dimension Reduction
48 pages
Training of Trainersof Grade 8 Teachers (Pre-Test) - With Answer Key
No ratings yet
Training of Trainersof Grade 8 Teachers (Pre-Test) - With Answer Key
3 pages
Performance Metrics (Classification) : Enrique J. de La Hoz D
100% (1)
Performance Metrics (Classification) : Enrique J. de La Hoz D
30 pages
WIREs Computational Stats - 2010 - Abdi - Principal Component Analysis
No ratings yet
WIREs Computational Stats - 2010 - Abdi - Principal Component Analysis
27 pages
WIREs Computational Stats - 2010 - Abdi - Principal Component Analysis
No ratings yet
WIREs Computational Stats - 2010 - Abdi - Principal Component Analysis
27 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
33 pages
P-3.1.4 - Pca
No ratings yet
P-3.1.4 - Pca
44 pages
Dimensonality Reduction
No ratings yet
Dimensonality Reduction
25 pages
Principal Component Analysis (PCA) : Gundimeda Venugopal
No ratings yet
Principal Component Analysis (PCA) : Gundimeda Venugopal
17 pages
AS Week 3
No ratings yet
AS Week 3
11 pages
Pac
No ratings yet
Pac
70 pages
Analytic Rubrics
100% (1)
Analytic Rubrics
2 pages
A Tutorial On Principal Component Analysis
No ratings yet
A Tutorial On Principal Component Analysis
12 pages
22AIP3101A Session 7
No ratings yet
22AIP3101A Session 7
28 pages
6 Dimension Reduction Theory
No ratings yet
6 Dimension Reduction Theory
18 pages
Varimax Rotation
No ratings yet
Varimax Rotation
47 pages
PCA Biology
No ratings yet
PCA Biology
45 pages
PCA1
No ratings yet
PCA1
45 pages
Principal Component Analysis and Cluster Analysis
No ratings yet
Principal Component Analysis and Cluster Analysis
14 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
8 pages
A Tutorial On Principal Component Analysis
No ratings yet
A Tutorial On Principal Component Analysis
29 pages
Unit Iii Dimentionality Reduction
No ratings yet
Unit Iii Dimentionality Reduction
12 pages
1501589578da Mod15 Q1 e Text
No ratings yet
1501589578da Mod15 Q1 e Text
9 pages
Module3 Notes
No ratings yet
Module3 Notes
13 pages
Data Projections & Visualization: Student Eng.: Maria-Alexandra MATEI
No ratings yet
Data Projections & Visualization: Student Eng.: Maria-Alexandra MATEI
18 pages
07 Local Search Algorithms
No ratings yet
07 Local Search Algorithms
32 pages
Facial Recognition and Mathematics - Vectors and Geometry in Action
No ratings yet
Facial Recognition and Mathematics - Vectors and Geometry in Action
6 pages
Principal Component Analysis: Courtesy:University of Louisville, CVIP Lab
No ratings yet
Principal Component Analysis: Courtesy:University of Louisville, CVIP Lab
48 pages
PCA Complete
No ratings yet
PCA Complete
8 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
19 pages
Unit V Foml
No ratings yet
Unit V Foml
18 pages
Projecting Data To A Lower Dimension With PCA
No ratings yet
Projecting Data To A Lower Dimension With PCA
6 pages
PCA Tutorial: Instructor: Forbes Burkowski
No ratings yet
PCA Tutorial: Instructor: Forbes Burkowski
12 pages
Principal Component Analysis: Jianxin Wu
No ratings yet
Principal Component Analysis: Jianxin Wu
24 pages
Principal Component Analysis: Herv e Abdi and Lynne J. Williams
No ratings yet
Principal Component Analysis: Herv e Abdi and Lynne J. Williams
27 pages
(ABDI H.) Principal Component Analysis
No ratings yet
(ABDI H.) Principal Component Analysis
27 pages
Lecture 14: Principal Component Analysis: Computing The Principal Components
No ratings yet
Lecture 14: Principal Component Analysis: Computing The Principal Components
6 pages
A Tutorial Pca in Matlab
No ratings yet
A Tutorial Pca in Matlab
12 pages
Kant - Lectures On Logic
No ratings yet
Kant - Lectures On Logic
16 pages
PE281 Green's Functions Course Notes
100% (1)
PE281 Green's Functions Course Notes
11 pages
Mock IGCSE Exam P4-Rev 14 March
No ratings yet
Mock IGCSE Exam P4-Rev 14 March
13 pages
Statistics For Managers Using Microsoft® Excel 5th Edition: Some Important Discrete Probability Distributions
No ratings yet
Statistics For Managers Using Microsoft® Excel 5th Edition: Some Important Discrete Probability Distributions
48 pages
Robert Rosen How Are Organisms Different
100% (2)
Robert Rosen How Are Organisms Different
17 pages
Full-Potential Linearized Augmented Plane Wave
100% (1)
Full-Potential Linearized Augmented Plane Wave
8 pages
A Proof of Sylow Theorem
No ratings yet
A Proof of Sylow Theorem
2 pages
Pre Algebra Lesson 5 7
No ratings yet
Pre Algebra Lesson 5 7
29 pages
AQA Formulae Booklet
No ratings yet
AQA Formulae Booklet
16 pages
Pascal's Triangle: Patterns Within The Triangle
No ratings yet
Pascal's Triangle: Patterns Within The Triangle
5 pages
Nisv SMC 2024-25
No ratings yet
Nisv SMC 2024-25
68 pages
Field Probs
No ratings yet
Field Probs
19 pages
Computer Science and Technology
No ratings yet
Computer Science and Technology
4 pages
Fractional Integral Transforms - Theory and Applications - Zayed, Ahmed I - 1, 2024 - Chapman and Hall - CRC - 9781003089353 - Anna's Archive
No ratings yet
Fractional Integral Transforms - Theory and Applications - Zayed, Ahmed I - 1, 2024 - Chapman and Hall - CRC - 9781003089353 - Anna's Archive
280 pages
Faisel
No ratings yet
Faisel
35 pages
Principal Components Analysis (Part II) : Predictive Modeling & Statistical Learning
No ratings yet
Principal Components Analysis (Part II) : Predictive Modeling & Statistical Learning
68 pages
Dashboards in R: Enrique J. de La Hoz D
No ratings yet
Dashboards in R: Enrique J. de La Hoz D
41 pages
CE 595 Section 4
No ratings yet
CE 595 Section 4
50 pages
Adobe Scan 01-Sep-2022
No ratings yet
Adobe Scan 01-Sep-2022
7 pages
Poisson and Quasipoisson Regression To Predict Counts: Enrique J. de La Hoz D
No ratings yet
Poisson and Quasipoisson Regression To Predict Counts: Enrique J. de La Hoz D
18 pages
Arima Garch 11 Modelling and Forecasting For A Ge Stock Price Using R
No ratings yet
Arima Garch 11 Modelling and Forecasting For A Ge Stock Price Using R
20 pages
CTET Syllabus
No ratings yet
CTET Syllabus
9 pages
Teorema Proyeksi
No ratings yet
Teorema Proyeksi
3 pages
Indeterminate Forms
No ratings yet
Indeterminate Forms
8 pages
Alg Lesson 4-1
No ratings yet
Alg Lesson 4-1
4 pages
Transportation Algorithms of MODI Method
No ratings yet
Transportation Algorithms of MODI Method
4 pages

Principal Components Analysis (Part I) : Data Science

Uploaded by

Principal Components Analysis (Part I) : Data Science

Uploaded by

Principal Components Analysis (part I)

I NBA Team Stats: regular season (2016-17)

[1] "team" "games_played" "wins"

For illustration purposes, let’s focus on the following variables:

XXT (1/n) XTX

Spurs Celtics Raptors Clippers

Thunder Hawks Bucks Blazers

Nuggets Hornets Mavericks Timberwolves

Magic Lakers Nets

20 40 60 100 110 20 24 28 6.5 7.5 8.5 9.5

Bucks ● ● Thunder● Rockets●

Magic ● Pelicans ● Pacers

−1.5 −1.0 −0.5 0.0 0.5 1.0 1.5

Principal Components Analysis (PCA) is a multivariate

I PCA was first introduced by Karl Pearson (1904)

With PCA we seek to reduce the

I Summarize a data set with the help of a small number of

PCA can be used for

PCA for Data Visualization

To help you understand the main idea of PCA from a

Looking at the cloud of points

We look for the “best” graphical representation that allows us

How do you find the associated axes?

Distances between individuals

d2(i, h) as close as possible to dH2(i, h)

The idea is to project the cloud of points on a plane (or a

Pair-wise Square distances

Square distances from centroid

Centroid g is the “average” team.

SSD = (2 × 3) × {d2 (LAL, g) + d2 (GSW, g) + d2 (UTA, g)}

The Total Inertia, I, is a weighted sum of squared distances

Equivalently, the Total Inertia can be calculated in terms of

Inertia and PCA

Maximize Projected Inertia

NBA teams in a p-dimensional space

We want a 1st axis that retains most of the projected inertia

Projection of object i on axis ∆1 generated by vector v1

The 1st component z1 is the projection of all points on v1

we don’t really manipulate the axis ∆1 , but its associated vector v1

I The axis ∆1 passes through the centroid g

We want a 2nd axis, orthogonal to ∆1 , that retains most of

Projection of object i on axis ∆2 generated by vector v2

The 2nd component z2 is the projection of all points on v2

we don’t really manipulate the axis ∆2 , but its associated vector v2

I The axis ∆2 passes through the centroid g

In practice, most software routines for PCA don’t really work

Instead, most programs work with the sample covariance

I With standardized data, the p variables are located within

I The property cos(θjl ) = cor(Xj , Xl ) is essential in PCA

Projection of the scatterplot of variables on the main plane of

The projection of variable j onto an axis k, is equal to the

The criterion maximizes:

where zk is the new variable which is the most correlated with

Projection of variable j on axis H1 generated by vector u1

The synthetic variable u1 can be used to obtain a factor q1

we don’t really manipulate the axis H1 , but its associated vector u1

Solution: u1 is the first eigenvector of n1 XXT , the matrix of

And the corresponding variable factors are given by:

It can be shown that the PCs can also be obtained as:

Note that most PCA programs work with n − 1 instead of n.

Principal Components or Scores

Factors for Variables

I Exploratory Multivariate Analysis by Example Using R by

I Statistique Exploratoire Multidimensionnelle by Lebart et al

You might also like