Lec-04 - Linear Discriminant Analysis

The document discusses Linear Discriminant Analysis (LDA), its advantages over Logistic Regression, and its application in classification problems with multiple classes. It explains the Bayes Decision Boundary and the use of Bayes Theorem for optimal classification, along with parameter estimation for LDA. Additionally, it contrasts LDA with Quadratic Discriminant Analysis (QDA) and provides examples and error rates from classification tasks.

Uploaded by

aman.sinha.iitkgp

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views23 pages

Lec-04 - Linear Discriminant Analysis

Uploaded by

aman.sinha.iitkgp

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 23

Linear Discriminant Analysis

Dr. Sayak Roychowdhury

Department of Industrial & Systems Engineering,
IIT Kharagpur
Reference Books
• James, G., Witten, D., Hastie, T., & Tibshirani, R. (2013). An
introduction to statistical learning (Vol. 112, p. 18). New York:
springer.
• Hastie, T., Tibshirani, R., Friedman, J. H., & Friedman, J. H.
(2009). The elements of statistical learning: data mining,
inference, and prediction (Vol. 2, pp. 1-758). New York: springer.
Why LDA?
• When classes are well separated, the parameter estimates for Logistic
Regression may become unstable.
• When 𝑛 is small and the distribution of 𝑋 is approximately normal,
LDA is more stable.
• LDA is more applicable when there are more than 2 classes, it
provides low dimensional view of the data.
• With right population model, the Bayes Rule is the best model.
Decision boundaries

Linear decision boundaries found by LDA Quadratic decision boundaries using LDA
Bayes Decision Boundary
• The test error rate of a classification problem is minimized when a new
observation is assigned to a class 𝑗, for which 𝑃(𝑌 = 𝑗|𝑋 = 𝑥0 ) is largest
• This is called a Bayes classifier
• In a 2 class problem for 𝑗 = {1,2},
predict class 1 for
𝑃 𝑌 = 1 𝑋 = 𝑥0 > 0.5
• The Bayes classifier produces the
lowest possible test error rate, called
the Bayes error rate
Bayes Decision Boundary
Bayes Theorem for Classification
Pr 𝑋=𝑥𝑌=𝑘 .Pr(𝑌=𝑘)
• Pr 𝑌 = 𝑘 𝑋 = 𝑥 =
Pr(𝑋=𝑥)
𝜋𝑘 𝑓𝑘 𝑥
• Pr 𝑌 = 𝑘 𝑋 = 𝑥 = σ𝑙 𝜋𝑙 𝑓𝑙 𝑥
• Where 𝑓𝑘 𝑥 = Pr 𝑋 = 𝑥 𝑌 = 𝑘 is the conditional density of 𝑋 in
class 𝑘
and 𝜋𝑘 = Pr(𝑌 = 𝑘) is the prior probability
• We need to know the class posterior Pr 𝑌 = 𝑘 𝑋 = 𝑥 for optimal
classification
Linear Discriminant Analysis for One Predictor
• The observation will be classified for which
𝑝𝑘 𝑥 = Pr 𝑌 = 𝑘 𝑋 = 𝑥 is greatest
1 1 2
• 𝑓𝑘 𝑥 = exp − 2 𝑥 − 𝜇𝑘
𝜎𝑘 2𝜋 2𝜎𝑘
Where 𝜇𝑘 and 𝜎𝑘 are mean and variance parameters of the 𝑘 𝑡ℎ class
• For Linear Discriminant Analysis, it is assumed
𝜎12 = 𝜎22 =. . = 𝜎𝑘2 = 𝜎 2
Linear Discriminant Analysis for One Predictor
𝜋𝑘 𝑓𝑘 𝑥
• 𝑝𝑘 𝑥 = Pr 𝑌 = 𝑘 𝑋 = 𝑥 =σ
𝑙 𝜋𝑙 𝑓𝑙 𝑥
1 1
𝜋𝑘 exp − 2 𝑥−𝜇𝑘 2
𝜎 2𝜋
𝑘 2𝜎 𝑘
= 1 1
σ𝑙 𝜋𝑙 exp − 2 𝑥−𝜇𝑙 2
𝜎𝑙 2𝜋 2𝜎𝑙
• The Bayes classifier will assign an observation at 𝑋 = 𝑥 to the class for
which 𝑝𝑘 (𝑥) is largest
• This is equivalent to assign the observation to a class for which 𝛿𝑘 𝑥
is largest
2
𝑥𝜇𝑘 𝜇𝑘
𝛿𝑘 𝑥 = − + log(𝜋𝑘 )
𝜎2 2𝜎 2
Bayes Decision Boundary
• For 𝐾 = 2 and 𝜋1 = 𝜋2 , observation is assigned to class 1 if
2𝑥 𝜇1 − 𝜇2 > 𝜇12 − 𝜇22
• The Bayes decision boundary correspond to the point where
𝜇12 −𝜇22 𝜇1 +𝜇2
𝑥= =
2 𝜇1 −𝜇2 2
Parameter Estimation
• 𝜇ො𝑘 = 1/𝑛𝑘 σ𝑖:𝑦𝑖 =𝑘 𝑥𝑖
• If no knowledge of prior probability 𝜋𝑘 is available, then it can be
estimated by
𝑛𝑘
𝜋ො 𝑘 =
𝑁
1
• 2
𝜎ො = σ𝐾
𝑘=1 σ𝑖:𝑦𝑖 =𝑘 𝑥𝑖 − 𝜇ො𝑘 2
𝑁−𝐾
• The LDA classifier plugs into these estimates for observation 𝑋 = 𝑥
𝑥 𝜇ො𝑘 𝜇ො𝑘2
𝛿መ𝑘 𝑥 = 2 − 2 + log(𝜋ො 𝑘 )
𝜎ො 2𝜎ො
Multivariate Gaussian

X1 and X2 uncorrelated, with X1 and X2 correlated

Var(X1)=Var(X2)
Gaussian Density Multiple Predictors
• Suppose each class density is multivariate Gaussian
1 1
− 𝑥−𝜇𝑘 𝑇 Σ−1 𝑥−𝜇𝑘
𝑓𝑘 𝑥 = 𝑝 1 𝑒
2
2𝜋 2 Σ𝑘 2

LDA is the special case when it is assume that the covariance matrix is
same for all the classes
Σ𝑘 = Σ ∀𝑘
LDA with Multiple Predictors

3 Gaussian Distributions Samples from 3 Gaussian Distributions,

Solid lines indicating LDA boundaries
Linear Discriminant function
• Discriminant function
1 𝑇 −1
𝛿𝑘 𝑥 = 𝑥 𝑇 Σ −1 𝜇𝑘 − 𝜇𝑘 Σ 𝜇𝑘 + log 𝜋𝑘
2
Decision rule: 𝐺 𝑥 = 𝑎𝑟𝑔𝑚𝑎𝑥𝑘 𝛿𝑘 (𝑥)
(the predicted class of 𝑥 is the one with largest 𝛿𝑘 (𝑥) value)

Estimated values:
𝑁𝐾
𝜋ො 𝑘 =
𝑁
𝑥
𝜇ො𝑘 = σ𝑔𝑖 =𝑘 𝑖
𝑁𝑘
𝑥𝑖 −ෝ 𝜇𝑘 𝑇
𝜇𝑘 𝑥𝑖 −ෝ
Σ෠ = 𝐾
σ𝑘=1 σ𝑔𝑖 =𝑘
𝑁−𝐾
Example (Default Data)

LDA classifier classified Yes when P(Default=Yes | X) > 0.5 (Bayes Classifier)
Overall training error rate 2.75%
Error rate for default individuals 75.7%
Sensitivity is the percentage of true defaulters that are identified :24.3 %.
Specificity is the percentage of non-defaulters that are correctly identified:
(1 − 23/9667) = 99.8 %.
Example (Default Data)

Fraction of defaulters
Incorrectly classified

LDA classifier classified Yes when

P(Default=Yes | X) > 0.2
Overall training error rate 3.73%
Error rate for default individuals 41.4% Fraction of
Overall error errors non-
rate defaulters
ROC Curve

True Positive Rate = Sensitivity

False Positive Rate = 1 - Specificity
Quadratic Discriminant function
• When the assumption of equal covariant matrix for all classes is
dropped, we get QDA
• Discriminant function
1
𝛿𝑘 𝑥 = − log Σ𝑘 − 𝑥 − 𝜇𝑘 𝑇 Σ𝑘−1 (𝑥 − 𝜇𝑘 ) + log 𝜋𝑘
2
Decision rule: 𝐺 𝑥 = 𝑎𝑟𝑔𝑚𝑎𝑥𝑘 𝛿𝑘 (𝑥)
QDA and LDA
• In LDA, the 𝐾 classes of response are assumed to have a common covariance
matrix, whereas in QDA this assumption is dropped
𝑝 𝑝+1
• For 𝑝 predictors, estimating a covariance matrix requires estimating
2
parameters
• QDA requires estimation of separate covariance matrices for each class, with a
𝐾𝑝 𝑝+1
total of parameters
2
• Hence QDA requires a lot of parameters leading to higher variance
• LDA model requires only 𝐾𝑝 linear coefficients to be estimated, it may lead to
higher bias
• If there are relatively few training data points, LDA performs better than QDA
• QDA is recommended when training data is very large
QDA and LDA

Two Gaussian classes with common correlation Two Gaussian classes with different covariance;
Between 𝑋1 and 𝑋2 ; Bayes decision boundary in Bayes decision boundary in purple dashed line,
purple dashed line, LDA (black dotted), QDA (green LDA (black dotted), QDA (green
Solid) Solid)
Example: Stock Market Data
Example: Stock Market Data
> plot(lda.fit)
Test Error Rate

Technical Brief Stats Concepts 19c
No ratings yet
Technical Brief Stats Concepts 19c
27 pages
Week2 Part1 Summer Partial Notes
No ratings yet
Week2 Part1 Summer Partial Notes
75 pages
Lecture 9: Classification, LDA: Reading: Chapter 4
No ratings yet
Lecture 9: Classification, LDA: Reading: Chapter 4
55 pages
Linear Discriminat Analysis
No ratings yet
Linear Discriminat Analysis
23 pages
Bayesian Classifier Linear Disciminant Analysis (LDA) Quadratic Discriminant Analysis (QDA)
No ratings yet
Bayesian Classifier Linear Disciminant Analysis (LDA) Quadratic Discriminant Analysis (QDA)
18 pages
Machine Learning-Lecture 3 (Student)
No ratings yet
Machine Learning-Lecture 3 (Student)
4 pages
Discriminant Analysis
No ratings yet
Discriminant Analysis
45 pages
Linear and Quadratic Discriminant Analysis: Tutorial: Benyamin Ghojogh
No ratings yet
Linear and Quadratic Discriminant Analysis: Tutorial: Benyamin Ghojogh
16 pages
Supervised Learning: Linear Methods (1/2) : Applied Multivariate Statistics - Spring 2012
No ratings yet
Supervised Learning: Linear Methods (1/2) : Applied Multivariate Statistics - Spring 2012
15 pages
Reference Material - LDA
No ratings yet
Reference Material - LDA
24 pages
Slide ML 0915
No ratings yet
Slide ML 0915
24 pages
Linear Discriminant Analysis Reference
No ratings yet
Linear Discriminant Analysis Reference
6 pages
Lecture 03 Bayes Classifier With Prob Concepts
No ratings yet
Lecture 03 Bayes Classifier With Prob Concepts
70 pages
Lecture14 Discriminant Analysis
No ratings yet
Lecture14 Discriminant Analysis
38 pages
Linear Discriminant Analysis
No ratings yet
Linear Discriminant Analysis
27 pages
Reference Material - LDA
No ratings yet
Reference Material - LDA
24 pages
Reference+Material LDA
No ratings yet
Reference+Material LDA
24 pages
Reference Material - LDA
No ratings yet
Reference Material - LDA
24 pages
Linear Classifiers: Dept. Computer Science & Engineering, Shanghai Jiao Tong University
No ratings yet
Linear Classifiers: Dept. Computer Science & Engineering, Shanghai Jiao Tong University
46 pages
n9 PDF
No ratings yet
n9 PDF
6 pages
ML Unit4
No ratings yet
ML Unit4
44 pages
LDA Slides N
No ratings yet
LDA Slides N
20 pages
Boedeker Kearns 2019 Linear Discriminant Analysis For Prediction of Group Membership A User Friendly Primer
No ratings yet
Boedeker Kearns 2019 Linear Discriminant Analysis For Prediction of Group Membership A User Friendly Primer
14 pages
Linear Discriminant Analysis: January 2015
No ratings yet
Linear Discriminant Analysis: January 2015
67 pages
Linear Discriminant Analysis How To Have A Practical Approach To An LDA Model?
No ratings yet
Linear Discriminant Analysis How To Have A Practical Approach To An LDA Model?
6 pages
Chapter 11 KNN Naive Bayes and LDA
No ratings yet
Chapter 11 KNN Naive Bayes and LDA
15 pages
Bayesian Classification
No ratings yet
Bayesian Classification
14 pages
Generative Algorithms
No ratings yet
Generative Algorithms
3 pages
1.2. Linear and Quadratic Discriminant Analysis - Scikit-Learn 1.6.1 Documentati
No ratings yet
1.2. Linear and Quadratic Discriminant Analysis - Scikit-Learn 1.6.1 Documentati
10 pages
Legal 3 AI
No ratings yet
Legal 3 AI
3 pages
3rd Unit Last 5 Answer AIML
No ratings yet
3rd Unit Last 5 Answer AIML
21 pages
Discriminant Analysis
No ratings yet
Discriminant Analysis
13 pages
IIT Madras Notes Machine Learning
No ratings yet
IIT Madras Notes Machine Learning
13 pages
U20cs604 Machine Learning Unit II
No ratings yet
U20cs604 Machine Learning Unit II
50 pages
Notes Discriminant Analysis March 2021
No ratings yet
Notes Discriminant Analysis March 2021
59 pages
DADM S14 Linear Discriminant Analysis
No ratings yet
DADM S14 Linear Discriminant Analysis
13 pages
1694601448-Unit 3.5 Linear Discriminant Analysis CU 2.0
No ratings yet
1694601448-Unit 3.5 Linear Discriminant Analysis CU 2.0
25 pages
Week#5
No ratings yet
Week#5
33 pages
Linear Classifiers
No ratings yet
Linear Classifiers
48 pages
Multivariate Analysis (Slides 8)
No ratings yet
Multivariate Analysis (Slides 8)
19 pages
Linear Discriminant Analysis
No ratings yet
Linear Discriminant Analysis
33 pages
Linear Models For Classification: Sumeet Agarwal, EEL709 (Most Figures From Bishop, PRML)
No ratings yet
Linear Models For Classification: Sumeet Agarwal, EEL709 (Most Figures From Bishop, PRML)
21 pages
2023 LSE MY474 Applied Machine Learning Social Science, Lecture3
No ratings yet
2023 LSE MY474 Applied Machine Learning Social Science, Lecture3
58 pages
Bayesian
No ratings yet
Bayesian
21 pages
Linearclassification
No ratings yet
Linearclassification
31 pages
Supervised Machine Learning
No ratings yet
Supervised Machine Learning
74 pages
Lec 9
No ratings yet
Lec 9
52 pages
LDA
No ratings yet
LDA
10 pages
4 - (9-10) LDA, QDA, & KNN Classifiers
No ratings yet
4 - (9-10) LDA, QDA, & KNN Classifiers
39 pages
ML Lab 8 - LDA
No ratings yet
ML Lab 8 - LDA
4 pages
Linear Discriminant Analysis: Predictive Modelling - Week3
No ratings yet
Linear Discriminant Analysis: Predictive Modelling - Week3
19 pages
Slides Classification Discranalysis
No ratings yet
Slides Classification Discranalysis
11 pages
Linear - Classification
No ratings yet
Linear - Classification
72 pages
Chapter5 PDF
No ratings yet
Chapter5 PDF
13 pages
Linear Methods For Classification
No ratings yet
Linear Methods For Classification
29 pages
Lec5 Part1
No ratings yet
Lec5 Part1
42 pages
AI19
No ratings yet
AI19
4 pages
LDA 01 Linear Discriminant Analysis
No ratings yet
LDA 01 Linear Discriminant Analysis
65 pages
Detailed Linear Discriminant Functions Notes
No ratings yet
Detailed Linear Discriminant Functions Notes
2 pages
Linear Classification: 1 1 N N I D I
No ratings yet
Linear Classification: 1 1 N N I D I
33 pages
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Kodak CCD Primer #KCP-001: Charge-Coupled Device (CCD) Image Sensors
No ratings yet
Kodak CCD Primer #KCP-001: Charge-Coupled Device (CCD) Image Sensors
13 pages
Ijctt V68i7p105 July2020publication
No ratings yet
Ijctt V68i7p105 July2020publication
7 pages
CCDC - Process Flow Chart
No ratings yet
CCDC - Process Flow Chart
1 page
ECA-II Manual Complete
No ratings yet
ECA-II Manual Complete
100 pages
Single Aisle Technical Training Manual M35 LINE MECHANICS (CFM 56) (LVL 2&3) Information Systems
100% (1)
Single Aisle Technical Training Manual M35 LINE MECHANICS (CFM 56) (LVL 2&3) Information Systems
42 pages
6.EBS1-PTFA27-SAQA-PLQA-1002-D00 - Project Quality Plan
No ratings yet
6.EBS1-PTFA27-SAQA-PLQA-1002-D00 - Project Quality Plan
28 pages
A Survey of Deep Learning For Mathematical Reasoning
No ratings yet
A Survey of Deep Learning For Mathematical Reasoning
24 pages
1.e10-Unit 3-On TX-GV
No ratings yet
1.e10-Unit 3-On TX-GV
2 pages
Prospectus: Schedule of Availability of Online Application Form
No ratings yet
Prospectus: Schedule of Availability of Online Application Form
29 pages
Unit 4 Physical Pharmaceutics 1
No ratings yet
Unit 4 Physical Pharmaceutics 1
37 pages
Are QSM Manual Rev 08
No ratings yet
Are QSM Manual Rev 08
43 pages
1Z0 1042 21 Questions
No ratings yet
1Z0 1042 21 Questions
3 pages
Modula Lift Proposal - Warehouse Area 12-4-24
No ratings yet
Modula Lift Proposal - Warehouse Area 12-4-24
15 pages
Alt Codes List - Alt Key Codes For Symbols, Special Characters
No ratings yet
Alt Codes List - Alt Key Codes For Symbols, Special Characters
3 pages
Digital Progress and Trends Report 2023
No ratings yet
Digital Progress and Trends Report 2023
177 pages
Jeemainsession2.ntaonline - in Frontend Web Advancecityintimationslip Admit-Card
No ratings yet
Jeemainsession2.ntaonline - in Frontend Web Advancecityintimationslip Admit-Card
4 pages
3HAC065036 OM OmniCore-en
No ratings yet
3HAC065036 OM OmniCore-en
284 pages
Purdue Owl Developing Strong Thesis Statements
100% (3)
Purdue Owl Developing Strong Thesis Statements
6 pages
EQUIPMENT LIST - 2021-01-15 - Rev-E
No ratings yet
EQUIPMENT LIST - 2021-01-15 - Rev-E
9 pages
Unit 4 - Cloud Programming Models
100% (2)
Unit 4 - Cloud Programming Models
21 pages
Analog Input Barrier Kfd2 Stc5 Ex1 P F
No ratings yet
Analog Input Barrier Kfd2 Stc5 Ex1 P F
3 pages
Cross NewLaboratoryDreams 2012
No ratings yet
Cross NewLaboratoryDreams 2012
20 pages
4-Hour Lockout Avoidance For LM2500 and LM6000 Gas Turbines: Conversion, Modification and Upgrade Offering
No ratings yet
4-Hour Lockout Avoidance For LM2500 and LM6000 Gas Turbines: Conversion, Modification and Upgrade Offering
1 page
Instrumentation Earthing
100% (1)
Instrumentation Earthing
11 pages
Locking Gas Springs: 2990 Technology Drive Rochester Hills, MI 48309 USA
No ratings yet
Locking Gas Springs: 2990 Technology Drive Rochester Hills, MI 48309 USA
3 pages
CASING PIPE DESIGN REPORT - Mayank Sir
No ratings yet
CASING PIPE DESIGN REPORT - Mayank Sir
62 pages
2nd Year NEP Syllabus
No ratings yet
2nd Year NEP Syllabus
30 pages
Drag Force: The Basics of Transport Phenomena
No ratings yet
Drag Force: The Basics of Transport Phenomena
12 pages
SANS Cheatsheet Google-Workspace
No ratings yet
SANS Cheatsheet Google-Workspace
1 page