0% found this document useful (0 votes)

124 views28 pages

04 Chap04 ClassificationMethods LDA QDA

1. Linear discriminant analysis (LDA) and quadratic discriminant analysis (QDA) are classification methods that estimate Bayes' classifier to discriminate between categories. 2. LDA assumes normal distributions with common variance for each category, while QDA allows different variances. 3. LDA performs well when variances are similar, while QDA is better when variances differ; however, QDA requires more data to accurately estimate variances.

Uploaded by

Fazlur Rahman

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

124 views28 pages

04 Chap04 ClassificationMethods LDA QDA

Uploaded by

Fazlur Rahman

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 28

IOM 530: Intro.

to Statistical Learning 1

CLASSIFICATION METHODS
Chapter 04 (part 02)
IOM 530: Intro. to Statistical Learning 2

LINEAR DISCRIMINANT ANALYSIS (LDA) &

QUADRATIC DISCRIMINANT ANALYSIS (QDA)

IOM 530: Intro. to Statistical Learning 3

Outline
• Overview of LDA
• Why not Logistic Regression?
• Estimating Bayes’ Classifier
• LDA Example with One Predictor (p=1)
• LDA Example with more than One Predictor (P>1)
• LDA on Default Data
• Overview of QDA
• Comparison between LDA and QDA
IOM 530: Intro. to Statistical Learning 4

Linear Discriminant Analysis

• LDA undertakes the same task as Logistic Regression. It
classifies data based on categorical variables
• Making profit or not
• Buy a product or not
• Satisfied customer or not
• Political party voting intention
IOM 530: Intro. to Statistical Learning 5

Why Linear? Why Discriminant?

• LDA involves the determination of linear equation (just like
linear regression) that will predict which group the case
belongs to.

• D: discriminant function
• v: discriminant coefficient or weight for the variable
• X: variable
• a: constant
IOM 530: Intro. to Statistical Learning 6

Purpose of LDA
• Choose the v’s in a way to maximize the distance
between the means of different categories

• Good predictors tend to have large v’s (weight)

• We want to discriminate between the different categories

• Think of food recipe. Changing the proportions (weights)

of the ingredients will change the characteristics of the
finished cakes. Hopefully that will produce different types
of cake!
IOM 530: Intro. to Statistical Learning 7

Assumptions of LDA
• The observations are a random sample

• Each predictor variable is normally distributed

IOM 530: Intro. to Statistical Learning 8

Why not Logistic Regression?

• Logistic regression is unstable when the classes are well
separated

• In the case where n is small, and the distribution of

predictors X is approximately normal, then LDA is more
stable than Logistic Regression

• LDA is more popular when we have more than two

response classes
IOM 530: Intro. to Statistical Learning 9

Bayes’ Classifier
• Bayes’ classifier is the golden standard. Unfortunately, it is
unattainable.

• So far, we have estimated it with two methods:

• KNN classifier
• Logistic Regression
IOM 530: Intro. to Statistical Learning 10

Estimating Bayes’ Classifier

• With Logistic Regression we modeled the probability of Y
being from the kth class as

=
• However, Bayes’ Theorem states

: Probability of coming from class k (prior probability)

: Density function for X given that X is an observation from

class k
IOM 530: Intro. to Statistical Learning 11

Estimate and
• We can estimate and to compute

• The most common model for is the Normal Density

• Using the density, we only need to estimate three

quantities to compute
IOM 530: Intro. to Statistical Learning 12

Use Training Data set for Estimation

• The mean could be estimated by the average of all
training observations from the kth class.
• The variance could be estimated as the weighted
average of variances of all k classes.
• And, is estimated as the proportion of the training
observations that belong to the kth class.
IOM 530: Intro. to Statistical Learning 13
IOM 530: Intro. to Statistical Learning 14

A Simple Example with One Predictor (p =1)

• Suppose we have only one predictor (p = 1)
• Two normal density function f1(x) and f2(x), represent two
distinct classes
• The two density functions overlap, so there is some
uncertainty about the class to which an observation with
an unknown class belongs
• The dashed vertical line represents Bayes’ decision
boundary
IOM 530: Intro. to Statistical Learning 15

Apply LDA
• LDA starts by assuming that each class has a normal
distribution with a common variance

• The mean and the variance are estimated

• Finally, Bayes’ theorem is used to compute pk and the

observation is assigned to the class with the maximum
probability among all k probabilities
IOM 530: Intro. to Statistical Learning 16

• 20 observations were drawn from each of the two classes

• The dashed vertical line is the Bayes’ decision boundary
• The solid vertical line is the LDA decision boundary
• Bayes’ error rate: 10.6%
• LDA error rate: 11.1%
• Thus, LDA is performing pretty well!
IOM 530: Intro. to Statistical Learning 17

An Example When p > 1

• If X is multidimensional (p > 1), we use exactly the same
approach except the density function f(x) is modeled using
the multivariate normal density
IOM 530: Intro. to Statistical Learning 18

• We have two predictors (p =2)

• Three classes
• 20 observations were generated from each class
• The solid lines are Bayes’ boundaries
• The dashed lines are LDA boundaries
IOM 530: Intro. to Statistical Learning 19

Running LDA on Default Data

• LDA makes 252+ 23 mistakes on 10000 predictions
(2.75% misclassification error rate)
• But LDA miss-predicts 252/333 = 75.5% of defaulters!
• Perhaps, we shouldn’t use 0.5 as threshold for predicting
default?
IOM 530: Intro. to Statistical Learning 20

Use 0.2 as Threshold for Default

• Now the total number of mistakes is 235+138 = 373
(3.73% misclassification error rate)
• But we only miss-predicted 138/333 = 41.4% of defaulters
• We can examine the error rate with other thresholds
IOM 530: Intro. to Statistical Learning 21

Default Threshold Values vs. Error Rates

• Black solid: overall error rate
• Blue dashed: Fraction of defaulters missed
• Orange dotted: non defaulters incorrectly classified
IOM 530: Intro. to Statistical Learning 22

Quadratic Discriminant Analysis (QDA)

• LDA assumed that every class has the same variance/
covariance
• However, LDA may perform poorly if this assumption is far
from true
• QDA works identically as LDA except that it estimates
separate variances/ covariance for each class
IOM 530: Intro. to Statistical Learning 23

Which is better? LDA or QDA?

• Since QDA allows for different variances among classes,
the resulting boundaries become quadratic

• Which approach is better: LDA or QDA?

• QDA will work best when the variances are very different between
classes and we have enough observations to accurately estimate
the variances
• LDA will work best when the variances are similar among classes
or we don’t have enough data to accurately estimate the variances
IOM 530: Intro. to Statistical Learning 24

Comparing LDA to QDA

• Black dotted: LDA boundary
• Purple dashed: Bayes’ boundary
• Green solid: QDA boundary
• Left: variances of the classes are equal (LDA is better fit)
• Right: variances of the classes are not equal (QDA is better fit)
IOM 530: Intro. to Statistical Learning 25

Comparison of Classification Methods

• KNN (Chapter 2)
• Logistic Regression (Chapter 4)
• LDA (Chapter 4)
• QDA (Chapter 4)
IOM 530: Intro. to Statistical Learning 26

Logistic Regression vs. LDA

• Similarity: Both Logistic Regression and LDA produce
linear boundaries
• Difference: LDA assumes that the observations are drawn
from the normal distribution with common variance in
each class, while logistic regression does not have this
assumption. LDA would do better than Logistic
Regression if the assumption of normality hold, otherwise
logistic regression can outperform LDA
IOM 530: Intro. to Statistical Learning 27

KNN vs. (LDA and Logistic Regression)

• KNN takes a completely different approach

• KNN is completely non-parametric: No assumptions are

made about the shape of the decision boundary!

• Advantage of KNN: We can expect KNN to dominate

both LDA and Logistic Regression when the decision
boundary is highly non-linear

• Disadvantage of KNN: KNN does not tell us which

predictors are important (no table of coefficients)
IOM 530: Intro. to Statistical Learning 28

QDA vs. (LDA, Logistic Regression, and KNN)

• QDA is a compromise between non-parametric KNN
method and the linear LDA and logistic regression

• If the true decision boundary is:

• Linear: LDA and Logistic outperforms
• Moderately Non-linear: QDA outperforms
• More complicated: KNN is superior

Project Report On Lifestyle
40% (5)
Project Report On Lifestyle
16 pages
Kernel Methods For Pattern Analysis
100% (3)
Kernel Methods For Pattern Analysis
478 pages
04 Chap04 ClassificationMethods-LDA-QDA
No ratings yet
04 Chap04 ClassificationMethods-LDA-QDA
30 pages
Lecture 03 Bayes Classifier With Prob Concepts
No ratings yet
Lecture 03 Bayes Classifier With Prob Concepts
70 pages
Week2 Part1 Summer Partial Notes
No ratings yet
Week2 Part1 Summer Partial Notes
75 pages
Slides Classification Discranalysis
No ratings yet
Slides Classification Discranalysis
11 pages
Lec-04 - Linear Discriminant Analysis
No ratings yet
Lec-04 - Linear Discriminant Analysis
23 pages
Linear Discriminant Analysis Reference
No ratings yet
Linear Discriminant Analysis Reference
6 pages
4 - (9-10) LDA, QDA, & KNN Classifiers
No ratings yet
4 - (9-10) LDA, QDA, & KNN Classifiers
39 pages
Linear Discriminat Analysis
No ratings yet
Linear Discriminat Analysis
23 pages
Reference+Material LDA
No ratings yet
Reference+Material LDA
24 pages
Ch04 Classification P1
No ratings yet
Ch04 Classification P1
22 pages
Reference Material - LDA
No ratings yet
Reference Material - LDA
24 pages
Assignment 3
No ratings yet
Assignment 3
4 pages
Lecture 9: Classification, LDA: Reading: Chapter 4
No ratings yet
Lecture 9: Classification, LDA: Reading: Chapter 4
55 pages
Temario Isl or
No ratings yet
Temario Isl or
15 pages
Reference Material - LDA
No ratings yet
Reference Material - LDA
24 pages
Week#5
No ratings yet
Week#5
33 pages
Linear Discriminant Analysis
No ratings yet
Linear Discriminant Analysis
27 pages
Supervised Learning: Linear Methods (1/2) : Applied Multivariate Statistics - Spring 2012
No ratings yet
Supervised Learning: Linear Methods (1/2) : Applied Multivariate Statistics - Spring 2012
15 pages
Machine Learning-Lecture 3 (Student)
No ratings yet
Machine Learning-Lecture 3 (Student)
4 pages
Islp 1
No ratings yet
Islp 1
15 pages
LDA 01 Linear Discriminant Analysis
No ratings yet
LDA 01 Linear Discriminant Analysis
65 pages
Reference Material - LDA
No ratings yet
Reference Material - LDA
24 pages
Lecture14 Discriminant Analysis
No ratings yet
Lecture14 Discriminant Analysis
38 pages
Discriminant Analysis
No ratings yet
Discriminant Analysis
45 pages
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
No ratings yet
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
13 pages
AI19
No ratings yet
AI19
4 pages
Sy19 A22 Cours3
No ratings yet
Sy19 A22 Cours3
98 pages
19-Introduction Classification Algorithm-18-09-2024
No ratings yet
19-Introduction Classification Algorithm-18-09-2024
102 pages
24CSR1R01 DSF Assignment 10
No ratings yet
24CSR1R01 DSF Assignment 10
3 pages
BTMMeeting25Nov2020 StatisticalLearning
No ratings yet
BTMMeeting25Nov2020 StatisticalLearning
49 pages
Generative Algorithms
No ratings yet
Generative Algorithms
3 pages
Lee Pda 2010
No ratings yet
Lee Pda 2010
12 pages
Legal 3 AI
No ratings yet
Legal 3 AI
3 pages
Unit 4 Classification
No ratings yet
Unit 4 Classification
87 pages
Module 3
No ratings yet
Module 3
132 pages
ICS 2408 - Lecture 6 - Classification and Prediction
No ratings yet
ICS 2408 - Lecture 6 - Classification and Prediction
47 pages
Springer Texts in Statistics: Series Editors
No ratings yet
Springer Texts in Statistics: Series Editors
14 pages
1.2. Linear and Quadratic Discriminant Analysis - Scikit-Learn 1.6.1 Documentati
No ratings yet
1.2. Linear and Quadratic Discriminant Analysis - Scikit-Learn 1.6.1 Documentati
10 pages
ICS 2408 - Lecture 6 - Classification and Prediction
No ratings yet
ICS 2408 - Lecture 6 - Classification and Prediction
47 pages
Merge
No ratings yet
Merge
240 pages
3rd Unit Last 5 Answer AIML
No ratings yet
3rd Unit Last 5 Answer AIML
21 pages
Biological Data Science Lecture7
No ratings yet
Biological Data Science Lecture7
17 pages
Lec5 Class
No ratings yet
Lec5 Class
14 pages
Bayesian Classifier Linear Disciminant Analysis (LDA) Quadratic Discriminant Analysis (QDA)
No ratings yet
Bayesian Classifier Linear Disciminant Analysis (LDA) Quadratic Discriminant Analysis (QDA)
18 pages
10 Statistical Techniques
No ratings yet
10 Statistical Techniques
9 pages
W8-Supervised Learning Methods
No ratings yet
W8-Supervised Learning Methods
30 pages
6 Classification
No ratings yet
6 Classification
53 pages
Exercises
No ratings yet
Exercises
69 pages
Linear Discriminant Analysis How To Have A Practical Approach To An LDA Model?
No ratings yet
Linear Discriminant Analysis How To Have A Practical Approach To An LDA Model?
6 pages
APA Chapter3 T20
No ratings yet
APA Chapter3 T20
24 pages
Chapter 11 KNN Naive Bayes and LDA
No ratings yet
Chapter 11 KNN Naive Bayes and LDA
15 pages
LDA Slides N
No ratings yet
LDA Slides N
20 pages
Classification & Prediction: - Shailesh Yadav Central University of Rajasthan
No ratings yet
Classification & Prediction: - Shailesh Yadav Central University of Rajasthan
28 pages
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
No ratings yet
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
12 pages
ICT202B AI ML and Emerging Technologies UNIT 3 (Classification and Regression) 2
No ratings yet
ICT202B AI ML and Emerging Technologies UNIT 3 (Classification and Regression) 2
23 pages
Cheet Sheet
No ratings yet
Cheet Sheet
47 pages
Classification
No ratings yet
Classification
33 pages
Unit 3 in Machine Intelligence
No ratings yet
Unit 3 in Machine Intelligence
62 pages
ML Valkenborg
No ratings yet
ML Valkenborg
84 pages
BiodiversityR PDF
No ratings yet
BiodiversityR PDF
128 pages
BRM Multi Var
No ratings yet
BRM Multi Var
38 pages
STAT3006 Lecture Notes 2021 Aug8 2021
No ratings yet
STAT3006 Lecture Notes 2021 Aug8 2021
110 pages
Linear Classification: 1 1 N N I D I
No ratings yet
Linear Classification: 1 1 N N I D I
33 pages
Bacterial Community Response in Ginseng Rhizosphere Soil - 2024 - Environmental
No ratings yet
Bacterial Community Response in Ginseng Rhizosphere Soil - 2024 - Environmental
13 pages
Multivariate Analysis of Variance (MANOVA) - Stahle
No ratings yet
Multivariate Analysis of Variance (MANOVA) - Stahle
15 pages
Predicting Bank Insolvencies Using Machine Learning Techniques
No ratings yet
Predicting Bank Insolvencies Using Machine Learning Techniques
42 pages
Animal Face Recognition - Edited
No ratings yet
Animal Face Recognition - Edited
11 pages
Hagenaars, J. A., & Halman, L. C. Searching For Ideal Types. The Potentialities of Latent Class Analysis
No ratings yet
Hagenaars, J. A., & Halman, L. C. Searching For Ideal Types. The Potentialities of Latent Class Analysis
16 pages
(Ebook PDF) Marketing Research 5th by Naresh Malhotra - The Full Ebook With Complete Content Is Ready For Download
100% (1)
(Ebook PDF) Marketing Research 5th by Naresh Malhotra - The Full Ebook With Complete Content Is Ready For Download
45 pages
ML Unit-2 Final
No ratings yet
ML Unit-2 Final
32 pages
Chapter 3 Big Data Analytics and Big Data Analytics Techniques PDF
No ratings yet
Chapter 3 Big Data Analytics and Big Data Analytics Techniques PDF
22 pages
Project Report Chetan Sharma
No ratings yet
Project Report Chetan Sharma
114 pages
Research Methodology and Ipr Notes
No ratings yet
Research Methodology and Ipr Notes
60 pages
AI Unit-5 Notes
No ratings yet
AI Unit-5 Notes
25 pages
Foundations of Machine Learning: Sudeshna Sarkar IIT Kharagpur
No ratings yet
Foundations of Machine Learning: Sudeshna Sarkar IIT Kharagpur
19 pages
Duleba1996 - Regression Analysis and Multivariate Analysis
No ratings yet
Duleba1996 - Regression Analysis and Multivariate Analysis
15 pages
Malhotra Mr05 PPT 20
100% (1)
Malhotra Mr05 PPT 20
41 pages
Organisational Culture Questionnaire - Profile of Employee
No ratings yet
Organisational Culture Questionnaire - Profile of Employee
28 pages
A Comparative Study of Face Recognition Techniques
No ratings yet
A Comparative Study of Face Recognition Techniques
9 pages
Mba SPSS Excel Lab Manual
No ratings yet
Mba SPSS Excel Lab Manual
66 pages
Multivariate Statistical Methods A First Course - 1st Edition Readable PDF Download
100% (20)
Multivariate Statistical Methods A First Course - 1st Edition Readable PDF Download
15 pages
Chapter11 Slides
No ratings yet
Chapter11 Slides
20 pages
Expert Systems With Applications: Van Tung Tran, Bo-Suk Yang
No ratings yet
Expert Systems With Applications: Van Tung Tran, Bo-Suk Yang
12 pages
Ch2 3market Study Procedure
No ratings yet
Ch2 3market Study Procedure
13 pages
Multidimensional Scale Types and Invariant Multivariate Statistics
No ratings yet
Multidimensional Scale Types and Invariant Multivariate Statistics
13 pages
Decision Tree
No ratings yet
Decision Tree
18 pages
Full Introduction To Research Methods and Data Analysis in Psychology 3rd Edition Darren Langdridge PDF All Chapters
100% (8)
Full Introduction To Research Methods and Data Analysis in Psychology 3rd Edition Darren Langdridge PDF All Chapters
82 pages

04 Chap04 ClassificationMethods LDA QDA

Uploaded by

04 Chap04 ClassificationMethods LDA QDA

Uploaded by

IOM 530: Intro.

LINEAR DISCRIMINANT ANALYSIS (LDA) &

QUADRATIC DISCRIMINANT ANALYSIS (QDA)

Linear Discriminant Analysis

Why Linear? Why Discriminant?

• Good predictors tend to have large v’s (weight)

• We want to discriminate between the different categories

• Think of food recipe. Changing the proportions (weights)

• Each predictor variable is normally distributed

Why not Logistic Regression?

• In the case where n is small, and the distribution of

• LDA is more popular when we have more than two

• So far, we have estimated it with two methods:

Estimating Bayes’ Classifier

: Probability of coming from class k (prior probability)

: Density function for X given that X is an observation from

• The most common model for is the Normal Density

• Using the density, we only need to estimate three

Use Training Data set for Estimation

A Simple Example with One Predictor (p =1)

• The mean and the variance are estimated

• Finally, Bayes’ theorem is used to compute pk and the

• 20 observations were drawn from each of the two classes

An Example When p > 1

• We have two predictors (p =2)

Running LDA on Default Data

Use 0.2 as Threshold for Default

Default Threshold Values vs. Error Rates

Quadratic Discriminant Analysis (QDA)

Which is better? LDA or QDA?

• Which approach is better: LDA or QDA?

Comparing LDA to QDA

Comparison of Classification Methods

Logistic Regression vs. LDA

KNN vs. (LDA and Logistic Regression)

• KNN is completely non-parametric: No assumptions are

• Advantage of KNN: We can expect KNN to dominate

• Disadvantage of KNN: KNN does not tell us which

QDA vs. (LDA, Logistic Regression, and KNN)

• If the true decision boundary is:

You might also like