0% found this document useful (0 votes)

21 views16 pages

Session 16-Discriminant Analysis

Uploaded by

Pratyusha Voruganti

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views16 pages

Session 16-Discriminant Analysis

Uploaded by

Pratyusha Voruganti

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 16

Linear Discriminant Analysis

Dr. Rajiv Kumar

IIM Kashipur

Note: Content used in this PPT has copied from various source.
Introduction

Linear discriminant analysis (LDA) is a technique for

analyzing data when the criterion or dependent variable is
categorical and the predictor or independent variables are
continuous or interval in nature.
Introduction…

The objectives of discriminant analysis are as follows:

 Development of discriminant functions, or linear combinations of the
predictor or independent variables, which will best discriminate
between the categories of the criterion or dependent variable (groups).
 Examination of whether significant differences exist among the groups,
in terms of the predictor variables.
 Determination of which predictor variables contribute to most of the
intergroup differences.
 Classification of cases to one of the groups based on the values of the
predictor variables.
 Evaluation of the accuracy of classification.
Introduction…

 When the criterion variable has two categories, the technique is known
as two-group discriminant analysis.
 When three or more categories are involved, the technique is referred
to as multiple discriminant analysis.
 The main distinction is that, in the two-group case, it is possible to
derive only one discriminant function. In multiple discriminant analysis,
more than one function may be computed. In general, with G groups
and k predictors, it is possible to estimate up to the smaller of G - 1, or
k, discriminant functions.
 The first function has the highest ratio of between-groups to within-
groups sum of squares. The second function, uncorrelated with the
first, has the second highest ratio, and so on. However, not all the
functions may be statistically significant.
Geometric Interpretation

Figure: A Geometric Interpretation of

Two-Group Discriminant Analysis
Discriminant Analysis Model

The discriminant analysis model involves linear combinations of the

following form:
D = b0 + b1X1 + b2X2 + b3X3 + . . . + bkXk Where:
D = discriminant score
b 's = discriminant coefficient or weight
X 's = predictor or independent variable

 The coefficients, or weights (b), are estimated so that the groups differ as much as
possible on the values of the discriminant function.
 This occurs when the ratio of between-group sum of squares to within-group sum of
squares for the discriminant scores is at a maximum.
Discriminant Analysis Model

The discriminant analysis model involves linear combinations of the

following form:
D = b0 + b1X1 + b2X2 + b3X3 + . . . + bkXk Where:
D = discriminant score
b 's = discriminant coefficient or weight
X 's = predictor or independent variable

Canonical correlation: Canonical correlation measures the extent of association

between the discriminant scores and the groups. It is a measure of association
between the single discriminant function and the set of dummy variables that
define the group membership.
Centroid: The centroid is the mean values for the discriminant scores for a
particular group. There are as many centroids as there are groups, as there is
one for each group. The means for a group on all the functions are the group
centroids.
Classification matrix: Sometimes also called confusion or prediction matrix, the
classification matrix contains the number of correctly classified and misclassified
cases.
Statistics Associated with Discriminant Analysis…

Discriminant function coefficients: The discriminant function coefficients

(unstandardized) are the multipliers of variables, when the variables are in the
original units of measurement.
Discriminant scores: The unstandardized coefficients are multiplied by the
values of the variables. These products are summed and added to the constant
term to obtain the discriminant scores.
Eigenvalue: For each discriminant function, the Eigenvalue is the ratio of
between-group to within-group sums of squares. Large Eigenvalues imply
superior functions.
Statistics Associated with Discriminant Analysis…

Standardized discriminant function coefficients: The standardized

discriminant function coefficients are the discriminant function coefficients and
are used as the multipliers when the variables have been standardized to a mean
of 0 and a variance of 1.
Structure correlations: Also referred to as discriminant loadings, the structure
correlations represent the simple correlations between the predictors and the
discriminant function.
Total correlation matrix: If the cases are treated as if they were from a single
sample and the correlations computed, a total correlation matrix is obtained.
Wilks'λ: Sometimes also called the U statistic, Wilks' λ for each predictor is the
ratio of the within-group sum of squares to the total sum of squares. Its value
varies between 0 and 1. Large values
of λ (near 1) indicate that group means do not seem to be different. Small values
of λ (near 0) indicate that the group means seem to be different.
Conducting Discriminant Analysis

If N1=N2 Cut-off Point=(Centroid1+Centroid2)/2

Otherwise,

Cut-off Point =(N1xCentroid2+N2xCentroid1)/(N1+N2)

Where N1, N2=number of training cases in group 1 and group 2

respectively
LDA in R (1of2)

library(readxl)
library(MASS)
df<-read_excel(‘E:/BC2/IrisData.xlsx')
#print(df)
model<-lda(Species~Sepal.Length+Sepal.Width+Petal.Length+Petal.Width, data=df)
model
LDA in R (2of2)
Output (Confusion Matrix):
setosa versicolor virginica
Confusion matrix and Hit Ratio (Accuracy) of the
setosa 50 0 0
model
versicolor 0 48 2
predicted_class<-predict(model, newdata = df)$class
predicted_class
virginica 0 1 49
#Here we consider training data. One can use test
data Setosa Versicolor Virginica
table(df$Species, predicted_class) #Confusion Matrix
Setosa 50 0 0

Setosa 0 48 2

Setosa 0 1 49

Hit Ratio (Accuracy)=Total Correct Prediction/(Total True Prediction + Total False Prediction)
=147/(147+3)
=98%
LDA in R (3of3)

Predicting a case: Sepal.Length=5.8, Sepal.Width=2.7, Petal.Length=4.1, Petal.Width=1

l1=list("Sepal.Length"=5.8, "Sepal.Width"=2.7, "Petal.Length"=4.1, "Petal.Width"=1)

predicted_class<-predict(model, l1)$class #One can use data frame instead of a list
predicted_class

Output
Logistic Regression Vs. Discriminant Analysis

Logistic Regression Discriminant Analysis

1. No assumption of MV normality Assume MV normality
2. Can accommodate large numbers of Large number of predictors violets MV
predictors more easily normality -> can’t be accommodated
3. Categorical predictors OK (e.g., dummy Predictors must be continuous, interval level
codes)
4. Less powerful when assumptions are More powerful when assumptions are met
met
5. Few assumptions, typically met in Many assumptions, rarely met in practice
practice
6. Categorical IVs can be dummy coded Categorical IVs create problems
MV-Multivariate, IV-Independent Variable
Primary Scales of Measurement
Scale Basic Characteristics Examples Permissible Statistics
Nominal Numbers identify and classify Social Security numbers, numbering of football Percentages, mode
objects players, Brand numbers, store types, gender
(male, female)
Ordinal Numbers indicate the relative Quality rankings, rankings of teams in a Percentile, median
positions of the objects but tournament, Preference rankings, market
not the magnitude of position, social class, satisfaction, intention
differences between them

Interval Differences between objects Temperature (Fahrenheit, centigrade) Range, mean, standard
can be compared; zero point Attitudes, opinions, index numbers deviation
is arbitrary

Ratio Zero point is fixed; ratios of Length, weight, Age, income, costs, sales, Geometric mean, harmonic
scale values can be computed market shares mean

Sample Size for Analytical Surveys, Using a Pretest-Posttest-Comparison-Group Design
From Everand
Sample Size for Analytical Surveys, Using a Pretest-Posttest-Comparison-Group Design
Joseph George Caldwell
No ratings yet
Energuide Label Infographic
No ratings yet
Energuide Label Infographic
1 page
Project Report Multiple Discriminant Analysis: in Partial Fulfilment of Covering The Course of Business Research Methods
No ratings yet
Project Report Multiple Discriminant Analysis: in Partial Fulfilment of Covering The Course of Business Research Methods
36 pages
Discriminant Analysis: Discriminant Functions Is A
No ratings yet
Discriminant Analysis: Discriminant Functions Is A
17 pages
Discriminant Analysis
No ratings yet
Discriminant Analysis
13 pages
Discriminant & Logit Analysis Using SAS Enterprise Guide
No ratings yet
Discriminant & Logit Analysis Using SAS Enterprise Guide
53 pages
Discriminant 5
No ratings yet
Discriminant 5
10 pages
Multiple Discriminant Analysis: Dr. Hemal Pandya
No ratings yet
Multiple Discriminant Analysis: Dr. Hemal Pandya
29 pages
Discriminant Analysis For Risk Classification and Prediction
No ratings yet
Discriminant Analysis For Risk Classification and Prediction
23 pages
Chapter 18 Sum, Mary
No ratings yet
Chapter 18 Sum, Mary
6 pages
Discriminant Analysis: Prepared By-Sumit Jain
No ratings yet
Discriminant Analysis: Prepared By-Sumit Jain
44 pages
Linear Discriminant Analysis
No ratings yet
Linear Discriminant Analysis
33 pages
Discriminant Analysis PDF
No ratings yet
Discriminant Analysis PDF
9 pages
Notes Discriminant Analysis March 2021
No ratings yet
Notes Discriminant Analysis March 2021
59 pages
Discriminant Analysis
No ratings yet
Discriminant Analysis
11 pages
Discriminant Analysis
No ratings yet
Discriminant Analysis
20 pages
Multiple Discriminant Analysis: Group 9
No ratings yet
Multiple Discriminant Analysis: Group 9
29 pages
Discriminant Analysis
No ratings yet
Discriminant Analysis
45 pages
Discriminant Analysis
100% (1)
Discriminant Analysis
16 pages
Ant Analysis
No ratings yet
Ant Analysis
31 pages
Classification Models
No ratings yet
Classification Models
95 pages
Discriminant Analysis
No ratings yet
Discriminant Analysis
19 pages
Analiza Discriminanta
No ratings yet
Analiza Discriminanta
3 pages
Discriminant Analysis Brief
No ratings yet
Discriminant Analysis Brief
30 pages
Chapter 25 - Discriminant Analysis
No ratings yet
Chapter 25 - Discriminant Analysis
20 pages
Discriminant Analysis
100% (1)
Discriminant Analysis
20 pages
Disk Rim in An Z Analyse
No ratings yet
Disk Rim in An Z Analyse
30 pages
Discriminant Function Analysis
No ratings yet
Discriminant Function Analysis
16 pages
Ant Analysis (Smoker Edition) Final
No ratings yet
Ant Analysis (Smoker Edition) Final
13 pages
Discriminant Analysis Presentation
No ratings yet
Discriminant Analysis Presentation
7 pages
Materi Discrminant Analysis
No ratings yet
Materi Discrminant Analysis
83 pages
Sma ss5 Chap 18 11
No ratings yet
Sma ss5 Chap 18 11
16 pages
DFA Interpretation Help
No ratings yet
DFA Interpretation Help
36 pages
Discriminant Analysis Psy.
No ratings yet
Discriminant Analysis Psy.
5 pages
TQM - TRG - F-09 - Discriminant Analysis - Rev01 - 20180602 PDF
No ratings yet
TQM - TRG - F-09 - Discriminant Analysis - Rev01 - 20180602 PDF
22 pages
QTA 25-04-2013 - Discriminant Analysis
No ratings yet
QTA 25-04-2013 - Discriminant Analysis
9 pages
Chapter11 Slides
No ratings yet
Chapter11 Slides
20 pages
Discriminant Function Analysis
100% (1)
Discriminant Function Analysis
30 pages
Last Copyyyyy
No ratings yet
Last Copyyyyy
3 pages
Discriminant Analysis
100% (1)
Discriminant Analysis
32 pages
Discriminant Analysis
No ratings yet
Discriminant Analysis
19 pages
Discriminant Ana
No ratings yet
Discriminant Ana
8 pages
9 ASAP Advanced Statistics Discriminant Analyisi
No ratings yet
9 ASAP Advanced Statistics Discriminant Analyisi
45 pages
Discrimi NT
No ratings yet
Discrimi NT
18 pages
Why Discriminant Analysis Is Done
No ratings yet
Why Discriminant Analysis Is Done
22 pages
DADM S14 Linear Discriminant Analysis
No ratings yet
DADM S14 Linear Discriminant Analysis
13 pages
Discriminant Analysis: How Can You Answer These Questions?
No ratings yet
Discriminant Analysis: How Can You Answer These Questions?
14 pages
MVDAUnit 5
No ratings yet
MVDAUnit 5
19 pages
Discriminant Function Analysis: Basics Psy524 Andrew Ainsworth
No ratings yet
Discriminant Function Analysis: Basics Psy524 Andrew Ainsworth
39 pages
Ant Analysis
No ratings yet
Ant Analysis
18 pages
Ant An Aly Sis: Presented by
No ratings yet
Ant An Aly Sis: Presented by
33 pages
Analisis Diskriminan 2
No ratings yet
Analisis Diskriminan 2
30 pages
Alternating Decision Tree: Fundamentals and Applications
From Everand
Alternating Decision Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet
Statistical Classification: Fundamentals and Applications
From Everand
Statistical Classification: Fundamentals and Applications
Fouad Sabry
No ratings yet
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Introduction To Business Statistics Through R Software: Software
From Everand
Introduction To Business Statistics Through R Software: Software
Editor IJSMI
No ratings yet
De-Mystifying Math and Stats for Machine Learning: Mastering the Fundamentals of Mathematics and Statistics for Machine Learning
From Everand
De-Mystifying Math and Stats for Machine Learning: Mastering the Fundamentals of Mathematics and Statistics for Machine Learning
Seaport AI Madhavan
No ratings yet
Naive Bayes Classifier: Fundamentals and Applications
From Everand
Naive Bayes Classifier: Fundamentals and Applications
Fouad Sabry
No ratings yet
Introduction To Non Parametric Methods Through R Software
From Everand
Introduction To Non Parametric Methods Through R Software
Editor IJSMI
No ratings yet
Decision Tree Pruning: Fundamentals and Applications
From Everand
Decision Tree Pruning: Fundamentals and Applications
Fouad Sabry
No ratings yet
Ordered Weighted Averaging Aggregation Operator: Fundamentals and Applications
From Everand
Ordered Weighted Averaging Aggregation Operator: Fundamentals and Applications
Fouad Sabry
No ratings yet
Geopolitics N Geoeconomics
No ratings yet
Geopolitics N Geoeconomics
6 pages
Epa Probit Analysis Program
100% (1)
Epa Probit Analysis Program
5 pages
Household Chores Exercise
No ratings yet
Household Chores Exercise
3 pages
Solvron® Introduction
No ratings yet
Solvron® Introduction
11 pages
Blue and Green Modern Minimalist Marketing Strategy Presentation
No ratings yet
Blue and Green Modern Minimalist Marketing Strategy Presentation
19 pages
Stearic Acid Plant
No ratings yet
Stearic Acid Plant
2 pages
Re Examining The Environmental Kuznets Curve (EKC) For India Via The Multiple Threshold NARDL Procedure
No ratings yet
Re Examining The Environmental Kuznets Curve (EKC) For India Via The Multiple Threshold NARDL Procedure
13 pages
Kautilya's Economic Thoughts
No ratings yet
Kautilya's Economic Thoughts
3 pages
State of UP Vs Renusagar Power Co.
No ratings yet
State of UP Vs Renusagar Power Co.
40 pages
Afroplast Energy Audit Report 25112017
No ratings yet
Afroplast Energy Audit Report 25112017
62 pages
Cebu Tour Package
No ratings yet
Cebu Tour Package
5 pages
Quiz 511
No ratings yet
Quiz 511
5 pages
IELTS Task 1 - Phrases For Graphs
No ratings yet
IELTS Task 1 - Phrases For Graphs
12 pages
SSC Wise - Job Role Wise Equipment Details
No ratings yet
SSC Wise - Job Role Wise Equipment Details
1,194 pages
CLASS VIII Syllabus ANNUAL EXAM
No ratings yet
CLASS VIII Syllabus ANNUAL EXAM
1 page
3-Bucket Elev. & 4-Chain Conv
No ratings yet
3-Bucket Elev. & 4-Chain Conv
24 pages
Salary and Benefits of Nursing Abroad - Chy, Jam
No ratings yet
Salary and Benefits of Nursing Abroad - Chy, Jam
3 pages
Sr. No. Empr Code Fact/Estab Name: List of Defaulter As On Feb-2023
No ratings yet
Sr. No. Empr Code Fact/Estab Name: List of Defaulter As On Feb-2023
2,295 pages
Hiba MOUALLEM Et Maria ISSA-Plomberie Installation
No ratings yet
Hiba MOUALLEM Et Maria ISSA-Plomberie Installation
14 pages
MPS Motor Seal Catalogue 2019
No ratings yet
MPS Motor Seal Catalogue 2019
7 pages
Ee210 HW3
No ratings yet
Ee210 HW3
2 pages
TM Iatf 16949
No ratings yet
TM Iatf 16949
2 pages
Bachelor of Science in Economics Program: BS Economics CURRICULUM Effective 1 Sem SY 2010-2011
No ratings yet
Bachelor of Science in Economics Program: BS Economics CURRICULUM Effective 1 Sem SY 2010-2011
1 page
KRA 2 Ok
No ratings yet
KRA 2 Ok
12 pages
Black and White Grey Modular Abstract Strategy Deck Business Presentation
No ratings yet
Black and White Grey Modular Abstract Strategy Deck Business Presentation
20 pages
Strategic Planning Part 1 Sales
100% (2)
Strategic Planning Part 1 Sales
17 pages
MR Example Multiplereg001
No ratings yet
MR Example Multiplereg001
17 pages
When Should I Trade
No ratings yet
When Should I Trade
4 pages
Preview
No ratings yet
Preview
22 pages

Session 16-Discriminant Analysis

Uploaded by

Session 16-Discriminant Analysis

Uploaded by

Linear Discriminant Analysis

Dr. Rajiv Kumar

Linear discriminant analysis (LDA) is a technique for

The objectives of discriminant analysis are as follows:

Figure: A Geometric Interpretation of

The discriminant analysis model involves linear combinations of the

The discriminant analysis model involves linear combinations of the

Canonical correlation: Canonical correlation measures the extent of association

Discriminant function coefficients: The discriminant function coefficients

Standardized discriminant function coefficients: The standardized

If N1=N2 Cut-off Point=(Centroid1+Centroid2)/2

Cut-off Point =(N1xCentroid2+N2xCentroid1)/(N1+N2)

Where N1, N2=number of training cases in group 1 and group 2

Predicting a case: Sepal.Length=5.8, Sepal.Width=2.7, Petal.Length=4.1, Petal.Width=1

l1=list("Sepal.Length"=5.8, "Sepal.Width"=2.7, "Petal.Length"=4.1, "Petal.Width"=1)

Logistic Regression Discriminant Analysis

You might also like