0% found this document useful (0 votes)

48 views24 pages

Reference+Material LDA

Linear discriminant analysis (LDA) is a technique used for classification and dimensionality reduction. It determines linear combinations of features that separate classes best by maximizing between-class variance while minimizing within-class variance. LDA addresses limitations of logistic regression like handling multi-class classification and performing better with well-separated classes. While LDA is supervised, principal component analysis is unsupervised and aims to capture maximum variance regardless of class labels.

Uploaded by

SHEKHAR SWAMI

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

48 views24 pages

Reference+Material LDA

Uploaded by

SHEKHAR SWAMI

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 24

LINEAR DISCRIMINAT ANALYSIS

An in-depth look at LDA with emphasis on Key Concepts & Working

Procedure
[email protected]
LORB32MGT5

This file is meant for personal use by [email protected] only.

Sharing or publishing the contents in part or full is liable for legal action.
Contents -

1. Overview
2. Key Concepts & Terminologies
3. LDA Implementation
[email protected]
LORB32MGT5
4. Conclusion
5. Use Cases of LDA
6. Further Reading

Proprietary content. ©Great Learning. All

This file is meant for personal
Rights use by
Reserved. [email protected]
Unauthorized use or only.
Sharing or publishing the contents inprohibited.
distribution part or full is liable for legal action.
Overview-
• In 1936, statistician Ronald Fisher presented dichotomous discriminant
analysis in "The use of multiple measurements in taxonomic problems."
which was later generalized into Linear Discriminant Analysis. LDA became a
common method to be used in pattern recognition and machine learning.
The core idea behind LDA is to determine a linear combination of features
that are able to discriminate between two (or more) classes. This linear
combination can also be used for dimensionality reduction.
[email protected]
LORB32MGT5

• This leads to the conclusion that LDA has similar use to both Logistic
regression (for classification) and Principal Component Analysis (for
dimensionality reduction). Let us have a brief discussion on the comparison
of these two techniques.

Proprietary content. ©Great Learning. All

This file is meant for personal
Rights use by
Reserved. [email protected]
Unauthorized use or only.
Sharing or publishing the contents inprohibited.
distribution part or full is liable for legal action.
LDA vs Logistic regression
• LDA and logistic regression are both multivariate statistical methods which
are used to determine relationships between different independent variables
to the categorical dependent variable.

• In logistic regression, the probability of the data point belonging to a class is

[email protected]
LORB32MGT5 obtained or to be more precise, the odds of the plausible outcome are
determined (ratio of the probability of the event happening to the
probability of the non-occurrence of the event).

• In LDA, the orthogonal (perpendicular to each other) discriminant functions

are estimated such that it maximizes the difference of means between the
existing groups (class labels) while minimizing the standard deviation within
the groups. Thus, the predicted class for a data point will be the one that has
the highest value for its corresponding linear function.

Proprietary content. ©Great Learning. All

This file is meant for personal
Rights use by
Reserved. [email protected]
Unauthorized use or only.
Sharing or publishing the contents inprohibited.
distribution part or full is liable for legal action.
Limitations of Logistic Regression
Logistic regression though a very powerful classification algorithm, has certain
limitations.

I Multi-class classification –
Logistic regression is primarily intended to be used as binary classifier, although
[email protected]
it can be extended to multi-class classification.
LORB32MGT5

II Poor performance with small sample size –

If the sample size is small, then the parameters estimated can be highly
unreliable. A good amount of observations (decent sample size) is necessary for
a stable logistic regression model.

Proprietary content. ©Great Learning. All

This file is meant for personal
Rights use by
Reserved. [email protected]
Unauthorized use or only.
Sharing or publishing the contents inprohibited.
distribution part or full is liable for legal action.
III Poor performance with well separated classes –
Although it sounds a bit weird, but logistic regression performs terribly when the
two classes are well separated. The reason being, if you have features that
separate the classes perfectly, the coefficients go off to infinity.
For further clarification on this, please do refer to the following article on “Quasi
Separation on Logistic Regression”

[email protected]
LORB32MGT5
All the above three limitations of logistic regression are usually handled by LDA, and
thus can be used as an alternate classifier in certain situations.

Proprietary content. ©Great Learning. All

This file is meant for personal
Rights use by
Reserved. [email protected]
Unauthorized use or only.
Sharing or publishing the contents inprohibited.
distribution part or full is liable for legal action.
LDA versus PCA -
Both Linear Discriminant Analysis (LDA) and Principal Component Analysis (PCA)
are linear transformation methods which closely relate to each other.

[email protected]
LORB32MGT5

Source - Linear Discriminant Analysis – Bit by Bit By Sebastian

Raschka

Proprietary content. ©Great Learning. All

This file is meant for personal
Rights use by
Reserved. [email protected]
Unauthorized use or only.
Sharing or publishing the contents inprohibited.
distribution part or full is liable for legal action.
• The major difference between LDA and PCA lies in the fact that LDA is a
supervised learning technique whereas PCA is an unsupervised machine
learning algorithm. What it essentially means is that PCA aims towards
finding the dimensions which captures the maximum variance irrespective of
the class labels (this can be intuitively thought of as treating the entire data
as a single class).
• On the other hand, LDA takes into account the different classes present in
[email protected]
LORB32MGT5
the data and finds the dimensions which captures the maximum variance &
maximum separability among the classes.
• Although we might be under the impression that LDA should always
outperform PCA since it directly deals with the class labels, but various
studies ( PCA versus LDA - Aleix M. Martinez et. al. 2001 ) have proved this
does not necessarily hold true when we have very few samples of certain
classes.

Proprietary content. ©Great Learning. All

This file is meant for personal
Rights use by
Reserved. [email protected]
Unauthorized use or only.
Sharing or publishing the contents inprohibited.
distribution part or full is liable for legal action.
Key Concepts and a few terminologies –
Before diving headfirst into the working of the LDA algorithm, let us discuss few key
concepts that are very crucial for a holistic understanding of LDA.
1. Bayes’ Theorem –
Bayes’ theorem relates the conditional and marginal probabilities of Events A and B:
𝑃 𝐵 𝐴 .𝑃 𝐴
𝑃 𝐴𝐵 =
𝑃(𝐵)
[email protected]
LORB32MGT5
Where ,
• P(A) is the prior probability or marginal probability of A. It is ”prior” in the sense that it does not take
into account any information about B.
• P(A|B) is the conditional probability of A, given B. It is also called the posterior probability because it is
derived from or depends upon the specified value of B.
• P(B|A) is the conditional probability of B given A.
• P(B) is the prior or marginal probability of B, and acts as a normalizing constant.

Proprietary content. ©Great Learning. All

This file is meant for personal
Rights use by
Reserved. [email protected]
Unauthorized use or only.
Sharing or publishing the contents inprohibited.
distribution part or full is liable for legal action.
2. Discriminant functions -
• Prediction using Bayes’ theorem assumes that the forms for the underlying
probability are known, and the training samples are used to estimate the
values of their parameters. Discriminant functions on the other hand does
not require knowledge of the forms of the underlying probability
distributions.
• To find the linear discriminant functions, minimization of a criterion function
or training error - the average loss incurred in classifying the set of training
samples is done.
[email protected]
LORB32MGT5

Where g(x) = Discriminant function

w = Weight vector
w0 = Bias or threshold weight
The bias and weight vectors are initiated with small random numbers (similar to
the Weight initialization for neural networks) and these values are updated
during minimization of the training error.

Proprietary content. ©Great Learning. All

This file is meant for personal
Rights use by
Reserved. [email protected]
Unauthorized use or only.
Sharing or publishing the contents inprohibited.
distribution part or full is liable for legal action.
The linear discriminant function divides the feature space by a hyperplane
decision surface. The orientation of the surface is determined by the normal
vector w, and the location of the surface is determined by the bias w0.

How predictions are made using the discriminant functions -

For a discriminant function of the form presented before, a binary classifier
implements the following decision rule:
[email protected]
LORB32MGT5
1. If value of discriminant function at x i.e. g(x) > 0 , then assign x to class I .
2. Similarly if g(x) < 0 , then assign it to class II.
In the event that g(x) = 0 , x can be ordinally assigned to either of the two
classes, or can be left undefined.

Proprietary content. ©Great Learning. All

This file is meant for personal
Rights use by
Reserved. [email protected]
Unauthorized use or only.
Sharing or publishing the contents inprohibited.
distribution part or full is liable for legal action.
3. Mahalanobis Distance –
The Mahalanobis distance (MD) is the distance between two points in
multivariate space. In a regular Euclidean space, variables (e.g. x, y, z) are
represented by axes drawn at right angles to each other; The distance between
any two points can be measured with a ruler.
For uncorrelated variables, the Euclidean distance equals the MD. However, if
two or more variables are correlated, the axes are no longer at right angles, and
the measurements become impossible with a ruler. In addition, if you have more
[email protected]
than three variables, you cannot plot them in regular 3D space at all. The MD
LORB32MGT5

solves this measurement problem, as it measures distances between points,

even correlated points for multiple variables.

MD(µ1,µ2,∑) = (µ1- µ2)t ∑-1

(µ1- µ2) [where µ1 and µ2 are two population means
with a common dispersion matrix ∑]

Proprietary content. ©Great Learning. All

This file is meant for personal
Rights use by
Reserved. [email protected]
Unauthorized use or only.
Sharing or publishing the contents inprohibited.
distribution part or full is liable for legal action.
LDA Implementation
Assumptions in LDA –
• Multivariate normality: The independent variables must follow normal
distribution.
• Homogeneity of variance/covariance (homoscedasticity): Variances among group
variables are the same across levels of predictors.
• Multicollinearity: Predictive power can decrease with an increased correlation
between predictor variables.
[email protected]
LORB32MGT5

Preparation of Data for LDA –

LDA algorithm has various underlying assumptions about the data discussed above,
hence it can be a good idea to prepare the data before applying LDA, whether for
classification or for dimensionality reduction. Although past studies suggest that
discriminant analysis is relatively robust to slight violations of these assumptions.

Proprietary content. ©Great Learning. All

This file is meant for personal
Rights use by
Reserved. [email protected]
Unauthorized use or only.
Sharing or publishing the contents inprohibited.
distribution part or full is liable for legal action.
Some of the preparatory steps are as follows –
1. Variable Transformation -
LDA assumes a Normal distribution of the input variables. Hence different
transformations ( e.g log and square root ) can be applied to the data to make it
more – normal (or near normal ).

[email protected]
LORB32MGT5

Proprietary content. ©Great Learning. All

This file is meant for personal
Rights use by
Reserved. [email protected]
Unauthorized use or only.
Sharing or publishing the contents inprohibited.
distribution part or full is liable for legal action.
2. Standardization –
Another LDA assumption is that the input variables have same variance. Thus,
Standardizing the variables to have mean of 0 and a standard deviation of 1 is
advised. This can be done when we are looking for the standardized coefficients
in the Linear Discriminant Model.

[email protected]
3. Outlier Treatment –
LORB32MGT5

Although linear models are very sensitive to outlier values, it greatly depends on
the particular use case whether a high values observation is treated as an outlier
or not. After the decision is made, outliers can be treated to avoid skew in the
basic statistics.

Proprietary content. ©Great Learning. All

LDA can be derived from simple probabilistic models which model the class conditional
distribution of the data for each class . Predictions can then be obtained by using Bayes’
rule, for each training sample :

(eq. 1 )
[email protected]
LORB32MGT5

all the expressions have there usual meaning (as discussed earlier)

And we select the class k which maximizes this posterior probability.

(eq. 2 )

Substituting the Probability distribution (eq. 2) in Bayes Rule (eq. 1) , and further
simplification, we get :
(eq. 3 )
[email protected]
LORB32MGT5

• The term (x- µk)t ∑-1 (x- µk) corresponds to the Mahalanobis Distance between the
sample and the mean. The Mahalanobis distance tells how close x is from the
mean µk , while also accounting for the variance of each feature.

• We can thus interpret LDA as assigning to the class whose mean is the closest in
terms of Mahalanobis distance, while also accounting for the class prior
probabilities.

P( k ) is a prior probability that the native class for x is k; and has to be specified
by the user. Usually by default all classes receive equal P(k) =
1/number_of_classes. Or, P(k) can be the count of the occurrence of one class
divided by the total number of occurrences of all the classes.
[email protected]
LORB32MGT5
P(x|k) is probability of presence of point x in class k, if class being dealt with is k.
The main issue in finding value for this term is that the variables are continuous
and not discrete. Hence, we need to compute the Probability Density Function
(PDF).

This file is meant for personal
Rights use by
Reserved. [email protected]
Unauthorized use or only.
Sharing or publishing the contents inprohibited.
distribution part or full is liable for legal action.
Once we have PDF (x | k ) for each of the classes, we can sum it up and normalize
it as below -
Consider we have two classes k and m –
So,
P(k|x)=P(k)∗PDF(x|k)/[P(k)∗PDF(x|k) + P(m)∗PDF(x|m)]
And ,
[email protected]
LORB32MGT5
P(m|x)= P(m)∗PDF(x|m)/[P(k)∗PDF(x|k) +P(m)∗PDF(x|m)]

Now each x will be substituted in the above two equations and the point will be
classified to the class for which P( Class | Data) is the highest.

This file is meant for personal
Rights use by
Reserved. [email protected]
Unauthorized use or only.
Sharing or publishing the contents inprohibited.
distribution part or full is liable for legal action.
Limitations of LDA –
1. LDA is a parametric method since it assumes unimodal Gaussian likelihoods. If
the distributions are significantly non-Gaussian, the LDA projections will not be
able to preserve any complex structure of the data, which may be needed for
classification

2. LDA will fail when the discriminatory information is not in the mean but rather
[email protected]
in the variance of the data.
LORB32MGT5

3. LDA produces at most C-1 feature projections. If the classification error

estimates establish that more features are needed, some other method must be
employed to provide those additional features

This file is meant for personal
Rights use by
Reserved. [email protected]
Unauthorized use or only.
Sharing or publishing the contents inprohibited.
distribution part or full is liable for legal action.
Conclusion:
As we have discussed, LDA is a very useful linear algorithm, which can be used for
both Dimensionality reduction and Classification problems. LDA can be seen as an
alternative to PCA for dimensionality reduction, and Logistic regression for
classification at situations where the aforementioned algorithms do not perform
satisfactorily. The procedure of LDA is very straightforward and can be summarized
as follows –
[email protected]
LORB32MGT5
1) Find the variance (also called scatter) within and between the classes.
2) Find the linear combinations which maximize the between class variance and
minimize the within class variance.
3) Transform the data as per the New Linear combinations (hyperplanes). We have
achieved Dimension Reduction till this step.
4) Predict class for each data point using Bayesian approach for the reduced
dimensions ( top k dimensions)

This file is meant for personal
Rights use by
Reserved. [email protected]
Unauthorized use or only.
Sharing or publishing the contents inprohibited.
distribution part or full is liable for legal action.
Use Cases of LDA:
1. Altman’s Z-score model
In 1968 an American finance professor Edward Altman developed a model to
predict the chances of a business going bankrupt in the next two years. The
coefficients used in the model were calculated by using Fisher’s Discriminant
Analysis.
The formula for the Z-score bankruptcy model is as follows:
[email protected]
LORB32MGT5
Z = 0.012X1 + 0.014X2 + 0.033X3 + 0.006X4 + 0.999X5
Where , X1, X2, X3, X4 are in percentage points.
X1 = working capital / total assets
X2 = retained earnings / total assets
X3 = earnings before interest and taxes / total assets
X4 = market value of equity / total liabilities
X5 = sales / total assets

This file is meant for personal
Rights use by
Reserved. [email protected]
Unauthorized use or only.
Sharing or publishing the contents inprohibited.
distribution part or full is liable for legal action.
2. Facial Recognition:
The aim of a typical facial recognition task is to identify the faces represented by
a very large number of pixel values, with each pixel serving as a feature. LDA is
often used to reduce the number of features to a more manageable number for
further classification. The linear combinations obtained using Fisher’s linear
discriminant are called Fisher faces.

[email protected]
3. Medical Field:
LORB32MGT5

Linear discriminant analysis (LDA) is used to classify the patient disease state as
mild, moderate or severe based upon the patient various parameters and the
medical treatment he is going through. This helps the doctors to intensify or
reduce the pace of their treatment.

This file is meant for personal
Rights use by
Reserved. [email protected]
Unauthorized use or only.
Sharing or publishing the contents inprohibited.
distribution part or full is liable for legal action.
Further Reading :
❖ An Introduction to Statistical Learning: with Applications in R, Chapter 4,
Page 138.
❖ Modern Multivariate Statistical Techniques: Regression, Classification, and
Manifold Learning, chapter 8
❖ Applied Predictive Modeling, Chapter 12, Page 287
❖ Linear Discriminant Analysis bit by bit (examples with Python)
[email protected]
LORB32MGT5

❖ Linear Discriminant Analysis (includes a link to an interactive LDA interface)

❖ The mathematics behind how sklearn performs Linear Discriminant Analysis.

Restful Api
No ratings yet
Restful Api
69 pages
Unit 4
No ratings yet
Unit 4
38 pages
DSA in JAVA Syllabus
No ratings yet
DSA in JAVA Syllabus
15 pages
IT Skill-2
100% (1)
IT Skill-2
58 pages
ML Unit4
No ratings yet
ML Unit4
41 pages
Haas VF-2 Operator Manual
100% (1)
Haas VF-2 Operator Manual
207 pages
ML Unit 2
No ratings yet
ML Unit 2
53 pages
C TS4FI 2023-Demo
No ratings yet
C TS4FI 2023-Demo
5 pages
Supervised Learning 1 PDF
100% (1)
Supervised Learning 1 PDF
162 pages
ML-Unit IV-1
No ratings yet
ML-Unit IV-1
28 pages
Wifi Service Log
No ratings yet
Wifi Service Log
405 pages
Topographic Survey of Comprehensive Secondary School Nawfia, Anambra State
100% (1)
Topographic Survey of Comprehensive Secondary School Nawfia, Anambra State
8 pages
Boedeker Kearns 2019 Linear Discriminant Analysis For Prediction of Group Membership A User Friendly Primer
No ratings yet
Boedeker Kearns 2019 Linear Discriminant Analysis For Prediction of Group Membership A User Friendly Primer
14 pages
LDA 01 Linear Discriminant Analysis
No ratings yet
LDA 01 Linear Discriminant Analysis
65 pages
Lecture 03 Bayes Classifier With Prob Concepts
No ratings yet
Lecture 03 Bayes Classifier With Prob Concepts
70 pages
Research Proposal
No ratings yet
Research Proposal
3 pages
OceanStor Dorado V6 6.0.0 Initial Configuration
No ratings yet
OceanStor Dorado V6 6.0.0 Initial Configuration
33 pages
Linear Classifiers: Dept. Computer Science & Engineering, Shanghai Jiao Tong University
No ratings yet
Linear Classifiers: Dept. Computer Science & Engineering, Shanghai Jiao Tong University
46 pages
ML Unit4
No ratings yet
ML Unit4
44 pages
2023 LSE MY474 Applied Machine Learning Social Science, Lecture3
No ratings yet
2023 LSE MY474 Applied Machine Learning Social Science, Lecture3
58 pages
Week2 Part1 Summer Partial Notes
No ratings yet
Week2 Part1 Summer Partial Notes
75 pages
LDA Slides N
No ratings yet
LDA Slides N
20 pages
KVT 715 DVD
No ratings yet
KVT 715 DVD
76 pages
Linear Discriminant Analysis: January 2015
No ratings yet
Linear Discriminant Analysis: January 2015
67 pages
Big Data para La Empresa
No ratings yet
Big Data para La Empresa
31 pages
U20cs604 Machine Learning Unit II
No ratings yet
U20cs604 Machine Learning Unit II
50 pages
Machine Learning Lab Manual 8
No ratings yet
Machine Learning Lab Manual 8
12 pages
Change Log
No ratings yet
Change Log
75 pages
Week 8 Notes - DM
No ratings yet
Week 8 Notes - DM
26 pages
Lecture14 Discriminant Analysis
No ratings yet
Lecture14 Discriminant Analysis
38 pages
Unit 4 ML
No ratings yet
Unit 4 ML
11 pages
Ann Unit-Ii
No ratings yet
Ann Unit-Ii
29 pages
Linear Discriminat Analysis
No ratings yet
Linear Discriminat Analysis
23 pages
Implementation of NTRIP and Management System in NIGNET Network
No ratings yet
Implementation of NTRIP and Management System in NIGNET Network
78 pages
Discriminant Analysis
No ratings yet
Discriminant Analysis
45 pages
Week#5
No ratings yet
Week#5
33 pages
Linear Discriminant Analysis
No ratings yet
Linear Discriminant Analysis
12 pages
Linear Methods For Classification
No ratings yet
Linear Methods For Classification
29 pages
Single
No ratings yet
Single
64 pages
3rd Unit Last 5 Answer AIML
No ratings yet
3rd Unit Last 5 Answer AIML
21 pages
Reference Material - LDA
No ratings yet
Reference Material - LDA
24 pages
Week 4 Logistic
No ratings yet
Week 4 Logistic
21 pages
Lec-04 - Linear Discriminant Analysis
No ratings yet
Lec-04 - Linear Discriminant Analysis
23 pages
Linear Models For Classification
No ratings yet
Linear Models For Classification
21 pages
Linear Discriminant Analysis
No ratings yet
Linear Discriminant Analysis
27 pages
LDA Final
No ratings yet
LDA Final
25 pages
Lecture 9: Classification, LDA: Reading: Chapter 4
No ratings yet
Lecture 9: Classification, LDA: Reading: Chapter 4
55 pages
IntelliSAW Sensor Installation Manual
No ratings yet
IntelliSAW Sensor Installation Manual
33 pages
Lda
No ratings yet
Lda
14 pages
Linear Discriminant Analysis
No ratings yet
Linear Discriminant Analysis
33 pages
ControlAcceso Resumen
No ratings yet
ControlAcceso Resumen
27 pages
Reference Material - LDA
No ratings yet
Reference Material - LDA
24 pages
Introduction To Linear Discriminants and Linear Discriminants For Classification in Machine Learning
No ratings yet
Introduction To Linear Discriminants and Linear Discriminants For Classification in Machine Learning
16 pages
1694601448-Unit 3.5 Linear Discriminant Analysis CU 2.0
No ratings yet
1694601448-Unit 3.5 Linear Discriminant Analysis CU 2.0
25 pages
Chapter 11 KNN Naive Bayes and LDA
No ratings yet
Chapter 11 KNN Naive Bayes and LDA
15 pages
Makalah Group 8 B. Ingg
No ratings yet
Makalah Group 8 B. Ingg
19 pages
B22CS014 Report
No ratings yet
B22CS014 Report
11 pages
Reference Material - LDA
No ratings yet
Reference Material - LDA
24 pages
Slide ML 0915
No ratings yet
Slide ML 0915
24 pages
Linear Discriminant Analysis: Predictive Modelling - Week3
No ratings yet
Linear Discriminant Analysis: Predictive Modelling - Week3
19 pages
Informative Speech Steve Jobs
100% (1)
Informative Speech Steve Jobs
5 pages
1.2. Linear and Quadratic Discriminant Analysis - Scikit-Learn 1.6.1 Documentati
No ratings yet
1.2. Linear and Quadratic Discriminant Analysis - Scikit-Learn 1.6.1 Documentati
10 pages
Fatima Khan.
No ratings yet
Fatima Khan.
8 pages
PA
No ratings yet
PA
8 pages
Microsoft Certifications-MCT Book2
No ratings yet
Microsoft Certifications-MCT Book2
9 pages
Linear Discriminant Analysis Reference
No ratings yet
Linear Discriminant Analysis Reference
6 pages
Chapter 4: Linear Models For Classification: Grit Hein & Susanne Leiberg
No ratings yet
Chapter 4: Linear Models For Classification: Grit Hein & Susanne Leiberg
21 pages
Chapter I
No ratings yet
Chapter I
18 pages
Module 5
No ratings yet
Module 5
11 pages
Ade Assignment
No ratings yet
Ade Assignment
18 pages
Supervised Learning: Linear Methods (1/2) : Applied Multivariate Statistics - Spring 2012
No ratings yet
Supervised Learning: Linear Methods (1/2) : Applied Multivariate Statistics - Spring 2012
15 pages
Enquiry: ENQ Nom Date of Enq Name of The Candidate Mobile Nomber Subject
No ratings yet
Enquiry: ENQ Nom Date of Enq Name of The Candidate Mobile Nomber Subject
15 pages
Jumo Indicator
No ratings yet
Jumo Indicator
10 pages
Machine Learning Assignment-2
No ratings yet
Machine Learning Assignment-2
7 pages
Handwriting 20250302 155545 Via 10015 Io
No ratings yet
Handwriting 20250302 155545 Via 10015 Io
7 pages
Dimensionality Reduction: Linear Discriminant Analysis (LDA)
No ratings yet
Dimensionality Reduction: Linear Discriminant Analysis (LDA)
8 pages
Discriminant Analysis Presentation
No ratings yet
Discriminant Analysis Presentation
7 pages
05.classification Algorithm
No ratings yet
05.classification Algorithm
3 pages
6 Preview, Print, and Distribute Documents: Previewing and Adjusting Page Layout
No ratings yet
6 Preview, Print, and Distribute Documents: Previewing and Adjusting Page Layout
12 pages
Presentation Topic
No ratings yet
Presentation Topic
4 pages
7368 ISAM ONT G-440G-A Datasheet
No ratings yet
7368 ISAM ONT G-440G-A Datasheet
2 pages
LDA
No ratings yet
LDA
10 pages
Assignment 3
No ratings yet
Assignment 3
4 pages
PRBT-0348 D20AC Peripheral Discontinuance Notice V100 R0
No ratings yet
PRBT-0348 D20AC Peripheral Discontinuance Notice V100 R0
3 pages
AI19
No ratings yet
AI19
4 pages
DVR Kit 0 NVR Kit - Pricelist (SINSYN-Tech) 2014.10
No ratings yet
DVR Kit 0 NVR Kit - Pricelist (SINSYN-Tech) 2014.10
4 pages
Install Apache PHP5 MySQL5.6 Debian 9.6
No ratings yet
Install Apache PHP5 MySQL5.6 Debian 9.6
5 pages
Linear Discriminant Analysis
No ratings yet
Linear Discriminant Analysis
2 pages
Ez Series Technical Bulletin: SUBJECT: EZ2x0/3x0 Program Update V3.16 EZ5x0 Program Update V4.25
No ratings yet
Ez Series Technical Bulletin: SUBJECT: EZ2x0/3x0 Program Update V3.16 EZ5x0 Program Update V4.25
2 pages
GIAC Security Essentials (GSEC) Exam Practice Guide: 700 Questions and Detailed Explanations
From Everand
GIAC Security Essentials (GSEC) Exam Practice Guide: 700 Questions and Detailed Explanations
Edwin Saunders
No ratings yet

Reference+Material LDA

Uploaded by

Reference+Material LDA

Uploaded by

LINEAR DISCRIMINAT ANALYSIS

An in-depth look at LDA with emphasis on Key Concepts & Working

This file is meant for personal use by [email protected] only.

Proprietary content. ©Great Learning. All

Proprietary content. ©Great Learning. All

• In logistic regression, the probability of the data point belonging to a class is

• In LDA, the orthogonal (perpendicular to each other) discriminant functions

Proprietary content. ©Great Learning. All

II Poor performance with small sample size –

Proprietary content. ©Great Learning. All

Proprietary content. ©Great Learning. All

Source - Linear Discriminant Analysis – Bit by Bit By Sebastian

Proprietary content. ©Great Learning. All

Proprietary content. ©Great Learning. All

Proprietary content. ©Great Learning. All

Where g(x) = Discriminant function

Proprietary content. ©Great Learning. All

How predictions are made using the discriminant functions -

Proprietary content. ©Great Learning. All

solves this measurement problem, as it measures distances between points,

MD(µ1,µ2,∑) = (µ1- µ2)t ∑-1

Proprietary content. ©Great Learning. All

Preparation of Data for LDA –

Proprietary content. ©Great Learning. All

Proprietary content. ©Great Learning. All

Proprietary content. ©Great Learning. All

And we select the class k which maximizes this posterior probability.

Proprietary content. ©Great Learning. All

Proprietary content. ©Great Learning. All

Proprietary content. ©Great Learning. All

Proprietary content. ©Great Learning. All

3. LDA produces at most C-1 feature projections. If the classification error

Proprietary content. ©Great Learning. All

Proprietary content. ©Great Learning. All

Proprietary content. ©Great Learning. All

Proprietary content. ©Great Learning. All

❖ Linear Discriminant Analysis (includes a link to an interactive LDA interface)

Proprietary content. ©Great Learning. All

You might also like