0% found this document useful (0 votes)

36 views7 pages

ECS7020P ClassificationExercisesSolutions II

The document discusses machine learning classification techniques including Gaussian distributions, parameter estimation, and the Bayes classifier. It provides solutions to exercises on plotting Gaussian distributions from sample data, deriving the Bayes classifier, and calculating performance metrics from confusion matrices for different classification boundaries on a 2D dataset.

Uploaded by

Yen-Kai Cheng

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

36 views7 pages

ECS7020P ClassificationExercisesSolutions II

Uploaded by

Yen-Kai Cheng

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

P RINCIPLES OF M ACHINE L EARNING

C LASSIFICATION II
A CADEMIC YEAR 2022/2023
Q UEEN M ARY U NIVERSITY OF L ONDON

1
S OLUTIONS

E XERCISE ]1 (S OL ): Let’s start plotting the dataset (we will use symbol X for class A and O
for class B):

Both class overlap. In fact two samples have the same predictor and different labels.

• A Gaussian distribution has two parameters, namely the mean µ and standard deviation
σ. The estimator for the mean is:
1 X
µ̂ = xi ,
N i

The estimator for the variance can be obtained as the square root of the estimator of the
variance σ 2 . There are two estimators for the variance, one biased and another unbiased:
1 X 1 X
σˆ2 = or σˆ2 =
2 2
(xi − µ̂) (biased) (xi − µ̂) (unbiased)
N i N −1 i

Using the estimator of the mean and the square root of the unbiased estimator of the
variance we get for each class:

µA = (−2 − 1 + 0 + 1 + 2)/5 = 0
p
σA = ((−2 − 0)2 + (−1 − 0)2 + (0 − 0)2 + (1 − 0)2 + (2 − 0)2 ) /4 = 1.58
µB = (1 + 2 + 3 + 4 + 5)/5 = 3
p
σB = ((1 − 3)2 + (2 − 3)2 + (3 − 3)2 + (4 − 3)2 + (5 − 3)2 ) /4 = 1.58

Note that both classes have the same standard deviation. If we plot them, we get:

-3 -2 -1 0 1 2 3 4 5 6

2
• Given a sample xi , the Bayes classifier compares the posterior probabilities P (A|xi ) and
P (B|xi ) to classify it:

P (A|xi )
>1 → ŷi = A
P (B|xi )
P (A|xi )
<1 → ŷi = B
P (B|xi )

Using Bayes rule, we can express the posterior probabilities in terms of the priors P (A)
and P (B) and the class densities p(x|A) and p(x|B). The class densities are Gaussian
and have the same standard deviation (as in linear discriminant analysis) and the priors
are P (A) = 0.5 and P (B) = 0.5. The classifier is then:

P (A|xi ) P (A)P (xi |A) 0.5P (xi |A) P (xi |A)

• If the priors are P (A) = 0.1 and P (B) = 0.9 instead and the class densities (also known
as likelihoods) are the same, we get:

0.1P (xi |A) P (xi |A) P (xi |A) P (xi |A)

E XERCISE ]2 (S OL ): A Gaussian distribution in a 2D predictor space has two parameters,

namely the mean µ and covariance matrix Σ:

µA ΣAA ΣAB
µ= Σ=
µB ΣBA ΣBB

If predictors xA and xB are independent, the covariance matrix is diagonal:

ΣAA 0
Σ=
0 ΣBB

and its diagonal entries are actually the variances of the marginal class densities:
2
σ 0
Σ= A 2
0 σB

Then, for each class we simply need to estimate the parameters of the marginal class densities.
In total, there are 6 class densities (3 classes × 2 predictors). Note that in this problem the
subindices A and B identify each predictor, whereas in the previous problem they were
used to identify each class instead. The means are

2
µ• =
8

7
µ• =
5

3
µ• =
3

3
And the covariance matrices:

1 0
Σ• =
0 1

1.1 0
Σ• =
0 8.9

0.5 0
Σ• =
0 0.5

After obtaining the mean and covariance matrix for each class densities, it is convenient to
check that the results make sense. The means should correspond to the centre of each class,
the variances should describe the spread of samples around their centres.

6
xB

0
0 1 2 3 4 5 6 7 8 9 10
xA

Figure 1

The boundaries of the classifier consist of the points where two or more posterior probabilities
are equal. The posterior probabilities can be expressed in terms of the priors and the class
densities. Note that in this exercise the priors are P (•) = 5/20=1/4, P (•) = 5/20=1/4 and P (•)
= 10/20=1/2.

E XERCISE ]3 (S OL ): The dataset consists of 53 samples in a 2D predictor space. There are 30 •

samples and 23 • samples. In this exercise we are asked to consider several classifiers defined
by the boundaries xB = 0.5, xB = 1.5, xB = 3.5, xB = 5.5, xB = 7.5 and xB = 9.5. We are
assuming that samples above each boundary are classified as •, and below as •. A confusion
matrix shows the number of correctly and incorrectly classified samples as follows:
Actual class
• •
• • samples labeled • • samples labeled •
Predicted class
• • samples labeled • • samples labeled •

We have 6 boundaries, i.e. 6 different classifiers, hence we need to produce a different

confusion matrix for each. For each classifier, we need to plot the boundary, count the number
of samples correctly and incorrectly classified for each class, and fill in the entries in the
confusion matrix.
Actual Actual Actual
xB = 0.5 • • xB = 1.5 • • xB = 3.5 • •
• 0 0 • 5 2 • 15 5
Predicted Predicted Predicted
• 23 30 • 18 28 • 8 25

4
Actual Actual Actual
xB = 5.5 • • xB = 7.5 • • xB = 9.5 • •
• 20 14 • 21 24 • 23 30
Predicted Predicted Predicted
• 3 16 • 2 6 • 0 0

We will assume • is the positive class and • the negative class. The sensitivity is calculated as:

# • samples correctly classified

Sensitivity =
# • samples

and the specificity as

# • samples correctly classified

Specificity =
# • samples

For each classifier, we have the following values of sensitivity and specificity:
xB = 0.5
0
Sensitivity = =0
23
30
Specificity = =1
30
xB = 1.5
5
Sensitivity =
23
28
Specificity =
30
xB = 3.5
15
Sensitivity =
23
25
Specificity =
30
xB = 5.5
20
Sensitivity =
23
16
Specificity =
30
xB = 7.5
21
Sensitivity =
23
6
Specificity =
30
xB = 9.5
23
Sensitivity = =1
23
0
Specificity = =0
30
Plotting we sensitivity against the 1-specificity we obtain the the ROC curve.

5
1

0.9

0.8

0.7

0.6

sensitivity
0.5

0.4

0.3

0.2

0.1

0
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
1 - specificity

E XERCISE ]4 (S OL ): The boundaries defined by xB = xA + c are straight lines with slope 1

and intercept c. Different values of c define a different boundary and a different classifier.
Let’s assume that samples above each boundary are classified as •, and below as •.

For each classifier (there are 6) we need to plot the boundary, count the number of samples
correctly and incorrectly classified for each class, and fill in their confusion matrices.

The confusion matrices are:

Actual Actual Actual

c = −8.5 • • c = −4.5 • • c = −1.5 • •
• 0 0 • 8 1 • 16 6
Predicted Predicted Predicted
• 23 30 • 15 29 • 7 24

Actual Actual Actual

c = 1.5 • • c = 4.5 • • c = 8.5 • •
• 20 18 • 22 27 • 23 30
Predicted Predicted Predicted
• 3 12 • 1 3 • 0 0

6
The sensitivity and specificity and specificity values are:
c = −8.5
0
Sensitivity = =0
23
30
Specificity = =1
30
c = −4.5
8
Sensitivity =
23
29
Specificity =
30
c = −1.5
16
Sensitivity =
23
24
Specificity =
30
c = 1.5
20
Sensitivity =
23
12
Specificity =
30
c = 4.5
22
Sensitivity =
23
3
Specificity =
30
c = 8.5
23
Sensitivity = =1
23
0
Specificity = =0
30
Plotting we sensitivity against the 1-specificity we obtain the the ROC curve. The red curve
corresponds to the family xB = xA + c, the blue to xB = c. The family of classifiers
xB = xA + c is slightly better than xB = c.
The area under the curve (AUC) is a measure of goodness for a classifier that can be calibrated.
1

0.9

0.8

0.7

0.6
sensitivity

0.5

0.4

0.3

0.2

0.1

0
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
1 - specificity

Notes and Solutions For: Pattern Recognition by Sergios Theodoridis and Konstantinos Koutroumbas.
100% (1)
Notes and Solutions For: Pattern Recognition by Sergios Theodoridis and Konstantinos Koutroumbas.
209 pages
Murphy Book Solution
No ratings yet
Murphy Book Solution
100 pages
Econometrics Formulas
80% (5)
Econometrics Formulas
2 pages
3 Unequal
No ratings yet
3 Unequal
7 pages
Lecture 03 Bayes Classifier With Prob Concepts
No ratings yet
Lecture 03 Bayes Classifier With Prob Concepts
70 pages
ECS7020P ClassificationExercises II
No ratings yet
ECS7020P ClassificationExercises II
3 pages
Homework Decision Solutions
No ratings yet
Homework Decision Solutions
3 pages
Bayesian
No ratings yet
Bayesian
21 pages
MLASSI007
No ratings yet
MLASSI007
9 pages
Course: DD2427 - Exercise Class 1: Exercise 1 Motivation For The Linear Neuron
No ratings yet
Course: DD2427 - Exercise Class 1: Exercise 1 Motivation For The Linear Neuron
5 pages
Linear Models For Classification: Sumeet Agarwal, EEL709 (Most Figures From Bishop, PRML)
No ratings yet
Linear Models For Classification: Sumeet Agarwal, EEL709 (Most Figures From Bishop, PRML)
21 pages
Pattern Recognition 21BR551 MODULE 02 NOTES
No ratings yet
Pattern Recognition 21BR551 MODULE 02 NOTES
16 pages
Inf2b Learn Note10 2up
No ratings yet
Inf2b Learn Note10 2up
7 pages
EE353 - 769 08 Linear Classification
No ratings yet
EE353 - 769 08 Linear Classification
22 pages
n9 PDF
No ratings yet
n9 PDF
6 pages
Chapter 07
No ratings yet
Chapter 07
68 pages
Pattern File
No ratings yet
Pattern File
29 pages
6.867 Section 3: Classification: 1 Intro 2 2 Representation 2 3 Probabilistic Models 2
No ratings yet
6.867 Section 3: Classification: 1 Intro 2 2 Representation 2 3 Probabilistic Models 2
10 pages
Problem 1 Report Trần Minh Long 2052154 Final
No ratings yet
Problem 1 Report Trần Minh Long 2052154 Final
31 pages
Supervised Unsupervised
No ratings yet
Supervised Unsupervised
39 pages
Exercises
No ratings yet
Exercises
69 pages
Supervised Machine Learning
No ratings yet
Supervised Machine Learning
74 pages
Pattern Revision
No ratings yet
Pattern Revision
63 pages
07 - Bayesian Learning
No ratings yet
07 - Bayesian Learning
55 pages
Legal 3 AI
No ratings yet
Legal 3 AI
3 pages
PR January20 04 PDF
No ratings yet
PR January20 04 PDF
40 pages
Weatherwax Theodoridis Solutions
No ratings yet
Weatherwax Theodoridis Solutions
212 pages
AE - Tema 5 - Two-Class Fisher Discriminant Analysis
No ratings yet
AE - Tema 5 - Two-Class Fisher Discriminant Analysis
6 pages
Chapter 4: Linear Models For Classification: Grit Hein & Susanne Leiberg
No ratings yet
Chapter 4: Linear Models For Classification: Grit Hein & Susanne Leiberg
21 pages
Uncertainty Based Classification Fusion - A Soft-Biometrics Test Case
No ratings yet
Uncertainty Based Classification Fusion - A Soft-Biometrics Test Case
4 pages
Lecture 6 - Generative Models
No ratings yet
Lecture 6 - Generative Models
33 pages
06 Lectureslides LinearClassification Fixed
No ratings yet
06 Lectureslides LinearClassification Fixed
52 pages
Lemlem Abebaw Asaye Asignment 7
No ratings yet
Lemlem Abebaw Asaye Asignment 7
9 pages
Tugas Mata Kuliah Pengenalan Pola: Sistem Komputer Fakultas Ilmu Komputer Universitas Sriwijaya 2019
No ratings yet
Tugas Mata Kuliah Pengenalan Pola: Sistem Komputer Fakultas Ilmu Komputer Universitas Sriwijaya 2019
6 pages
PR Practical File
No ratings yet
PR Practical File
38 pages
Linearclassification
No ratings yet
Linearclassification
31 pages
Linear and Quadratic Discriminant Analysis: Tutorial: Benyamin Ghojogh
No ratings yet
Linear and Quadratic Discriminant Analysis: Tutorial: Benyamin Ghojogh
16 pages
Linear Models For Classification
No ratings yet
Linear Models For Classification
21 pages
Genomic Signal Processing: Classification of Disease Subtype Based On Microarray Data
No ratings yet
Genomic Signal Processing: Classification of Disease Subtype Based On Microarray Data
26 pages
ML1 2023 Fe
No ratings yet
ML1 2023 Fe
25 pages
Catboost ET Comparaison
No ratings yet
Catboost ET Comparaison
20 pages
Bayesian Learning: Berrin Yanikoglu
No ratings yet
Bayesian Learning: Berrin Yanikoglu
64 pages
APA Chapter3 T20
No ratings yet
APA Chapter3 T20
24 pages
Bayesian Learning
No ratings yet
Bayesian Learning
21 pages
Mod09-ppt2-ML in Image Classification
No ratings yet
Mod09-ppt2-ML in Image Classification
30 pages
02-Classification - Commented2
No ratings yet
02-Classification - Commented2
22 pages
Cheat Sheet
No ratings yet
Cheat Sheet
4 pages
ML Unit 1
No ratings yet
ML Unit 1
73 pages
Lec 13
No ratings yet
Lec 13
16 pages
Ml2 Script v2
No ratings yet
Ml2 Script v2
123 pages
Materi 5 - 2
No ratings yet
Materi 5 - 2
25 pages
2021 Logistic Regression
No ratings yet
2021 Logistic Regression
33 pages
ML 05 Bayesian Classifier
No ratings yet
ML 05 Bayesian Classifier
19 pages
CSCE 970 Lecture 2: Bayesian-Based Classifiers: Most Probable
No ratings yet
CSCE 970 Lecture 2: Bayesian-Based Classifiers: Most Probable
5 pages
Support Vector Machines (SVM) Models in Stata
No ratings yet
Support Vector Machines (SVM) Models in Stata
19 pages
Previous Exam Exercises On Classification: Exercise 4 2012: Classification With 2 Features
No ratings yet
Previous Exam Exercises On Classification: Exercise 4 2012: Classification With 2 Features
9 pages
Unit 3-Discriminative Models
No ratings yet
Unit 3-Discriminative Models
29 pages
Introduction To Pattern Recognition
No ratings yet
Introduction To Pattern Recognition
12 pages
Note - Wireless Communications For Everybody
No ratings yet
Note - Wireless Communications For Everybody
2 pages
ECS765P - W3 - Hadoop Principles and Components
No ratings yet
ECS765P - W3 - Hadoop Principles and Components
47 pages
ECS765P - W5 - Spark Programming
No ratings yet
ECS765P - W5 - Spark Programming
43 pages
Week 3 v1.1 (Hidden) Supervised Learning (Regression)
No ratings yet
Week 3 v1.1 (Hidden) Supervised Learning (Regression)
52 pages
ECS765P - W4 - Introduction To Spark
No ratings yet
ECS765P - W4 - Introduction To Spark
39 pages
ECS726-Week02 Symmetric EncryptionP
No ratings yet
ECS726-Week02 Symmetric EncryptionP
62 pages
Magic Pen Script 10-05-19
No ratings yet
Magic Pen Script 10-05-19
4 pages
Lights Illusions Script 08-26-19
No ratings yet
Lights Illusions Script 08-26-19
6 pages
Week 4 v1.1 (Hidden) - Supervised Learning (Classification)
No ratings yet
Week 4 v1.1 (Hidden) - Supervised Learning (Classification)
43 pages
ECS726-Week04 - Hash - MAC - Digital Sinatures - Freshness - Dynamic Password Schemes
No ratings yet
ECS726-Week04 - Hash - MAC - Digital Sinatures - Freshness - Dynamic Password Schemes
52 pages
ECS781P-9-Cloud Data Management
No ratings yet
ECS781P-9-Cloud Data Management
79 pages
Ecs765p W2
No ratings yet
Ecs765p W2
55 pages
ECS726-Week01 Intro
No ratings yet
ECS726-Week01 Intro
70 pages
ECS765P - W10 - Stream Processing
No ratings yet
ECS765P - W10 - Stream Processing
39 pages
ECS765P - W11 - Stream Processing II
No ratings yet
ECS765P - W11 - Stream Processing II
47 pages
ECS726-Week05 Cryptographic Protocols Key Management-P
No ratings yet
ECS726-Week05 Cryptographic Protocols Key Management-P
58 pages
ECS765P - W6 - Big Data Ingestion and Storage
No ratings yet
ECS765P - W6 - Big Data Ingestion and Storage
34 pages
ECS765P - W9 - Large-Scale Graph Processing
No ratings yet
ECS765P - W9 - Large-Scale Graph Processing
51 pages
ECS781P-11-Edge of The Cloud
No ratings yet
ECS781P-11-Edge of The Cloud
30 pages
W2 Ecs7020p
No ratings yet
W2 Ecs7020p
54 pages
Ecs781p 4 Rest
No ratings yet
Ecs781p 4 Rest
47 pages
W3 Ecs7020p
No ratings yet
W3 Ecs7020p
51 pages
ECS781P 10 Microservices
No ratings yet
ECS781P 10 Microservices
34 pages
ECS781P 6 CloudPerformanceSLAs
No ratings yet
ECS781P 6 CloudPerformanceSLAs
39 pages
Tom Rose - From The Red Notebook 2nd Edition
75% (4)
Tom Rose - From The Red Notebook 2nd Edition
33 pages
ECS781P-3-Cloud Applications
No ratings yet
ECS781P-3-Cloud Applications
50 pages
Cloud Computing Lab 2
No ratings yet
Cloud Computing Lab 2
4 pages
Matt Mello - Thought Control
No ratings yet
Matt Mello - Thought Control
16 pages
The Passion of An Amateur Card Magician
100% (4)
The Passion of An Amateur Card Magician
557 pages
W4 Ecs7020p
No ratings yet
W4 Ecs7020p
48 pages
Logrank Tests (Lachin and Foulkes)
No ratings yet
Logrank Tests (Lachin and Foulkes)
13 pages
Ex01 Linear Regression
No ratings yet
Ex01 Linear Regression
2 pages
Logit Probit
No ratings yet
Logit Probit
11 pages
Assignment
100% (1)
Assignment
3 pages
Padeepz App AD3491 Syllabus
No ratings yet
Padeepz App AD3491 Syllabus
2 pages
U00 CohenTextbook EDUC 6600 2018
No ratings yet
U00 CohenTextbook EDUC 6600 2018
6 pages
Statictics Maths 2 Marks - 2
No ratings yet
Statictics Maths 2 Marks - 2
42 pages
Introduction To Correlation and Regression Analyses PDF
No ratings yet
Introduction To Correlation and Regression Analyses PDF
12 pages
FSS 204 Lecture 7
No ratings yet
FSS 204 Lecture 7
2 pages
The Classical Model: Slides by Niels-Hugo Blunch Washington and Lee University
100% (1)
The Classical Model: Slides by Niels-Hugo Blunch Washington and Lee University
22 pages
Decomposition Exercise Solution 19
No ratings yet
Decomposition Exercise Solution 19
11 pages
Example of Interpreting and Applying A Multiple Regression Model
No ratings yet
Example of Interpreting and Applying A Multiple Regression Model
4 pages
Sop 23
No ratings yet
Sop 23
8 pages
Sma 361-Theory of Estimation
No ratings yet
Sma 361-Theory of Estimation
3 pages
Multiple Regression
No ratings yet
Multiple Regression
6 pages
Probability Worksheet Marking Scheme
No ratings yet
Probability Worksheet Marking Scheme
15 pages
Statistics and Probability T-Test
No ratings yet
Statistics and Probability T-Test
37 pages
Important Notes About Econometrics
No ratings yet
Important Notes About Econometrics
24 pages
Assignment For Statistical Economics
No ratings yet
Assignment For Statistical Economics
3 pages
Class 03 04 Confidence Interval, Hypothesis Testing
No ratings yet
Class 03 04 Confidence Interval, Hypothesis Testing
87 pages
Quanititative Techniques For Business-2024
No ratings yet
Quanititative Techniques For Business-2024
3 pages
Final Exam in Stat2010
No ratings yet
Final Exam in Stat2010
5 pages
Stat For Management CH 3
No ratings yet
Stat For Management CH 3
5 pages
Stats 1 Ch7 - Hypothesis Testing
No ratings yet
Stats 1 Ch7 - Hypothesis Testing
5 pages
Confidence Interval For Population Variance
No ratings yet
Confidence Interval For Population Variance
48 pages
Example Ch101
No ratings yet
Example Ch101
18 pages
1.1 Introduction-What Is Statistics
No ratings yet
1.1 Introduction-What Is Statistics
8 pages
Level 2 r12 Multiple Regression
No ratings yet
Level 2 r12 Multiple Regression
29 pages

ECS7020P ClassificationExercisesSolutions II

Uploaded by

ECS7020P ClassificationExercisesSolutions II

Uploaded by

P RINCIPLES OF M ACHINE L EARNING

P (A|xi ) P (A)P (xi |A) 0.5P (xi |A) P (xi |A)

0.1P (xi |A) P (xi |A) P (xi |A) P (xi |A)

E XERCISE ]2 (S OL ): A Gaussian distribution in a 2D predictor space has two parameters,

If predictors xA and xB are independent, the covariance matrix is diagonal:

E XERCISE ]3 (S OL ): The dataset consists of 53 samples in a 2D predictor space. There are 30 •

We have 6 boundaries, i.e. 6 different classifiers, hence we need to produce a different

# • samples correctly classified

and the specificity as

# • samples correctly classified

E XERCISE ]4 (S OL ): The boundaries defined by xB = xA + c are straight lines with slope 1

The confusion matrices are:

Actual Actual Actual

Actual Actual Actual

You might also like