0% found this document useful (0 votes)

9 views48 pages

D3 IT Performance Metrics May 2023

The document discusses the performance estimation of machine learning classifiers, focusing on metrics derived from the confusion matrix, including accuracy, precision, recall, and F1 score. It highlights the limitations of predictive accuracy, especially in imbalanced datasets, and emphasizes the importance of alternative metrics for evaluating classifier effectiveness. Additionally, it touches on multiclass classification and its distinction from binary classification.

Uploaded by

aditya tulsyan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views48 pages

D3 IT Performance Metrics May 2023

Uploaded by

aditya tulsyan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 48

Performance Estimation of

Machine Learning Model

5/17/2023 1
Performance Estimation of a Classifier
Predicted Actual
Accuracy = ¾ =0.75 or 75%
0 1
1 1
0 0
0 0

 Various other metrics may be derived from a single

matrix known as “Confusion Matrix” or “Contingency
Matrix”

Predicted/ 1 0
Actual
1 True Positive (TP) False Positive (FP)
0 False Negative (FN) True Negative (TN)

5/17/2023
Accuracy = TP+TN/FP+FN+TP+TN 2
Performance Estimation of a Classifier
 True Positive (TP)
 The predicted value matches the actual value
 The actual value was positive and the model predicted a positive
value
 True Negative (TN)
 The predicted value matches the actual value
 The actual value was negative and the model predicted a negative
 False Positive (FP) – Type 1 error
 The predicted value was falsely predicted
 The actual value was negative but the model predicted a positive
 False Negative (FN) – Type 2 error
 The predicted value was falsely predicted
 The actual value was positive but the model predicted a negative
value
5/17/2023 3
Performance Estimation of a Classifier
Predicted Actual
Accuracy = ¾ =0.75 or 75%
0 1
1 1
0 0
0 0

 “Confusion Matrix” or “Contingency Matrix”

Predicted + -
/Actual
+ ++ +-
- -+ --

Predicted 1 0
/Actual
1 1 0
0 1 2
5/17/2023 4
Performance Estimation of a Classifier
 Predictive accuracy works fine, when the classes
are balanced
 That is, every class in the data set are equally
important

 In fact, data sets with imbalanced class

distributions are quite common in many real life
applications
 When the classifier classified a test data set with
imbalanced class distributions then, predictive
accuracy on its own is not a reliable indicator of
a classifier’s effectiveness.
5/17/2023 5
Performance Estimation of a Classifier
 Datasets to identify customer churn where a
vast majority of customers will continue using
the service. Specifically, Telecommunication
companies where Churn Rate is lower than 2
%.
 Data sets to identify rare diseases in medical
diagnostics etc.
 Natural Disaster like Earthquakes etc
 Credit card/financial transactions

5/17/2023 6
Performance Estimation of a Classifier
Effectiveness of Predictive Accuracy
 Given a data set of stock markets, we are to classify
them as “good” and “worst”. Suppose, in the data set,
out of 100 entries, 98 belong to “good” class and only
2 are in “worst” class.
 With this data set, if classifier’s predictive accuracy is 0.98, a very high
value!
 Here, there is a high chance that 2 “worst” stock markets may
incorrectly be classified as “good”

 On the other hand, if the predictive accuracy is 0.02, then none of the
stock markets may be classified as “good”

5/17/2023 7
Performance Estimation of a Classifier
Effectiveness of Predictive Accuracy
 Given a data set of stock markets, we are to classify
them as “good” and “worst”. Suppose, in the data set,
out of 100 entries, 98 belong to “good” class and only
2 are in “worst” class.
 With this data set, if classifier’s predictive accuracy is 0.98, a very high
value!
 Here, there is a high chance that 2 “worst” stock markets may
incorrectly be classified as “good”

 On the other hand, if the predictive accuracy is 0.02, then none of the
stock markets may be classified as “good”

5/17/2023 8
Performance Estimation of a Classifier
Predicted Fever Actual Fever Status
1 1 TP
0 0 TN
0 0 TN
1 0 FP
0 0 TN
1 0 FP
0 1 FN
1 1 TP
0 0 TN
1 0 FP

TP = 30, TN = 940, FP = 20, FN = 10

Overall Accuracy = 97%
5/17/2023 9
Performance Estimation of a Classifier
 Our model is saying “I can predict sick people 97% of the
time”.
 However, it is doing the opposite. It is predicting the
people who will not get sick with 97% accuracy while the
sick are spreading the virus!
 Do you think this is a correct metric for our model given
the seriousness of the issue?
 Shouldn’t we be measuring how many positive cases we
can predict correctly to arrest the spread of the contagious
virus? Or maybe, out of the correctly predicted cases, how
many are positive cases to check the reliability of our
model?

5/17/2023 10
Performance Estimation of a Classifier
 Thus, when the classifier classified a test data set
with imbalanced class distributions, then
predictive accuracy on its own is not a reliable
indicator of a classifier’s effectiveness.
 This necessitates an alternative metrics to judge
the classifier.

5/17/2023 11
Performance Estimation of a Classifier
 These metrics may be derived from a single matrix
known as “Confusion Matrix” or “Contingency Matrix”

5/17/2023 12
Performance Estimation of a Classifier
 source: https://en.wikipedia.org/wiki/Sensitivity_and_specificity

 Imagine a study evaluating a new test that

screens people for a disease. Each person
taking the test either has or does not have the
disease.
 The test outcome can be positive (classifying
the person as having the disease) or negative
(classifying the person as not having the
disease).
 The test results for each subject may or may
not match the subject’s actual status.
5/17/2023 13
Performance Estimation of a Classifier
 True positive (TP): Sick people correctly identified
as sick
 Transaction is fraud and you predicted it as fraud
 False positive (FP): Healthy people incorrectly
identified as sick
 Transaction is genuine but you predicted it as
fraud
 False negative (FN): Sick people incorrectly
identified as healthy
 Transaction is fraud but you predicted it as
genuine
5/17/2023 14
Performance Estimation of a Classifier
 True negative (TN): Healthy people correctly
identified as healthy
 Accuracy (ACC):
 ACC = (TP + TN) / (TP + FP + FN + TN)
 (TP + TN) denotes the number of correct
classifications
 (FP + FN) denotes the number of errors in
classification.
 For a perfect classifier FP = FN = 0, that is, there
would be no Type 1 or Type 2 errors.
5/17/2023 15
Performance Estimation of a Classifier
 Precision or positive predictive value (PPV):
 PPV = TP / (TP + FP)
 Precision tells us how many of the correctly
predicted cases actually turned out to be
positive.
 This would determine whether our model is
reliable or not.
 30/30+20 = 3/5 = 60% of the correctly
predicted cases turned out to be positive
cases Predicted 1 0
/Actual
1 30 20
5/17/2023 0 10 940 16
Performance Estimation of a Classifier
 Sensitivity (SEN) or recall of the positive class
or True Positive Rate (TPR) or hit rate:
 SEN = TP / P = TP / (TP+FN)
 Correctly predicted/ real or actual positive
 Recall tells us how many of the actual
positive cases we were able to predict
correctly with our model.
 30/30+10 = ¾ = 75% of the correctly predicted cases
turned out to be positive cases
Predicted 1 0
/Actual
1 30 20
5/17/2023 0 10 940 17
Recall
 Specificity (SPC) or recall of the negative class or
True Negative Rate:
 SPC = TN / N = TN / (TN+FP)
 Recall is important in medical cases where it doesn’t matter
whether we raise a false alarm but the actual positive cases
should not go undetected!
 In our example, Recall would be a better metric because we
don’t want to accidentally discharge an infected person and let
them mix with the healthy population thereby spreading the
virus.
 But there will be cases where there is no clear distinction
between whether Precision is more important or Recall.

5/17/2023 18
Performance Estimation of a Classifier
 F1 Score (or F-score) which is a weighted
average or harmonic mean of precision and
recall are useful to deal with imbalanced
datasets
 High value of F1 ensures that Precision and
Recall both are reasonably high

5/17/2023 19
Analysis with Performance Measurement Metrics
 Based on the various performance metrics, we can characterize a classifier.

 We do it in terms of TPR, FPR, Precision and Recall and Accuracy

 Case 1: Perfect Classifier
When every instance is correctly classified, it is called the perfect classifier. In
this case, TP = P, TN = N and CM is

𝑃
TPR = =1
𝑃 Predicted Class
0
FPR = =0 + -
𝑁
𝑃
Precision = = 1 + P 0
Actual

𝑃 class

2×1
F1 Score = =1 - 0 N
1+1
𝑃+𝑁
Accuracy = =1
𝑃+𝑁

5/17/2023 20
Analysis with Performance Measurement Metrics

 Case 2: Worst Classifier

When every instance is wrongly classified, it is called the worst classifier. In this
case, TP = 0, TN = 0 and the CM is

Predicted Class
0
TPR = =0 + -
𝑃
𝑁 + 0 P
FPR = =1

Actual
class
𝑁
0 - N 0
Precision = =0
𝑁
F1 Score = Not applicable
as Recall + Precision = 0
0
Accuracy = =0
𝑃+𝑁

5/17/2023 21
Analysis with Performance Measurement Metrics

 Case 3: Ultra-Liberal Classifier

The classifier always predicts the + class correctly. Here, the False Negative
(FN) and True Negative (TN) are zero. The CM is

Predicted Class
𝑃
TPR = =1 + -
𝑃
𝑁 + P 0
FPR = =1

Actual
class
𝑁
𝑃 - N 0
Precision =
𝑃+𝑁
2𝑃
F1 Score =
2𝑃+𝑁
𝑃
Accuracy = =0
𝑃+𝑁

5/17/2023 22
Analysis with Performance Measurement Metrics

 Case 4: Ultra-Conservative Classifier

This classifier always predicts the - class correctly. Here, the False Negative
(FN) and True Negative (TN) are zero. The CM is

Predicted Class
0
TPR = =0 + -
𝑃
0 + 0 p
FPR = =0

Actual
class
𝑁
- 0 N
Precision = Not applicable
(as TP + FP = 0)
F1 Score = Not applicable
𝑁
Accuracy = =0
𝑃+𝑁

5/17/2023 23
Predictive Accuracy versus TPR and FPR
 One strength of characterizing a classifier by its TPR and FPR is that they do
not depend on the relative size of P and N.
 The same is also applicable for FNR and TNR and others measures from CM.

 In contrast, the Predictive Accuracy, Precision, Error Rate, F1 Score, etc. are
affected by the relative size of P and N.

 FPR, TPR, FNR and TNR are calculated from the different rows of the CM.
 On the other hand Predictive Accuracy, etc. are derived from the values in both
rows.

 This suggests that FPR, TPR, FNR and TNR are more effective than
Predictive Accuracy, etc.

5/17/2023 24
Confusion Matrix
A classifier is built on a dataset regarding Good and Worst
classes of stock markets. The model is then tested with a test set
of 10000 unseen instances. The result is shown in the form of a
confusion matrix. The result is self explanatory.

Class Good Worst Total Rate(%)

Good 6954 46 7000 99.34
Worst 412 2588 3000 86.27
Total 7366 2634 10000 95.52

Predictive accuracy?

5/17/2023 25
Multiclass Classification
 Multiclass (or multinomial) classification is a type
of supervised learning task in machine learning and
data science where the goal is to predict the
categorical class label of an input data sample from
a set of two or more possible classes.
 In other words, given an input, the model has to
predict which of the multiple possible classes that
input belongs to. For example, given an image of an
animal, the task might be to classify it as a cat, dog,
or bird.
5/17/2023 26
Multiclass Classification
 Multiclass classification is different from binary
classification, where the goal is to predict a binary label,
such as "yes" or "no," or "true" or "false." Multiclass
classification can be performed using various algorithms
such as logistic regression, decision trees, random forests,
support vector machines, neural networks, etc.
 Some algorithms (such as Random Forest classifiers or
naive Bayes classifiers) are capable of handling multiple
classes directly. Others (such as SVM classifiers or Linear
classifiers) are strictly binary classifiers. However, there are
various strategies that you can use to perform multiclass
classification using multiple binary classifiers

5/17/2023 27
Multilabel Classification
 goal is to predict multiple categorical class labels for an input data
sample.
 In other words, instead of predicting a single class label for a data
sample, the model predicts multiple labels.
 For example, in a movie recommendation system, a movie might
have multiple genre labels, such as "comedy," "action," and
"drama."
 Multilabel classification is different from multiclass classification,
where the goal is to predict a single class label from multiple
possible classes. In multilabel classification, the number of possible
labels is not limited to two or more classes, and a data sample can
belong to one or more labels.
 Multilabel classification can be performed using various algorithms
such as binary relevance, label powerset, classifier chains, and
hierarchical classification.
5/17/2023 28
Confusion Matrix for Multiclass Classifier
 Having m classes, confusion matrix is a table of size
m×m , where, element at (i, j) indicates the number of
instances of class i but classified as class j.
 To have good accuracy for a classifier, ideally most
diagonal entries should have large values with the rest of
23 1 4 0 1
entries being close to zero.
2 35 6 2 2
3 1 73 3 7
4 2 4 50 3
5 4 2 5 28
 Confusion matrix may have additional rows or columns
to provide total or recognition rates per class.

5/17/2023 29
Confusion Matrix for Multiclass
Classifier
 Unlike binary classification, there are no positive or
negative classes here.
 At first, it might be a little difficult to find TP, TN, FP
and FN since there are no positive or negative classes,
but it’s actually pretty easy.
 What we have to do here is to find TP, TN, FP and FN for
each individual class. For example, if we take class
Apple, then let’s see what are the values of the metrics
from the confusion matrix.

5/17/2023 30
Source: https://towardsdatascience.com/confusion-matrix-for-your-multi-class-machine-learning-model-ff9aa3bf7826
Confusion Matrix for Multiclass Classifier
TP = 7
TN = (2+3+2+1) = 8
FP = (8+9) = 17
FN = (1+3) = 4
Since we have all the
necessary metrics for class
Apple from the confusion
matrix, now we can calculate
the performance measures
for class Apple. For example,
class Apple has
Precision = 7/(7+17) = 0.29
Recall = 7/(7+4) = 0.64
5/17/2023
F1-score = 0.40 31
Source: https://towardsdatascience.com/confusion-matrix-for-your-multi-class-machine-learning-model-ff9aa3bf7826
Confusion Matrix for Multiclass
Classifier

5/17/2023 32
Source: https://towardsdatascience.com/confusion-matrix-for-your-multi-class-machine-learning-model-ff9aa3bf7826
Confusion Matrix for Multiclass Classifier
Confusion matrix with multiple class
Following table shows the confusion matrix of a classification problem with six
classes labeled as C1, C2, C3, C4, C5 and C6.

Class C1 C2 C3 C4 C5 C6
C1 52 10 7 0 0 1
C2 15 50 6 2 1 2
C3 5 6 6 0 0 0
C4 0 2 0 10 0 1
C5 0 1 0 0 7 1
C6 1 3 0 1 0 24

Predictive accuracy?

Data Mining: Concepts and Techniques, (3rd Edn.), Jiawei Han, Micheline Kamber, Morgan Kaufmann, 2015.

Introduction to Data Mining, Pang-Ning Tan, Michael Steinbach, and Vipin Kumar, Addison-Wesley, 2014
5/17/2023 33
Confusion Matrix for Multiclass Classifier
 In case of multiclass classification, sometimes one class is important enough to
be regarded as positive with all other classes combined together as negative.

 Thus a large confusion matrix of m*m can be concised into 2*2 matrix.

m×m CM to 2×2 CM
 For example, the CM shown in Example transformed into a CM of size 2×2
considering the class C1 as the positive class and classes C2, C3, C4, C5 and C6
combined together as negative.

Class + -
+ 52 18
- 21 123

Data Mining: Concepts and Techniques, (3rd Edn.), Jiawei Han, Micheline Kamber, Morgan Kaufmann, 2015.

Introduction to Data Mining, Pang-Ning Tan, Michael Steinbach, and Vipin Kumar, Addison-Wesley, 2014
5/17/2023 34
ROC Curves

5/17/2023 35
ROC Curves
 ROC is an abbreviation of Receiver Operating Characteristic come from the
signal detection theory, developed during World War 2 for analysis of radar
images.

 In the context of classifier, ROC plot is a useful tool to study the behaviour of
a classifier or comparing two or more classifiers.

 A ROC plot is a two-dimensional graph, where, X-axis represents FP rate

(FPR) and Y-axis represents TP rate (TPR).

 Since, the values of FPR and TPR varies from 0 to 1 both inclusive, the two
axes thus from 0 to 1 only.

 Each point (x, y) on the plot indicating that the FPR has value x and the TPR
value y.

Data Mining: Concepts and Techniques, (3rd Edn.), Jiawei Han, Micheline Kamber, Morgan Kaufmann, 2015.

Introduction to Data Mining, Pang-Ning Tan, Michael Steinbach, and Vipin Kumar, Addison-Wesley, 2014
5/17/2023 36
ROC Plot
 A typical look of ROC plot with few points in it is shown in the following
figure.

 Note the four cornered points are the four extreme cases of classifiers

Identify the four extreme classifiers.

Data Mining: Concepts and Techniques, (3rd Edn.), Jiawei Han, Micheline Kamber, Morgan Kaufmann, 2015.

Introduction to Data Mining, Pang-Ning Tan, Michael Steinbach, and Vipin Kumar, Addison-Wesley, 2014
5/17/2023 37
Interpretation of Different Points in ROC Plot
 Le us interpret the different points in the ROC plot.

 The four points (A, B, C, and D)

 A: TPR = 1, FPR = 0, the ideal model, i.e., the perfect classifier, no false results
 B: TPR = 0, FPR = 1, the worst classifier, not able to predict a single instance
 C: TPR = 0, FPR = 0, the model predicts every instance to be a Negative class, i.e., it is
an ultra-conservative classifier
 D: TPR = 1, FPR = 1, the model predicts every instance to be a Positive class, i.e., it is
an ultra-liberal classifier
Data Mining: Concepts and Techniques, (3rd Edn.), Jiawei Han, Micheline Kamber, Morgan Kaufmann, 2015.
5/17/2023 38
Introduction to Data Mining, Pang-Ning Tan, Michael Steinbach, and Vipin Kumar, Addison-Wesley, 2014
Interpretation of Different Points in ROC Plot
 Le us interpret the different points in the ROC plot.

 The points on diagonals

 The diagonal line joining point C(0,0) and D(1,1) corresponds to random guessing
 Random guessing means that a record is classified as positive (0r negative) with a certain probability
 Suppose, a test set contacting N+ positive and N- negative instances. Suppose, the classifier guesses
any instances with probability p
 Thus, the random classifier is expected to correctly classify p.N+ of the positive instances and p.N- of
the negative instances
 Hence, TPR = FPR = p
 Since TPR = FPR, the random
rd
classifier results reside on the main diagonals
Data Mining: Concepts and Techniques, (3 Edn.), Jiawei Han, Micheline Kamber, Morgan Kaufmann, 2015.
5/17/2023 39
Introduction to Data Mining, Pang-Ning Tan, Michael Steinbach, and Vipin Kumar, Addison-Wesley, 2014
Interpretation of Different Points in ROC Plot
 Let us interpret the different points in the ROC plot.

 The points on the upper diagonal region

 All points, which reside on upper-diagonal region are corresponding to classifiers
“good” as their TPR is as good as FPR (i.e., FPRs are lower than TPRs)
 Here, X is better than Z as X has higher TPR and lower FPR than Z.

 If we compare X and Y, neither classifier is superior to the other

Data Mining: Concepts and Techniques, (3rd Edn.), Jiawei Han, Micheline Kamber, Morgan Kaufmann, 2015.

5/17/2023
Introduction to Data Mining, Pang-Ning Tan, Michael Steinbach, and Vipin Kumar, Addison-Wesley, 2014 40
Interpretation of Different Points in ROC Plot
 Let us interpret the different points in the ROC plot.

 The points on the lower diagonal region

 The Lower-diagonal triangle corresponds to the classifiers that are worst than random
classifiers
 Note: A classifier that is worst than random guessing, simply by reversing its
prediction, we can get good results.
 W’(0.2, 0.4) is the better version than W(0.4, 0.2), W’ is a mirror reflection of W
Data Mining: Concepts and Techniques, (3rd Edn.), Jiawei Han, Micheline Kamber, Morgan Kaufmann, 2015.

Introduction to Data Mining, Pang-Ning Tan, Michael Steinbach, and Vipin Kumar, Addison-Wesley, 2014
5/17/2023 41
Tuning a Classifier through ROC Plot
 Using ROC plot, we can compare two or more classifiers by their TPR and
FPR values and this plot also depicts the trade-off between TPR and FPR of a
classifier.

 Examining ROC curves can give insights into the best way of tuning
parameters of classifier.

 For example, in the curve C2, the result is degraded after the point P.
Similarly for the observation C1, beyond Q the settings are not acceptable.
Data Mining: Concepts and Techniques, (3rd Edn.), Jiawei Han, Micheline Kamber, Morgan Kaufmann, 2015.

Introduction to Data Mining, Pang-Ning Tan, Michael Steinbach, and Vipin Kumar, Addison-Wesley, 2014
5/17/2023 42
Comparing Classifiers trough ROC Plot
 Two curves C1 and C2 are corresponding to the experiments to choose two
classifiers with their parameters.

 Here, C1 is better than C2 when FPR is less than 0.3.

 However, C2 is better, when FPR is greater than 0.3.

 Clearly, neither of these two classifiers dominates the other.

Data Mining: Concepts and Techniques, (3rd Edn.), Jiawei Han, Micheline Kamber, Morgan Kaufmann, 2015.

Introduction to Data Mining, Pang-Ning Tan, Michael Steinbach, and Vipin Kumar, Addison-Wesley, 2014
5/17/2023 43
Comparing Classifiers trough ROC Plot
 We can use the concept of “area under curve” (AUC) as a better method to
compare two or more classifiers.
 If a model is perfect, then its AUC = 1.
 If a model simply performs random guessing, then its AUC = 0.5
 A model that is strictly better than other, would have a larger value of AUC
than the other.

 Here, C3 is best, and C2 is better than C1 as AUC(C3)>AUC(C2)>AUC(C1).

Data Mining: Concepts and Techniques, (3rd Edn.), Jiawei Han, Micheline Kamber, Morgan Kaufmann, 2015.

Introduction to Data Mining, Pang-Ning Tan, Michael Steinbach, and Vipin Kumar, Addison-Wesley, 2014
5/17/2023 44
A Quantitative Measure of a Classifier
 The concept of ROC plot can be extended to compare quantitatively using
Euclidean distance measure.

 See the following figure for an explanation.

 Here, C(fpr, tpr) is a classifier and 𝜹 denotes the Euclidean distance between
the best classifier (0, 1) and C. That is,
 𝜹= 𝑓𝑝𝑟 2 + (1 − 𝑡𝑝𝑟)2

Data Mining: Concepts and Techniques, (3rd Edn.), Jiawei Han, Micheline Kamber, Morgan Kaufmann, 2015.

Introduction to Data Mining, Pang-Ning Tan, Michael Steinbach, and Vipin Kumar, Addison-Wesley, 2014
5/17/2023 45
Performance Estimation of a Regression Model

 Different Measures :
 Mean Absolute Error (MAE)
 Mean Squared Error (MSE)
 Root Mean Squared Error (RMSE)
 R Squared (R2 )

Actual (y) Predicted Difference

(y’) (y-y’)
80 70 10
60 65 -5
70 55 15
50 60 -10
85 75 10

5/17/2023 46
Actual Predicted Predicted
1 1 1 1
2 4 3 4
3 9 8 9 0.80
4 16 17 16
5 25 23 25 R squared =1

5/17/2023 47
Performance Estimation of a Regression Model
 The Mean Absolute Error (or MAE) is the average of the
absolute differences between predictions and actual
values. It gives an idea of how wrong the predictions
were.
 The measure gives an idea of the magnitude of the error,
but no idea of the direction (e.g. over or under
predicting).
 The R^2 (or R Squared) metric provides an indication of
the goodness of fit of a set of predictions to the actual
values. In statistical literature, this measure is called the
coefficient of determination.
 This is a value between 0 and 1 for no-fit and perfect fit
5/17/2023 48

Data Preprocessing in Data Mining PDF
100% (3)
Data Preprocessing in Data Mining PDF
327 pages
Lesson 4 - Performance Metrics
No ratings yet
Lesson 4 - Performance Metrics
46 pages
Chapter 11 - Similarity
100% (1)
Chapter 11 - Similarity
37 pages
Fem Objective Questions
0% (2)
Fem Objective Questions
7 pages
FS-719 Numerical Methods PDF
No ratings yet
FS-719 Numerical Methods PDF
2 pages
KNN Evaluation
No ratings yet
KNN Evaluation
51 pages
6.data Mining - Classification
No ratings yet
6.data Mining - Classification
37 pages
Data Mining: Accuracy and Error Measures For Classification and Prediction
No ratings yet
Data Mining: Accuracy and Error Measures For Classification and Prediction
15 pages
Pennachi - Theory of Asset Pricing
100% (1)
Pennachi - Theory of Asset Pricing
570 pages
Marketing Measurement and Forecasting
86% (14)
Marketing Measurement and Forecasting
16 pages
ML Unit 3
No ratings yet
ML Unit 3
127 pages
Roller Coaster SImulation
No ratings yet
Roller Coaster SImulation
75 pages
Anyons in An Exactly Solved Model and Beyond: Alexei Kitaev
No ratings yet
Anyons in An Exactly Solved Model and Beyond: Alexei Kitaev
113 pages
6 Evaluation
No ratings yet
6 Evaluation
57 pages
Performance Metrics
No ratings yet
Performance Metrics
12 pages
Share Class 6 Pt2 2025
No ratings yet
Share Class 6 Pt2 2025
4 pages
Ex: Luggage / Baggage / Breakage / Advice / Furniture / Information / Scenery / Poetry / Work / Soap / Food / Bread / Fish / Paper / Machinery Etc
No ratings yet
Ex: Luggage / Baggage / Breakage / Advice / Furniture / Information / Scenery / Poetry / Work / Soap / Food / Bread / Fish / Paper / Machinery Etc
3 pages
Unit 5 Classification PDF
No ratings yet
Unit 5 Classification PDF
131 pages
Data Mining and Machine Learning: Fundamental Concepts and Algorithms
No ratings yet
Data Mining and Machine Learning: Fundamental Concepts and Algorithms
79 pages
Lec07 Classification ModelEvaluation Ensemble
No ratings yet
Lec07 Classification ModelEvaluation Ensemble
62 pages
Lesson 6 Analytics Methods
No ratings yet
Lesson 6 Analytics Methods
12 pages
1 Complex Numbers
No ratings yet
1 Complex Numbers
9 pages
Unit 6-Feature Engineering and Sensitivity Analysis
No ratings yet
Unit 6-Feature Engineering and Sensitivity Analysis
63 pages
Confusion Matrix and Performance Evaluation Metrics
No ratings yet
Confusion Matrix and Performance Evaluation Metrics
13 pages
Notes 03
No ratings yet
Notes 03
38 pages
Unit 4 Learning
No ratings yet
Unit 4 Learning
100 pages
Lectures3 5
No ratings yet
Lectures3 5
57 pages
Introduction To Artificial Intelligence: Amna Iftikhar Fall ' 2019 1
No ratings yet
Introduction To Artificial Intelligence: Amna Iftikhar Fall ' 2019 1
33 pages
Ch01 ICS422 03
No ratings yet
Ch01 ICS422 03
46 pages
Lect 02 Evaluation Part 1
No ratings yet
Lect 02 Evaluation Part 1
33 pages
Balancing Hard To Balance Equations PDF
No ratings yet
Balancing Hard To Balance Equations PDF
2 pages
Pythagorean Triples: Determine Whether Each Set of Numbers Form A Pythagorean Triple. 12, 20, 16 8, 15, 17 1, 7, 5
No ratings yet
Pythagorean Triples: Determine Whether Each Set of Numbers Form A Pythagorean Triple. 12, 20, 16 8, 15, 17 1, 7, 5
2 pages
Confusion Matrix and Performance Evaluation Metrics
No ratings yet
Confusion Matrix and Performance Evaluation Metrics
13 pages
Computer Architecture ECE 361 Lecture 5: The Design Process & ALU Design
No ratings yet
Computer Architecture ECE 361 Lecture 5: The Design Process & ALU Design
55 pages
L9 RBF+PM
No ratings yet
L9 RBF+PM
33 pages
Unit6 - 7 Issues
No ratings yet
Unit6 - 7 Issues
53 pages
Resilience-Oriented Optimal Operation of Networked Hybrid Microgrids
No ratings yet
Resilience-Oriented Optimal Operation of Networked Hybrid Microgrids
11 pages
DL IT324a 4
No ratings yet
DL IT324a 4
52 pages
S1 Evaluate Performance LKW 1mar2025
No ratings yet
S1 Evaluate Performance LKW 1mar2025
26 pages
6 Evaluarea Performantei
No ratings yet
6 Evaluarea Performantei
43 pages
CH-5 ML
No ratings yet
CH-5 ML
36 pages
Intermediate Analytics-Regression-Week 3-1
No ratings yet
Intermediate Analytics-Regression-Week 3-1
44 pages
A Comparative Study On Text Representation Schemes in Text Categorization
No ratings yet
A Comparative Study On Text Representation Schemes in Text Categorization
11 pages
Chapter 10
No ratings yet
Chapter 10
31 pages
3-Performance Measures
No ratings yet
3-Performance Measures
35 pages
Session 1 Evaluation Model
No ratings yet
Session 1 Evaluation Model
58 pages
08 Classifier Evaluation
No ratings yet
08 Classifier Evaluation
39 pages
ANGLES
No ratings yet
ANGLES
9 pages
Classification - Performance Evlaution
No ratings yet
Classification - Performance Evlaution
13 pages
Model Evaluation and Selection
No ratings yet
Model Evaluation and Selection
41 pages
IE 527 Intelligent Engineering Systems: Basic Concepts Model/performance Evaluation Overfitting
No ratings yet
IE 527 Intelligent Engineering Systems: Basic Concepts Model/performance Evaluation Overfitting
18 pages
CH 6
No ratings yet
CH 6
24 pages
Model Performance Assessment
No ratings yet
Model Performance Assessment
13 pages
Model Evaluation and Selection
No ratings yet
Model Evaluation and Selection
22 pages
Day 19
No ratings yet
Day 19
40 pages
Improved Spectrogram Analysis For ECG Signal in Emergency Medical Applications
No ratings yet
Improved Spectrogram Analysis For ECG Signal in Emergency Medical Applications
6 pages
JNN 5.2 Confusion Matrix and Performance Evaluation Metrics
No ratings yet
JNN 5.2 Confusion Matrix and Performance Evaluation Metrics
13 pages
Balance This Case Histories From Difficult Balance Jobs 1714274382
No ratings yet
Balance This Case Histories From Difficult Balance Jobs 1714274382
20 pages
CST 42315 Dam - L9 1
No ratings yet
CST 42315 Dam - L9 1
15 pages
Lecture - (3-4) Evaluation Metrices Classification and Regression
No ratings yet
Lecture - (3-4) Evaluation Metrices Classification and Regression
28 pages
IFEM Solution Ch15
No ratings yet
IFEM Solution Ch15
3 pages
Class 10 Science Chapter 8 Presentation
No ratings yet
Class 10 Science Chapter 8 Presentation
68 pages
19-Performance Metrics
No ratings yet
19-Performance Metrics
23 pages
06-FSSR DS610 2024 2025T1 Metrics
No ratings yet
06-FSSR DS610 2024 2025T1 Metrics
24 pages
2-Training and Testing Models, Evaluation Metrics-01-07-2023
No ratings yet
2-Training and Testing Models, Evaluation Metrics-01-07-2023
23 pages
Delivery Feet Data Using K Mean Clustering With Applied SPSS
No ratings yet
Delivery Feet Data Using K Mean Clustering With Applied SPSS
2 pages
Ai DS 2 Book-Chpt-5
No ratings yet
Ai DS 2 Book-Chpt-5
17 pages
Chapter 5 Model Evaluation
No ratings yet
Chapter 5 Model Evaluation
21 pages
L22 KNN+Metrics
No ratings yet
L22 KNN+Metrics
18 pages
ML Model Evaluation
No ratings yet
ML Model Evaluation
17 pages
Cristian Quiñonez Fase2
No ratings yet
Cristian Quiñonez Fase2
7 pages
Session 2 Evaluation Boosting Bagging Contemporary Business Anaytics
No ratings yet
Session 2 Evaluation Boosting Bagging Contemporary Business Anaytics
17 pages
Evaluation Metrics-ML
No ratings yet
Evaluation Metrics-ML
16 pages
Accuracy and Error Measures
No ratings yet
Accuracy and Error Measures
14 pages
A10 Model Performance v2 2up
No ratings yet
A10 Model Performance v2 2up
11 pages
Confusion Matrix and Performance Evaluation Metrics
No ratings yet
Confusion Matrix and Performance Evaluation Metrics
13 pages
DSP - Mod2 QB
No ratings yet
DSP - Mod2 QB
15 pages
The Classification of Stocks With Basic Financial Indicators An Application of Cluster Analysis On The BIST 100 Index
No ratings yet
The Classification of Stocks With Basic Financial Indicators An Application of Cluster Analysis On The BIST 100 Index
29 pages
Evaluation Measures
No ratings yet
Evaluation Measures
8 pages
Evaluation Measures For Machine Learning Models
No ratings yet
Evaluation Measures For Machine Learning Models
6 pages
Ads 5
No ratings yet
Ads 5
5 pages
A New and Simple Method To Estimate RE) and Ko (E) in DAEM Model From Three Set of Experimental Data
No ratings yet
A New and Simple Method To Estimate RE) and Ko (E) in DAEM Model From Three Set of Experimental Data
6 pages
Ads Exp4
No ratings yet
Ads Exp4
3 pages
Bece Practice Questions
No ratings yet
Bece Practice Questions
11 pages
80-20 Curve (Pareto) PDF
No ratings yet
80-20 Curve (Pareto) PDF
6 pages
Dimitri Vey - Multisymplectic Geometry and Loop Quantum Gravity: Toward A Covariant Canonical Quantum Gravity
No ratings yet
Dimitri Vey - Multisymplectic Geometry and Loop Quantum Gravity: Toward A Covariant Canonical Quantum Gravity
19 pages
Project Proposal
No ratings yet
Project Proposal
8 pages
Dosage Calculations Made Easy for Nursing Students: 500+ Step-by-Step Practice Problems with Complete Solutions and Explanations
From Everand
Dosage Calculations Made Easy for Nursing Students: 500+ Step-by-Step Practice Problems with Complete Solutions and Explanations
Stanley Lawrence Richardson
No ratings yet
PTCB Pharmacy Calculations Workbook: Master Alligations, Dilutions, IV Flow Rates, Dosages & Conversions with Over 350 Practice Questions with Detailed Explanations
From Everand
PTCB Pharmacy Calculations Workbook: Master Alligations, Dilutions, IV Flow Rates, Dosages & Conversions with Over 350 Practice Questions with Detailed Explanations
Stanley Lawrence Richardson
No ratings yet
Exercises of Statistical Inference
From Everand
Exercises of Statistical Inference
Simone Malacrida
No ratings yet

D3 IT Performance Metrics May 2023

Uploaded by

D3 IT Performance Metrics May 2023

Uploaded by

Performance Estimation of

Machine Learning Model

 Various other metrics may be derived from a single

 “Confusion Matrix” or “Contingency Matrix”

 In fact, data sets with imbalanced class

TP = 30, TN = 940, FP = 20, FN = 10

 Imagine a study evaluating a new test that

 We do it in terms of TPR, FPR, Precision and Recall and Accuracy

 Case 2: Worst Classifier

 Case 3: Ultra-Liberal Classifier

 Case 4: Ultra-Conservative Classifier

Class Good Worst Total Rate(%)

 A ROC plot is a two-dimensional graph, where, X-axis represents FP rate

Identify the four extreme classifiers.

 The four points (A, B, C, and D)

 The points on diagonals

 The points on the upper diagonal region

 If we compare X and Y, neither classifier is superior to the other

 The points on the lower diagonal region

 Here, C1 is better than C2 when FPR is less than 0.3.

 However, C2 is better, when FPR is greater than 0.3.

 Clearly, neither of these two classifiers dominates the other.

 Here, C3 is best, and C2 is better than C1 as AUC(C3)>AUC(C2)>AUC(C1).

 See the following figure for an explanation.

Actual (y) Predicted Difference

You might also like