Lecture - 3

The document discusses performance metrics in machine learning, emphasizing their importance in evaluating model effectiveness. Key metrics include accuracy, precision, recall, F1 score, and AUC-ROC, each with specific formulas and applications. Understanding these metrics is crucial for selecting the appropriate evaluation methods for different classification problems.

Uploaded by

Chaitali Patil

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views24 pages

Lecture - 3

Uploaded by

Chaitali Patil

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 24

Performance Metrics in

Machine Learning

Prof. Chaitali Mhatre

Assistant Professor
🞂 Evaluating the performance of a
Machine learning model is one of the
important steps while building an
effective ML model.

🞂 To evaluate the performance or quality

of the model, different metrics are
used, and these metrics are known as
performance metrics or evaluation
metrics.
🞂 It is the easiest way to measure the
performance of a classification
problem where the output can be of
two or more type of classes.

🞂 Not all metrics can be used for al

types of problems; hence, it is
important to know and understand
which metrics should be used.
🞂 To evaluate the performance of a classification
model, different metrics are used, and some of
them are as follows:-
🞂 Accuracy
🞂 Confusion Matrix
🞂 Precision
🞂 Recall
🞂 F-1-Score
🞂 AUC (Area Under the Curve)-ROC
🞂 A confusion matrix is nothing but a
table with two dimensions viz. “Actual”
and “Predicted” and furthermore, both
the dimensions have “True Positives
(TP)”, “True Negatives (TN)”, “False
Positives (FP)”, “False Negatives (FN)”
as shown below −
FALSE
NEGATIVE
(FN)
🞂 True Positives (TP) − It is the case when both
actual class & predicted class of data point is 1.
🞂 True Negatives (TN) − It is the case when both
actual class & predicted class of data point is 0.
🞂 False Positives (FP) − It is the case when actual
class of data point is 0 & predicted class of data
point is 1.
🞂 False Negatives (FN) − It is the case when actual
class of data point is 1 & predicted class of data
point is 0.
🞂 The sklearn metrics module implements
several loss, score, and utility functions to
measure classification performance. Some
metrics might require probability estimates of
the positive class, confidence values, or
binary decisions values.

🞂 We can use confusion matrix function of

sklearn.metrics to compute Confusion
Matrix of our classification model.
 Total number of predictions are 165 out of which
110 time predicted YES, whereas 55 times predicted
NO.
 However, in reality, 60 cases in which patients don't
have the disease, whereas 105 cases in which
patients have the disease.
Total Cases Actual Actual
165 ( YES) ( NO)

Predicted ( YES) 100 10

Predicted ( NO) 5 50
It is most common performance metric for
classification algorithms. It may be defined as
the number of correct predictions made as a
ratio of all predictions made.

We can easily calculate it by confusion matrix

with the help of following formula −

Accuracy = (TP+TN)
(TP+FP+FN+TN)
🞂 We can use accuracy_score function of
sklearn.metrics to compute accuracy of our
classification model.

🞂 Sklearn metrics lets you to assess the quality

of your predictions.
🞂 Precision, used in document retrievals,
may be defined as the number of
correct documents returned by our ML
model.
🞂 We can easily calculate it by confusion
matrix with the help of following formula
—

Precision = TP
TP+FP

= 100/(100+10)
=91 %
🞂 Recall may be defined as the number of positives
returned by our ML model. We can easily
calculate it by confusion matrix with the help of
following formula −

🞂 Recall= TP
TP+FN

= 100/(100+5)
=95 %
🞂 Specificity, in contrast to recall, may be defined
as the number of negatives returned by our ML
model. We can easily calculate it by confusion
matrix with the help of following formula −

🞂 Specificity= TN
TN+FP
= 50/(50+10)
=83 %
🞂 This score will give us the harmonic mean of
precision and recall. Mathematically, F1
score is the weighted average of the
precision and recall. The best value of F1
would be 1 and worst would be 0. We can
calculate F1 score with the help of following
formula −
🞂 𝑭𝟏 = 𝟐 ∗ (𝒑𝒓𝒆𝒄𝒊𝒔𝒊𝒐𝒏 ∗ 𝒓𝒆𝒄𝒂𝒍𝒍) / (𝒑𝒓𝒆𝒄𝒊𝒔𝒊𝒐𝒏 + 𝒓𝒆𝒄𝒂𝒍𝒍)
🞂 = 2x (91 x 95) / (91 +95)
🞂 = 92.95
🞂 = .93 ~ 1
🞂 F1 score is having equal relative contribution
of precision and recall.
🞂 We can use classification_report function of
sklearn.metrics to get the classification report
of our classification model.
🞂 AUC (Area Under Curve)-ROC (Receiver
Operating Characteristic) is a performance
metric, based on varying threshold values, for
classification problems.
🞂 As name suggests, ROC is a probability curve
and AUC measure the separability.
🞂 In simple words, AUC-ROC metric will tell us
about the capability of model in
distinguishing the classes. Higher the AUC,
better the model.
🞂 Mathematically, it can be created by plotting
TPR (True Positive Rate) i.e. Sensitivity or
recall vs FPR (False Positive Rate) i.e. 1-
Specificity, at various threshold values.

🞂 Following is the graph showing ROC, AUC

having TPR at y-axis and FPR at x-axis −
🞂 An ROC curve (receiver operating
characteristic curve) is a graph showing the
performance of a classification model at all
classification thresholds. This curve plots two
parameters:
🞂 True Positive Rate

🞂 False Positive Rate

🞂 True Positive Rate (TPR) is a synonym for
recall and is therefore defined as follows:

🞂 TPR= TP
TP+FN
=100/105
=0.95
🞂 False Positive Rate (FPR) is a synonym for
recall and is therefore defined as follows:

🞂 FPR= FP
FP+TN
=10/60
=0.17

Confusion Matrix For Your Multi-Class Machine Learning Model - by Joydwip Mohajon - Towards Data Science
No ratings yet
Confusion Matrix For Your Multi-Class Machine Learning Model - by Joydwip Mohajon - Towards Data Science
9 pages
Performance Metrics
No ratings yet
Performance Metrics
12 pages
Unit 4 Model Evaluation
No ratings yet
Unit 4 Model Evaluation
24 pages
Axpert MAXII 8K TWIN Off Grid Manual 20220121
No ratings yet
Axpert MAXII 8K TWIN Off Grid Manual 20220121
81 pages
Module 2
No ratings yet
Module 2
151 pages
Notes 03
No ratings yet
Notes 03
38 pages
CSL0777 L06
No ratings yet
CSL0777 L06
24 pages
Lec 12 13 Evaluation Measures
No ratings yet
Lec 12 13 Evaluation Measures
45 pages
Performance Parameters
No ratings yet
Performance Parameters
23 pages
12-Confusion Matrix
No ratings yet
12-Confusion Matrix
3 pages
Confusion Matrix
No ratings yet
Confusion Matrix
43 pages
Confusion Matrix V 2.0
No ratings yet
Confusion Matrix V 2.0
14 pages
ML Lecture 11 Evaluation
No ratings yet
ML Lecture 11 Evaluation
17 pages
JNN 5.2 Confusion Matrix and Performance Evaluation Metrics
No ratings yet
JNN 5.2 Confusion Matrix and Performance Evaluation Metrics
13 pages
Lec 4 ML S4 Evaluation Metrics
No ratings yet
Lec 4 ML S4 Evaluation Metrics
29 pages
ML Material
No ratings yet
ML Material
21 pages
ML-Lecture-12 (Evaluation Metrics For Classification)
No ratings yet
ML-Lecture-12 (Evaluation Metrics For Classification)
15 pages
Work Class Rovs Systems: SMD - Co.Uk
No ratings yet
Work Class Rovs Systems: SMD - Co.Uk
20 pages
3 - Model Evaluation & Validation
No ratings yet
3 - Model Evaluation & Validation
47 pages
March 3rd&4th
No ratings yet
March 3rd&4th
19 pages
WINSEM2024-25 CBS3006 ETH VL2024250505168 2025-01-09 Reference-Material-IV
No ratings yet
WINSEM2024-25 CBS3006 ETH VL2024250505168 2025-01-09 Reference-Material-IV
20 pages
W6 CSE 4781 Classification Metrics
No ratings yet
W6 CSE 4781 Classification Metrics
28 pages
IT 138 - Lecture 4
No ratings yet
IT 138 - Lecture 4
30 pages
Classification Metrics
No ratings yet
Classification Metrics
24 pages
ML Unit 3
No ratings yet
ML Unit 3
127 pages
Imp Notes For Aamd
No ratings yet
Imp Notes For Aamd
6 pages
Unit-6 Notes PART A
No ratings yet
Unit-6 Notes PART A
20 pages
? Task
No ratings yet
? Task
23 pages
Exp7 MLAI2
No ratings yet
Exp7 MLAI2
8 pages
Unit III Iml Final
No ratings yet
Unit III Iml Final
36 pages
جلسه 13
No ratings yet
جلسه 13
76 pages
Machine Learning: B.Tech (CSBS) V Semester
No ratings yet
Machine Learning: B.Tech (CSBS) V Semester
9 pages
ML CH 5
No ratings yet
ML CH 5
45 pages
Assignment 5
No ratings yet
Assignment 5
22 pages
Confusion Matrix
No ratings yet
Confusion Matrix
16 pages
Performance Metrics Classification
No ratings yet
Performance Metrics Classification
39 pages
Comprehensive Guide On Confusion Matrix 1657202063
No ratings yet
Comprehensive Guide On Confusion Matrix 1657202063
5 pages
Evaluation Metrics-ML
No ratings yet
Evaluation Metrics-ML
16 pages
Model Evaluation Metrics - A Comprehensive Guide For Beginners - by Yash - Medium
No ratings yet
Model Evaluation Metrics - A Comprehensive Guide For Beginners - by Yash - Medium
9 pages
Accuracy and Error Measures
No ratings yet
Accuracy and Error Measures
14 pages
Iai&ml Unit-5
No ratings yet
Iai&ml Unit-5
15 pages
Unit 2 Chap 4
No ratings yet
Unit 2 Chap 4
14 pages
Unit 3
No ratings yet
Unit 3
13 pages
19-Performance Metrics
No ratings yet
19-Performance Metrics
23 pages
Lecture - (3-4) Evaluation Metrices Classification and Regression
No ratings yet
Lecture - (3-4) Evaluation Metrices Classification and Regression
28 pages
Performance Evaluation
No ratings yet
Performance Evaluation
24 pages
Ad3501-Dl-Unit 4 Notes
No ratings yet
Ad3501-Dl-Unit 4 Notes
16 pages
Confusion Matrix
No ratings yet
Confusion Matrix
5 pages
Deep Dive Into Confusion Matrix - Towards AI
No ratings yet
Deep Dive Into Confusion Matrix - Towards AI
9 pages
Ai DS 2 Book-Chpt-5
No ratings yet
Ai DS 2 Book-Chpt-5
17 pages
Lecture 5
No ratings yet
Lecture 5
21 pages
Ads Exp4
No ratings yet
Ads Exp4
3 pages
ML Metrics
No ratings yet
ML Metrics
9 pages
Learning Best Practices For Model Evaluation and Hyper-Parameter Tuning
No ratings yet
Learning Best Practices For Model Evaluation and Hyper-Parameter Tuning
20 pages
Machine Learningassignment
No ratings yet
Machine Learningassignment
10 pages
Evaluation Measures
No ratings yet
Evaluation Measures
8 pages
Ads 5
No ratings yet
Ads 5
5 pages
BigData Section6
No ratings yet
BigData Section6
10 pages
Evaluation Metrics
No ratings yet
Evaluation Metrics
11 pages
DVD Formats PDF
No ratings yet
DVD Formats PDF
3 pages
HWMobilePanels2GenUS en-US
No ratings yet
HWMobilePanels2GenUS en-US
280 pages
GTN-650 Quick Reference Guide
No ratings yet
GTN-650 Quick Reference Guide
4 pages
Machine Learning Model Evaluation
No ratings yet
Machine Learning Model Evaluation
11 pages
Library Management System Report
No ratings yet
Library Management System Report
23 pages
01 - Introduction 1
No ratings yet
01 - Introduction 1
21 pages
Indigo 36X3: Manual de Usuario
No ratings yet
Indigo 36X3: Manual de Usuario
4 pages
Course Code: MCS-014 Course Title: Systems Analysis and Design Assignment Number: MCA (I) /014/assignment/20-21
No ratings yet
Course Code: MCS-014 Course Title: Systems Analysis and Design Assignment Number: MCA (I) /014/assignment/20-21
6 pages
Real-Time Location Tracking Using Browser-Based Geolocation API
No ratings yet
Real-Time Location Tracking Using Browser-Based Geolocation API
57 pages
BCP 78 BCP 79
No ratings yet
BCP 78 BCP 79
43 pages
Prabhu Raj Resume
No ratings yet
Prabhu Raj Resume
2 pages
Lexmark Address Book Administrator's Guide
No ratings yet
Lexmark Address Book Administrator's Guide
10 pages
HCM - Oracle Journeys - New Features in Release 25A
No ratings yet
HCM - Oracle Journeys - New Features in Release 25A
28 pages
I/A Series Hardware Intelligent Transmitter/ 0 To 20 Ma Output Interface Module (FBM39)
No ratings yet
I/A Series Hardware Intelligent Transmitter/ 0 To 20 Ma Output Interface Module (FBM39)
8 pages
Billing Management Console
No ratings yet
Billing Management Console
11 pages
Firmware Upgrade Tool Lite User Guide
No ratings yet
Firmware Upgrade Tool Lite User Guide
22 pages
Audio Critic 19
No ratings yet
Audio Critic 19
57 pages
MODULE-2 Descriptive Statistics
No ratings yet
MODULE-2 Descriptive Statistics
18 pages
Kontur PDF
No ratings yet
Kontur PDF
1 page
Peerspot Peerpaper
No ratings yet
Peerspot Peerpaper
15 pages
Import Java - Util.stack Public Class Linkedinterface (
No ratings yet
Import Java - Util.stack Public Class Linkedinterface (
4 pages
Recruitment For Various Posts
No ratings yet
Recruitment For Various Posts
4 pages
(W 11093) Zhang Et Al 2023 Artificial Intelligence Enhanced Molecular Simulations
No ratings yet
(W 11093) Zhang Et Al 2023 Artificial Intelligence Enhanced Molecular Simulations
13 pages
Zihad Projeject
No ratings yet
Zihad Projeject
20 pages
Legal Aspects of Ethical Hacking - Sejal Agarwal
No ratings yet
Legal Aspects of Ethical Hacking - Sejal Agarwal
5 pages
Sysmex CA-660 操作手册
No ratings yet
Sysmex CA-660 操作手册
2 pages
Job Scheduling in HPC Cluster
No ratings yet
Job Scheduling in HPC Cluster
4 pages
Sushant's Resume
No ratings yet
Sushant's Resume
2 pages
Applies To
No ratings yet
Applies To
2 pages
Acceptance-Rejection Sampling and Multi-dimensional Monte Carlo Integrations Utilizing Mathematica®
From Everand
Acceptance-Rejection Sampling and Multi-dimensional Monte Carlo Integrations Utilizing Mathematica®
SUJAUL CHOWDHURY
No ratings yet
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
From Everand
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
SUJAUL CHOWDHURY
No ratings yet