Evaluation Metrics

The document discusses evaluation metrics in machine learning, specifically focusing on accuracy, precision, recall, and F1-score, which are essential for assessing classifier performance. It distinguishes between accuracy (closeness to a specific value) and precision (closeness to each other) and introduces the confusion matrix as a tool for visualizing classifier performance. The document emphasizes the importance of understanding these metrics, especially in the context of applications like spam filtering, where misclassification can have significant consequences.

Uploaded by

Sadbin Mohshin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views16 pages

Evaluation Metrics

Uploaded by

Sadbin Mohshin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 16

Evaluation Metrics

• Not only in machine learning but also in general

life, especially business life, you will hear questions
"How accurate is your product?" or "How precise is
your machine?".

• When people get replies like "the most accurate

product in its field!" or "This machine has the
highest imaginable precision!", they comforted by
both answers. Shouldn't they? Indeed, the terms
accurate and precise are very often used
interchangeably.

• But in a nutshell, we can say: Accuracy measure for

the closeness of some measurements to a specific
value, while precision is the closeness
measurements to each other.
• We need them for evaluating ML algorithms or better their
results.
• Four important metrics are used to evaluate the results of
classifications. The metrics are:

• Accuracy
• Precision
• Recall
• F1-Score

We will introduce each of these metrics and we will discuss the pro
and cons of each of them. Each measures something different
about a classifiers performance. The metrics will be of outmost
importance for all machine learning.
Accuracy
• Accuracy is a measure for the closeness of the
measurements to a specific value, while precision is
closeness of the measurements to each other, i.e. not
necessarily to a specific value.
• To put it in other If we have a set of data points from
repeated measurements of the same quantity, the set is
said to be accurate if their average is close to the true
value of the quantity being measured. On the other
hand, the set to be precise, if the values are close to
each other.
• The two concepts are independent of each which means
that the set of data can be accurate, or precise, or both,
Confusion Matrix
• Before we continue with the term accuracy, we want to
make sure that you understand what a confusion matrix is
about.
• A confusion matrix, also called a contingency table or error
matrix, is used to visualize the performance classifier.
• The columns of the matrix represent the instances of the
predicted classes and the rows represent the instances of the
actual class. (Note: It can be the other way around as well.)
• In the case of binary classification the table has 2 rows and 2
columns.
Accuracy in Classification
• Accuracy is also used as a statistical measure.
• Accuracy statistical measure which is defined as the quotient of correct
predictions (both True positives (TP) negatives (TN)) made by a classifier
divided by the sum of all predictions made by the classifier, including False
positives (FP) and False negatives (FN). Therefore, the formula for quantifying
bin
Exercise: Before you go on with the text think about what the value precision
means. If you look at the precision measure of our spam filter example, what
does it tell you about the quality of the spam filter? What do the results of the
confusion matrix of an ideal spam filter look like? What is worse, high FP or FN
values?

You will find the answers indirectly in the following explanations.

Incidentally, the ideal spam filter would have 0 values for both FP and FN.

The previous result means that 11 mail pieces out of a hundred will be classified
as ham, even though they are spam. 89 are correctly classified as ham. This is a
point where we should talk about the costs of misclassification. It is troublesome
when a spam mail is not recognized as "spam" and is instead presented to us as
"ham". If the percentage is not too high, it is annoying but not a disaster. In
contrast, when a non-spam message is wrongly labeled as spam, the email will
not be shown in many cases or even automatically deleted. For example, this
carries a high risk of losing customers and friends. The measure precision makes
no statement about this last-mentioned problem class. What about other
measures?

Shandon Cytospin 3 Operator Guide
No ratings yet
Shandon Cytospin 3 Operator Guide
68 pages
Volume Control Dampers
100% (1)
Volume Control Dampers
13 pages
Bernd Klein Python and Machine Learning Letter
No ratings yet
Bernd Klein Python and Machine Learning Letter
453 pages
Accuracy Precision and Recall
No ratings yet
Accuracy Precision and Recall
21 pages
Lesson 4 - Performance Metrics
No ratings yet
Lesson 4 - Performance Metrics
46 pages
08 Classifier Evaluation
No ratings yet
08 Classifier Evaluation
39 pages
Evaluation Measures
No ratings yet
Evaluation Measures
8 pages
ML CH 5
No ratings yet
ML CH 5
45 pages
Unit 3
No ratings yet
Unit 3
13 pages
Confusion Matrix
No ratings yet
Confusion Matrix
14 pages
Performance Metrics (Classification) : Enrique J. de La Hoz D
100% (1)
Performance Metrics (Classification) : Enrique J. de La Hoz D
30 pages
Confusion Matrix and Classification Evaluation Metrics
No ratings yet
Confusion Matrix and Classification Evaluation Metrics
16 pages
How To Calculate Precision, Recall, and F-Measure For Imbalanced Classification
No ratings yet
How To Calculate Precision, Recall, and F-Measure For Imbalanced Classification
19 pages
11.2 - Classification Evaluation Metrics
No ratings yet
11.2 - Classification Evaluation Metrics
22 pages
Assignment 5
No ratings yet
Assignment 5
22 pages
Lec 4
No ratings yet
Lec 4
24 pages
Accuracy and Error Measures
No ratings yet
Accuracy and Error Measures
14 pages
ML Unit 3
No ratings yet
ML Unit 3
127 pages
ML Evaluation Metrics
No ratings yet
ML Evaluation Metrics
20 pages
Unit III Iml Final
No ratings yet
Unit III Iml Final
36 pages
Lecture 5
No ratings yet
Lecture 5
21 pages
Session 1 Evaluation Model
No ratings yet
Session 1 Evaluation Model
58 pages
WINSEM2024-25 CBS3006 ETH VL2024250505168 2025-01-09 Reference-Material-IV
No ratings yet
WINSEM2024-25 CBS3006 ETH VL2024250505168 2025-01-09 Reference-Material-IV
20 pages
CE880 Lecture6 Slides
No ratings yet
CE880 Lecture6 Slides
25 pages
Unit8 (Evaluation Method)
No ratings yet
Unit8 (Evaluation Method)
43 pages
Lecture - (3-4) Evaluation Metrices Classification and Regression
No ratings yet
Lecture - (3-4) Evaluation Metrices Classification and Regression
28 pages
Confusion Matrix
No ratings yet
Confusion Matrix
43 pages
BigData Section6
No ratings yet
BigData Section6
10 pages
Evaluation Metrics
No ratings yet
Evaluation Metrics
20 pages
Chapter 10
No ratings yet
Chapter 10
31 pages
06-FSSR DS610 2024 2025T1 Metrics
No ratings yet
06-FSSR DS610 2024 2025T1 Metrics
24 pages
Unit 4 Learning
No ratings yet
Unit 4 Learning
100 pages
Ad3501-Dl-Unit 4 Notes
No ratings yet
Ad3501-Dl-Unit 4 Notes
16 pages
Evaluating Model Performance Unit 6
No ratings yet
Evaluating Model Performance Unit 6
33 pages
Lecture 2 Classifier Performance Metrics
No ratings yet
Lecture 2 Classifier Performance Metrics
60 pages
Chapter 7 - LAST
No ratings yet
Chapter 7 - LAST
29 pages
Chater 3 Class 10
No ratings yet
Chater 3 Class 10
4 pages
Confusion Matrix
No ratings yet
Confusion Matrix
18 pages
ML Lecture 11 Evaluation
No ratings yet
ML Lecture 11 Evaluation
17 pages
Model Evaluation and Selection
No ratings yet
Model Evaluation and Selection
41 pages
Classification Metrics in Machine Learning
No ratings yet
Classification Metrics in Machine Learning
6 pages
3-Performance Measures
No ratings yet
3-Performance Measures
35 pages
Evaluation Metrics
No ratings yet
Evaluation Metrics
11 pages
Lec 8
No ratings yet
Lec 8
35 pages
Evaluation Metrics-ML
No ratings yet
Evaluation Metrics-ML
16 pages
Instruction & Option Choice
No ratings yet
Instruction & Option Choice
6 pages
Performance Metrics Classification
No ratings yet
Performance Metrics Classification
39 pages
4.8 Estimating The Performance of A Classifier
No ratings yet
4.8 Estimating The Performance of A Classifier
19 pages
Lecture 7
No ratings yet
Lecture 7
25 pages
Confusion Matrix Accuracy Score
No ratings yet
Confusion Matrix Accuracy Score
1 page
Exp7 MLAI2
No ratings yet
Exp7 MLAI2
8 pages
Iai&ml Unit-5
No ratings yet
Iai&ml Unit-5
15 pages
Confusion Matrix Machine Learning
No ratings yet
Confusion Matrix Machine Learning
9 pages
Evaluation of Predictive Models Final
No ratings yet
Evaluation of Predictive Models Final
6 pages
Chapter 3 Model Evaluation Final
No ratings yet
Chapter 3 Model Evaluation Final
30 pages
Model Evaluation and Selection
No ratings yet
Model Evaluation and Selection
22 pages
Learning Best Practices For Model Evaluation and Hyper-Parameter Tuning
No ratings yet
Learning Best Practices For Model Evaluation and Hyper-Parameter Tuning
20 pages
Basics of ML and Evaluation
No ratings yet
Basics of ML and Evaluation
42 pages
Comprehensive Guide On Confusion Matrix 1657202063
No ratings yet
Comprehensive Guide On Confusion Matrix 1657202063
5 pages
Performance Measures - Session 2
No ratings yet
Performance Measures - Session 2
35 pages
GEP June 2024 Chapter2 EAP
No ratings yet
GEP June 2024 Chapter2 EAP
64 pages
GEP June 2024 Chapter2 ECA
No ratings yet
GEP June 2024 Chapter2 ECA
60 pages
GEP June 2024 Chapter1 Box1
No ratings yet
GEP June 2024 Chapter1 Box1
39 pages
Lab Report 04
No ratings yet
Lab Report 04
10 pages
DT RF
No ratings yet
DT RF
64 pages
1-Data Mining and Applications
No ratings yet
1-Data Mining and Applications
70 pages
Preprocessing
No ratings yet
Preprocessing
90 pages
Application Report
No ratings yet
Application Report
1 page
1 s2.0 S0263224113006519 Main
No ratings yet
1 s2.0 S0263224113006519 Main
11 pages
Visual Rhetoric PresentationWorksheet
No ratings yet
Visual Rhetoric PresentationWorksheet
2 pages
Catalogo Thompson MOTORES
No ratings yet
Catalogo Thompson MOTORES
23 pages
Role of Women in Mozart and Puccinis Operas
No ratings yet
Role of Women in Mozart and Puccinis Operas
12 pages
6936 PDF
100% (2)
6936 PDF
2 pages
Cka Practice Questions
100% (1)
Cka Practice Questions
11 pages
LP 4TH Grade 10 Day1
No ratings yet
LP 4TH Grade 10 Day1
3 pages
An Introduction To Ferrography
No ratings yet
An Introduction To Ferrography
37 pages
Lesson 04 - Physical Science
No ratings yet
Lesson 04 - Physical Science
24 pages
Adjective Order NA
No ratings yet
Adjective Order NA
2 pages
Experiment 6 Isolation of Eugenol From Cloves TECHNIQUE: Steam Distillation
No ratings yet
Experiment 6 Isolation of Eugenol From Cloves TECHNIQUE: Steam Distillation
2 pages
Lesson 1: Pre-Analytical Factors and Gross Description: Histopathologic and Cytologic Techniques - Lecture
No ratings yet
Lesson 1: Pre-Analytical Factors and Gross Description: Histopathologic and Cytologic Techniques - Lecture
28 pages
Bab 9 Akm
No ratings yet
Bab 9 Akm
44 pages
Beginner's Guide To Kirigami 24 Skill Building Projects For The Absolute Beginner Exclusive Download
100% (12)
Beginner's Guide To Kirigami 24 Skill Building Projects For The Absolute Beginner Exclusive Download
15 pages
Configuring The Switch For Access Point Discovery
No ratings yet
Configuring The Switch For Access Point Discovery
8 pages
Group Assignment 6 ICT (XII IPA 5) - 20240118 - 003400 - 0000
No ratings yet
Group Assignment 6 ICT (XII IPA 5) - 20240118 - 003400 - 0000
13 pages
Teacher Leader Qualities Self Assessment
No ratings yet
Teacher Leader Qualities Self Assessment
7 pages
Preparing OpenStackInstallation Guide
No ratings yet
Preparing OpenStackInstallation Guide
100 pages
Energy Relationships in Chemical Reactions
No ratings yet
Energy Relationships in Chemical Reactions
11 pages
For Ex Project
No ratings yet
For Ex Project
64 pages
Combining Hospitality With Security: Are We Secure Enough?
No ratings yet
Combining Hospitality With Security: Are We Secure Enough?
20 pages
On A Clear Day A Town With An Ocean View Joe Hisaishi
No ratings yet
On A Clear Day A Town With An Ocean View Joe Hisaishi
22 pages
Sharif Abushaikha Hotel General Manager
No ratings yet
Sharif Abushaikha Hotel General Manager
2 pages
15 Advanced English Phrases For Better Expressing Emotions
No ratings yet
15 Advanced English Phrases For Better Expressing Emotions
4 pages
A Brief History of Consumer Culture
No ratings yet
A Brief History of Consumer Culture
6 pages
Group 8 Ocampo ED 203 MidTerm Exam
No ratings yet
Group 8 Ocampo ED 203 MidTerm Exam
6 pages
RFQ - Section - III - Technical - Questionnaire
No ratings yet
RFQ - Section - III - Technical - Questionnaire
12 pages
Grade 7 Maths Notes Part 1
No ratings yet
Grade 7 Maths Notes Part 1
6 pages

Evaluation Metrics

Uploaded by

Evaluation Metrics

Uploaded by

Evaluation Metrics

• Not only in machine learning but also in general

• When people get replies like "the most accurate

• But in a nutshell, we can say: Accuracy measure for

You will find the answers indirectly in the following explanations.

You might also like