Confusion Matrix & Evaluation Metrics in Machine Learning

The document discusses the importance of evaluation metrics for machine learning classification algorithms, focusing on the confusion matrix and various performance metrics such as accuracy, precision, recall, F1 score, and AUC-ROC. It explains the components of a confusion matrix, including true positives, true negatives, false positives, and false negatives, and highlights the significance of these metrics in assessing model performance. Additionally, it covers regression evaluation metrics like Mean Absolute Error and Root Mean Square Error, emphasizing their roles in predicting continuous values.

Uploaded by

Haleem

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

67 views23 pages

Confusion Matrix & Evaluation Metrics in Machine Learning

Uploaded by

Haleem

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 23

Confusion Matrix &

Evaluation Metrics in
Machine Learning
Metrics to Evaluate Machine
Learning Classification Algorithms
• Now that we have an idea of the different types of
classification models, it is crucial to choose the right
evaluation metrics for those models.
• We will cover the most commonly used metrics: accuracy,
precision, recall, F1 score, and area under the ROC
(Receiver Operating Characteristic) curve and AUC
(Area Under the Curve).
Understanding the Confusion Matrix
in Machine Learning
• Machine learning models are increasingly used in
various applications to classify data into different
categories.
• However, evaluating the performance of these models
is crucial to ensure their accuracy and reliability.
• One essential tool in this evaluation process is the
confusion matrix
What is a Confusion Matrix?
• A confusion matrix is a simple table that shows how well a
classification model is performing by comparing its
predictions to the actual results.
• It breaks down the predictions into four categories:
• Correct predictions for both classes (true positives and true
negatives) and
• Incorrect predictions (false positives and false negatives).
• This helps you understand where the model is making
mistakes, so you can improve it
A 2X2 Confusion matrix
• The matrix displays the number of instances produced by the
model on the test data.
• True Positive (TP): The model correctly predicted a positive
outcome (the actual outcome was positive).
• True Negative (TN): The model correctly predicted a negative
outcome (the actual outcome was negative)
• False Positive (FP): The model incorrectly predicted a positive
outcome (the actual outcome was negative). Also known as a
Type I error.
• False Negative (FN): The model incorrectly predicted a
negative outcome (the actual outcome was positive). Also
known as a Type II error
Example - Confusion Matrix for Dog
Image Recognition with Numbers
• Actual Dog Counts = 6

• Actual Not Dog Counts = 4

• True Positive Counts = 5

• False Positive Counts = 1

• True Negative Counts = 3

• False Negative Counts = 1

Why do we need a Confusion
Matrix?
• A confusion matrix helps you see how well a model is
working by showing correct and incorrect predictions.
• It also helps calculate key measures like accuracy,
precision, and recall, which give a better idea of
performance, especially when the data is imbalanced.
Metrics based on Confusion Matrix Data
What is the AUC-ROC curve?

• The AUC-ROC curve, or Area Under the Receiver

Operating Characteristic curve, is a graphical
representation of the performance of a binary
classification model at various classification thresholds.
• It is commonly used in machine learning to assess the
ability of a model to distinguish between two classes,
typically the positive class (e.g., presence of a disease)
and the negative class (e.g., absence of a disease).
• Receiver Operating Characteristics (ROC)
Curve
• ROC stands for Receiver Operating
Characteristics, and the ROC curve is the
graphical representation of the
effectiveness of the binary classification
model.
• It plots the true positive rate (TPR) vs the
false positive rate (FPR) at different
classification thresholds
Area Under Curve (AUC) Curve

• The AUC curve represents the area under the ROC curve.
• It measures the overall performance of the binary classification
model.
• As both TPR and FPR range between 0 to 1, So, the area will always
lie between 0 and 1, and A greater value of AUC denotes better
model performance.
• Our main goal is to maximize this area in order to have the highest
TPR and lowest FPR at the given threshold.
• The AUC measures the probability that the model will assign a
randomly chosen positive instance a higher predicted probability
compared to a randomly chosen negative instance.
Type 1 and Type 2 error
Type 1 error
• A Type 1 Error occurs when the model incorrectly predicts a positive instance, but the
actual instance is negative.
• This is also known as a false positive. Type 1 Errors affect the precision of a model,
which measures the accuracy of positive predictions.
Example:

Scenario: A diagnostic test is used to detect a particular disease in patients.

• Type 1 Error (False Positive):

• This occurs when the test predicts a patient has the disease
(positive result), but the patient is actually healthy (negative
case).
Type 2 Error
• A Type 2 Error occurs when the model fails to predict a positive instance,
even though it is actually positive.
• This is also known as a false negative.
• Type 2 Errors impact the recall of a model, which measures how well the
model identifies all actual positive cases

Example:
Scenario: A diagnostic test is used to detect a particular disease in patients.

• Type 2 Error (False Negative):

• This occurs when the test predicts the patient is healthy
(negative result), but the patient actually has the disease
(positive case).
Strategies to choose the right metric
Regression Evaluation Metrics
• In the regression task, we are supposed to predict the
target variable which is in the form of continuous
values.
• To evaluate the performance of such a model below
mentioned evaluation metrics are used
• Mean Absolute Error
• Mean Squared Error
• Root Mean Square Error
• Root Mean Square Logarithmic Error
• R2 – Score
Mean Absolute Error(MAE)
• It is the average distance between Predicted and
original values.
• Basically, it gives how we have predicted from the
actual output.
• However, there is one limitation i.e. it doesn’t give any
idea about the direction of the error which is whether
we are under-predicting or over-predicting our data.
• It can be represented mathematically in this way:
Root Mean Square Error(RMSE)
• We can say that RMSE is a metric that can be obtained
by just taking the square root of the MSE value.
• As we know that the MSE metrics are not robust to
outliers and so are the RMSE values.
• This gives higher weightage to the large errors in
predictions.
Root Mean Squared Logarithmic Error(RMSLE)
• There are times when the target variable varies in a
wide range of values.
• And hence we do not want to penalize the
overestimation of the target values but penalize the
underestimation of the target values.
• For such cases, RMSLE is used as an evaluation metric
which helps us to achieve the above objective.
• Some changes in the original formula of the RMSE code
will give us the RMSLE formula that is as shown below:

(PDF) 400+ Excel Formulas List - Excel Shortcut Keys PDF - Download Here
No ratings yet
(PDF) 400+ Excel Formulas List - Excel Shortcut Keys PDF - Download Here
60 pages
Confusion Matrix Machine Learning
No ratings yet
Confusion Matrix Machine Learning
9 pages
Lecture 2 Classifier Performance Metrics
No ratings yet
Lecture 2 Classifier Performance Metrics
60 pages
Unit 4 Model Evaluation
No ratings yet
Unit 4 Model Evaluation
24 pages
Confusion Matrix
No ratings yet
Confusion Matrix
8 pages
Ca 3 Merged
No ratings yet
Ca 3 Merged
275 pages
Confusion Matrix
No ratings yet
Confusion Matrix
43 pages
Regression Analysis Assignment
100% (1)
Regression Analysis Assignment
8 pages
ERROR and Confusion Matrix
No ratings yet
ERROR and Confusion Matrix
29 pages
ML3 Evaluating Models
No ratings yet
ML3 Evaluating Models
40 pages
Model Evaluation Parameters
No ratings yet
Model Evaluation Parameters
31 pages
Confusion Matrix in Machine Learning
No ratings yet
Confusion Matrix in Machine Learning
10 pages
Lecture - 2 - IT Infrastructure
No ratings yet
Lecture - 2 - IT Infrastructure
22 pages
ML-Lecture-12 (Evaluation Metrics For Classification)
No ratings yet
ML-Lecture-12 (Evaluation Metrics For Classification)
15 pages
Lect 02 Evaluation Part 1
No ratings yet
Lect 02 Evaluation Part 1
33 pages
Lec 8
No ratings yet
Lec 8
35 pages
Unit8 (Evaluation Method)
No ratings yet
Unit8 (Evaluation Method)
43 pages
FALLSEM2024-25 BCSE209L TH VL2024250101737 2024-07-30 Reference-Material-II
No ratings yet
FALLSEM2024-25 BCSE209L TH VL2024250101737 2024-07-30 Reference-Material-II
23 pages
ML Lecture 11 Evaluation
No ratings yet
ML Lecture 11 Evaluation
17 pages
Unit III Iml Final
No ratings yet
Unit III Iml Final
36 pages
Performance Metrics (Classification) : Enrique J. de La Hoz D
100% (1)
Performance Metrics (Classification) : Enrique J. de La Hoz D
30 pages
Evaluation Matrix
No ratings yet
Evaluation Matrix
29 pages
3 - Model Evaluation & Validation
No ratings yet
3 - Model Evaluation & Validation
47 pages
Tutorial 6 Evaluation Metrics For Machine Learning Models: Classification and Regression Models
No ratings yet
Tutorial 6 Evaluation Metrics For Machine Learning Models: Classification and Regression Models
22 pages
ML CH 5
No ratings yet
ML CH 5
45 pages
Lec 4
No ratings yet
Lec 4
24 pages
IT 138 - Lecture 4
No ratings yet
IT 138 - Lecture 4
30 pages
Classification Metrics
No ratings yet
Classification Metrics
39 pages
06-FSSR DS610 2024 2025T1 Metrics
No ratings yet
06-FSSR DS610 2024 2025T1 Metrics
24 pages
Evaluation Metrics
No ratings yet
Evaluation Metrics
20 pages
Exp7 MLAI2
No ratings yet
Exp7 MLAI2
8 pages
Performance Evaluation
No ratings yet
Performance Evaluation
24 pages
Ad3501-Dl-Unit 4 Notes
No ratings yet
Ad3501-Dl-Unit 4 Notes
16 pages
Unit 2 Chap 4
No ratings yet
Unit 2 Chap 4
14 pages
Lecture 2.3
No ratings yet
Lecture 2.3
9 pages
Metrix in ML
No ratings yet
Metrix in ML
7 pages
Confusion Matrix
No ratings yet
Confusion Matrix
16 pages
Session-11 Machine Learning - Jupyter Notebook
No ratings yet
Session-11 Machine Learning - Jupyter Notebook
11 pages
L 13 Choose Your Own Algorithm D 07062024 111828am
No ratings yet
L 13 Choose Your Own Algorithm D 07062024 111828am
36 pages
Confusion Matrix: A Confusion Matrix Is A Summary of Prediction Results On A Classification Problem
No ratings yet
Confusion Matrix: A Confusion Matrix Is A Summary of Prediction Results On A Classification Problem
13 pages
Iai&ml Unit-5
No ratings yet
Iai&ml Unit-5
15 pages
Analytics in Practice: Model Evaluation
No ratings yet
Analytics in Practice: Model Evaluation
40 pages
Metric
No ratings yet
Metric
6 pages
Intel Assignment
No ratings yet
Intel Assignment
13 pages
Confusion Matrix
No ratings yet
Confusion Matrix
13 pages
Unit 3
No ratings yet
Unit 3
13 pages
19-Performance Metrics
No ratings yet
19-Performance Metrics
23 pages
6.evaluation Metrics - UNIT 2
No ratings yet
6.evaluation Metrics - UNIT 2
4 pages
Lecture - (3-4) Evaluation Metrices Classification and Regression
No ratings yet
Lecture - (3-4) Evaluation Metrices Classification and Regression
28 pages
Assignment 5
No ratings yet
Assignment 5
22 pages
Machine Learning Model Evaluation
No ratings yet
Machine Learning Model Evaluation
2 pages
11.2 - Classification Evaluation Metrics
No ratings yet
11.2 - Classification Evaluation Metrics
22 pages
Evaluation Metrics
No ratings yet
Evaluation Metrics
11 pages
Learning Best Practices For Model Evaluation and Hyper-Parameter Tuning
No ratings yet
Learning Best Practices For Model Evaluation and Hyper-Parameter Tuning
20 pages
Evaluation Measures
No ratings yet
Evaluation Measures
8 pages
Confusion Matrix
No ratings yet
Confusion Matrix
10 pages
Confusion Matrix
No ratings yet
Confusion Matrix
5 pages
Understanding The Confusion Matrix in Machine Learning
No ratings yet
Understanding The Confusion Matrix in Machine Learning
4 pages
Machine Learning Project Report (Group 3) Shahbaz Khan
No ratings yet
Machine Learning Project Report (Group 3) Shahbaz Khan
11 pages
Confusion Matrix in Machine Learning
No ratings yet
Confusion Matrix in Machine Learning
2 pages
Evaluation Metrics:: Confusion Matrix
No ratings yet
Evaluation Metrics:: Confusion Matrix
7 pages
Evaluation of Predictive Models Final
No ratings yet
Evaluation of Predictive Models Final
6 pages
Network and IT Infrastructure Services
No ratings yet
Network and IT Infrastructure Services
44 pages
Block 1
No ratings yet
Block 1
42 pages
Bayesian Network - Problem
100% (1)
Bayesian Network - Problem
4 pages
4 How To Estimate Parameters of A Weibull Distribution
No ratings yet
4 How To Estimate Parameters of A Weibull Distribution
6 pages
Ghahramani Lecture2
No ratings yet
Ghahramani Lecture2
30 pages
Requirements Elicitation
No ratings yet
Requirements Elicitation
59 pages
MATH Normal Distribution
No ratings yet
MATH Normal Distribution
37 pages
Continuous Prob Dist
No ratings yet
Continuous Prob Dist
79 pages
One-Way ANOVA
No ratings yet
One-Way ANOVA
39 pages
Basic Concepts of Split-Plot Design, Analysis of Covariance (ANCOVA) & Response Surface Design (Statistics)
No ratings yet
Basic Concepts of Split-Plot Design, Analysis of Covariance (ANCOVA) & Response Surface Design (Statistics)
20 pages
Statistics Theory Notes 2025
No ratings yet
Statistics Theory Notes 2025
15 pages
3 Confidence Intervals
No ratings yet
3 Confidence Intervals
16 pages
3rd Sem Maths Model Paper 1
No ratings yet
3rd Sem Maths Model Paper 1
2 pages
Engineering Prob & Stat Last Assingment
No ratings yet
Engineering Prob & Stat Last Assingment
2 pages
Stat - Prob-Q3-Module-8
No ratings yet
Stat - Prob-Q3-Module-8
21 pages
AP Statistics Unit 5 Progress Check MCQ Part A Report Details
No ratings yet
AP Statistics Unit 5 Progress Check MCQ Part A Report Details
1 page
Quantitative Methods OUBS 027125 Revision Notes: Tutor: Ms Mushira Laloo
No ratings yet
Quantitative Methods OUBS 027125 Revision Notes: Tutor: Ms Mushira Laloo
12 pages
Pre-Unit Assessment
No ratings yet
Pre-Unit Assessment
3 pages
Statisticss 2011 PDF
No ratings yet
Statisticss 2011 PDF
4 pages
Statistical Analysis With Software Application - 2nd Summative Test
No ratings yet
Statistical Analysis With Software Application - 2nd Summative Test
5 pages
A Simple Implicit Measure of The Effective Bid-Ask Spread in An Efficient
No ratings yet
A Simple Implicit Measure of The Effective Bid-Ask Spread in An Efficient
14 pages
Cap 7
No ratings yet
Cap 7
32 pages
Data and Knowledge Management
No ratings yet
Data and Knowledge Management
19 pages
Matrix de Covarianza
No ratings yet
Matrix de Covarianza
12 pages
Type 1 and Type 2 Errors
No ratings yet
Type 1 and Type 2 Errors
3 pages
Assignment 1 Statistics
No ratings yet
Assignment 1 Statistics
6 pages
North Luzon Philippines State College: Midterm Examination IN Biostatistics
No ratings yet
North Luzon Philippines State College: Midterm Examination IN Biostatistics
6 pages
Code Combining-A Maximum-Likelihood Decoding Approach Combining An Arbitrary Number
No ratings yet
Code Combining-A Maximum-Likelihood Decoding Approach Combining An Arbitrary Number
9 pages
Mathematics Assignment Student Name: Instructor Name: Course Number: 3rd July 2019
No ratings yet
Mathematics Assignment Student Name: Instructor Name: Course Number: 3rd July 2019
8 pages
Worksheet - XII - Multiplication Theorem - Conditional Probability PDF Final
No ratings yet
Worksheet - XII - Multiplication Theorem - Conditional Probability PDF Final
2 pages
Outline For ASM2
No ratings yet
Outline For ASM2
4 pages
Hypothesis Testing: Six Sigma Thinking, #6
From Everand
Hypothesis Testing: Six Sigma Thinking, #6
Sumeet Savant
No ratings yet

Confusion Matrix & Evaluation Metrics in Machine Learning

Uploaded by

Confusion Matrix & Evaluation Metrics in Machine Learning

Uploaded by

Confusion Matrix &

• Actual Not Dog Counts = 4

• True Positive Counts = 5

• False Positive Counts = 1

• True Negative Counts = 3

• False Negative Counts = 1

• The AUC-ROC curve, or Area Under the Receiver

Scenario: A diagnostic test is used to detect a particular disease in patients.

• Type 1 Error (False Positive):

• Type 2 Error (False Negative):

You might also like