0% found this document useful (0 votes)

18 views24 pages

Lec 4

Uploaded by

hima3255

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views24 pages

Lec 4

Uploaded by

hima3255

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 24

Tikrit University

College of computer science and

mathematics
Department of computer science
Machine learning

Lecture Four
Introduction to machine learning(cont.)

1
Topics
Performance evaluation metrics for machine learning methods:

1. Classification Metrics

2. Regression Metrics

3. Clustering Metrics

2
Some performance evaluation metrics for machine learning methods
• The topic of evaluating the performance of machine learning methods and
algorithms is highly important as it determines the effectiveness and
performance of a particular machine learning technique or algorithm.
• Here are key performance metrics for different types of machine learning
problems:
1. Classification Metrics

2. Regression Metrics

3. Clustering Metrics

3
Classification Metrics

For tasks where the goal is to assign a label to each input (e.g., spam
detection, image classification):
- Accuracy
- Confusion Matrix
- Precision
- Recall
- F1-score

4
Accuracy

Accuracy is a performance metric that is based on the correct results of

the learning process and divides them by the total results, as shown in
the following equation:

𝑁𝑜. 𝑜𝑓 𝐶𝑜𝑟𝑟𝑒𝑐𝑡𝑒𝑑 𝑅𝑒𝑠𝑢𝑙𝑡𝑠

𝐴𝑐𝑐𝑢𝑟𝑎𝑐𝑦 =
𝑇𝑜𝑡𝑎𝑙 𝑁𝑜. 𝑜𝑓 𝑅𝑒𝑠𝑢𝑙𝑡𝑠
It is best for balanced datasets where each class has a roughly equal
number of samples.
5
Confusion matrix
• A confusion matrix is a n*n matrix that is used for evaluating the
performance of the classification model.

For Binary classification —The confusion Matrix is a 2*2 matrix.

• If the target class is 3 means the Confusion Matrix is 3*3 matrix and so on.
• The classifiers for the two-category confusion matrix are shown in the next
slide.

6
Confusion matrix(cont.)

Terminologies used in Confusion Matrix:

• True Positive → Positive class which is predicted as positive.
• True Negative → Negative class which is predicted as negative.
• False Positive → Negative class which is predicted as positive.[Type I Error]
• False Negative →Positive class which is predicted as negative.[Type II Error]

7
Precision

Measures how many of the predicted positive cases were

positive. High precision means few false positives. The
precision model is written as follows:

8
Recall
Measures how many actual positive cases were correctly predicted.
High recall means few false negatives. The recalled model is calculated
as follows:

9
F1-Score
• The F1 score is a metric used to evaluate the performance of a
classification model in machine learning. It is a weighted average of
precision and recall. It provides a balance between these two metrics. The
F1 score is particularly useful when there is an imbalance in the number of
instances between the positive and negative classes, as it considers both
false positive and false negative predictions.

𝟐 (𝒑𝒓𝒆𝒄𝒊𝒔𝒊𝒐𝒏 ∗ 𝒓𝒆𝒄𝒂𝒍𝒍)
𝐹 − 𝑠𝑐𝑜𝑟𝑒 =
𝒑𝒓𝒆𝒄𝒊𝒔𝒊𝒐𝒏 + 𝒓𝒆𝒄𝒂𝒍𝒍
10
F1 score vs Accuracy

• Accuracy deals with True positive and True Negative. It doesn’t mention about
False-positive and False-negative. So we are not aware of the distribution of
False-positive and False-negative. If accuracy is 95% means, we don’t know how
the remaining 5% is distributed between False-positive and False-negative.

• F1 Score deals with False-positive and False-negative. For some models, we want
to know about the distribution of False-negative and False-positive. For those
models, the F1 Score metric is used for evaluating the performance.

11
• Example. The cancer data set has 100 records, out of which 94 are cancer records
and 6 are non-cancer records. But the model is predicting 90 out of 94 cancer
records correctly. Four cancer records are not predicted correctly [ 4 —FN].
Compute Recall.

12
Calculating metrics from the Confusion Matrix
• Let’s take the “Email Spam Filtering” example. Our task is to detect spam
emails. So spam emails are marked as 1 and not spam emails are marked as 0.
• I have taken 10 records. Let’s say our model prediction looks like this.

13
14
15
Regression Metrics

For tasks where the goal is to predict continuous values (e.g., house
prices, stock prices):
- Mean Absolute Error (MAE)
- Mean Squared Error (MSE)
- Root Mean Squared Error (RMSE)

16
Mean absolute error
• It is measured as the average absolute difference between the
predicted values and the actual values and is used to assess the
effectiveness of a regression model.
• T h e Mean Absolute Error is calculated as:

17
• Example (MAE): Here’s a set of actual prices of crafts and the prices
predicted by the algorithm:

Solution:
• Here, n = 5,
• MAE = 1/5 * (|25-28| + |15-14| + |20-22| + |30-29| + |40-38|) = 1/5
* (3+1+2+1+2) = 1/5 * (9) = 1.8

18
Mean square error
• T h e Mean Squared Error measures how close a regression line is to a
set of data points.

• A smaller MSE is preferred because it indicates that your data points

are dispersed closely around its central moment (mean).

• T h e Mean Squared Error is calculated as:

Where n represents the number of samples.

19
• Calculate Mean Square Error for the sales data of a product of all the
months.
• Step 1: Calculate the squared error of each data :

20
• Step2: Calculate the Mean Squared Error:

The MSE for this model is 8.17.

21
Root Mean Squared Error (RMSE)

It is the square root of MSE and gives an error in the same units as the
target variable. The RMSE is calculated as:

The RMSE for the model mentioned in the previous slide is 2.86.

22
Clustering Metrics

- Clustering algorithms are unsupervised learning techniques used to

divide data into groups (clusters) such that the data points within
each group are more similar to each other than to those in other
groups.

- Evaluating the performance of clustering algorithms can be

challenging since there is no predefined classification to rely on.

23
Clustering Metrics(cont.)
Several evaluation metrics are used to assess the quality of clustering. These
metrics can be divided into two main categories:

- Internal Evaluation Metrics: These metrics rely on the information available

from the data itself, such as the distances between points or their similarity. The
goal is to evaluate the clustering quality based on the internal structure of the
data.

- External Evaluation Metrics: These metrics rely on prior knowledge of the

correct labels or classifications of the data and are usually used when labeled or
pre-categorized data is available.

Evaluation Metrics in Machine Learning
No ratings yet
Evaluation Metrics in Machine Learning
14 pages
Confusion Matrix & Evaluation Metrics in Machine Learning
No ratings yet
Confusion Matrix & Evaluation Metrics in Machine Learning
23 pages
Unit 4 Model Evaluation
No ratings yet
Unit 4 Model Evaluation
24 pages
Performance Metrics (Classification) : Enrique J. de La Hoz D
100% (1)
Performance Metrics (Classification) : Enrique J. de La Hoz D
30 pages
Evaluation Metrics
No ratings yet
Evaluation Metrics
20 pages
6 Evaluarea Performantei
No ratings yet
6 Evaluarea Performantei
43 pages
Unit8 (Evaluation Method)
No ratings yet
Unit8 (Evaluation Method)
43 pages
Unit7 ML
No ratings yet
Unit7 ML
54 pages
Session 1 Evaluation Model
No ratings yet
Session 1 Evaluation Model
58 pages
ML CH 5
No ratings yet
ML CH 5
45 pages
3-Performance Measures
No ratings yet
3-Performance Measures
35 pages
Unit III Iml Final
No ratings yet
Unit III Iml Final
36 pages
Regression With Dummy Variables Econ420 1
No ratings yet
Regression With Dummy Variables Econ420 1
47 pages
Lecture 20 - Evaluation Metrics
No ratings yet
Lecture 20 - Evaluation Metrics
27 pages
08 Classifier Evaluation
No ratings yet
08 Classifier Evaluation
39 pages
Lect 02 Evaluation Part 1
No ratings yet
Lect 02 Evaluation Part 1
33 pages
Dr. Dubacharla Gyaneshwar
No ratings yet
Dr. Dubacharla Gyaneshwar
30 pages
Lec 4 ML S4 Evaluation Metrics
No ratings yet
Lec 4 ML S4 Evaluation Metrics
29 pages
Lec 12 Performances Metrices Matrix Part 2
No ratings yet
Lec 12 Performances Metrices Matrix Part 2
26 pages
Lecture - (3-4) Evaluation Metrices Classification and Regression
No ratings yet
Lecture - (3-4) Evaluation Metrices Classification and Regression
28 pages
IT 138 - Lecture 4
No ratings yet
IT 138 - Lecture 4
30 pages
L 13 Choose Your Own Algorithm D 07062024 111828am
No ratings yet
L 13 Choose Your Own Algorithm D 07062024 111828am
36 pages
S1 Evaluate Performance LKW 1mar2025
No ratings yet
S1 Evaluate Performance LKW 1mar2025
26 pages
Performance Evaluation
No ratings yet
Performance Evaluation
24 pages
Basics of ML and Evaluation
No ratings yet
Basics of ML and Evaluation
42 pages
06-FSSR DS610 2024 2025T1 Metrics
No ratings yet
06-FSSR DS610 2024 2025T1 Metrics
24 pages
Chapter 5 Model Evaluation
No ratings yet
Chapter 5 Model Evaluation
21 pages
Lec 8
No ratings yet
Lec 8
35 pages
CSL0777 L06
No ratings yet
CSL0777 L06
24 pages
22AIP3101A Session 3
No ratings yet
22AIP3101A Session 3
24 pages
Chapter 3 Model Evaluation Final
No ratings yet
Chapter 3 Model Evaluation Final
30 pages
Performance Measures
No ratings yet
Performance Measures
19 pages
2-Training and Testing Models, Evaluation Metrics-01-07-2023
No ratings yet
2-Training and Testing Models, Evaluation Metrics-01-07-2023
23 pages
ML3 Evaluating Models
No ratings yet
ML3 Evaluating Models
40 pages
Intel Assignment
No ratings yet
Intel Assignment
13 pages
Session 2 Evaluation Boosting Bagging Contemporary Business Anaytics
No ratings yet
Session 2 Evaluation Boosting Bagging Contemporary Business Anaytics
17 pages
Model Evaluation
No ratings yet
Model Evaluation
18 pages
Ad3501-Dl-Unit 4 Notes
No ratings yet
Ad3501-Dl-Unit 4 Notes
16 pages
Confusion Matrix
No ratings yet
Confusion Matrix
4 pages
Week 08
No ratings yet
Week 08
13 pages
2nd Exam Question Paper 2
No ratings yet
2nd Exam Question Paper 2
16 pages
Ai Unit 5
No ratings yet
Ai Unit 5
13 pages
Lec 12 Performances Metrices Matrix Part 2
No ratings yet
Lec 12 Performances Metrices Matrix Part 2
26 pages
Performance Metrics
No ratings yet
Performance Metrics
8 pages
JNN 5.2 Confusion Matrix and Performance Evaluation Metrics
No ratings yet
JNN 5.2 Confusion Matrix and Performance Evaluation Metrics
13 pages
Evaluating A Machine Learning Model
No ratings yet
Evaluating A Machine Learning Model
14 pages
Evaluation Metrics-ML
No ratings yet
Evaluation Metrics-ML
16 pages
Expanded Model Evaluation Metrics
No ratings yet
Expanded Model Evaluation Metrics
8 pages
Machine Learningassignment
No ratings yet
Machine Learningassignment
10 pages
Evaluation Metrics
No ratings yet
Evaluation Metrics
6 pages
Evaluation Metrics
No ratings yet
Evaluation Metrics
11 pages
Metric
No ratings yet
Metric
6 pages
Evaluation Measures
No ratings yet
Evaluation Measures
8 pages
Imp Notes For Aamd
No ratings yet
Imp Notes For Aamd
6 pages
Cfa
No ratings yet
Cfa
40 pages
Week 5
No ratings yet
Week 5
4 pages
Ads Exp 4
No ratings yet
Ads Exp 4
4 pages
Ads Exp4
No ratings yet
Ads Exp4
3 pages
What Are The Evaluation Metrics in Machine Learning
No ratings yet
What Are The Evaluation Metrics in Machine Learning
3 pages
Performance Metrics ML
No ratings yet
Performance Metrics ML
4 pages
Machine Learning Model Evaluation
No ratings yet
Machine Learning Model Evaluation
2 pages
Machine Learning Model Evaluation - Zero To Mastery Academy
No ratings yet
Machine Learning Model Evaluation - Zero To Mastery Academy
1 page
Machine Learning LAB: Practical-1
100% (2)
Machine Learning LAB: Practical-1
24 pages
TMT Siciliano
No ratings yet
TMT Siciliano
9 pages
Unit - 5
No ratings yet
Unit - 5
111 pages
Regression Models Course Project
100% (1)
Regression Models Course Project
4 pages
Chapter1 Regression Introduction PDF
No ratings yet
Chapter1 Regression Introduction PDF
8 pages
Machine Learning
No ratings yet
Machine Learning
115 pages
Tutorial 8 - With Solution
No ratings yet
Tutorial 8 - With Solution
6 pages
Glossary Multilevel Analysis
No ratings yet
Glossary Multilevel Analysis
8 pages
Ann PM
No ratings yet
Ann PM
1 page
An Introduction To ROC Curve (Receiver Operating Characteristics)
No ratings yet
An Introduction To ROC Curve (Receiver Operating Characteristics)
16 pages
Introduction To Classifier Performance Analysis With R by Sutaip L.C. Saw
No ratings yet
Introduction To Classifier Performance Analysis With R by Sutaip L.C. Saw
222 pages
A Mediation Analysis of Achievement Motives, Goals, Learning Strategies, and Academic Achievement
No ratings yet
A Mediation Analysis of Achievement Motives, Goals, Learning Strategies, and Academic Achievement
17 pages
1 Correlation
No ratings yet
1 Correlation
1 page
Confusion Matrix Accuracy Recall Sensitivity
No ratings yet
Confusion Matrix Accuracy Recall Sensitivity
16 pages
536C3B
No ratings yet
536C3B
2 pages
Project Report - MT2013 - HK222
No ratings yet
Project Report - MT2013 - HK222
24 pages
Ensemble Learning and Random Forest 4th
No ratings yet
Ensemble Learning and Random Forest 4th
19 pages
Table Stat Test Example SPSS
No ratings yet
Table Stat Test Example SPSS
42 pages
Logistic Regression With Pyspark
No ratings yet
Logistic Regression With Pyspark
19 pages
ANOVAWelchcorrection Satterthwaitecorrectionand Kruskal Wallistestcomparisonoftype Ierrorrateandpower
No ratings yet
ANOVAWelchcorrection Satterthwaitecorrectionand Kruskal Wallistestcomparisonoftype Ierrorrateandpower
23 pages
Data Mining - Lab 2
No ratings yet
Data Mining - Lab 2
5 pages
Functions of Random Variables
No ratings yet
Functions of Random Variables
13 pages
NF Assighment4
No ratings yet
NF Assighment4
5 pages
Glossary SPSS Statistics
No ratings yet
Glossary SPSS Statistics
6 pages
Econometrics I: Problem Set II: Prof. Nicolas Berman November 30, 2018
No ratings yet
Econometrics I: Problem Set II: Prof. Nicolas Berman November 30, 2018
4 pages
STATISTICAL REPORTING ACTIVITY - 1 Way
No ratings yet
STATISTICAL REPORTING ACTIVITY - 1 Way
3 pages
De-Mystifying Math and Stats for Machine Learning: Mastering the Fundamentals of Mathematics and Statistics for Machine Learning
From Everand
De-Mystifying Math and Stats for Machine Learning: Mastering the Fundamentals of Mathematics and Statistics for Machine Learning
Seaport AI Madhavan
No ratings yet

Lec 4

Uploaded by

Lec 4

Uploaded by

Tikrit University

College of computer science and

Accuracy is a performance metric that is based on the correct results of

𝑁𝑜. 𝑜𝑓 𝐶𝑜𝑟𝑟𝑒𝑐𝑡𝑒𝑑 𝑅𝑒𝑠𝑢𝑙𝑡𝑠

For Binary classification —The confusion Matrix is a 2*2 matrix.

Terminologies used in Confusion Matrix:

Measures how many of the predicted positive cases were

• A smaller MSE is preferred because it indicates that your data points

• T h e Mean Squared Error is calculated as:

Where n represents the number of samples.

The MSE for this model is 8.17.

- Clustering algorithms are unsupervised learning techniques used to

- Evaluating the performance of clustering algorithms can be

- Internal Evaluation Metrics: These metrics rely on the information available

- External Evaluation Metrics: These metrics rely on prior knowledge of the

You might also like