Machine Learning # 2

The document provides an overview of model performance and evaluation metrics in machine learning, focusing on classification and regression metrics such as accuracy, precision, F1 score, and ROC AUC. It discusses the importance of proper dataset partitioning, underfitting and overfitting, and hyperparameter tuning through techniques like holdout and k-fold cross-validation. The content emphasizes the need for appropriate evaluation metrics, especially in cases of imbalanced data, to ensure accurate model performance assessment.

Uploaded by

sm08

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views17 pages

Machine Learning # 2

Uploaded by

sm08

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

Introduction to Machine Learning

Course Teacher:
Dr. M. Shahidur Rahman
Professor, DoCSE, SUST
2 Model Performance and Evaluation Metrics

Topics covered:
 Evaluation Metrics
 Model Performance Evaluation
 Model Selection
Model Performance and Evaluation Metrics

 In classification domain, the simplest visualization of the success of a

model is normally described using the confusion matrix.
Evaluation Metrics

Accuracy:

 True positive rate (TPR) or recall or hit rate or sensitivity:

 Precision or positive predictive value:

 F1 Score:
Evaluation Metrics…

 Specificity:

 Miss rate or false negative rate:

 False Positive Rate (FPR):

Evaluation Metrics…

 Accuracy and classification error are informative measures of success

when the data is balanced in terms of the classes
 When the data is imbalanced, i.e., one class is represented in larger
proportion over the other class in the dataset, these measures become
biased towards the majority class and give a wrong estimate of success.
 In such cases, base measures, such as true positive rate (TPR), false
positive rate (FPR), true negative rate (TNR), and false negative rate (FNR),
become useful.
 Metrics such as F1 score combines the base measures to give an overall
measure of success.
Evaluation Metrics…

 The curve that plots TPR and FPR for a classifier at various thresholds is
known as the receiver-operating characteristic (ROC) curve.
 Precision and recall can be plotted at different thresholds, giving the
precision-recall curve (PRC)
 The areas under each curve are respectively known as auROC and auPRC
and are popular metrics of performance.
 In particular, auPRC is generally considered to be an informative metric
in the presence of imbalanced classes.
8 ROC AUC

 A perfect classifier would fall into

the top-left corner of the graph
with a TPR of 1 and an FPR of 0.
 Based on the ROC curve, we
compute the ROC area under the
curve (ROC AUC) to characterize
the performance of a classification
model.
 Higher ROC AUC means better
classification performance.
Regression Evaluation Metrics

 Average prediction error:

 Mean absolute error (MAE):

 Root mean squared error (RMSE):

 Relative squared error (RSE) is used when two errors are measured in
different units:
10 Ratio for partitioning a dataset into training and
test datasets
 In general, we don't want to allocate too much information to the test set.
 However, the smaller the test set, the more inaccurate the estimation of the
generalization error.
 Dividing a dataset into training and test datasets is all about balancing this
tradeoff.
 In practice, the most commonly used splits are 60:40, 70:30, or 80:20,
depending on the size of the initial dataset.
 For large datasets, 90:10 or 99:1 splits are also common and appropriate.
 For example, if the dataset contains more than 100,000 training examples, it
might be fine to withhold only 10,000 examples for testing in order to get a
good estimate of the generalization performance.
11 Underfitting and overfitting

 model can also suffer from underfitting (high bias), which means that
our model is not complex enough to capture the pattern in the training
data well and suffers from low performance on unseen data.
 If a model is too complex for a given training dataset—there are too
many parameters in this model—the model tends to overfit the training
data and does not generalize well to unseen data
12 Debugging algorithms with learning and
validation curves
Hyperparameter tuning

 Validation techniques are meant to

answer the question of how to select
a model(s) with the right
hyperparameter values.
 Hyperparameters are parameters set
before training a machine learning
model. They are not learned from the
data but are manually configured to
optimize model performance. Ex.
Learning Rate (𝛼), Number of Trees, Hyperparameter C is the inverse
Kernal type in SVM. regularization parameter of the
LogisticRegression classifier,
where C=1 provides best performance.
14 Holdout cross-validation

 For estimating the

generalization
performance of ML
models is holdout cross-
validation
K-fold cross validation

 The validation process needs a

large number of labeled data
points for creating the training
set and the validation set.
 Collecting a large labeled set is
usually difficult
 In such cases, instead of
physically separating the training
set and validation set, k-fold
cross-validation is used.
16 K-fold cross validation…

 Once we have found satisfactory hyperparameter values, we can retrain

the model on the complete training dataset and obtain a final
performance estimate using the independent test dataset.
 Value of k in k-fold cross-validation is typically k = 10.
 A special case of k-fold cross-validation is the leave-one-out cross-
validation (LOOCV) method, where k = n, number of training examples.
 It is recommended for working with very small datasets.
17 Model Selection

 Use cross-validation or k-fold cross-validation for fine-tuning the

performance of an ML model by varying its hyperparameter values
 Choose the model that performs best on relevant criteria such as
accuracy.

Evaluating Machine Learning Algorithms and Model Selection
No ratings yet
Evaluating Machine Learning Algorithms and Model Selection
10 pages
UNIT-2 ML
No ratings yet
UNIT-2 ML
10 pages
Unit 2
No ratings yet
Unit 2
23 pages
Unit 4 Model Evaluation
No ratings yet
Unit 4 Model Evaluation
24 pages
Model Performance Assessment
No ratings yet
Model Performance Assessment
13 pages
Chapter 2 Part II
No ratings yet
Chapter 2 Part II
28 pages
Codes and Concepts of ML-Developer-2
No ratings yet
Codes and Concepts of ML-Developer-2
17 pages
7 ML
No ratings yet
7 ML
38 pages
Chapter 5 2025
No ratings yet
Chapter 5 2025
19 pages
MLT Notes
No ratings yet
MLT Notes
28 pages
SML Updated UNIT 4
No ratings yet
SML Updated UNIT 4
44 pages
Week 4 Lecture Slides BUS265 2023
No ratings yet
Week 4 Lecture Slides BUS265 2023
45 pages
AIMl TA2
No ratings yet
AIMl TA2
4 pages
AIML-HC Mod 03
No ratings yet
AIML-HC Mod 03
46 pages
Performance Measures
No ratings yet
Performance Measures
19 pages
T1 ML QB Soln
No ratings yet
T1 ML QB Soln
23 pages
AI351 Lecture 2 - Common Evaluation Metrics
No ratings yet
AI351 Lecture 2 - Common Evaluation Metrics
50 pages
Introduction To Econometrics, 5 Edition: Chapter 5: Dummy Variables
No ratings yet
Introduction To Econometrics, 5 Edition: Chapter 5: Dummy Variables
23 pages
CHP 3
No ratings yet
CHP 3
70 pages
Model Generalization
No ratings yet
Model Generalization
117 pages
Lec - 4
No ratings yet
Lec - 4
43 pages
DTS 101 Lecture 2
No ratings yet
DTS 101 Lecture 2
30 pages
AIML-Unit 5 Notes-Assignment 5
No ratings yet
AIML-Unit 5 Notes-Assignment 5
24 pages
Lecture 3b - Evaluation
No ratings yet
Lecture 3b - Evaluation
37 pages
Lecture 9 - Evaluations
No ratings yet
Lecture 9 - Evaluations
68 pages
Training Evaluation
No ratings yet
Training Evaluation
42 pages
Lect 03 Evaluation Part 2
No ratings yet
Lect 03 Evaluation Part 2
40 pages
Machine Learning Basics
No ratings yet
Machine Learning Basics
3 pages
Unit 4
No ratings yet
Unit 4
34 pages
ML 3170724 Unit-3
No ratings yet
ML 3170724 Unit-3
48 pages
Model Evaluation and Selection
No ratings yet
Model Evaluation and Selection
49 pages
Introduction Class
No ratings yet
Introduction Class
134 pages
ML Fundamentals
No ratings yet
ML Fundamentals
15 pages
Model Evaluation in ML
No ratings yet
Model Evaluation in ML
12 pages
ML Unit IV
No ratings yet
ML Unit IV
70 pages
机器学习
No ratings yet
机器学习
41 pages
A10 Model Performance v2 2up
No ratings yet
A10 Model Performance v2 2up
11 pages
5th Session Forecasting Business
No ratings yet
5th Session Forecasting Business
13 pages
ML 5
No ratings yet
ML 5
14 pages
3ML.02.MainConcepts Evaluation
No ratings yet
3ML.02.MainConcepts Evaluation
35 pages
Durbin Watson H Test
No ratings yet
Durbin Watson H Test
4 pages
Lecture 5 Evaluation - Classifer
No ratings yet
Lecture 5 Evaluation - Classifer
61 pages
Econometrics Whole Course PDF
No ratings yet
Econometrics Whole Course PDF
50 pages
Clase10 11
No ratings yet
Clase10 11
18 pages
Evaluating A Machine Learning Model
No ratings yet
Evaluating A Machine Learning Model
14 pages
MACHINELEARNING
No ratings yet
MACHINELEARNING
20 pages
2-Training and Testing Models, Evaluation Metrics-01-07-2023
No ratings yet
2-Training and Testing Models, Evaluation Metrics-01-07-2023
23 pages
04 - Model Selection
No ratings yet
04 - Model Selection
62 pages
Strategy Deck
No ratings yet
Strategy Deck
16 pages
AI & ML Notes
No ratings yet
AI & ML Notes
22 pages
PSCS511 - Machine Learning
No ratings yet
PSCS511 - Machine Learning
23 pages
6 Evaluarea Performantei
No ratings yet
6 Evaluarea Performantei
43 pages
Xchapter 1
No ratings yet
Xchapter 1
31 pages
L2 - Problems in ML & Performance Evaluation
No ratings yet
L2 - Problems in ML & Performance Evaluation
30 pages
CH-5 ML
No ratings yet
CH-5 ML
36 pages
Econometrics For Finance Assignment 2 2023 12-07-12!14!23
100% (1)
Econometrics For Finance Assignment 2 2023 12-07-12!14!23
3 pages
Multi Variate Statistical Analysis
No ratings yet
Multi Variate Statistical Analysis
13 pages
STD X - Model Evaluation - Content
No ratings yet
STD X - Model Evaluation - Content
5 pages
APA Table Templates Table 1: Regular Demographic/Informational Table
No ratings yet
APA Table Templates Table 1: Regular Demographic/Informational Table
2 pages
Lampiran Uji ANOVA Dua Jalur
No ratings yet
Lampiran Uji ANOVA Dua Jalur
3 pages
DL IT324a 4
No ratings yet
DL IT324a 4
52 pages
Project Assignment Final - Finance With Excel.
No ratings yet
Project Assignment Final - Finance With Excel.
34 pages
Machine Learning Model Evaluation
No ratings yet
Machine Learning Model Evaluation
2 pages
Unit6 - 7 Issues
No ratings yet
Unit6 - 7 Issues
53 pages
Choosing Model and Tuning
No ratings yet
Choosing Model and Tuning
20 pages
ML MAKAUT Unit-3
No ratings yet
ML MAKAUT Unit-3
6 pages
Cheatsheet Machine Learning Tips and Tricks PDF
No ratings yet
Cheatsheet Machine Learning Tips and Tricks PDF
2 pages
Statistical Modelling: Regression: Multicollinearity
No ratings yet
Statistical Modelling: Regression: Multicollinearity
22 pages
Answers 4
No ratings yet
Answers 4
10 pages
X Y Korelasi Regresi: 0.58 35 Regression Statistics
No ratings yet
X Y Korelasi Regresi: 0.58 35 Regression Statistics
3 pages
Tugas 2
No ratings yet
Tugas 2
9 pages
Introduction To Machine Learning - Unit 4 - Week 2
No ratings yet
Introduction To Machine Learning - Unit 4 - Week 2
4 pages
Accident Prediction Model and Mitigative Measures (FINAL)
No ratings yet
Accident Prediction Model and Mitigative Measures (FINAL)
34 pages
Estimating The Economic Model of Crime With Panel Data: June 2019
No ratings yet
Estimating The Economic Model of Crime With Panel Data: June 2019
12 pages
Chap03 4
No ratings yet
Chap03 4
49 pages
Problem 1 (15 Points)
No ratings yet
Problem 1 (15 Points)
2 pages
(Ferraz Et Al 2020) Equacoesingls
No ratings yet
(Ferraz Et Al 2020) Equacoesingls
12 pages
Slides
No ratings yet
Slides
39 pages
Lesson 07 7.02 Knowledge Check
No ratings yet
Lesson 07 7.02 Knowledge Check
7 pages
KNN Algo
No ratings yet
KNN Algo
7 pages
TR Rain Error
No ratings yet
TR Rain Error
6 pages
Assignment On Data Analysis Using Stat and EViews.
No ratings yet
Assignment On Data Analysis Using Stat and EViews.
33 pages
Csen3261 - Machine Learning and Its Applications
No ratings yet
Csen3261 - Machine Learning and Its Applications
2 pages
DSP Lec7
No ratings yet
DSP Lec7
9 pages
DSP Lec8
No ratings yet
DSP Lec8
12 pages
1 Correlation
No ratings yet
1 Correlation
1 page
Engel y Kroner 1995
No ratings yet
Engel y Kroner 1995
30 pages
DSP Lec6
No ratings yet
DSP Lec6
10 pages
Nonparametric Regression: Lowess/Loess
No ratings yet
Nonparametric Regression: Lowess/Loess
4 pages
Statistical Power Analyses Using GPower 3.1 Tests For Correlation and Regression Analyses
No ratings yet
Statistical Power Analyses Using GPower 3.1 Tests For Correlation and Regression Analyses
13 pages
Front
No ratings yet
Front
1 page
ARMA and ARIMA TJ Academy
No ratings yet
ARMA and ARIMA TJ Academy
6 pages
2ceit506 ML
No ratings yet
2ceit506 ML
2 pages
Random Sample Consensus: Robust Estimation in Computer Vision
From Everand
Random Sample Consensus: Robust Estimation in Computer Vision
Fouad Sabry
No ratings yet