0% found this document useful (0 votes)

35 views23 pages

2-Training and Testing Models, Evaluation Metrics-01-07-2023

Uploaded by

Roy Goyal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

35 views23 pages

2-Training and Testing Models, Evaluation Metrics-01-07-2023

Uploaded by

Roy Goyal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 23

Training and Testing Models

• Machine Learning algorithms enable the machines

to make predictions and solve problems on the basis
of past observations or experiences.
• These experiences or observations an algorithm can
take from the training data, which is fed to it.
• Further, one of the great things about ML
algorithms is that they can learn and improve over
time on their own, as they are trained with the
relevant training data.
• Once the model is trained enough with the relevant
training data, it is tested with the test data.
32
• We can understand the whole process of training
and testing in three steps, which are as follows:
• Feed: Firstly, we need to train the model by feeding it
with training input data.
• Define: Now, training data is tagged with the
corresponding outputs (in Supervised Learning), and the
model transforms the training data into text vectors or a
number of data features.
• Test: In the last step, we test the model by feeding it with
the test data/unseen dataset. This step ensures that the
model is trained efficiently and can generalize well.

33
34
Training Vs Testing
• The main difference between training data and
testing data is that training data is the subset of
original data that is used to train the machine
learning model, whereas testing data is used to
check the accuracy of the model.
• The training dataset is generally larger in size
compared to the testing dataset. The general ratios
of splitting train and test datasets are 80:20, 70:30,
or 90:10.
• Training data is well known to the model as it is
used to train the model, whereas testing data is like
unseen/new data to the model.
35
Evaluation Metrics
• Performance metrics are used to evaluate the
performance/ effectiveness of our machine learning
model.

36
Performance Metrics for
Regression
• Regression analysis is a subfield of supervised
machine learning.
• It aims to model the relationship between a certain
number of features and a continuous target variable.
• Following are the performance metrics used for
evaluating a regression model:
• Mean Absolute Error (MAE)
• Mean Squared Error (MSE)
• Root Mean Squared Error (RMSE)
• R-Squared
• Adjusted R-squared

37
Mean Squared Error (MSE)

• Here, the error term is squared and thus more

sensitive to outliers as compared to Mean Absolute
Error (MAE).
• Thus,
MSE = 1/4 * (|5-4.8|^2+|10-10.6|^2+|15-14.3|^2+|20-
20.1|^2) = 0.225

39
Root Mean Squared Error
(RMSE)

• Since MSE includes squared error terms, we take

the square root of the MSE, which gives rise to Root
Mean Squared Error (RMSE).
• Thus,
RMSE = (0.225)^0.5 = 0.474

40
R-Squared

• R-squared is calculated by dividing the sum of

squares of residuals (SSres) from the regression
model by the total sum of squares (SStot) of errors
from the average model and then subtract it from 1.
• R-squared is also known as the Coefficient of
Determination.
• It explains the degree to which the input variables
explain the variation of the output / predicted
variable.
41
Adjusted R-squared

• Here, N- total sample size (number of rows) and p- number of predictors

(number of columns)
• The limitation of R-squared is that it will “either stay the same or
increases with the addition of more variables, even if they do not have
any relationship with the output variables.”
• To overcome this limitation, Adjusted R-square comes into the picture as
it penalizes you for adding the variables which do not improve your
existing model.
• Hence, if you are building Linear regression on multiple variables, it is
always suggested that you use Adjusted R-squared to judge the goodness
of the model.
• If there exists only one input variable, R-square and Adjusted R squared
are same. 42
Performance Metrics for
Classification
• Classification is the problem of identifying to which
of a set of categories/classes a new observation
belongs, based on the training set of data containing
records whose class label is known.
• Following are the performance metrics used for
evaluating a classification model:
• Accuracy
• Precision and Recall
• Specificity
• F1-score
• AUC-ROC

43
• To understand different metrics, we must
understand the Confusion matrix.
• A confusion matrix is a table that is often used to
describe the performance of a classification model
(or "classifier") on a set of test data for which the
true values are known.

44
For Multiclass

45
• TN- True negatives (actual 0 predicted 0)
• TP- True positives (actual 1 predicted 1)
• FP- False positives (actual 0 predicted 1)
• FN- False Negatives (actual 1 predicted 0)
• Consider the following values for the confusion
matrix-
• True negatives (TN) = 300
• True positives (TP) = 500
• False negatives (FN) = 150
• False positives (FP) = 50

46
Accuracy

• Accuracy is defined as the ratio of the number of

correct predictions and the total number of
predictions. It lies between [0,1].
• In general, higher accuracy means a better model
(TP and TN must be high).
• However, accuracy is not a useful metric in case of
an imbalanced dataset (datasets with uneven
distribution of classes).
47
• Say we have a data of 1000 patients out of which 50
are having cancer and 950 not, a dumb model which
always predicts as no cancer will have the accuracy
of 95%, but it is of no practical use since in this
case, we want the number of False Negatives as a
minimum.
• Thus, we have different metrics like recall,
precision, F1-score etc.
Thus, Accuracy using above values will be
(500+300)/(500+50+150+300) = 800/1000 = 80%

48
Recall

• Recall is a useful metric in case of cancer detection,

where we want to minimize the number of False
negatives for any practical use since we don't want our
model to mark a patient suffering from cancer as safe.
• On the other hand, predicting a healthy patient as
cancerous is not a big issue since, in further diagnosis, it
will be cleared that he does not have cancer. Recall is
also known as Sensitivity.
• Thus, Recall using above values will be 500/(500+150) =
500/650 = 76.92%

49
Precision

• Precision is useful when we want to reduce the number

of False Positives.
• Consider a system that predicts whether the e-mail
received is spam or not. Taking spam as a positive class,
we do not want our system to predict non-spam e-mails
(important e-mails) as spam, i.e., the aim is to reduce
the number of False Positives.
• Thus, Precision using above values will be 500/(500+50) =
500/550 = 90.90%

50
Specificity
• Specificity is defined as the ratio of True negatives
and True negatives + False positives.
• We want the value of specificity to be high. Its
value lies between [0,1].

• Thus, Specificity using above values will be

300/(300+50) = 300/350 = 85.71%
51
F1-score
• F1-score is a metric that combines both Precision
and Recall and equals to the harmonic mean of
precision and recall.
• Its value lies between [0,1] (more the value better
the F1-score).

• Using values of precision=0.9090 and

recall=0.7692, F1-score = 0.8333 = 83.33%

52
AUC-ROC
• AUC (Area Under The Curve)- ROC (Receiver
Operating Characteristics) curve is one of the most
important evaluation metrics for checking any
classification model’s performance.
• It is plotted between FPR (X-axis) and TPR (Y-axis).

• If the value is less than 0.5 than the model is even worse
than a random guessing model.
53
54

Attachment 1
No ratings yet
Attachment 1
3 pages
Basics of ML and Evaluation
No ratings yet
Basics of ML and Evaluation
42 pages
3-Performance Measures
No ratings yet
3-Performance Measures
35 pages
Performance Measures
No ratings yet
Performance Measures
19 pages
Lec 4
No ratings yet
Lec 4
24 pages
Ad3501-Dl-Unit 4 Notes
No ratings yet
Ad3501-Dl-Unit 4 Notes
16 pages
Lect 02 Evaluation Part 1
No ratings yet
Lect 02 Evaluation Part 1
33 pages
22AIP3101A Session 3
No ratings yet
22AIP3101A Session 3
24 pages
Model Evaluation
No ratings yet
Model Evaluation
18 pages
Machine Learningassignment
No ratings yet
Machine Learningassignment
10 pages
Evaluation Metrics-ML
No ratings yet
Evaluation Metrics-ML
16 pages
ML Unit 3
No ratings yet
ML Unit 3
127 pages
Chapter 5 Model Evaluation
No ratings yet
Chapter 5 Model Evaluation
21 pages
Lecture - (3-4) Evaluation Metrices Classification and Regression
No ratings yet
Lecture - (3-4) Evaluation Metrices Classification and Regression
28 pages
Performance Evaluation
No ratings yet
Performance Evaluation
24 pages
Lecture 20 - Evaluation Metrics
No ratings yet
Lecture 20 - Evaluation Metrics
27 pages
IT 138 - Lecture 4
No ratings yet
IT 138 - Lecture 4
30 pages
Evaluation Metrics
No ratings yet
Evaluation Metrics
11 pages
DL IT324a 4
No ratings yet
DL IT324a 4
52 pages
Unit III Iml Final
No ratings yet
Unit III Iml Final
36 pages
Lecture 3b - Evaluation
No ratings yet
Lecture 3b - Evaluation
37 pages
Evaluating A Machine Learning Model
No ratings yet
Evaluating A Machine Learning Model
14 pages
Ads 5
No ratings yet
Ads 5
5 pages
Lec 8
No ratings yet
Lec 8
35 pages
S1 Evaluate Performance LKW 1mar2025
No ratings yet
S1 Evaluate Performance LKW 1mar2025
26 pages
CH-5 ML
No ratings yet
CH-5 ML
36 pages
AI & ML Notes
No ratings yet
AI & ML Notes
22 pages
Lecture 5 Evaluation - Classifer
No ratings yet
Lecture 5 Evaluation - Classifer
61 pages
Confusion Matrix
No ratings yet
Confusion Matrix
4 pages
08 Classifier Evaluation
No ratings yet
08 Classifier Evaluation
39 pages
Chap3 Part1 Classification
No ratings yet
Chap3 Part1 Classification
38 pages
Introduction Class
No ratings yet
Introduction Class
134 pages
Lesson 2.4.1 What Is Scikit Learn Keynote
No ratings yet
Lesson 2.4.1 What Is Scikit Learn Keynote
21 pages
Machine Learning Model Evaluation
No ratings yet
Machine Learning Model Evaluation
11 pages
Lec 4 ML S4 Evaluation Metrics
No ratings yet
Lec 4 ML S4 Evaluation Metrics
29 pages
Performance Metrics
No ratings yet
Performance Metrics
8 pages
Machine Learning Model Evaluation
No ratings yet
Machine Learning Model Evaluation
2 pages
ML MAKAUT Unit-3
No ratings yet
ML MAKAUT Unit-3
6 pages
A10 Model Performance v2 2up
No ratings yet
A10 Model Performance v2 2up
11 pages
Unit6 - 7 Issues
No ratings yet
Unit6 - 7 Issues
53 pages
Evaluation Measures
No ratings yet
Evaluation Measures
8 pages
BSC ML CH1
No ratings yet
BSC ML CH1
63 pages
Ai Unit 5
No ratings yet
Ai Unit 5
13 pages
ML Metrics
No ratings yet
ML Metrics
9 pages
Evaluation Metrics
No ratings yet
Evaluation Metrics
20 pages
ML Model Evaluation
No ratings yet
ML Model Evaluation
17 pages
Machine Learning Models: by Mayuri Bhandari
No ratings yet
Machine Learning Models: by Mayuri Bhandari
48 pages
6 Evaluarea Performantei
No ratings yet
6 Evaluarea Performantei
43 pages
Session 1 Evaluation Model
No ratings yet
Session 1 Evaluation Model
58 pages
KNN Evaluation
No ratings yet
KNN Evaluation
51 pages
Unit 4 Model Evaluation
No ratings yet
Unit 4 Model Evaluation
24 pages
Chapter 3 Model Evaluation Final
No ratings yet
Chapter 3 Model Evaluation Final
30 pages
Performance Metrics
No ratings yet
Performance Metrics
12 pages
0 Machine Learning Overview and Metrics LT
No ratings yet
0 Machine Learning Overview and Metrics LT
84 pages
机器学习
No ratings yet
机器学习
41 pages
Module 6
No ratings yet
Module 6
24 pages
Evaluating Model Performance Unit 6
No ratings yet
Evaluating Model Performance Unit 6
33 pages
Chương 2e. Model Evaluation
No ratings yet
Chương 2e. Model Evaluation
27 pages
Chapter 7 - LAST
No ratings yet
Chapter 7 - LAST
29 pages
Week 4 Lecture Slides BUS265 2023
No ratings yet
Week 4 Lecture Slides BUS265 2023
45 pages
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
From Everand
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
SUJAUL CHOWDHURY
No ratings yet
6-PL - SQL Cycle Sheet-02-12-2022
No ratings yet
6-PL - SQL Cycle Sheet-02-12-2022
1 page
6-IoT and Cloud-26-07-2023
No ratings yet
6-IoT and Cloud-26-07-2023
70 pages
2-Physical Design of IoT-06-07-2023
No ratings yet
2-Physical Design of IoT-06-07-2023
60 pages
3-Introduction To M2M, Communication Technologies-10-07-2023
No ratings yet
3-Introduction To M2M, Communication Technologies-10-07-2023
54 pages
7-Brief About Big Data, Hadoop Map Reduce-31-07-2023
No ratings yet
7-Brief About Big Data, Hadoop Map Reduce-31-07-2023
35 pages
4-IEEE 802.15.4 & BACnet-15-07-2023
No ratings yet
4-IEEE 802.15.4 & BACnet-15-07-2023
34 pages
5-6LoWPAN, RPL-19-07-2023
No ratings yet
5-6LoWPAN, RPL-19-07-2023
17 pages
1-Introduction To IoT-30-06-2023
No ratings yet
1-Introduction To IoT-30-06-2023
31 pages
Focus On Posture Correction (Documentation) .
No ratings yet
Focus On Posture Correction (Documentation) .
32 pages
Human Stress Detection Based On Sleeping Habits Using Machine Learning Algorithms
No ratings yet
Human Stress Detection Based On Sleeping Habits Using Machine Learning Algorithms
6 pages
PAPER
No ratings yet
PAPER
15 pages
Seminar Paper
No ratings yet
Seminar Paper
9 pages
Random Forest Model
No ratings yet
Random Forest Model
16 pages
Chapter 3 Modelling and Evaluation
No ratings yet
Chapter 3 Modelling and Evaluation
11 pages
FineTuning Process Using OpenAI 1703440516
No ratings yet
FineTuning Process Using OpenAI 1703440516
14 pages
Nutrify AI
No ratings yet
Nutrify AI
7 pages
Malware Detection and Classification Using Generative Adversarial Network
No ratings yet
Malware Detection and Classification Using Generative Adversarial Network
18 pages
Artificial Intelligence and Deep Learning For Computer Network Management and Analysis
No ratings yet
Artificial Intelligence and Deep Learning For Computer Network Management and Analysis
137 pages
Cbse - Department of Skill Education: Artificial Intelligence (Subject Code - 417)
No ratings yet
Cbse - Department of Skill Education: Artificial Intelligence (Subject Code - 417)
8 pages
A Comparative Study On Mushroom Classification Using Supervised Machine Learning Algorithms
No ratings yet
A Comparative Study On Mushroom Classification Using Supervised Machine Learning Algorithms
8 pages
Advancing Healthcare and Elderly Activity Recognition Active Machine and Deep Learning For Fine-Grained Heterogeneity Activity Recognition 1
No ratings yet
Advancing Healthcare and Elderly Activity Recognition Active Machine and Deep Learning For Fine-Grained Heterogeneity Activity Recognition 1
11 pages
Duong Et Al. - 2023
No ratings yet
Duong Et Al. - 2023
13 pages
Evaluation Measures For Text Summarization
No ratings yet
Evaluation Measures For Text Summarization
26 pages
Ai First Prebaord Class X
No ratings yet
Ai First Prebaord Class X
7 pages
1694600777-Unit2.2 Logistic Regression CU 2.0
100% (1)
1694600777-Unit2.2 Logistic Regression CU 2.0
37 pages
YOLO Based Object Detection Models: A Review and Its Applications
No ratings yet
YOLO Based Object Detection Models: A Review and Its Applications
40 pages
Confusion Matrix Machine Learning
No ratings yet
Confusion Matrix Machine Learning
9 pages
Spam News Detection Report
No ratings yet
Spam News Detection Report
9 pages
Ad3461 ML Manual
No ratings yet
Ad3461 ML Manual
34 pages
A Web Based Application For Automating Bank Loan Eligibility Using Machine Learning
No ratings yet
A Web Based Application For Automating Bank Loan Eligibility Using Machine Learning
43 pages
Customer Churn Presentation
No ratings yet
Customer Churn Presentation
28 pages
Ai QP 1
No ratings yet
Ai QP 1
7 pages
Viral Genome Prediction From Raw Human DNA Sequence Samples by Combining Natural Language Proc-1
No ratings yet
Viral Genome Prediction From Raw Human DNA Sequence Samples by Combining Natural Language Proc-1
10 pages
Sentiment Analysis Report
No ratings yet
Sentiment Analysis Report
31 pages
Ai (X) Practice Paper 3-1
No ratings yet
Ai (X) Practice Paper 3-1
4 pages
Imbalanced XGBoost Paper
No ratings yet
Imbalanced XGBoost Paper
11 pages
Chameleon Swarm Optimisation With Machine Learning
No ratings yet
Chameleon Swarm Optimisation With Machine Learning
8 pages

2-Training and Testing Models, Evaluation Metrics-01-07-2023

Uploaded by

2-Training and Testing Models, Evaluation Metrics-01-07-2023

Uploaded by

Training and Testing Models

• Machine Learning algorithms enable the machines

• Here, the error term is squared and thus more

• Since MSE includes squared error terms, we take

• R-squared is calculated by dividing the sum of

• Here, N- total sample size (number of rows) and p- number of predictors

• Accuracy is defined as the ratio of the number of

• Recall is a useful metric in case of cancer detection,

• Precision is useful when we want to reduce the number

• Thus, Specificity using above values will be

• Using values of precision=0.9090 and

You might also like