Performance Metrics

Performance metrics used in regression and classification tasks

Uploaded by

nipun.yadav.ug23

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views8 pages

Performance Metrics

Performance metrics used in regression and classification tasks

Uploaded by

nipun.yadav.ug23

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

PERFORMANCE

METRICS IN MACHINE
LEARNING AND
DEEP LEARNING
Performance Metrics in Machine
Learning (ML) and Deep Learning (DL)

In both Machine Learning (ML) and Deep Learning (DL), the performance of
a model is evaluated based on various metrics, depending on the type of task
being solved—whether it’s a classification, regression, or other specialized
tasks. For regression tasks, there are specific performance metrics that help
assess how well a model is performing in predicting continuous numerical
values.

Here, we will cover the key regression metrics, their applications, formulas,
and use cases:
1.)R² Score (Coefficient of Determination)
• The R² score (also known as the Coefficient of Determination) is one of the most commonly used metrics for
evaluating the performance of regression models. It measures how well the model’s predictions approximate the actual
values. An R² score of 1.0 indicates perfect predictions, while a score of 0.0 means the model performs no better than
predicting the mean of the target variable.

Applications:
• R² is used to assess the goodness-of-fit of a regression model.
• It’s most commonly applied in linear regression and generalized
linear models.

Interpretation:
• R² = 1: Perfect model.
• R² = 0: Model doesn’t explain any of the variance.
• Negative R²: Model performs worse than a model that just
predicts the mean of the target variable.
2.) Mean Absolute Error (MAE)
• Mean Absolute Error (MAE) measures the average magnitude of errors between predicted values and
actual values. It provides a straightforward interpretation of prediction errors in the same units as the
target variable.

Applications:
• MAE is commonly used for regression tasks where the
emphasis is on minimizing the absolute differences between
actual and predicted values.
• It is easy to interpret, especially when dealing with non-technical
stakeholders.

Interpretation:
• The lower the MAE, the better the model’s predictions are.
• MAE = 0 means perfect predictions.
3.) Mean Absolute Percentage Error (MAPE)
• Mean Absolute Percentage Error (MAPE) is used to express the prediction accuracy of the model as a percentage. It
represents the average absolute percentage difference between the actual and predicted values. MAPE is particularly
useful when comparing the performance of different models across datasets of varying scales.

Applications:
• MAPE is used in forecasting models, especially in time series
forecasting (e.g., sales predictions, stock prices).
• It is highly effective when you want to understand percentage
errors and provide a normalized metric for the model’s
performance.

Interpretation:
• MAPE = 0% means perfect predictions.
• A higher MAPE indicates that the model is less accurate.
• However, MAPE is sensitive to zero values in the actual
dataset, which can cause undefined results.
4.) Mean Directional Accuracy (MDA)
•

• Mean Directional Accuracy (MDA) measures the ability of the model to predict the correct direction of change
(whether the predicted value is higher or lower than the actual value). It is a common metric in applications like stock
market prediction, where directionality is often more important than exact prediction.

Applications:
• MDA is especially important in financial
forecasting and time series prediction
where the direction of movement is
more critical than the exact magnitude of
the change.

Interpretation:
• MDA = 1 means the model always
predicts the correct direction.
• MDA = 0 means the model predicts the
wrong direction or is no better than
random guessing.
5.) Explained Variance Score (EVS)
• The Explained Variance Score (EVS) measures the proportion of variance in the target variable that is explained by the
model. It provides insight into the model’s ability to capture the variability in the data.

Applications:
• EVS is useful in cases where the proportion of explained
variance is important, such as in linear regression, ridge
regression, or principal component analysis (PCA).

Interpretation:
• EVS = 1 means the model explains all the variance.
• EVS = 0 means the model explains none of the variance.
• Negative EVS means the model is worse than predicting the
mean value.
Classification Performance Metrics
Accuracy is a simple metric that measures the overall correctness of a classification model, calculated as the ratio of correctly predicted instances
(both true positives and true negatives) to the total number of instances. It is effective when the classes in the dataset are balanced. However,
accuracy can be misleading in imbalanced datasets where one class vastly outnumbers the other, as a model could perform well simply by
predicting the majority class.

Precision focuses on the accuracy of positive predictions, measuring the proportion of true positives out of all predicted positives. It is crucial in
scenarios where false positives are costly, such as in fraud detection or email spam filtering, where predicting a negative as positive could lead to
unnecessary actions or expenses.

Recall, also known as sensitivity or the true positive rate, calculates the proportion of true positives out of all actual positives. This metric is
important when the cost of missing positive instances is high, such as in medical diagnostics, where failing to identify a disease (false negative)
could have serious consequences.

F1 Score combines both precision and recall into a single metric by taking their harmonic mean. It is particularly useful when dealing with
imbalanced datasets, as it provides a balance between precision and recall, penalizing extreme values in either metric. It is often preferred over
accuracy when there is a need to handle both false positives and false negatives effectively.

ROC-AUC (Receiver Operating Characteristic - Area Under Curve) measures the trade-off between the true positive rate and the false positive
rate across different thresholds. The AUC represents the model’s ability to distinguish between classes, with a higher AUC indicating better overall
performance. It is a powerful metric for evaluating binary classifiers, especially in imbalanced datasets, as it does not depend on a fixed threshold
for classification.

Confusion Matrix provides a detailed breakdown of a model’s performance by showing the counts of true positives, false positives, true negatives,
and false negatives. It helps to visualize where the model is making errors and is the foundation for many of the other performance metrics. It is
especially useful when precision and recall are important and helps in understanding how the model is performing across different classes.

Evaluation Metrics in Machine Learning
No ratings yet
Evaluation Metrics in Machine Learning
14 pages
Where Can Buy Hands On Machine Learning With Scikit Learn Keras and TensorFlow 2 / Paperback Edition Aurélien Géron Ebook With Cheap Price
No ratings yet
Where Can Buy Hands On Machine Learning With Scikit Learn Keras and TensorFlow 2 / Paperback Edition Aurélien Géron Ebook With Cheap Price
67 pages
Data Science Statistics Mathematics Cheat Sheet
100% (1)
Data Science Statistics Mathematics Cheat Sheet
13 pages
Performance Metrics (Classification) : Enrique J. de La Hoz D
100% (1)
Performance Metrics (Classification) : Enrique J. de La Hoz D
30 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
23 pages
Performance Metrics
No ratings yet
Performance Metrics
12 pages
ML Lecture#2
No ratings yet
ML Lecture#2
70 pages
Unit 4 Model Evaluation
No ratings yet
Unit 4 Model Evaluation
24 pages
Lect36 Tasks
No ratings yet
Lect36 Tasks
95 pages
Lecture - (3-4) Evaluation Metrices Classification and Regression
No ratings yet
Lecture - (3-4) Evaluation Metrices Classification and Regression
28 pages
Face Mask Detection Final Report Draft
No ratings yet
Face Mask Detection Final Report Draft
69 pages
Imp Notes For Aamd
No ratings yet
Imp Notes For Aamd
6 pages
L 13 Choose Your Own Algorithm D 07062024 111828am
No ratings yet
L 13 Choose Your Own Algorithm D 07062024 111828am
36 pages
ML CH 5
No ratings yet
ML CH 5
45 pages
08 Classifier Evaluation
No ratings yet
08 Classifier Evaluation
39 pages
Batch-4 Idp
No ratings yet
Batch-4 Idp
52 pages
Internship - Report MONICA Finall
No ratings yet
Internship - Report MONICA Finall
37 pages
6 Evaluarea Performantei
No ratings yet
6 Evaluarea Performantei
43 pages
机器学习
No ratings yet
机器学习
41 pages
Basics of ML and Evaluation
No ratings yet
Basics of ML and Evaluation
42 pages
ML3 Evaluating Models
No ratings yet
ML3 Evaluating Models
40 pages
Unit8 (Evaluation Method)
No ratings yet
Unit8 (Evaluation Method)
43 pages
3-Performance Measures
No ratings yet
3-Performance Measures
35 pages
Regular Expressions, Text Normalization, Edit Distance Part 1
No ratings yet
Regular Expressions, Text Normalization, Edit Distance Part 1
37 pages
Deep Unit 4
No ratings yet
Deep Unit 4
31 pages
Final Report
No ratings yet
Final Report
29 pages
Unit III Iml Final
No ratings yet
Unit III Iml Final
36 pages
Lec 8
No ratings yet
Lec 8
35 pages
Performance Evaluation
No ratings yet
Performance Evaluation
24 pages
Performance Parameters
No ratings yet
Performance Parameters
23 pages
Evaluation Metrics: Yining Chen (Adapted From Slides by Anand Avati) May 1, 2020
No ratings yet
Evaluation Metrics: Yining Chen (Adapted From Slides by Anand Avati) May 1, 2020
31 pages
Evaluation From Precision Recall and F-Factor To R
No ratings yet
Evaluation From Precision Recall and F-Factor To R
25 pages
2-Training and Testing Models, Evaluation Metrics-01-07-2023
No ratings yet
2-Training and Testing Models, Evaluation Metrics-01-07-2023
23 pages
CSL0777 L06
No ratings yet
CSL0777 L06
24 pages
Dr. Dubacharla Gyaneshwar
No ratings yet
Dr. Dubacharla Gyaneshwar
30 pages
Model Validation and Perf Metrics - v2 - Noman - 08 - 06 - 24
No ratings yet
Model Validation and Perf Metrics - v2 - Noman - 08 - 06 - 24
25 pages
22AIP3101A Session 3
No ratings yet
22AIP3101A Session 3
24 pages
IT 138 - Lecture 4
No ratings yet
IT 138 - Lecture 4
30 pages
S1 Evaluate Performance LKW 1mar2025
No ratings yet
S1 Evaluate Performance LKW 1mar2025
26 pages
Lec 4
No ratings yet
Lec 4
24 pages
Detection of Diseases On Bananas (Musa SP.) Using Image Processing and Machine Learning Techniques
No ratings yet
Detection of Diseases On Bananas (Musa SP.) Using Image Processing and Machine Learning Techniques
15 pages
Evaluating A Machine Learning Model
No ratings yet
Evaluating A Machine Learning Model
14 pages
11 - Model Eval and Tuning
No ratings yet
11 - Model Eval and Tuning
17 pages
Intel Assignment
No ratings yet
Intel Assignment
13 pages
Model Evaluation
No ratings yet
Model Evaluation
18 pages
Named Entity Recognition and Resolution
No ratings yet
Named Entity Recognition and Resolution
17 pages
Integration of Natural Language Processing Methods and Machine Learning Model For Malicious Webpage Detection Based On Web Contents
No ratings yet
Integration of Natural Language Processing Methods and Machine Learning Model For Malicious Webpage Detection Based On Web Contents
11 pages
Performance Measures
No ratings yet
Performance Measures
19 pages
2021-Application of Artificial Intelligence and Machine Learning To Detect DrillingAnomalies Leading To Stuck Pipe Incidents
No ratings yet
2021-Application of Artificial Intelligence and Machine Learning To Detect DrillingAnomalies Leading To Stuck Pipe Incidents
11 pages
Identifying Ransomware-Specific Properties Using Static Analysis of Executables
No ratings yet
Identifying Ransomware-Specific Properties Using Static Analysis of Executables
10 pages
Ad3501-Dl-Unit 4 Notes
No ratings yet
Ad3501-Dl-Unit 4 Notes
16 pages
Iai&ml Unit-5
No ratings yet
Iai&ml Unit-5
15 pages
1 s2.0 S092702562400048X Main
No ratings yet
1 s2.0 S092702562400048X Main
13 pages
Week 08
No ratings yet
Week 08
13 pages
Evaluation Metrics
No ratings yet
Evaluation Metrics
11 pages
Jsir 84 (4) 421-434
No ratings yet
Jsir 84 (4) 421-434
14 pages
Unit 3
No ratings yet
Unit 3
13 pages
Machine Learningassignment
No ratings yet
Machine Learningassignment
10 pages
ML Metrics
No ratings yet
ML Metrics
9 pages
Hybrid Undersampling and Oversampling For Handling Imbalanced Credit Card Data
No ratings yet
Hybrid Undersampling and Oversampling For Handling Imbalanced Credit Card Data
11 pages
Model Evaluation Metrics - A Comprehensive Guide For Beginners - by Yash - Medium
No ratings yet
Model Evaluation Metrics - A Comprehensive Guide For Beginners - by Yash - Medium
9 pages
Jain 2018
No ratings yet
Jain 2018
14 pages
SonarQube Rules
No ratings yet
SonarQube Rules
11 pages
Evaluation Measures
No ratings yet
Evaluation Measures
8 pages
Unit 1 Notes-1
No ratings yet
Unit 1 Notes-1
10 pages
Third Seminar Assignment On Machine Learning (CSC 912)
No ratings yet
Third Seminar Assignment On Machine Learning (CSC 912)
10 pages
Attention and Feature Fusion SSD For Remote Sensing Object Detection
No ratings yet
Attention and Feature Fusion SSD For Remote Sensing Object Detection
9 pages
G20 - Crowdfunding Predicting Kickstarter Project Success
No ratings yet
G20 - Crowdfunding Predicting Kickstarter Project Success
7 pages
Evaluation Metrics
No ratings yet
Evaluation Metrics
6 pages
Ads 5
No ratings yet
Ads 5
5 pages
Metric
No ratings yet
Metric
6 pages
Detection of Breast Cancer Using Data Mining Tool WEKA PDF
No ratings yet
Detection of Breast Cancer Using Data Mining Tool WEKA PDF
5 pages
Evaluation Metrics:: Confusion Matrix
No ratings yet
Evaluation Metrics:: Confusion Matrix
7 pages
AD3S
No ratings yet
AD3S
6 pages
Confusion Matrix
No ratings yet
Confusion Matrix
4 pages
Metrix in ML
No ratings yet
Metrix in ML
7 pages
Face Mask Detection Using Machine Learning and Deep Learning
No ratings yet
Face Mask Detection Using Machine Learning and Deep Learning
6 pages
ML CH 5
No ratings yet
ML CH 5
5 pages
Analysis of Performance Metrics of Heart Failured Patie - 2021 - Global Transiti
No ratings yet
Analysis of Performance Metrics of Heart Failured Patie - 2021 - Global Transiti
5 pages
Ads Exp4
No ratings yet
Ads Exp4
3 pages
Machine Learning Model Evaluation
No ratings yet
Machine Learning Model Evaluation
2 pages
Ads Exp 4
No ratings yet
Ads Exp 4
4 pages
Performance Metrics ML
No ratings yet
Performance Metrics ML
4 pages
Week 6 ML
No ratings yet
Week 6 ML
3 pages
ML 5
No ratings yet
ML 5
3 pages
What Are The Evaluation Metrics in Machine Learning
No ratings yet
What Are The Evaluation Metrics in Machine Learning
3 pages
Machine Learning Model Evaluation - Zero To Mastery Academy
No ratings yet
Machine Learning Model Evaluation - Zero To Mastery Academy
1 page
Automatic Detection of The Chewing Side Using Two-Channel Recordings Under The Ear
No ratings yet
Automatic Detection of The Chewing Side Using Two-Channel Recordings Under The Ear
2 pages
Datamining 2020 p1
No ratings yet
Datamining 2020 p1
1 page
Process Performance Models: Statistical, Probabilistic & Simulation
From Everand
Process Performance Models: Statistical, Probabilistic & Simulation
Vishnuvarthanan Moorthy
No ratings yet

Performance Metrics

Uploaded by

Performance Metrics

Uploaded by

PERFORMANCE

You might also like