Ads 5

The document outlines the implementation and evaluation of performance metrics for machine learning models, specifically focusing on classification problems. Key metrics discussed include the Confusion Matrix, Accuracy, Precision, Recall, and F1 Score, each serving to quantify model effectiveness. A practical example using the iris dataset demonstrates how to compute these metrics using Python and relevant libraries.

Uploaded by

madhavikhaire77

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views5 pages

Ads 5

Uploaded by

madhavikhaire77

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Experiment No.

Aim:
To implement and explore performance evaluation metrics for data models

Theory:
In machine learning, once we build a model, we need to evaluate its performance. This helps in
understanding how well the model is performing and whether it is suitable for making
predictions. For classification problems, there are several performance evaluation metrics that
help quantify the effectiveness of a model. These metrics include Accuracy, Precision, Recall,
F1 Score, and the

Confusion Matrix.
1. Confusion Matrix:
The Confusion Matrix is a table used to describe the performance of a classification algorithm.
It compares the predicted labels with the actual true labels of the dataset. It contains the
following four components:
● True Positive (TP): The number of correct predictions where the model predicted the
positive class, and it is actually positive.
● True Negative (TN): The number of correct predictions where the model predicted the
negative class, and it is actually negative.
● False Positive (FP): The number of incorrect predictions where the model predicted the
positive class, but it is actually negative (also known as a Type I error).
● False Negative (FN): The number of incorrect predictions where the model predicted the
negative class, but it is actually positive (also known as a Type II error).
The Confusion Matrix is represented as:
Predicted Predicted
Positive Negative

Actual Positive True Positive (TP) False Negative

(FN)
Actual 2. Accuracy: True Negative (TN)
Negative False Positive (FP)

Accuracy is the simplest and most commonly used evaluation metric. It measures the overall
performance of the model and is calculated as the ratio of correct predictions to the total number
of predictions.
● High accuracy indicates that the model is performing well overall.
● However, accuracy can be misleading, especially if the dataset is imbalanced (e.g., one
class is much larger than the other). In such cases, even a model that predicts the
majority class most of the time could have high accuracy but poor performance in
predicting the minority class.

3. Precision:
Precision is a metric that measures how many of the predicted positive instances are actually
positive. It answers the question: Of all the instances that were predicted as positive, how many
were truly positive?

● High precision means that when the model predicts positive, it is very likely to be
correct.
● Precision is particularly important when the cost of a false positive is high (for example,
in medical diagnoses where misdiagnosing healthy patients as sick could be harmful).

4. Recall (Sensitivity or True Positive Rate):

Recall (also known as Sensitivity or True Positive Rate) measures how many actual positive
instances were correctly predicted by the model. It answers the question: Of all the actual
positives, how many did the model successfully identify?

● High recall indicates that the model is correctly identifying most of the positive cases. ●
Recall is critical when the cost of missing a positive instance (false negative) is high. For
instance, in fraud detection or disease detection, we prefer to catch most of the positive
cases, even if it means having some false positives.

5. F1 Score:
The F1 Score is the harmonic mean of Precision and Recall. It provides a single score that
balances both precision and recall, especially when the classes are imbalanced.

● The F1 score is a good metric when we want to balance both precision and recall. It is
particularly useful when we have an imbalanced dataset, where one class is much more
frequent than the other.
● The F1 score gives a better idea of a model’s performance compared to accuracy, as it
considers both false positives and false negatives.
Key Takeaways:
● Confusion Matrix: A table summarizing the true vs. predicted labels. It shows how
many instances were correctly/incorrectly classified.
● Accuracy: The overall percentage of correct predictions made by the model. ●
Precision: The ratio of correctly predicted positive instances to the total predicted
positives. It tells you how reliable your positive predictions are.
● Recall: The ratio of correctly predicted positive instances to the actual total positives. It
tells you how many of the actual positive instances were caught by the model. ● F1 Score:
The harmonic mean of precision and recall. It is a balanced metric for imbalanced datasets.

Program and output:

# Import necessary libraries
import numpy as np
import pandas as pd
from sklearn.model_selection import train_test_split
from sklearn.datasets import load_iris
from sklearn.linear_model import LogisticRegression
from sklearn.metrics import accuracy_score, precision_score, recall_score, f1_score,
confusion_matrix, classification_report
import matplotlib.pyplot as plt
import seaborn as sns
# Load the iris dataset
iris = load_iris()
X = iris.data
y = iris.target

# Split the dataset into training and testing sets

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.3, random_state=42)
# Train a logistic regression model
model = LogisticRegression(max_iter=200)
model.fit(X_train, y_train)

# Make predictions
y_pred = model.predict(X_test)

# Accuracy score
accuracy = accuracy_score(y_test, y_pred)
print(f'Accuracy: {accuracy:.4f}')

# Precision score (weighted for multi-class classification)

precision = precision_score(y_test, y_pred, average='weighted')
print(f'Precision (Weighted): {precision:.4f}')

# Recall score (weighted for multi-class classification)

recall = recall_score(y_test, y_pred, average='weighted')
print(f'Recall (Weighted): {recall:.4f}')

# F1 score (weighted for multi-class classification)

f1 = f1_score(y_test, y_pred, average='weighted')
print(f'F1 Score (Weighted): {f1:.4f}')

# Confusion Matrix
conf_matrix = confusion_matrix(y_test, y_pred)
print("Confusion Matrix:")
print(conf_matrix)

# Plotting the confusion matrix

plt.figure(figsize=(6, 5))
sns.heatmap(conf_matrix, annot=True, fmt='d', cmap='Blues', xticklabels=iris.target_names,
yticklabels=iris.target_names)
plt.title('Confusion Matrix')
plt.xlabel('Predicted Labels')
plt.ylabel('True Labels')
plt.show()

# Classification Report (Precision, Recall, F1 for each class)

class_report = classification_report(y_test, y_pred)
print("\nClassification Report:")
print(class_report)

Module 2
No ratings yet
Module 2
151 pages
Performance Metrics
No ratings yet
Performance Metrics
12 pages
More Predictive Analytics. Microsoft Excel (PDFDrive)
No ratings yet
More Predictive Analytics. Microsoft Excel (PDFDrive)
465 pages
Machine Learning Model Evaluation
No ratings yet
Machine Learning Model Evaluation
11 pages
Confusion Matrix
No ratings yet
Confusion Matrix
43 pages
Unit 4 Model Evaluation
No ratings yet
Unit 4 Model Evaluation
24 pages
Confusion Matrix and Performance Evaluation Metrics
No ratings yet
Confusion Matrix and Performance Evaluation Metrics
13 pages
D3 IT Performance Metrics May 2023
No ratings yet
D3 IT Performance Metrics May 2023
48 pages
Lec 12 13 Evaluation Measures
No ratings yet
Lec 12 13 Evaluation Measures
45 pages
Exp7 MLAI2
No ratings yet
Exp7 MLAI2
8 pages
JNN 5.2 Confusion Matrix and Performance Evaluation Metrics
No ratings yet
JNN 5.2 Confusion Matrix and Performance Evaluation Metrics
13 pages
Lecture - 3
No ratings yet
Lecture - 3
24 pages
ML Lecture 11 Evaluation
No ratings yet
ML Lecture 11 Evaluation
17 pages
Model Performance Assessment
No ratings yet
Model Performance Assessment
13 pages
Performance Metrics (Classification) : Enrique J. de La Hoz D
100% (1)
Performance Metrics (Classification) : Enrique J. de La Hoz D
30 pages
Evaluation Metrics: Yining Chen (Adapted From Slides by Anand Avati) May 1, 2020
No ratings yet
Evaluation Metrics: Yining Chen (Adapted From Slides by Anand Avati) May 1, 2020
31 pages
Lecture 20 - Evaluation Metrics
No ratings yet
Lecture 20 - Evaluation Metrics
27 pages
Cristian Quiñonez Fase2
No ratings yet
Cristian Quiñonez Fase2
7 pages
Engineering Mathematics and Artificial Intelligence Foundations Methods and Applications Herb Kunze Download
No ratings yet
Engineering Mathematics and Artificial Intelligence Foundations Methods and Applications Herb Kunze Download
79 pages
Classification Metrics in Machine Learning
No ratings yet
Classification Metrics in Machine Learning
6 pages
Lecture 2 Measuring and Measures of Biodiversity Part I
No ratings yet
Lecture 2 Measuring and Measures of Biodiversity Part I
36 pages
Assignment 5
No ratings yet
Assignment 5
22 pages
Intermediate Analytics-Regression-Week 3-1
No ratings yet
Intermediate Analytics-Regression-Week 3-1
44 pages
3-Performance Measures
No ratings yet
3-Performance Measures
35 pages
Unit III Iml Final
No ratings yet
Unit III Iml Final
36 pages
Extracted Questions and Answers CMP 411
No ratings yet
Extracted Questions and Answers CMP 411
5 pages
LL Evaluationmatrics
No ratings yet
LL Evaluationmatrics
2 pages
Unit-6 Notes PART A
No ratings yet
Unit-6 Notes PART A
20 pages
ML CH 5
No ratings yet
ML CH 5
45 pages
Model Evaluation Metrics - Interpretation
No ratings yet
Model Evaluation Metrics - Interpretation
1 page
Evaluation Metrics: Yining Chen (Adapted From Slides by Anand Avati) May 1, 2020
No ratings yet
Evaluation Metrics: Yining Chen (Adapted From Slides by Anand Avati) May 1, 2020
31 pages
Classification Metrics
No ratings yet
Classification Metrics
24 pages
Model Evaluation
No ratings yet
Model Evaluation
18 pages
Numerov
No ratings yet
Numerov
5 pages
Performance Metrics
No ratings yet
Performance Metrics
8 pages
The Oxford College of Engineering: (Affiliated To Visvesvaraya Technological University, Belgaum)
No ratings yet
The Oxford College of Engineering: (Affiliated To Visvesvaraya Technological University, Belgaum)
42 pages
Document 2 1 - VIKAS MAURYA
No ratings yet
Document 2 1 - VIKAS MAURYA
5 pages
08 Classifier Evaluation
No ratings yet
08 Classifier Evaluation
39 pages
Dijkstra's Algorithm
No ratings yet
Dijkstra's Algorithm
59 pages
Chapter 5 Model Evaluation
No ratings yet
Chapter 5 Model Evaluation
21 pages
2-Training and Testing Models, Evaluation Metrics-01-07-2023
No ratings yet
2-Training and Testing Models, Evaluation Metrics-01-07-2023
23 pages
Accuracy and Error Measures
No ratings yet
Accuracy and Error Measures
14 pages
19-Performance Metrics
No ratings yet
19-Performance Metrics
23 pages
Module 7 - Evaluation Measures
No ratings yet
Module 7 - Evaluation Measures
27 pages
Ad3501-Dl-Unit 4 Notes
No ratings yet
Ad3501-Dl-Unit 4 Notes
16 pages
Condition Assessment Models For Sewer Pipelines
No ratings yet
Condition Assessment Models For Sewer Pipelines
121 pages
Lecture - (3-4) Evaluation Metrices Classification and Regression
No ratings yet
Lecture - (3-4) Evaluation Metrices Classification and Regression
28 pages
Iai&ml Unit-5
No ratings yet
Iai&ml Unit-5
15 pages
PV Tables
No ratings yet
PV Tables
10 pages
Performance Measures
No ratings yet
Performance Measures
9 pages
11.2 - Classification Evaluation Metrics
No ratings yet
11.2 - Classification Evaluation Metrics
22 pages
Lecture 5
No ratings yet
Lecture 5
21 pages
Evaluation Metrics-ML
No ratings yet
Evaluation Metrics-ML
16 pages
Model Evaluation Metrics - A Comprehensive Guide For Beginners - by Yash - Medium
No ratings yet
Model Evaluation Metrics - A Comprehensive Guide For Beginners - by Yash - Medium
9 pages
Cse
No ratings yet
Cse
4 pages
Deep Dive Into Confusion Matrix - Towards AI
No ratings yet
Deep Dive Into Confusion Matrix - Towards AI
9 pages
LCS Q
No ratings yet
LCS Q
6 pages
Confusion Matrix
No ratings yet
Confusion Matrix
11 pages
Machine Learningassignment
No ratings yet
Machine Learningassignment
10 pages
Confusion Matrix and Performance Evaluation Metrics
No ratings yet
Confusion Matrix and Performance Evaluation Metrics
13 pages
Risk Security and Regulatory Compliance
No ratings yet
Risk Security and Regulatory Compliance
12 pages
Confusion Matrix
No ratings yet
Confusion Matrix
5 pages
Ads Exp4
No ratings yet
Ads Exp4
3 pages
Ada Chapt6 Tronsform and Conquer
No ratings yet
Ada Chapt6 Tronsform and Conquer
106 pages
Evaluation Measures
No ratings yet
Evaluation Measures
8 pages
Evaluation Metrics
No ratings yet
Evaluation Metrics
11 pages
Imp Notes For Aamd
No ratings yet
Imp Notes For Aamd
6 pages
Diversification and Portfolio Analysis
No ratings yet
Diversification and Portfolio Analysis
14 pages
Evaluation Data
No ratings yet
Evaluation Data
3 pages
Confusion Matrix
No ratings yet
Confusion Matrix
5 pages
Confusion Matrix and Performance Evaluation Metrics
No ratings yet
Confusion Matrix and Performance Evaluation Metrics
13 pages
Collision Risk in Hash-Based Surrogate Keys - by Krzysztof K. Zdeb - Nov, 2024 - Towards Data Science
No ratings yet
Collision Risk in Hash-Based Surrogate Keys - by Krzysztof K. Zdeb - Nov, 2024 - Towards Data Science
11 pages
1 s2.0 S0950705124005999 Main
No ratings yet
1 s2.0 S0950705124005999 Main
12 pages
Evaluation Metrics:: Confusion Matrix
No ratings yet
Evaluation Metrics:: Confusion Matrix
7 pages
Quasi-Likelihood Functions, Generalized Linear Models, and The Gauss-Newton Method (Wedderburn Article)
No ratings yet
Quasi-Likelihood Functions, Generalized Linear Models, and The Gauss-Newton Method (Wedderburn Article)
9 pages
Revising Deep Learning Methods in Parking Lot Occupancy Detection
No ratings yet
Revising Deep Learning Methods in Parking Lot Occupancy Detection
22 pages
Energy-Constrained Delivery of Goods With Drones Under Varying Wind Conditions 1
No ratings yet
Energy-Constrained Delivery of Goods With Drones Under Varying Wind Conditions 1
13 pages
Tutorial 1 Solutions
No ratings yet
Tutorial 1 Solutions
3 pages
Bản sao của softmax - regression.ipynb - Colab
No ratings yet
Bản sao của softmax - regression.ipynb - Colab
6 pages
Block 4
No ratings yet
Block 4
96 pages
EMS-LECTURE 5: State Estimation
No ratings yet
EMS-LECTURE 5: State Estimation
3 pages
Cs Gyaat
No ratings yet
Cs Gyaat
1 page
Square-1 Full Cubeshape
100% (1)
Square-1 Full Cubeshape
3 pages
Factor Analysis: KMO and Bartlett's Test
No ratings yet
Factor Analysis: KMO and Bartlett's Test
6 pages
Operation Analytics MCQ
No ratings yet
Operation Analytics MCQ
11 pages
Decision Tree Learning
No ratings yet
Decision Tree Learning
16 pages
Index DSA
No ratings yet
Index DSA
2 pages
DAMT Formulas
No ratings yet
DAMT Formulas
1 page
De-Mystifying Math and Stats for Machine Learning: Mastering the Fundamentals of Mathematics and Statistics for Machine Learning
From Everand
De-Mystifying Math and Stats for Machine Learning: Mastering the Fundamentals of Mathematics and Statistics for Machine Learning
Seaport AI Madhavan
No ratings yet
Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)