0% found this document useful (0 votes)

23 views5 pages

ROC Auc

The ROC-AUC Curve is a graphical tool that evaluates the performance of binary classification models by plotting the True Positive Rate against the False Positive Rate at various thresholds. The AUC value, ranging from 0 to 1, quantifies the model's ability to distinguish between classes, with higher values indicating better performance. This metric is particularly useful in business contexts such as customer churn prediction, fraud detection, medical diagnosis, and targeted marketing.

Uploaded by

Lipika Sharma

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views5 pages

ROC Auc

Uploaded by

Lipika Sharma

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

ROC-AUC Curve: Overview and Explanation

ROC (Receiver Operating Characteristic) Curve is a graphical

representation of the performance of a binary classification model. It plots
the True Positive Rate (TPR) against the False Positive Rate (FPR) at
various threshold settings. The AUC (Area Under the Curve) is the area
under this ROC curve and is a numerical measure of how well the model
distinguishes between classes.

Key Terminology:

1. True Positive Rate (TPR) / Sensitivity / Recall: The proportion of

actual positives that are correctly identified by the model.

TPR=TPTP+FNTPR = \frac{TP}{TP + FN}TPR=TP+FNTP

Where:

o TP = True Positives

o FN = False Negatives

2. False Positive Rate (FPR): The proportion of actual negatives that

are incorrectly identified as positive by the model.

FPR=FPFP+TNFPR = \frac{FP}{FP + TN}FPR=FP+TNFP

Where:

o FP = False Positives

o TN = True Negatives

3. AUC (Area Under the Curve): The area under the ROC curve,
representing the likelihood that the model ranks a randomly chosen
positive instance higher than a randomly chosen negative instance.
The AUC value ranges from 0 to 1:

o AUC = 1: Perfect classifier.

o AUC = 0.5: Random classifier, no discriminative ability.

o AUC < 0.5: Worse than random classifier.

ROC Curve Construction:

To plot the ROC curve:

 Vary the decision threshold from 0 to 1.

 For each threshold, calculate the TPR and FPR.

 Plot the points (FPR, TPR) to generate the curve.

Example:

Imagine you have a binary classifier that predicts whether a customer will
churn (leave the service, which we treat as "1") or not churn (stay, treated
as "0").

 True Positives (TP): Customers who churned and the model

correctly predicted churn.

 False Positives (FP): Customers who didn’t churn but the model
incorrectly predicted churn.

 True Negatives (TN): Customers who didn’t churn and the model
correctly predicted no churn.

 False Negatives (FN): Customers who churned but the model

incorrectly predicted no churn.

Step-by-Step Example:

 Model predicts probabilities for each instance in the test set (for
example, for customer churn, the probability that a customer will
churn).

 Vary the threshold for predicting "1" (churn), from 0 to 1.

o At each threshold, calculate TPR and FPR.

 Plot these values (FPR, TPR) on the ROC curve.

If your model gives the following probabilities for a set of customers:

 Customer A: 0.85 (Predicted: Churn)

 Customer B: 0.45 (Predicted: No Churn)

 Customer C: 0.60 (Predicted: Churn)

 Customer D: 0.20 (Predicted: No Churn)

By adjusting thresholds, you can plot points representing the TPR and FPR.

Business Use Cases of ROC-AUC Curve:

1. Customer Churn Prediction:

o Problem: A telecom company wants to predict which

customers are likely to leave (churn). Using a machine
learning model, they can classify customers as churn or not
churn.

o Why ROC-AUC: The ROC-AUC will help evaluate how well the
model differentiates between customers who will churn vs.
those who won’t, across various thresholds. If the AUC is high,
the model does a good job of identifying at-risk customers.

2. Fraud Detection:

o Problem: In financial transactions, the goal is to detect

fraudulent transactions. A model is trained to classify
transactions as fraudulent (1) or not (0).

o Why ROC-AUC: In fraud detection, false positives (incorrectly

classifying a legitimate transaction as fraudulent) can be
costly. The ROC-AUC helps balance between false positives
and false negatives, giving a clearer view of the model's
overall performance across different thresholds.

3. Medical Diagnosis:

o Problem: A model is developed to detect whether a patient

has a particular disease (binary classification: sick vs. not
sick).

o Why ROC-AUC: Medical decisions often involve different

thresholds for risk tolerance. The ROC-AUC will help doctors
assess how well the model discriminates between healthy and
sick patients across various threshold levels (e.g., setting a
higher threshold for false positives when medical
consequences are severe).

4. Marketing and Targeting:

o Problem: A company builds a model to predict which

customers are likely to respond to a marketing campaign
(binary: will respond or will not respond).

o Why ROC-AUC: The model's ability to differentiate between

responders and non-responders can be crucial in targeting the
right customers. The ROC-AUC provides insight into how well
the model can distinguish between these groups, helping
marketers optimize their targeting.

Interpreting ROC-AUC in Business Context:

 High AUC (close to 1): A model that does an excellent job in

distinguishing between the positive and negative classes. This is
desirable in critical use cases like fraud detection or medical
diagnosis.
 AUC ~ 0.5: The model has no discriminatory ability, indicating
random guessing. This typically calls for improvements in the model
or data.

 AUC < 0.5: A model worse than random guessing, which would be
problematic in any business setting, as it means the model is
actively making wrong predictions.

Example Code (Python):

python

Copy

import matplotlib.pyplot as plt

from sklearn.metrics import roc_curve, roc_auc_score

from sklearn.model_selection import train_test_split

from sklearn.datasets import make_classification

from sklearn.ensemble import RandomForestClassifier

# Generate synthetic binary classification dataset

X, y = make_classification(n_samples=1000, n_features=20, n_classes=2,

random_state=42)

# Split dataset into training and testing sets

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2,

random_state=42)

# Train a RandomForest model

model = RandomForestClassifier(random_state=42)

model.fit(X_train, y_train)

# Predict probabilities on the test set

y_probs = model.predict_proba(X_test)[:, 1] # Probabilities for the

positive class
# Compute ROC curve and AUC

fpr, tpr, thresholds = roc_curve(y_test, y_probs)

roc_auc = roc_auc_score(y_test, y_probs)

# Plot the ROC curve

plt.figure()

plt.plot(fpr, tpr, color='darkorange', lw=2, label=f'ROC curve (AUC =

{roc_auc:.2f})')

plt.plot([0, 1], [0, 1], color='navy', lw=2, linestyle='--')

plt.xlim([0.0, 1.0])

plt.ylim([0.0, 1.05])

plt.xlabel('False Positive Rate')

plt.ylabel('True Positive Rate')

plt.title('Receiver Operating Characteristic (ROC) Curve')

plt.legend(loc='lower right')

plt.show()

Conclusion:

The ROC-AUC curve is a powerful tool for evaluating the performance of

binary classification models. It provides insights into how well the model
distinguishes between positive and negative classes across different
thresholds. For businesses, it can be crucial for making informed
decisions, especially in high-stakes areas like fraud detection, medical
diagnosis, and customer churn prediction.

Bloom's Taxonomy Question Stems: Remembering
100% (1)
Bloom's Taxonomy Question Stems: Remembering
3 pages
HCI Chapter One New
No ratings yet
HCI Chapter One New
8 pages
F1 Score Vs ROC AUC Vs Accuracy Vs PR AUC Which Evaluation Metric Should You Choose - Neptune - Ai
No ratings yet
F1 Score Vs ROC AUC Vs Accuracy Vs PR AUC Which Evaluation Metric Should You Choose - Neptune - Ai
1 page
Sample Paper For Ipfp Pre-Assessment Test
0% (1)
Sample Paper For Ipfp Pre-Assessment Test
6 pages
Confusion Matrix & Evaluation Metrics in Machine Learning
No ratings yet
Confusion Matrix & Evaluation Metrics in Machine Learning
23 pages
QUANT EXCELddjdjjddjdjdjdjdjdjdididdddddd
No ratings yet
QUANT EXCELddjdjjddjdjdjdjdjdjdididdddddd
24 pages
09 Class Advanced
No ratings yet
09 Class Advanced
64 pages
Unit2 - Perfomance Measures
No ratings yet
Unit2 - Perfomance Measures
32 pages
Module 3 Lesson 1 Notetaking
No ratings yet
Module 3 Lesson 1 Notetaking
3 pages
Ca 3 Merged
No ratings yet
Ca 3 Merged
275 pages
Roc Curve in Python
No ratings yet
Roc Curve in Python
58 pages
08 - ROC Curves and Operating Points
No ratings yet
08 - ROC Curves and Operating Points
11 pages
Performance Parameters
No ratings yet
Performance Parameters
23 pages
Perf Meas
No ratings yet
Perf Meas
15 pages
4.9 Estimating The Performance of A Classifier II
No ratings yet
4.9 Estimating The Performance of A Classifier II
16 pages
Last Day
No ratings yet
Last Day
35 pages
Threshold-Free Metrics - Reproducible Machine Learning For Credit Card Fraud Detection - Practical Handbook
No ratings yet
Threshold-Free Metrics - Reproducible Machine Learning For Credit Card Fraud Detection - Practical Handbook
8 pages
Voss Ursula Induction of Self Awareness in Dreams 2014
No ratings yet
Voss Ursula Induction of Self Awareness in Dreams 2014
5 pages
1 PB
No ratings yet
1 PB
4 pages
Week7 ROC
No ratings yet
Week7 ROC
8 pages
AUC and The ROC Curve in Machine Learning - DataCamp
No ratings yet
AUC and The ROC Curve in Machine Learning - DataCamp
12 pages
Introduction To ROC Analysis
No ratings yet
Introduction To ROC Analysis
15 pages
Binary Logistic Regression
No ratings yet
Binary Logistic Regression
1 page
Notes On ML Basics (Classifier, Types of Classification Algorithms, AUC-ROC Curve, Cross-Validation)
No ratings yet
Notes On ML Basics (Classifier, Types of Classification Algorithms, AUC-ROC Curve, Cross-Validation)
1 page
Auc Roc: "Area Under The Curve" (AUC) of The "Receiver Operating Characteristic" (ROC)
No ratings yet
Auc Roc: "Area Under The Curve" (AUC) of The "Receiver Operating Characteristic" (ROC)
7 pages
Area Under The Curve
No ratings yet
Area Under The Curve
2 pages
Group Factor Theory
No ratings yet
Group Factor Theory
4 pages
Peers To Peers: Developing A Student-Coordinated Conversation Partner Program
No ratings yet
Peers To Peers: Developing A Student-Coordinated Conversation Partner Program
12 pages
DLL 1
No ratings yet
DLL 1
3 pages
13-Module 5 - ROC Curve Analysis - Introduction and Motivation-26-09-2023
No ratings yet
13-Module 5 - ROC Curve Analysis - Introduction and Motivation-26-09-2023
8 pages
Progress Assesment (ROV Curve and AUC)
No ratings yet
Progress Assesment (ROV Curve and AUC)
2 pages
W6 CSE 4781 Classification Metrics
No ratings yet
W6 CSE 4781 Classification Metrics
28 pages
El8 Research Digest
No ratings yet
El8 Research Digest
11 pages
Blue Property
No ratings yet
Blue Property
10 pages
ML Lecture 11 Evaluation
No ratings yet
ML Lecture 11 Evaluation
17 pages
The ROC Curve
No ratings yet
The ROC Curve
5 pages
Chicco 2023
No ratings yet
Chicco 2023
23 pages
Introduction To ROC Analysis
No ratings yet
Introduction To ROC Analysis
15 pages
ML-Lecture-12 (Evaluation Metrics For Classification)
No ratings yet
ML-Lecture-12 (Evaluation Metrics For Classification)
15 pages
The Receiver Operating Characteristic (ROC) Curve Offers Us A Visual
No ratings yet
The Receiver Operating Characteristic (ROC) Curve Offers Us A Visual
2 pages
AUC ROC Curve
No ratings yet
AUC ROC Curve
5 pages
3 - Model Evaluation & Validation
No ratings yet
3 - Model Evaluation & Validation
47 pages
Lecture 2.3
No ratings yet
Lecture 2.3
9 pages
How To Use ROC Curves and Precision-Recall Curves For Classification in Python
No ratings yet
How To Use ROC Curves and Precision-Recall Curves For Classification in Python
47 pages
Classification Metrics
No ratings yet
Classification Metrics
39 pages
ROC Analysis and The AUC - Area Under The Curve by Carolina Bento Towards Data Science
No ratings yet
ROC Analysis and The AUC - Area Under The Curve by Carolina Bento Towards Data Science
1 page
PROS - Ivanna Kristianti T - Predicting Receiver Operating Characteristic - Fulltext
No ratings yet
PROS - Ivanna Kristianti T - Predicting Receiver Operating Characteristic - Fulltext
5 pages
Binary Classification PDF
No ratings yet
Binary Classification PDF
27 pages
Introduction To ROC Analysis: Pattern Recognition Letters June 2006
No ratings yet
Introduction To ROC Analysis: Pattern Recognition Letters June 2006
16 pages
Lecture11evaluationmetricsforclassification 240913060639 0c766554
No ratings yet
Lecture11evaluationmetricsforclassification 240913060639 0c766554
28 pages
Logistic Regression With R
No ratings yet
Logistic Regression With R
58 pages
IS4242 W6 Model Evaluation and Selection
No ratings yet
IS4242 W6 Model Evaluation and Selection
86 pages
Bradley PR97 PDF
No ratings yet
Bradley PR97 PDF
15 pages
Research
No ratings yet
Research
19 pages
An Introduction To ROC Curve (Receiver Operating Characteristics)
No ratings yet
An Introduction To ROC Curve (Receiver Operating Characteristics)
16 pages
AI Performance Evaluation - Annotated
No ratings yet
AI Performance Evaluation - Annotated
52 pages
Tutorial 6 Evaluation Metrics For Machine Learning Models: Classification and Regression Models
No ratings yet
Tutorial 6 Evaluation Metrics For Machine Learning Models: Classification and Regression Models
22 pages
The Receiver Operating Characteristic ROC Curve
No ratings yet
The Receiver Operating Characteristic ROC Curve
3 pages
جلسه 13
No ratings yet
جلسه 13
76 pages
Unit 2 Chap 4
No ratings yet
Unit 2 Chap 4
14 pages
Love - Sternberg
No ratings yet
Love - Sternberg
10 pages
Class Imbalance Problem: BY Dr. Anupam Ghosh 4 SEPT, 2023
No ratings yet
Class Imbalance Problem: BY Dr. Anupam Ghosh 4 SEPT, 2023
27 pages
Iai&ml Unit-5
No ratings yet
Iai&ml Unit-5
15 pages
Roc and Auc: Receiver Operating Characteristic
No ratings yet
Roc and Auc: Receiver Operating Characteristic
4 pages
Pronominal and Reference: BY: Faiza Hassan Lady Jaisa Jainuddin Don Jayeko Tayabas
No ratings yet
Pronominal and Reference: BY: Faiza Hassan Lady Jaisa Jainuddin Don Jayeko Tayabas
40 pages
SUSIE Pharmaceutical CMC Ontology-Based Information Extraction For Drug Development Using Machine Learning
No ratings yet
SUSIE Pharmaceutical CMC Ontology-Based Information Extraction For Drug Development Using Machine Learning
15 pages
Our Lady of Manaoag Innovative School, Inc.: The Problem
No ratings yet
Our Lady of Manaoag Innovative School, Inc.: The Problem
90 pages
Module 5 ML
No ratings yet
Module 5 ML
12 pages
Module 1 - Essentials of Project Management - Project Management in Global Health (B)
No ratings yet
Module 1 - Essentials of Project Management - Project Management in Global Health (B)
9 pages
Auc Roc Curve Machine Learning
No ratings yet
Auc Roc Curve Machine Learning
12 pages
Applied Natural Language Processing: Barbara Rosario
No ratings yet
Applied Natural Language Processing: Barbara Rosario
39 pages
Visualizing Future For Prosperity & Success
100% (3)
Visualizing Future For Prosperity & Success
12 pages
Machine Learning Project Report (Group 3) Shahbaz Khan
No ratings yet
Machine Learning Project Report (Group 3) Shahbaz Khan
11 pages
CAIE Gr11 Psych Prelims 2 P1 MS 23-24
No ratings yet
CAIE Gr11 Psych Prelims 2 P1 MS 23-24
8 pages
LASSI Presentation
No ratings yet
LASSI Presentation
10 pages
Personality Factors and Second Language Acquisition
No ratings yet
Personality Factors and Second Language Acquisition
61 pages
HBO Chapter 03 LEARNING PERCEPTION AND ATTRIBUTION
No ratings yet
HBO Chapter 03 LEARNING PERCEPTION AND ATTRIBUTION
30 pages
Task 2 Claudia Lopez A
No ratings yet
Task 2 Claudia Lopez A
6 pages
Flach Roc Analysis
No ratings yet
Flach Roc Analysis
12 pages
School Year Action Plan
No ratings yet
School Year Action Plan
4 pages
Essay Writing Rubrics
No ratings yet
Essay Writing Rubrics
4 pages
Intro TRW Module 01
No ratings yet
Intro TRW Module 01
28 pages
Kinds, Format and Components of The Lesson Plan
No ratings yet
Kinds, Format and Components of The Lesson Plan
14 pages
Social Psychology MCQ
67% (3)
Social Psychology MCQ
9 pages
Seminar Maschinellem Lernen: An Improved Model Selection Heuristic For AUC
No ratings yet
Seminar Maschinellem Lernen: An Improved Model Selection Heuristic For AUC
19 pages
Organisation Behaviour M Com Project
No ratings yet
Organisation Behaviour M Com Project
42 pages
DFD Placement Cell
No ratings yet
DFD Placement Cell
9 pages
AIML - Module 1-Question Bank
No ratings yet
AIML - Module 1-Question Bank
3 pages
Top 20 MS Excel VBA Simulations, VBA to Model Risk, Investments, Growth, Gambling, and Monte Carlo Analysis
From Everand
Top 20 MS Excel VBA Simulations, VBA to Model Risk, Investments, Growth, Gambling, and Monte Carlo Analysis
Andrei Besedin
2.5/5 (2)
MCS-011: Problem Solving and Programming
From Everand
MCS-011: Problem Solving and Programming
Dr. DK Sukhani
No ratings yet

ROC Auc

Uploaded by

ROC Auc

Uploaded by

ROC-AUC Curve: Overview and Explanation

ROC (Receiver Operating Characteristic) Curve is a graphical

1. True Positive Rate (TPR) / Sensitivity / Recall: The proportion of

TPR=TPTP+FNTPR = \frac{TP}{TP + FN}TPR=TP+FNTP

2. False Positive Rate (FPR): The proportion of actual negatives that

FPR=FPFP+TNFPR = \frac{FP}{FP + TN}FPR=FP+TNFP

o AUC = 1: Perfect classifier.

o AUC = 0.5: Random classifier, no discriminative ability.

o AUC < 0.5: Worse than random classifier.

ROC Curve Construction:

To plot the ROC curve:

 Vary the decision threshold from 0 to 1.

 For each threshold, calculate the TPR and FPR.

 Plot the points (FPR, TPR) to generate the curve.

 True Positives (TP): Customers who churned and the model

 False Negatives (FN): Customers who churned but the model

 Vary the threshold for predicting "1" (churn), from 0 to 1.

o At each threshold, calculate TPR and FPR.

 Plot these values (FPR, TPR) on the ROC curve.

If your model gives the following probabilities for a set of customers:

 Customer A: 0.85 (Predicted: Churn)

 Customer B: 0.45 (Predicted: No Churn)

 Customer C: 0.60 (Predicted: Churn)

 Customer D: 0.20 (Predicted: No Churn)

Business Use Cases of ROC-AUC Curve:

1. Customer Churn Prediction:

o Problem: A telecom company wants to predict which

o Problem: In financial transactions, the goal is to detect

o Why ROC-AUC: In fraud detection, false positives (incorrectly

o Problem: A model is developed to detect whether a patient

o Why ROC-AUC: Medical decisions often involve different

4. Marketing and Targeting:

o Problem: A company builds a model to predict which

o Why ROC-AUC: The model's ability to differentiate between

Interpreting ROC-AUC in Business Context:

 High AUC (close to 1): A model that does an excellent job in

Example Code (Python):

import matplotlib.pyplot as plt

from sklearn.metrics import roc_curve, roc_auc_score

from sklearn.model_selection import train_test_split

from sklearn.datasets import make_classification

from sklearn.ensemble import RandomForestClassifier

# Generate synthetic binary classification dataset

X, y = make_classification(n_samples=1000, n_features=20, n_classes=2,

# Split dataset into training and testing sets

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2,

# Train a RandomForest model

# Predict probabilities on the test set

y_probs = model.predict_proba(X_test)[:, 1] # Probabilities for the

fpr, tpr, thresholds = roc_curve(y_test, y_probs)

roc_auc = roc_auc_score(y_test, y_probs)

# Plot the ROC curve

plt.plot(fpr, tpr, color='darkorange', lw=2, label=f'ROC curve (AUC =

plt.plot([0, 1], [0, 1], color='navy', lw=2, linestyle='--')

plt.xlabel('False Positive Rate')

plt.ylabel('True Positive Rate')

plt.title('Receiver Operating Characteristic (ROC) Curve')

The ROC-AUC curve is a powerful tool for evaluating the performance of

You might also like