CS340 Machine Learning ROC Curves

This document discusses performance measures for binary classifiers such as precision, recall, and ROC curves. It explains that precision measures the proportion of predicted positives that are actual positives, while recall measures the proportion of actual positives that are correctly predicted as such. The document also discusses how precision and recall can be visualized using precision-recall curves and ROC curves. Finally, it notes that accuracy, precision, and recall are not always the best metrics, and mutual information may be a better measure in some cases.

Uploaded by

ProgAchr

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

49 views8 pages

CS340 Machine Learning ROC Curves

Uploaded by

ProgAchr

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

CS340 Machine learning ROC curves

Performance measures for binary classifiers

Confusion matrix, contingency table

precision = positive predictive value (PPV) = TP / P-hat

Sensitivity = recall = True pos rate = hit rate = TP / P = 1-FNR False neg rate = false rejection = type II error rate = FN / P = 1-TPR

False pos rate = false acceptance = = type I error rate = FP / N = 1-spec

Specificity = TN / N = 1-FPR

Performance depends on threshold

Declare xn to be a positive if p(y=1|xn)>, otherwise declare it to be negative (y=0)
yn = 1 p(y = 1|xn ) >

Number of TPs and FPs depends on threshold . As we change , we get different (TPR, FPR) points.

T P R = p( = 1|y = 1) y F P R = p( = 1|y = 0) y

Example
i 1 2 3 4 5 6 7 8 9 yi 1 1 1 1 1 0 0 0 0 p(yi = 1|xi ) 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 yi ( = 0) 1 1 1 1 1 1 1 1 1 yi ( = 0.5) 1 1 1 1 1 0 0 0 0 yi ( = 1) 0 0 0 0 0 0 0 0 0

i 1 2 3 4 5 6 7 8 9

yi 1 1 1 1 1 0 0 0 0

p(yi = 1|xi ) 0.9 0.8 0.7 0.6 0.2 0.6 0.3 0.2 0.1

yi ( = 0) 1 1 1 1 1 1 1 1 1

yi ( = 0.5) 1 1 1 1 0 1 0 0 0

yi ( = 1) 0 0 0 0 0 0 0 0 0

Performance measures
EER- Equal error rate/ cross over error rate (false pos rate = false neg rate), smaller is better AUC - Area under curve, larger is better Accuracy = (TP+TN)/(P+N)

Precision-recall curves
Useful when notion of negative (and hence FPR) is not well defined, or too many negatives (rare event detection) Recall = of those that exist, how many did you find? Precision = of those that you found, how many correct? 2P R 2 = F = F-score is harmonic mean 1/P + 1/R R+P
prec = p(y = 1| = 1) y recall = p( = 1|y = 1) y

Word of caution
Consider binary classifiers A, B, C
1 0 A 1 0.9 0 . 0 0.1 0 B 1 0.8 0.1 . 0 0 0.1 C 1 0.78 0.12 . 0 0 0.1

Clearly A is useless, since it always predicts label 1, regardless of the input. Also, B is slightly better than C (less probability mass wasted on the offdiagonal entries). Yet here are the performance metrics.
Metric Accuracy Precision Recall F-score A 0.9 0.9 1.0 0.947 B 0.9 1.0 0.888 0.941 C 0.88 1.0 0.8667 0.9286

Mutual information is a better measure

The MI between estimated and true label is
1 1

I(Y , Y ) =
y =0

p(, y) y p(, y) log y p()p(y) y y=0

This gives the intuitively correct rankings B>C>A

Metric Accuracy Precision Recall F-score Mutual information A 0.9 0.9 1.0 0.947 0 B 0.9 1.0 0.888 0.941 0.1865 C 0.88 1.0 0.8667 0.9286 0.1735

Int3209 - Data Mining: Week 5: Classification Model Improvements
No ratings yet
Int3209 - Data Mining: Week 5: Classification Model Improvements
56 pages
Notes 03
No ratings yet
Notes 03
38 pages
Bi 2
No ratings yet
Bi 2
25 pages
Lect 02 Evaluation Part 1
No ratings yet
Lect 02 Evaluation Part 1
33 pages
Confusion Matrix V 2.0
No ratings yet
Confusion Matrix V 2.0
14 pages
CSE4261 Lecture-10
No ratings yet
CSE4261 Lecture-10
50 pages
Data Mining Final
No ratings yet
Data Mining Final
25 pages
AI Performance Evaluation - Annotated
No ratings yet
AI Performance Evaluation - Annotated
52 pages
Model Evaluation and Selection
No ratings yet
Model Evaluation and Selection
49 pages
FALLSEM2024-25 BCSE334L TH VL2024250101768 2024-10-08 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE334L TH VL2024250101768 2024-10-08 Reference-Material-I
18 pages
Binary Classification PDF
No ratings yet
Binary Classification PDF
27 pages
Imbalance Problem
No ratings yet
Imbalance Problem
13 pages
Lecture 3b - Evaluation
No ratings yet
Lecture 3b - Evaluation
37 pages
Lec5 Classification
No ratings yet
Lec5 Classification
27 pages
March 3rd&4th
No ratings yet
March 3rd&4th
19 pages
004 07 Roc Auc Eer W4L2 W5L1 PDF
No ratings yet
004 07 Roc Auc Eer W4L2 W5L1 PDF
12 pages
Lecture 3 1611410001002
No ratings yet
Lecture 3 1611410001002
51 pages
Lecture 10
No ratings yet
Lecture 10
16 pages
Lec 12 13 Evaluation Measures
No ratings yet
Lec 12 13 Evaluation Measures
45 pages
Evaluation Matrix
No ratings yet
Evaluation Matrix
29 pages
W6 CSE 4781 Classification Metrics
No ratings yet
W6 CSE 4781 Classification Metrics
28 pages
AUC and The ROC Curve in Machine Learning - DataCamp
No ratings yet
AUC and The ROC Curve in Machine Learning - DataCamp
12 pages
Performance
No ratings yet
Performance
11 pages
جلسه 13
No ratings yet
جلسه 13
76 pages
CH-5 ML
No ratings yet
CH-5 ML
36 pages
6 Evaluarea Performantei
No ratings yet
6 Evaluarea Performantei
43 pages
CH 4
No ratings yet
CH 4
9 pages
Class Imbalance Problem: BY Dr. Anupam Ghosh 4 SEPT, 2023
No ratings yet
Class Imbalance Problem: BY Dr. Anupam Ghosh 4 SEPT, 2023
27 pages
UNIT-1-2.Binary Classification and Related Tasks
No ratings yet
UNIT-1-2.Binary Classification and Related Tasks
22 pages
Performance Parameters
No ratings yet
Performance Parameters
23 pages
ML CH 5
No ratings yet
ML CH 5
45 pages
ML Lecture 11 Evaluation
No ratings yet
ML Lecture 11 Evaluation
17 pages
4.9 Estimating The Performance of A Classifier II
No ratings yet
4.9 Estimating The Performance of A Classifier II
16 pages
06-FSSR DS610 2024 2025T1 Metrics
No ratings yet
06-FSSR DS610 2024 2025T1 Metrics
24 pages
Classification Metrics
No ratings yet
Classification Metrics
39 pages
Lecture11evaluationmetricsforclassification 240913060639 0c766554
No ratings yet
Lecture11evaluationmetricsforclassification 240913060639 0c766554
28 pages
Lecture 2.3
No ratings yet
Lecture 2.3
9 pages
A10 Model Performance v2 2up
No ratings yet
A10 Model Performance v2 2up
11 pages
Hands On Machine Learning 3 Edition
No ratings yet
Hands On Machine Learning 3 Edition
31 pages
Classification Metrics Mod 6
No ratings yet
Classification Metrics Mod 6
8 pages
DL IT324a 4
No ratings yet
DL IT324a 4
52 pages
Unit6 - 7 Issues
No ratings yet
Unit6 - 7 Issues
53 pages
The Receiver Operating Characteristic (ROC) Curve Offers Us A Visual
No ratings yet
The Receiver Operating Characteristic (ROC) Curve Offers Us A Visual
2 pages
Model Evaluation - II
No ratings yet
Model Evaluation - II
12 pages
ML Unit 3
No ratings yet
ML Unit 3
127 pages
Unit 2 Chap 4
No ratings yet
Unit 2 Chap 4
14 pages
Performance Evaluation
No ratings yet
Performance Evaluation
24 pages
L 13 Choose Your Own Algorithm D 07062024 111828am
No ratings yet
L 13 Choose Your Own Algorithm D 07062024 111828am
36 pages
Railway Reservation Syatem
No ratings yet
Railway Reservation Syatem
87 pages
Performance Parameters
No ratings yet
Performance Parameters
14 pages
ML Metrics
No ratings yet
ML Metrics
9 pages
Iai&ml Unit-5
No ratings yet
Iai&ml Unit-5
15 pages
19-Performance Metrics
No ratings yet
19-Performance Metrics
23 pages
Module 5 ML
No ratings yet
Module 5 ML
12 pages
Ai DS 2 Book-Chpt-5
No ratings yet
Ai DS 2 Book-Chpt-5
17 pages
Evaluation Metrics
No ratings yet
Evaluation Metrics
11 pages
PROS - Ivanna Kristianti T - Predicting Receiver Operating Characteristic - Fulltext
No ratings yet
PROS - Ivanna Kristianti T - Predicting Receiver Operating Characteristic - Fulltext
5 pages
Evaluation Measures
No ratings yet
Evaluation Measures
8 pages
Machine Learning Project Report (Group 3) Shahbaz Khan
No ratings yet
Machine Learning Project Report (Group 3) Shahbaz Khan
11 pages
Chapter 4 - Transfer Functions
100% (1)
Chapter 4 - Transfer Functions
36 pages
Roc 1 PDF
No ratings yet
Roc 1 PDF
8 pages
Sawc/Fte El Calafate, Argentina: Bokuk Motno
No ratings yet
Sawc/Fte El Calafate, Argentina: Bokuk Motno
18 pages
Attendance Management System (Rovaid Khan)
No ratings yet
Attendance Management System (Rovaid Khan)
8 pages
Enterprise Architecture Dissertation
100% (2)
Enterprise Architecture Dissertation
6 pages
Eee IV Control Systems 10es43 Notes PDF
No ratings yet
Eee IV Control Systems 10es43 Notes PDF
124 pages
Agile Processes and Methodologies: A Conceptual Study: Sheetal Sharma
No ratings yet
Agile Processes and Methodologies: A Conceptual Study: Sheetal Sharma
1 page
Artificial Intelligence and Robotics
No ratings yet
Artificial Intelligence and Robotics
13 pages
Convolution Correlation
No ratings yet
Convolution Correlation
39 pages
Simulated Annealing Based Optimal Proportional Integral Derivativ
No ratings yet
Simulated Annealing Based Optimal Proportional Integral Derivativ
70 pages
Test Analyst Assessment Exam
No ratings yet
Test Analyst Assessment Exam
5 pages
Deep Learning Review and Discussion of Its Future
No ratings yet
Deep Learning Review and Discussion of Its Future
7 pages
Fikes 1971 - Strips
No ratings yet
Fikes 1971 - Strips
20 pages
Question Bank - 2020
No ratings yet
Question Bank - 2020
4 pages
The Impact of Artificial Intelligence On Human Life
No ratings yet
The Impact of Artificial Intelligence On Human Life
1 page
Stock Market Prediction Using Machine Learning: December 2018
No ratings yet
Stock Market Prediction Using Machine Learning: December 2018
4 pages
GUI Based Control System Analysis Using PID Controller For Education
No ratings yet
GUI Based Control System Analysis Using PID Controller For Education
11 pages
Ai 900
No ratings yet
Ai 900
4 pages
Timeandmotionstudy
No ratings yet
Timeandmotionstudy
6 pages
The Circular Economy and Industry40 Synergies and Challengesrevista de Gestao
No ratings yet
The Circular Economy and Industry40 Synergies and Challengesrevista de Gestao
14 pages
Deep Learning
100% (1)
Deep Learning
2 pages
الاستراتيجية و الثقافة التنظيمية
No ratings yet
الاستراتيجية و الثقافة التنظيمية
17 pages
FMEDA
No ratings yet
FMEDA
3 pages
Capstone Case Study 2 - Automation
No ratings yet
Capstone Case Study 2 - Automation
3 pages
Explain Incremental Design of Software Development Process Wih Diagram
No ratings yet
Explain Incremental Design of Software Development Process Wih Diagram
19 pages
Intelligent System Dessign-Approaches
No ratings yet
Intelligent System Dessign-Approaches
7 pages
Agile 1.1
No ratings yet
Agile 1.1
7 pages
Be - Computer Engineering Ai, DS, ML - Semester 6 - 2023 - May - Distributed Computing Rev 2019 C Scheme
No ratings yet
Be - Computer Engineering Ai, DS, ML - Semester 6 - 2023 - May - Distributed Computing Rev 2019 C Scheme
1 page
Thermo Reflection Script
No ratings yet
Thermo Reflection Script
2 pages
Open Problems in ML & NLP
No ratings yet
Open Problems in ML & NLP
2 pages
Shortcuts to College Calculus Refreshment Kit
From Everand
Shortcuts to College Calculus Refreshment Kit
Juan Acevedo
No ratings yet