0% found this document useful (0 votes)

13 views11 pages

Performance

dss

Uploaded by

Bareeq Nope

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views11 pages

Performance

dss

Uploaded by

Bareeq Nope

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 11

Performance Measures

• Most common measure is accuracy

• Summed squared error
• Mean squared error
• Classification accuracy
• Precision, Recall, F-score
• ROC

1
Binary Classification
Predicted Output
1 0
True Output (Target)

True Positive (TP) False Negative (FN)

1 Hits Misses

False Positive (FP) True Negative (TN)

0
False Alarm Correct Rejections

Accuracy = (TP+TN)/(TP+TN+FP+FN)
Precision = TP/(TP+FP)
Recall = TP/(TP+FN)
2
Precision
Predicted Output
1 0
True Output (Target)

True Positive (TP) False Negative (FN)

1 Hits Misses

False Positive (FP) True Negative (TN)

0
False Alarm Correct Rejections

Precision = TP/(TP+FP)
The percentage of predicted true positives
that are target true positives
3
Recall
Predicted Output
1 0
True Output (Target)

True Positive (TP) False Negative (FN)

1 Hits Misses

False Positive (FP) True Negative (TN)

0
False Alarm Correct Rejections

Recall = TP/(TP+FN)
The percentage of target true positives
that were predicted as true positives
4
Other measures - Precision vs.
Recall
• Considering precision and recall let us choose a ML approach which
maximizes what we are most interested in (precision or recall) and not
just accuracy.
• Tradeoff - Can also adjust ML parameters to accomplish the goal of the
application
• Break even point: precision = recall
• F1 or F-score = 2(precision  recall)/(precision  recall) - Harmonic
mean of precision and recall

5
ROC Curves and Area under curve
• Receiver Operating Characteristic
• Developed in WWII to statistically model false positive and false
negative detections of radar operators
• Standard measure in medicine and biology
• True positive rate (sensitivity) vs false positive rate (1- specificity)
• True positive rate (Probability of predicting true when it is true)
P(Pred:T|T) = Sensitivity = Recall = TP/P = TP/(TP+FN)
• False positive rate (Probability of predicting true when it is false)
P(Pred:T|F) = FP/N = FP/(TN+FP) = 1 – specificity (true negative rate) = 1
– TN/N = 1 - TN/(TN+FP)
• Want to maximize TPR and minimize FPR
• How would you do each independently?

6
ROC Curves and ROC Area
• Want to find the right balance
• But the right balance/threshold can differ for each task considered
• How do we know which algorithms are robust and accurate across
many different thresholds? – ROC curve
• Each point on the ROC curve represents a different tradeoff (cost
ratio) between true positive rate and false positive rate
• The standard measures just show accuracy for one setting of the
cost/ratio threshold, whereas the ROC curve shows accuracy for all
settings and thus allows us to compare how robust to different
thresholds one algorithm is compared to another

7
8
 Assume thresholds:
 Threshold = 1 (0,0), then all
outputs are 0 .3
TPR = P(T|T) = 0
FPR = P (T|F) = 0
.5
 Threshold = 0, (1,1)
TPR = 1, FPR = 1
 Threshold = .8 (.2,.2)
TPR = .38 FPR = .02 .8
- Better Precision
 Threshold = .5 (.5,.5)
TPR = .82 FPR = .18
- Better Accuracy
 Threshold = .3 (.7,.7)
TPR = .95 FPR = .43
Accuracy is maximized at point closest to the top left corner.
- Better Recall
Note that Sensitivity = Recall and the lower the
false positive rate, the higher the precision.
9
ROC Properties

• Area Properties
• 1.0 - Perfect prediction
• .9 - Excellent
• .7 - Moderate
• .5 - Random
• ROC area represents performance over all possible thresholds
• If two ROC curves do not intersect then one method dominates over
the other
• If they do intersect then one method is better for some thresholds,
and is worse for others
• Blue alg better for precision, yellow alg for recall, red neither
• Can choose method and balance based on goals

10
Performance Measurement
Summary
• Some of these measures (ROC, F-score) gaining popularity
• Can allow you to look at a range of thresholds
• However, they do not extend to multi-class situations which are very
common
• However, medicine, finance, etc. have lots of two class problems
• Could always cast problem as a set of two class problems but that can be
inconvenient
• Accuracy handles multi-class outputs and is still the most common
measure but often combined with other measures such as ROC, etc.

BR235 - EN - Col18 SAP Convergent Charging
No ratings yet
BR235 - EN - Col18 SAP Convergent Charging
195 pages
Performance Measures
No ratings yet
Performance Measures
32 pages
Lecture11evaluationmetricsforclassification 240913060639 0c766554
No ratings yet
Lecture11evaluationmetricsforclassification 240913060639 0c766554
28 pages
Module 5 ML
No ratings yet
Module 5 ML
12 pages
DL IT324a 4
No ratings yet
DL IT324a 4
52 pages
CS340 Machine Learning ROC Curves
No ratings yet
CS340 Machine Learning ROC Curves
8 pages
Roc 1 PDF
No ratings yet
Roc 1 PDF
8 pages
جلسه 13
No ratings yet
جلسه 13
76 pages
CH 4
No ratings yet
CH 4
9 pages
4.9 Estimating The Performance of A Classifier II
No ratings yet
4.9 Estimating The Performance of A Classifier II
16 pages
FALLSEM2024-25 BCSE334L TH VL2024250101768 2024-10-08 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE334L TH VL2024250101768 2024-10-08 Reference-Material-I
18 pages
Lecture 10
No ratings yet
Lecture 10
16 pages
Classification Metrics
No ratings yet
Classification Metrics
39 pages
Ca 3 Merged
No ratings yet
Ca 3 Merged
275 pages
Performance Parameters
No ratings yet
Performance Parameters
23 pages
Unit6 - 7 Issues
No ratings yet
Unit6 - 7 Issues
53 pages
IML 7 - ROC Curve
No ratings yet
IML 7 - ROC Curve
17 pages
Evaluation Matrix
No ratings yet
Evaluation Matrix
29 pages
Introduction To ROC Analysis: Pattern Recognition Letters June 2006
No ratings yet
Introduction To ROC Analysis: Pattern Recognition Letters June 2006
16 pages
Introduction To ROC Analysis
No ratings yet
Introduction To ROC Analysis
15 pages
6 Evaluarea Performantei
No ratings yet
6 Evaluarea Performantei
43 pages
PRcurves ISL
No ratings yet
PRcurves ISL
33 pages
Lecture 3b - Evaluation
No ratings yet
Lecture 3b - Evaluation
37 pages
06-FSSR DS610 2024 2025T1 Metrics
No ratings yet
06-FSSR DS610 2024 2025T1 Metrics
24 pages
Performance Parameters
No ratings yet
Performance Parameters
14 pages
Introduction To ROC Analysis
No ratings yet
Introduction To ROC Analysis
15 pages
ML CH 5
No ratings yet
ML CH 5
5 pages
Unit2 - Perfomance Measures
No ratings yet
Unit2 - Perfomance Measures
32 pages
ML Lecture 11 Evaluation
No ratings yet
ML Lecture 11 Evaluation
17 pages
9b. Evaluation of Classifiers
No ratings yet
9b. Evaluation of Classifiers
4 pages
A10 Model Performance v2 2up
No ratings yet
A10 Model Performance v2 2up
11 pages
AI Performance Evaluation - Annotated
No ratings yet
AI Performance Evaluation - Annotated
52 pages
Precision, Recall and ROC Curves
No ratings yet
Precision, Recall and ROC Curves
17 pages
Evaluation Metrics
No ratings yet
Evaluation Metrics
11 pages
ML Metrics
No ratings yet
ML Metrics
9 pages
Int3209 - Data Mining: Week 5: Classification Model Improvements
No ratings yet
Int3209 - Data Mining: Week 5: Classification Model Improvements
56 pages
IT 138 - Lecture 4
No ratings yet
IT 138 - Lecture 4
30 pages
An Introduction To ROC Curve (Receiver Operating Characteristics)
No ratings yet
An Introduction To ROC Curve (Receiver Operating Characteristics)
16 pages
March 3rd&4th
No ratings yet
March 3rd&4th
19 pages
5 ROC Curve
No ratings yet
5 ROC Curve
2 pages
PROS - Ivanna Kristianti T - Predicting Receiver Operating Characteristic - Fulltext
No ratings yet
PROS - Ivanna Kristianti T - Predicting Receiver Operating Characteristic - Fulltext
5 pages
Imbalance Problem
No ratings yet
Imbalance Problem
13 pages
3-Performance Measures
No ratings yet
3-Performance Measures
35 pages
Hands On Machine Learning 3 Edition
No ratings yet
Hands On Machine Learning 3 Edition
31 pages
13-Module 5 - ROC Curve Analysis - Introduction and Motivation-26-09-2023
No ratings yet
13-Module 5 - ROC Curve Analysis - Introduction and Motivation-26-09-2023
8 pages
19-Performance Metrics
No ratings yet
19-Performance Metrics
23 pages
Introduction To Data Mining Unit 4
No ratings yet
Introduction To Data Mining Unit 4
13 pages
11.2 - Classification Evaluation Metrics
No ratings yet
11.2 - Classification Evaluation Metrics
22 pages
Classification Metrics Mod 6
No ratings yet
Classification Metrics Mod 6
8 pages
Imp Notes For Aamd
No ratings yet
Imp Notes For Aamd
6 pages
L 13 Choose Your Own Algorithm D 07062024 111828am
No ratings yet
L 13 Choose Your Own Algorithm D 07062024 111828am
36 pages
Ad3501-Dl-Unit 4 Notes
No ratings yet
Ad3501-Dl-Unit 4 Notes
16 pages
Unit8 (Evaluation Method)
No ratings yet
Unit8 (Evaluation Method)
43 pages
Lect 02 Evaluation Part 1
No ratings yet
Lect 02 Evaluation Part 1
33 pages
ML CH 5
No ratings yet
ML CH 5
45 pages
Model Validation and Perf Metrics - v2 - Noman - 08 - 06 - 24
No ratings yet
Model Validation and Perf Metrics - v2 - Noman - 08 - 06 - 24
25 pages
Unit3 7 Issues
No ratings yet
Unit3 7 Issues
24 pages
Chap3 Part1 Classification
No ratings yet
Chap3 Part1 Classification
38 pages
Roc Curve in Python
No ratings yet
Roc Curve in Python
58 pages
Cylinder Form
No ratings yet
Cylinder Form
1 page
Advanced Power Electronics Corp.: Description
No ratings yet
Advanced Power Electronics Corp.: Description
6 pages
Vaixell Teseu
No ratings yet
Vaixell Teseu
5 pages
PAC-1 - Appendix H - Approved Vendor List
No ratings yet
PAC-1 - Appendix H - Approved Vendor List
12 pages
Gree Vireo Gen3 Submittal 9mbh 230v A
No ratings yet
Gree Vireo Gen3 Submittal 9mbh 230v A
6 pages
Chapter 4
No ratings yet
Chapter 4
7 pages
Inmoov Report
No ratings yet
Inmoov Report
94 pages
Steering System PDF
No ratings yet
Steering System PDF
49 pages
Duck Creek Questions, Issues, Concerns v4
No ratings yet
Duck Creek Questions, Issues, Concerns v4
6 pages
Siemens TC14.3 Installation Guide
No ratings yet
Siemens TC14.3 Installation Guide
338 pages
Kenwood TRC 80 User Manual PDF
No ratings yet
Kenwood TRC 80 User Manual PDF
33 pages
2022 Inspiring Profiles en Forster 0 Cat355cb804
No ratings yet
2022 Inspiring Profiles en Forster 0 Cat355cb804
32 pages
Skoda Enyaq Brochure April 2024
No ratings yet
Skoda Enyaq Brochure April 2024
43 pages
Trends1 Aio Pretest
No ratings yet
Trends1 Aio Pretest
4 pages
Bill of Material IH
No ratings yet
Bill of Material IH
1 page
Geotechnical Earthquake Engineering: Dr. Deepankar Choudhury
No ratings yet
Geotechnical Earthquake Engineering: Dr. Deepankar Choudhury
40 pages
EG-EM1 Manual
No ratings yet
EG-EM1 Manual
4 pages
Catalogo Typhoon Piaggio 125 4T 2V 2010-2011
No ratings yet
Catalogo Typhoon Piaggio 125 4T 2V 2010-2011
62 pages
Docker Training
100% (1)
Docker Training
261 pages
PEAC Lesson Plan English 8
No ratings yet
PEAC Lesson Plan English 8
2 pages
Disassembly and Assembly Manual Cat c15 Engine
0% (1)
Disassembly and Assembly Manual Cat c15 Engine
2 pages
CFS Families
No ratings yet
CFS Families
4 pages
Motioneering - Damping Solutions
No ratings yet
Motioneering - Damping Solutions
1 page
Unit 8
No ratings yet
Unit 8
4 pages
HVAC - Part-3
No ratings yet
HVAC - Part-3
55 pages
3.1 Critical Thinking Rubric
No ratings yet
3.1 Critical Thinking Rubric
1 page
Ix Developer: User's Guide
100% (1)
Ix Developer: User's Guide
48 pages
Project 2
No ratings yet
Project 2
4 pages
Mock 3
No ratings yet
Mock 3
7 pages

Performance

Uploaded by

Performance

Uploaded by

Performance Measures

• Most common measure is accuracy

True Positive (TP) False Negative (FN)

False Positive (FP) True Negative (TN)

True Positive (TP) False Negative (FN)

False Positive (FP) True Negative (TN)

True Positive (TP) False Negative (FN)

False Positive (FP) True Negative (TN)

You might also like