Week 05 Classification Performance
Week 05 Classification Performance
TOD 533
Classification Performance:
Validation and metrics
Amit Das
TODS / AMSOM / AU
[email protected]
1
05-09-2024
k-fold Cross-validation
• Divide training data into k equally-sized subsets
• Randomize order, if necessary
• Train model on subsets 2, 3, …, k
• Choose subset 1 for testing model
Confusion Classification
Matrix Table
2
05-09-2024
Performance: accuracy
Performance: precision
3
05-09-2024
Performance: specificity
4
05-09-2024
5
05-09-2024
Results
Predictive Measures
Accuracy Specificity Sensitivity
0.783 0.890 0.582
Note. The cut-off value is set to 0.5
Accuracy
6
05-09-2024
Specificity
Precision Sensitivity
F-measure
• Harmonic mean of precision and recall
• More generally,
7
05-09-2024
ROC Curves
• ROC is an abbreviation of Receiver Operating Characteristic
coming from the signal detection theory, developed during
World War II (for analysis of radar images).
• In the context of classifiers, ROC plot is a useful tool to study
• the behavior of a classifier or
• comparing two or more classifiers.
8
05-09-2024
Amit’s Grades
AUC > 0.9 Excellent
AUC 0.8 to 0.9 Very Good
AUC 0.7 to 0.8 Good
AUC 0.6 to 0.7 Needs Improvement
AUC 0.5 to 0.6 Hopeless
9
05-09-2024
10
05-09-2024
Separability of classes
11