0% found this document useful (0 votes)

8 views

Week 05 Classification Performance

Uploaded by

sabrinashah2002

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views

Week 05 Classification Performance

Uploaded by

sabrinashah2002

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

05-09-2024

TOD 533
Classification Performance:
Validation and metrics
Amit Das
TODS / AMSOM / AU
[email protected]

Model validation: Holdout sample

• Training set: data for training model (optimum values of parameters)
• Validation set: assessing performance on data withheld from training
• Opportunity to set / refine some model (hyper)parameters
• Test set: Expose model to data of interest for prediction

• Avoid overfitting – customizing model to quirks of training data that

are absent in other (particularly, target) data
• Prefer simpler models (Occam’s razor)

1
05-09-2024

k-fold Cross-validation
• Divide training data into k equally-sized subsets
• Randomize order, if necessary
• Train model on subsets 2, 3, …, k
• Choose subset 1 for testing model

• Repeat with subsets 2, 3, …, k as testing sets

• Stratified k-fold cross-validation

• Average performance over k runs (accuracy, …)

Comparing predicted to actual

Confusion Classification
Matrix Table

2
05-09-2024

Performance: accuracy

Performance: precision

3
05-09-2024

Performance: sensitivity (recall)

Performance: specificity

4
05-09-2024

Accuracy, precision, sensitivity and specificity

Actual
Positive Negative
Positive True Positive False Positive
Predicted
TP FP
Negative False Negative True Negative
FN TN

Accuracy (TP + TN) / (TP + TN + FP + FN)

Precision TP / (TP + FP)
Sensitivity (Recall) TP / (TP + FN)
Specificity TN / (TN + FP)

In the Diabetes context

Predicted
Diabetic Healthy
Diabetic True Positive False Negative
TP FN
Actual
153 115
Healthy False Positive True Negative
FP TN
60 440

Accuracy (TP + TN) / (TP + TN + FP + FN) = 0.772

Precision TP / (TP + FP) = 0.718
Sensitivity (Recall) TP / (TP + FN) = 0.571
Specificity TN / (TN + FP) = 0.880

5
05-09-2024

Jamovi output: Classification table

Results
Classification Table – …
Predicted
Observed tested_negative tested_positive % Correct
tested_negative 445 55 89.0
tested_positive 112 156 58.2
Note. The cut-off value is set to 0.5

Results
Predictive Measures
Accuracy Specificity Sensitivity
0.783 0.890 0.582
Note. The cut-off value is set to 0.5

Accuracy of classification: Logistic Regression

Accuracy

6
05-09-2024

Confusion Matrix: Logistic Regression

Specificity

Precision Sensitivity

F-measure
• Harmonic mean of precision and recall

• More generally,

• b < 1 focuses on precision, while b > 1 emphasizes recall

7
05-09-2024

MCC (Matthews correlation coefficient)

• It can be calculated from the confusion matrix as:

ROC Curves
• ROC is an abbreviation of Receiver Operating Characteristic
coming from the signal detection theory, developed during
World War II (for analysis of radar images).
• In the context of classifiers, ROC plot is a useful tool to study
• the behavior of a classifier or
• comparing two or more classifiers.

• A ROC plot is a two-dimensional graph, where the x-axis

represents FP rate (FPR) and y-axis represents TP rate (TPR).

8
05-09-2024

Comparing classifiers using ROC Plot

• We can use the concept of the “area
under the curve” (AUC) as a method to
compare two or more classifiers
• If a model is perfect, then its AUC = 1
• If a model simply performs random
guessing, then its AUC = 0.5
• A model that is strictly better than
another has a larger value of AUC than
the other

• Here, C3 is best, and C2 is better than

C1 as AUC(C3) > AUC(C2) > AUC(C1)

Comparison of Area under the ROC curve (AUC)

Classifier Logistic Discriminant KNN-5 Naïve Bayes Decision Tree Decision Rules
AUC 0.832 0.832 0.766 0.819 0.751 0.739

Amit’s Grades
AUC > 0.9 Excellent
AUC 0.8 to 0.9 Very Good
AUC 0.7 to 0.8 Good
AUC 0.6 to 0.7 Needs Improvement
AUC 0.5 to 0.6 Hopeless

9
05-09-2024

Multiway Classification: The Iris dataset

SepalLength SepalWidth PetalLength PetalWidth Species

5.1 3.5 1.4 0.2 Iris-setosa
4.9 3 1.4 0.2 Iris-setosa
4.7 3.2 1.3 0.2 Iris-setosa
4.6 3.1 1.5 0.2 Iris-setosa
5 3.6 1.4 0.2 Iris-setosa
7 3.2 4.7 1.4 Iris-versicolor
6.4 3.2 4.5 1.5 Iris-versicolor
6.9 3.1 4.9 1.5 Iris-versicolor
5.5 2.3 4 1.3 Iris-versicolor
6.5 2.8 4.6 1.5 Iris-versicolor
6.3 3.3 6 2.5 Iris-virginica
5.8 2.7 5.1 1.9 Iris-virginica
7.1 3 5.9 2.1 Iris-virginica
6.3 2.9 5.6 1.8 Iris-virginica
6.5 3 5.8 2.2 Iris-virginica

Multinomial Logistic Regression

Model Coefficients - Species
Species Predictor Estimate SE Z p Odds ratio
Iris-versicolor -
Intercept 18.68 30.3 0.6165 0.538 1.30e+8
Iris-setosa
PetalWidth -3.09 39.7 -0.0779 0.938 0.04535
PetalLength 13.95 52.6 0.2655 0.791 1.15e+6
SepalWidth -8.65 134.2 -0.0645 0.949 1.75e-4
SepalLength -5.32 76.7 -0.0694 0.945 0.00488
Iris-virginica -
Intercept -23.70 31.2 -0.7594 0.448 5.10e-11
Iris-setosa
PetalWidth 15.10 40.2 0.3756 0.707 3.61e+6
PetalLength 23.34 52.9 0.4415 0.659 1.37e+10
SepalWidth -15.31 134.2 -0.1140 0.909 2.25e-7
SepalLength -7.78 76.7 -0.1015 0.919 4.17e-4

10
05-09-2024

Multiway classification (Weka)

Logistic Regression with ridge parameter of 1.0E-8

Coefficients...
Class
Variable Iris-setosa Iris-versicolor
=============================================== === Confusion Matrix ===
SepalLength 21.8065 2.4652
SepalWidth 4.5648 6.6809 a b c <-- classified as
PetalLength -26.3083 -9.4293 50 0 0 | a = Iris-setosa
PetalWidth -43.887 -18.2859 0 46 4 | b = Iris-versicolor
Intercept 8.1743 42.637 0 2 48 | c = Iris-virginica

Separability of classes

DNA Repair-R
100% (2)
DNA Repair-R
31 pages
Manual de Parte Grove RT 9130e
100% (2)
Manual de Parte Grove RT 9130e
1,154 pages
Lecture 3b - Evaluation
No ratings yet
Lecture 3b - Evaluation
37 pages
Module 5 ML
No ratings yet
Module 5 ML
12 pages
lecture11evaluationmetricsforclassification-240913060639-0c766554
No ratings yet
lecture11evaluationmetricsforclassification-240913060639-0c766554
28 pages
Ai DS 2 Book-Chpt-5
No ratings yet
Ai DS 2 Book-Chpt-5
17 pages
Machine Learning Model
No ratings yet
Machine Learning Model
9 pages
Machine Learning
No ratings yet
Machine Learning
9 pages
DL_IT324a_4
No ratings yet
DL_IT324a_4
52 pages
Unit3 7 Issues
No ratings yet
Unit3 7 Issues
24 pages
3 - Model Evaluation & Validation
No ratings yet
3 - Model Evaluation & Validation
47 pages
ML-Lecture-11-Evaluation
No ratings yet
ML-Lecture-11-Evaluation
17 pages
Notes 03
No ratings yet
Notes 03
38 pages
Unit6 -7 Issues_23bc7150-918a-4ebe-9af6-01db96af986a
No ratings yet
Unit6 -7 Issues_23bc7150-918a-4ebe-9af6-01db96af986a
53 pages
CSE4261 Lecture-10
No ratings yet
CSE4261 Lecture-10
50 pages
Machine Learning Evaluation Metrics Lecturer
No ratings yet
Machine Learning Evaluation Metrics Lecturer
30 pages
19-Performance Metrics
No ratings yet
19-Performance Metrics
23 pages
lec5_Classification
No ratings yet
lec5_Classification
27 pages
PROS - Ivanna Kristianti T - Predicting Receiver Operating Characteristic - Fulltext
No ratings yet
PROS - Ivanna Kristianti T - Predicting Receiver Operating Characteristic - Fulltext
5 pages
جلسه 13
No ratings yet
جلسه 13
76 pages
A10-Model-Performance-v2-2up
No ratings yet
A10-Model-Performance-v2-2up
11 pages
13-Module 5 - ROC Curve Analysis - Introduction and Motivation-26-09-2023
No ratings yet
13-Module 5 - ROC Curve Analysis - Introduction and Motivation-26-09-2023
8 pages
Machine Learning Model Evaluation
No ratings yet
Machine Learning Model Evaluation
11 pages
Lesson 6 Analytics Methods
No ratings yet
Lesson 6 Analytics Methods
12 pages
Binary Classification PDF
No ratings yet
Binary Classification PDF
27 pages
FALLSEM2024-25 BCSE334L TH VL2024250101768 2024-10-08 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE334L TH VL2024250101768 2024-10-08 Reference-Material-I
18 pages
ML Metrics
No ratings yet
ML Metrics
9 pages
Performance Parameters
No ratings yet
Performance Parameters
14 pages
Evaluation Measures
No ratings yet
Evaluation Measures
8 pages
Classification Metrics.pptx
No ratings yet
Classification Metrics.pptx
39 pages
Machine Learning Project Report (Group 3) Shahbaz Khan
No ratings yet
Machine Learning Project Report (Group 3) Shahbaz Khan
11 pages
Chicco 2023
No ratings yet
Chicco 2023
23 pages
Chap3 Part1 Classification
No ratings yet
Chap3 Part1 Classification
38 pages
L22 KNN+Metrics
No ratings yet
L22 KNN+Metrics
18 pages
Analytics in Practice: Model Evaluation
No ratings yet
Analytics in Practice: Model Evaluation
40 pages
IAI&ML UNIT-5
No ratings yet
IAI&ML UNIT-5
15 pages
Unit 2 Chap 4
No ratings yet
Unit 2 Chap 4
14 pages
3-Performance Measures
No ratings yet
3-Performance Measures
35 pages
Performance Parameters
No ratings yet
Performance Parameters
23 pages
Auc Roc Curve Machine Learning
No ratings yet
Auc Roc Curve Machine Learning
12 pages
AD3501-DL-UNIT 4 NOTES
No ratings yet
AD3501-DL-UNIT 4 NOTES
16 pages
AUC ROC curve
No ratings yet
AUC ROC curve
5 pages
Ramana 2019
No ratings yet
Ramana 2019
6 pages
UNIT-1-2.Binary Classification and Related Tasks
No ratings yet
UNIT-1-2.Binary Classification and Related Tasks
22 pages
Classification Metrics Mod 6
No ratings yet
Classification Metrics Mod 6
8 pages
Exp7_MLAI2
No ratings yet
Exp7_MLAI2
8 pages
AI Performance Evaluation - Annotated
No ratings yet
AI Performance Evaluation - Annotated
52 pages
Int3209 - Data Mining: Week 5: Classification Model Improvements
No ratings yet
Int3209 - Data Mining: Week 5: Classification Model Improvements
56 pages
Ca 3 Merged
No ratings yet
Ca 3 Merged
275 pages
SupervisedLearning_Classification
No ratings yet
SupervisedLearning_Classification
20 pages
Tomato Disease Classification 1 3
No ratings yet
Tomato Disease Classification 1 3
3 pages
09ClassAdvanced
No ratings yet
09ClassAdvanced
64 pages
Evaluation Metrics
No ratings yet
Evaluation Metrics
11 pages
March_3rd&4th
No ratings yet
March_3rd&4th
19 pages
AUC and the ROC Curve in Machine Learning _ DataCamp
No ratings yet
AUC and the ROC Curve in Machine Learning _ DataCamp
12 pages
Class Imbalance Problem: BY Dr. Anupam Ghosh 4 SEPT, 2023
No ratings yet
Class Imbalance Problem: BY Dr. Anupam Ghosh 4 SEPT, 2023
27 pages
CIVI6731 Lecture (Week9)
No ratings yet
CIVI6731 Lecture (Week9)
18 pages
Assingment On Database
No ratings yet
Assingment On Database
16 pages
CH-5_ML
No ratings yet
CH-5_ML
36 pages
Sanatander Analysis
No ratings yet
Sanatander Analysis
19 pages
Evaluation Metrics and Statistical Tests For Machi
No ratings yet
Evaluation Metrics and Statistical Tests For Machi
15 pages
Exercises of Statistical Inference
From Everand
Exercises of Statistical Inference
Simone Malacrida
No ratings yet
Mining Frequent Patterns and Associations
No ratings yet
Mining Frequent Patterns and Associations
13 pages
Basic Concepts On: Fuzzy Sets
No ratings yet
Basic Concepts On: Fuzzy Sets
14 pages
Garver On Silence Wittgenstein
No ratings yet
Garver On Silence Wittgenstein
15 pages
Create Inforecord
100% (5)
Create Inforecord
12 pages
Sample Interview Questions For Network Engineers
No ratings yet
Sample Interview Questions For Network Engineers
5 pages
Система управления двигателем
No ratings yet
Система управления двигателем
3 pages
Review Criteria
No ratings yet
Review Criteria
49 pages
Baum GMBH Assembly instructions-DIN April 2012 PDF
No ratings yet
Baum GMBH Assembly instructions-DIN April 2012 PDF
2 pages
Resume__1734199998
No ratings yet
Resume__1734199998
1 page
UART HCI Bluetooth Module For Linux v1 - 0
No ratings yet
UART HCI Bluetooth Module For Linux v1 - 0
7 pages
SUMMER TRAINING REPORT ON INDIAN RAILWAYword
No ratings yet
SUMMER TRAINING REPORT ON INDIAN RAILWAYword
65 pages
Fishers LDA
No ratings yet
Fishers LDA
47 pages
Grade 3 Diamond Dotted Presspaper
No ratings yet
Grade 3 Diamond Dotted Presspaper
2 pages
Stiffening Ring Design On PV Elite For External Pressure Acting On Thin Walled Cylindrical Mounded Vessel
No ratings yet
Stiffening Ring Design On PV Elite For External Pressure Acting On Thin Walled Cylindrical Mounded Vessel
4 pages
Midterms To Finals MATH 6100
83% (6)
Midterms To Finals MATH 6100
44 pages
Adding and Subtracting
No ratings yet
Adding and Subtracting
3 pages
FANUC Robotics Documentation W
No ratings yet
FANUC Robotics Documentation W
2 pages
08 Acceptance Sampling Plans - Ppm's
No ratings yet
08 Acceptance Sampling Plans - Ppm's
30 pages
Chemical Injection PDF
100% (1)
Chemical Injection PDF
4 pages
Governor
100% (2)
Governor
43 pages
GEO 12 Chapter 12: Denudation: Weathering and Mass Wasting Notes and Chapter Review
No ratings yet
GEO 12 Chapter 12: Denudation: Weathering and Mass Wasting Notes and Chapter Review
7 pages
L2, 3,4 - Pne
No ratings yet
L2, 3,4 - Pne
37 pages
Swimming Pool Construction
100% (4)
Swimming Pool Construction
48 pages
Mahindra & Mahindra Interview questions
No ratings yet
Mahindra & Mahindra Interview questions
23 pages
Tides and Currents
100% (9)
Tides and Currents
73 pages
Matlab Code For Cooling Load Calculation
100% (1)
Matlab Code For Cooling Load Calculation
8 pages
Chapter 10
No ratings yet
Chapter 10
41 pages
Electrochemistry Notes
No ratings yet
Electrochemistry Notes
72 pages

Week 05 Classification Performance

Uploaded by

Week 05 Classification Performance

Uploaded by

05-09-2024

Model validation: Holdout sample

• Avoid overfitting – customizing model to quirks of training data that

• Repeat with subsets 2, 3, …, k as testing sets

• Average performance over k runs (accuracy, …)

Comparing predicted to actual

Performance: sensitivity (recall)

Accuracy, precision, sensitivity and specificity

Accuracy (TP + TN) / (TP + TN + FP + FN)

In the Diabetes context

Accuracy (TP + TN) / (TP + TN + FP + FN) = 0.772

Jamovi output: Classification table

Accuracy of classification: Logistic Regression

Confusion Matrix: Logistic Regression

• b < 1 focuses on precision, while b > 1 emphasizes recall

MCC (Matthews correlation coefficient)

• It can be calculated from the confusion matrix as:

• A ROC plot is a two-dimensional graph, where the x-axis

Comparing classifiers using ROC Plot

• Here, C3 is best, and C2 is better than

Comparison of Area under the ROC curve (AUC)

Multiway Classification: The Iris dataset

SepalLength SepalWidth PetalLength PetalWidth Species

Multinomial Logistic Regression

Multiway classification (Weka)

Logistic Regression with ridge parameter of 1.0E-8

You might also like