0% found this document useful (0 votes)

26 views27 pages

Lec5 Classification

Uploaded by

Saitama Deku

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views27 pages

Lec5 Classification

Uploaded by

Saitama Deku

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 27

1

Intro to classification

CSCI-P 556
ZORAN TIGANJ
2
Reminders/Announcements

u Don’t forget the quiz deadline today

4
Today: Intro to classification

u After regression example, now we will cover a classification example

u We will use MNIST dataset, which is a set of 70,000 small images of digits
handwritten by high school students and employees of the US Census
Bureau. Each image is labeled with the digit it represents.
5
MNIST dataset
6
Examples of MNIST digits
7
Training a Binary Classifier

u Let’s simplify the problem for now and only try to identify one digit—for
example, the number 5.
u This “5-detector” will be an example of a binary classifier, capable of
distinguishing between just two classes, 5 and not 5.
8
Performance Measures

u Let’s do cross-validation:

corss_val_score did K-fold cross-validation which means splitting the

training set into K folds (in this case, three), then making predictions
and evaluating them on each fold using a model trained on the
remaining folds.
9
Performance Measures

u Well, before you get too excited, let’s look at a very dumb classifier that
just classifies every single image in the “not-5” class:
10
Performance Measures

u Well, before you get too excited, let’s look at a very dumb classifier that
just classifies every single image in the “not-5” class:
Accuracy not a good measure due to skewness in data

This demonstrates why accuracy is generally not the preferred

performance measure for classifiers, especially when you are
dealing with skewed datasets (i.e., when some classes are much
more frequent than others)
11
Confusion Matrix

u A much better way to evaluate the performance of a classifier is to look at

the confusion matrix.
u The general idea is to count the number of times instances of class A are
classified as class B.
u For example, to know the number of times the classifier confused images of 5s
with 3s, you would look in the fifth row and third column of the confusion matrix.
12
Confusion Matrix

u Each row in a confusion matrix represents an actual

class, while each column represents a predicted class.
u The first row of this matrix considers non-5 images (the
negative class ): 53,057 of them were correctly
classified as non-5s (they are called true negatives),
while the remaining 1,522 were wrongly classified as 5s
(false positives).
u The second row considers the images of 5s (the positive
class): 1,325 were wrongly classified as non-5s (false
negatives), while the remaining 4,096 w ere correctly
classified as 5s ( true positives ).
13
Confusion Matrix
14
Precision of the classifier

u The confusion matrix gives you a lot of information, but sometimes you may
prefer a more concise metric.
u An interesting one to look at is the accuracy of the positive predictions; this
is called the precision of the classifier

u TP is the number of true positives,

u FP is the number of false positives.
15
Sensitivity

u Precision is typically used along with another metric named recall, also
called sensitivity or the true positive rate (TPR): the ratio of positive
instances that are correctly detected by the classifier

u FN is the number of false negatives.

16
Confusion matrix - illustration
17
Precision and recall of 5-detector

u Scikit-Learn provides several functions to compute classifier metrics,

including precision and recall:
18
F1 score

u It is often convenient to combine precision and recall into a single metric

called the F1 score, in particular if you need a simple way to compare two
classifiers.
u The F score is the harmonic mean of precision and recall. Whereas the
regular mean treats all values equally, the harmonic mean gives much
more weight to low values.
u As a result, the classifier will only get a high F score if both recall and
precision are high.
19
F1 score of 5-detector
20
Precision/Recall Trade-off

u The F1 score favors classifiers that have similar precision and recall.
u This is not always what you want: in some contexts you mostly care about
precision, and in other contexts you really care about recall.
u For example, if you trained a classifier to detect videos that are safe for kids, you
would probably prefer a classifier that rejects many good videos (low recall) but
keeps only safe ones (high precision).
u On the other hand, suppose you train a classifier to detect shoplifters in
surveillance images: it is probably fine if your classifier has only 30% precision as
long as it has 99% recall (sure, the security guards will get a few false alerts, but
almost all shoplifters will get caught).
u Unfortunately, you can’t have it both ways: increasing precision reduces
recall, and vice versa. This is called the precision/recall trade-off.
21
22
The receiver operating characteristic
(ROC) curve
u The receiver operating characteristic (ROC) curve
is another common tool used with binary classifiers.
u It is very similar to the precision/recall curve, but
instead of plotting precision versus recall, the ROC
curve plots the true positive rate (another name
for recall) against the false positive rate (FPR).
u The FPR is the ratio of negative instances that are
incorrectly classified as positive. It is equal to 1 –
the true negative rate (TNR), which is the ratio of
negative instances that are correctly classified as
negative.
u The TNR is also called specificity. Hence, the ROC
curve plots sensitivity (recall) versus 1 – specificity .
(1- TNR (Specificity)
23
Area under the curve (AUC)

u One way to compare classifiers is to measure the area under the curve
(AUC).
u A perfect classifier will have a ROC AUC equal to 1, whereas a purely
random classifier will have a ROC AUC equal to 0.5.
u Scikit-Learn provides a function to compute the ROC AUC:
24
Comparing different classifiers using
ROC
25
OvR vs OvO

u Some algorithms (such as SGD classifiers, Random Forest classifiers, and

naive Bayes classifiers) are capable of handling multiple classes natively.
u Others (such as Logistic Regression or Support Vector Machine classifiers)
are strictly binary classifiers. However, there are various strategies that you
can use to perform multiclass classification with multiple binary classifiers.
u One way to create a system that can classify the digit images into 10 classes
(from 0 to 9) is to train 10 binary classifiers, one for each digit (a 0-detector, a 1-
detector, a 2-detector, and so on). This is called the one-versus-the-rest (OvR)
strategy (also called one-versus-all ).
26
OvR vs OvO

u Some algorithms (such as SGD classifiers, Random Forest classifiers, and

naive Bayes classifiers) are capable of handling multiple classes natively.
u Others (such as Logistic Regression or Support Vector Machine classifiers)
are strictly binary classifiers. However, there are various strategies that you
can use to perform multiclass classification with multiple binary classifiers.
u Another strategy is to train a binary classifier for every pair of digits: one to
distinguish 0s and 1s, another to distinguish 0s and 2s, another for 1s and 2s, and
so on. This is called the one-versus-one (OvO) strategy.
u The main advantage of OvO is that each classifier only needs to be trained on
the part of the training set for the two classes that it must distinguish.
27
Multilabel Classification

u Consider a face-recognition classifier: what should it do if it recognizes

several people in the same picture?
u It should attach one tag per person it recognizes. Say the classifier has been
trained to recognize three faces, Alice, Bob, and Charlie.
u Then when the classifier is shown a picture of Alice and Charlie, it should output
[1, 0, 1] (meaning “Alice yes, Bob no, Charlie yes”).
u Such a classification system that outputs multiple binary tags is called a
multilabel classification system.
28
Next time

u Linear regression, from Chapter 4 from Hands on machine learning

textbook

Module 1 - Introduction To Animal Science
No ratings yet
Module 1 - Introduction To Animal Science
13 pages
KNN Evaluation
No ratings yet
KNN Evaluation
51 pages
Application Letter For Safaricom
No ratings yet
Application Letter For Safaricom
4 pages
Hands On Machine Learning 3 Edition
No ratings yet
Hands On Machine Learning 3 Edition
18 pages
Brookfield Blower, Filter Calculation
100% (1)
Brookfield Blower, Filter Calculation
3 pages
Module 2
No ratings yet
Module 2
151 pages
ML Unit 3
No ratings yet
ML Unit 3
127 pages
3 - Thermal Energy Storage in District Heating and Cooling Systems A Review
No ratings yet
3 - Thermal Energy Storage in District Heating and Cooling Systems A Review
22 pages
Chap 3 Lecture
No ratings yet
Chap 3 Lecture
41 pages
HD5208 FLX: High Density Polyethylene
No ratings yet
HD5208 FLX: High Density Polyethylene
2 pages
Lecture03. Classification (Chapter 3)
No ratings yet
Lecture03. Classification (Chapter 3)
46 pages
Business Communication (4th Semester)
No ratings yet
Business Communication (4th Semester)
3 pages
ML - Mod2 Classification
No ratings yet
ML - Mod2 Classification
74 pages
CS340 Machine Learning ROC Curves
No ratings yet
CS340 Machine Learning ROC Curves
8 pages
ML Interview Questions Placements
No ratings yet
ML Interview Questions Placements
99 pages
Unit Ii
No ratings yet
Unit Ii
118 pages
Pitambara 1
No ratings yet
Pitambara 1
30 pages
Unit 5 Classification PDF
No ratings yet
Unit 5 Classification PDF
131 pages
Hardy Weinberg Law
No ratings yet
Hardy Weinberg Law
7 pages
BSC ML CH1
No ratings yet
BSC ML CH1
63 pages
Hawas Bajawi CV
No ratings yet
Hawas Bajawi CV
4 pages
The Kite Runner Essays
100% (2)
The Kite Runner Essays
7 pages
جلسه 13
No ratings yet
جلسه 13
76 pages
Classification: Prof. Gheith Abandah
No ratings yet
Classification: Prof. Gheith Abandah
30 pages
Unit 4 Learning
No ratings yet
Unit 4 Learning
100 pages
Machine Learning II
No ratings yet
Machine Learning II
61 pages
Classification Metrics
No ratings yet
Classification Metrics
39 pages
Unit6 - 7 Issues
No ratings yet
Unit6 - 7 Issues
53 pages
3 - Model Evaluation & Validation
No ratings yet
3 - Model Evaluation & Validation
47 pages
CSE4261 Lecture-10
No ratings yet
CSE4261 Lecture-10
50 pages
Ayesha Ramzan
No ratings yet
Ayesha Ramzan
19 pages
Binary Classification PDF
No ratings yet
Binary Classification PDF
27 pages
ML Unit 2
No ratings yet
ML Unit 2
31 pages
Brochure Final Brochure A. M. 2025
No ratings yet
Brochure Final Brochure A. M. 2025
4 pages
Lecture 3 1611410001002
No ratings yet
Lecture 3 1611410001002
51 pages
Lecture 5 Evaluation - Classifer
No ratings yet
Lecture 5 Evaluation - Classifer
61 pages
Roc 1 PDF
No ratings yet
Roc 1 PDF
8 pages
Work and Energy
No ratings yet
Work and Energy
13 pages
Lecture 3b - Evaluation
No ratings yet
Lecture 3b - Evaluation
37 pages
CH-5 ML
No ratings yet
CH-5 ML
36 pages
Hands On Machine Learning 3 Edition
No ratings yet
Hands On Machine Learning 3 Edition
31 pages
ML CH 5
No ratings yet
ML CH 5
45 pages
DL IT324a 4
No ratings yet
DL IT324a 4
52 pages
ML Lecture 11 Evaluation
No ratings yet
ML Lecture 11 Evaluation
17 pages
Project English Language Gandhian Principal
No ratings yet
Project English Language Gandhian Principal
3 pages
Machine Learning Chapter3
No ratings yet
Machine Learning Chapter3
27 pages
WP0 REPLA0140 Same 0 Box 00 PUBLIC0
No ratings yet
WP0 REPLA0140 Same 0 Box 00 PUBLIC0
114 pages
Chap3 Part1 Classification
No ratings yet
Chap3 Part1 Classification
38 pages
Module 6
No ratings yet
Module 6
24 pages
Lecture11evaluationmetricsforclassification 240913060639 0c766554
No ratings yet
Lecture11evaluationmetricsforclassification 240913060639 0c766554
28 pages
L 13 Choose Your Own Algorithm D 07062024 111828am
No ratings yet
L 13 Choose Your Own Algorithm D 07062024 111828am
36 pages
Machine Learning Evaluation Metrics Lecturer
No ratings yet
Machine Learning Evaluation Metrics Lecturer
30 pages
UNIT-1-2.Binary Classification and Related Tasks
No ratings yet
UNIT-1-2.Binary Classification and Related Tasks
22 pages
06-FSSR DS610 2024 2025T1 Metrics
No ratings yet
06-FSSR DS610 2024 2025T1 Metrics
24 pages
19-Performance Metrics
No ratings yet
19-Performance Metrics
23 pages
Unit3 7 Issues
No ratings yet
Unit3 7 Issues
24 pages
Lecture 10
No ratings yet
Lecture 10
16 pages
Chapter 5 Model Evaluation
No ratings yet
Chapter 5 Model Evaluation
21 pages
Accuracy Precision and Recall
No ratings yet
Accuracy Precision and Recall
21 pages
Ai DS 2 Book-Chpt-5
No ratings yet
Ai DS 2 Book-Chpt-5
17 pages
Model Evaluation - II
No ratings yet
Model Evaluation - II
12 pages
Performance Parameters
No ratings yet
Performance Parameters
14 pages
Set 4 QP CLASS 11
No ratings yet
Set 4 QP CLASS 11
15 pages
Classification Metrics Mod 6
No ratings yet
Classification Metrics Mod 6
8 pages
Territorial Depth - Kris Scheerlinck 1-8
No ratings yet
Territorial Depth - Kris Scheerlinck 1-8
45 pages
Classification Algorithm in Machine Learning
No ratings yet
Classification Algorithm in Machine Learning
13 pages
Module 4 - Classification
No ratings yet
Module 4 - Classification
10 pages
Imbalance Problem
No ratings yet
Imbalance Problem
13 pages
Food Culture and A Travelogue Nine Fishy Tales of Samanth Subramanian's Following Fish
No ratings yet
Food Culture and A Travelogue Nine Fishy Tales of Samanth Subramanian's Following Fish
4 pages
Module 5 ML
No ratings yet
Module 5 ML
12 pages
Evaluation Metrics
No ratings yet
Evaluation Metrics
11 pages
Instruction & Option Choice
No ratings yet
Instruction & Option Choice
6 pages
ML Metrics
No ratings yet
ML Metrics
9 pages
Machine Learning Project Report (Group 3) Shahbaz Khan
No ratings yet
Machine Learning Project Report (Group 3) Shahbaz Khan
11 pages
The Earth's Magnetic Field: Stephen Kimbrough Damjan Štrus Corina Toma
No ratings yet
The Earth's Magnetic Field: Stephen Kimbrough Damjan Štrus Corina Toma
5 pages
Evaluation Measures For Machine Learning Models
No ratings yet
Evaluation Measures For Machine Learning Models
6 pages
Evaluation Measures
No ratings yet
Evaluation Measures
8 pages
Machine Learning
No ratings yet
Machine Learning
6 pages
WTC Foundation Beam MKD 03
No ratings yet
WTC Foundation Beam MKD 03
8 pages
Comprehensive Guide On Confusion Matrix 1657202063
No ratings yet
Comprehensive Guide On Confusion Matrix 1657202063
5 pages
ME451: Control Systems Course Roadmap
No ratings yet
ME451: Control Systems Course Roadmap
5 pages
Pemanfaatan Serat Selulosa ECENG GONDOK (Eichhornia Crassipes) SEBAGAI BAHAN BAKU Pembuatan Kertas: Isolasi Dan Karakterisasi
No ratings yet
Pemanfaatan Serat Selulosa ECENG GONDOK (Eichhornia Crassipes) SEBAGAI BAHAN BAKU Pembuatan Kertas: Isolasi Dan Karakterisasi
8 pages
Lab 2
No ratings yet
Lab 2
5 pages
अनुसन्धान अधिकृत राजपत्रांकित तृतीय श्रेणी सरह को पाठ्यक्रम र परीक्षा योजना २०७५ 1
No ratings yet
अनुसन्धान अधिकृत राजपत्रांकित तृतीय श्रेणी सरह को पाठ्यक्रम र परीक्षा योजना २०७५ 1
7 pages
Abrar's Lesson Plan
No ratings yet
Abrar's Lesson Plan
4 pages
Software Evaluation New 2023
No ratings yet
Software Evaluation New 2023
3 pages
CH 4 - Book Exercise
No ratings yet
CH 4 - Book Exercise
3 pages
Assignment No. 7 Chemical Engineering Fluid Dynamics Session 2016 Due Date: 16 May-2018 Solve All The Questions. (As A Part of Assessment of CLO3)
No ratings yet
Assignment No. 7 Chemical Engineering Fluid Dynamics Session 2016 Due Date: 16 May-2018 Solve All The Questions. (As A Part of Assessment of CLO3)
1 page
CC Block (Horiskhali)
No ratings yet
CC Block (Horiskhali)
1 page
Errors of Regression Models: Bite-Size Machine Learning, #1
From Everand
Errors of Regression Models: Bite-Size Machine Learning, #1
Lee Baker
No ratings yet

Lec5 Classification

Uploaded by

Lec5 Classification

Uploaded by

1

u Don’t forget the quiz deadline today

u After regression example, now we will cover a classification example

corss_val_score did K-fold cross-validation which means splitting the

This demonstrates why accuracy is generally not the preferred

u A much better way to evaluate the performance of a classifier is to look at

u Each row in a confusion matrix represents an actual

u TP is the number of true positives,

u FN is the number of false negatives.

u Scikit-Learn provides several functions to compute classifier metrics,

u It is often convenient to combine precision and recall into a single metric

u Some algorithms (such as SGD classifiers, Random Forest classifiers, and

u Some algorithms (such as SGD classifiers, Random Forest classifiers, and

u Consider a face-recognition classifier: what should it do if it recognizes

u Linear regression, from Chapter 4 from Hands on machine learning

You might also like