0% found this document useful (0 votes)

38 views38 pages

Chap3 Part1 Classification

The document discusses classification in machine learning, including basic concepts, processes, and evaluation methods. It covers classification tasks, algorithms, building classification models, and evaluating model performance using metrics like accuracy, precision, recall, F1 score, ROC curves, and AUC.

Uploaded by

houcem.swissi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

38 views38 pages

Chap3 Part1 Classification

Uploaded by

houcem.swissi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 38

Classification

ML Team
Outline:
1- Introduction
2- Classification:
- Basic concepts
- Process
3- Classification Methods Evaluation:
- Confusion Matrix
- Accuracy , Recall, Precision and F1 Score
- ROC Curve
- AUC
Introduction:
What is Classification ?

• Classification is a machine learning task of predicting

the value of a labeled categorical class.
Identifying and assigning for a given observation (
data point ) the corresponding class ( label).

• Considered as an instance of Supervised Learning (

Known as Clustering in Unsupervised Learning ).
• Classification quality Evaluation (Error rate).
Data and Goal:

• Data: A set of data records (also called examples,

instances or cases) described by
– k attributes: A1, A2, … Ak.
– a class: Each example is labelled with a pre-defined
class.
• Goal: To learn a classification model from the data that
can be used to predict the classes of new (future, or test)
cases/instances.
Classification Problems: Examples
1) Email Spam
2) Handwritten Digit Recognition
3) Customer behavior prediction:
4) Image classification
5) Anomaly detection problems such as fraud detection
6) Etc
Classification Process:

Step 1: Construction of the model from the learning set

(training set).

Step 2: The use of the model:

-Test the accuracy of the model (test set).
- Use it in predicting new features ( observations).
Step 1: Building the classification Model

• Step 1: Dividing labeled data into a set of training

data ( Learning process) and a set of testing data (
Evaluation Process)
Data Repartition
Commonly:
1) Training set: represents 80% ( 75%).
2) Testing set: represents 20% ( 25%).
Cross Validation
• Cross validation consists of training and then
validating our model on several possible sections of
the train set.
- Divide the training data into k subsets.
- Use k = 1 fold as training data and a subset as test
data.
k-fold cross-validation
• Using k-fold cross-validation for hyper-parameter tuning is
common when the size of the training data is small
– It also leads to a better and less noisy estimate of the model
performance by averaging the results across several folds
• E.g., 5-fold cross-validation (see the figure on the next slide)
1. Split the train data into 5 equal folds
2. First use folds 2-5 for training and fold 1 for validation
3. Repeat by using fold 2 for validation, then fold 3, fold 4, and
fold 5
4. Average the results over the 5 runs (for reporting purposes)
5. Once the best hyper-parameters are determined, evaluate the
model on the test data
Step 1: Building the classification Model

• The chosen machine learning algorithm creates a

predictive model.
Step 2: Evaluation of the classification Model

• Testing data are used to evaluated the created model.

Step 2: Evaluation of the classification Model

• Based on the obtained results, modifications may be

done to the constructed model or training it on new data.
Step 2: Evaluation of the classification Model
Evaluation of classification methods
• Evaluating the performance of a classification model is essential:
• To know if the model is significant globally: My model does it
really reflect causality?
• To get an idea of deployment performance: What will be the
reliability (the associated costs) when I use my model?
• To compare several candidate models: Which among several
models will be the most efficient given my objectives?

 The measurement and evaluation of the performance of a

classification model is always done on the test sample: It is
necessary to test the performance of the model on data which
have not and e used to build the classification model.
Confusion Matrix:
• To evaluate the performance of a classification model
we present four indicators which are calculated from
the confusion matrix:

1) Accuracy rate
2) The recall
3) The precision
4) F1 Score
Example:
• We have a database of customers who have subscribed
to a service.
• Customers who are still subscribers.
• Customers who have canceled the service.
Example:
• We build a churn score: for each customer, we predict if he
will cancel or keep their subscription the following month.
• What is the performance of this score?
• How much can I trust him to predict future terminations?
Confusion Matrix:
Accuracy:
Precision:

• The precision answers the following question:

What proportion of positive identifications was actually
correct?
 A classification model producing no false positives at a
precision of 1.0.
Recall:
The recall is the number of correctly classified positive
examples divided by the total number of actual positive
examples in the test set.

• The recall (recall) allows you to answer the following

question:
What proportion of true positives was identified correctly?
• It gives an indication of the proportion of false negatives.
 A model producing no false negatives has a recall of 1.0.
An example

• This confusion matrix gives

– precision p = 100% and
– recall r = 1%
because we only classified one positive example correctly and no
negative examples wrongly.
• Note: precision and recall only measure classification on
the positive class.

24
Precision and recall measures

• To evaluate the performance of a model in a complete

way: It is necessary to analyze both the precision and
the recall.
• Precision and recall are frequently in tension:
improving precision usually comes at the expense of
recall and reciprocally.
• Various tools have been created to simultaneously
assess precision and recall. The F-score is one of them
F1-score
• It is hard to compare two classifiers using two measures. F1
score combines precision and recall into one measure

• The harmonic mean of two numbers tends to be closer to the

smaller of the two.
• For F1-value to be large, both p and r much be large.
Receive Operating Characteristics curve
• It is commonly called the ROC curve.
• It is a plot of the true positive rate (TPR)
against the false positive rate (FPR).

• True positive rate:

• False positive rate:

Roc Curve: Sensitivity and Specificity
• In statistics, there are two other evaluation
measures:
– Sensitivity: Same as TPR
– Specificity: Also called True Negative Rate (TNR)

• Then we have
Receive Operating Characteristics curve
• The ROC curve is a tool for evaluating and comparing
models:
- Independently of confusion matrices of misassignment;
It allows to know if a model M1 will be better than the
model M2 regardless of the confusion matrix.
- Operational even in the case of very unbalanced
distributions: Without the perverse effects of the
confusion matrix linked to the need to perform an
assignment.
- A graphical tool that visualizes performance: Only one a
glance should allow us to see the most suitable model for
our interest.
Example ROC curves
Drawing an ROC curve
Area under the curve (AUC)
• Which classifier is better, C1 or C2?
– It depends on which region you talk about.
• Can we have one measure?
– Yes, we compute the area under the curve (AUC)
• If AUC for Ci is greater than that of Cj, it is said that Ci is
better than Cj.
– If a classifier is perfect, its AUC value is 1
– If a classifier makes all random guesses, its AUC value is
0.5.
ROC curve Comparison
ROC curve Comparison
AUC Evaluation

Perfect Case:
• The curve of M1 is always above
that of M2:
 There cannot exist a situation
where M2 would be a better
classification model.
AUC Evaluation

Possible Case: Overlap between ROC

curves
• M1 is dominated by all models, it
can be eliminated.
• In our example, the convex hull is
formed by the curves of M3 and
M2.
• M4 may be better than M3 in
some cases, but in those cases
there, it will be less good than M2
 M4 can be eliminated.
Mann-Whitney test
• A rank-based statistics used to
show that two distributions are
different.
• In our context, show that the +
present (in average) higher scores
than the _.
• A statistical test can be derived
from the Sum of the ranks of the +:
S+.

• Mann-Whitney Measure:
Classification Algorithms
• k-Nearest Neighbour (k-NN).
• Decision Tree
• Support vector Machine (SVM)
• Naive Bayes
• Logistic Regression.

6.data Mining - Classification
No ratings yet
6.data Mining - Classification
37 pages
QA - QC - Precision and Accuracy
100% (1)
QA - QC - Precision and Accuracy
24 pages
Aspirants AS1143 11th Physics 235 Marks Study Materials English Medium PDF
100% (1)
Aspirants AS1143 11th Physics 235 Marks Study Materials English Medium PDF
29 pages
2.1 Introduction To Industrial Instrumentation
No ratings yet
2.1 Introduction To Industrial Instrumentation
34 pages
Unit 4 Model Evaluation
No ratings yet
Unit 4 Model Evaluation
24 pages
Model Performance Assessment
No ratings yet
Model Performance Assessment
13 pages
Lesson 6 Analytics Methods
No ratings yet
Lesson 6 Analytics Methods
12 pages
BSC ML CH1
No ratings yet
BSC ML CH1
63 pages
l09 Machine Learning
No ratings yet
l09 Machine Learning
39 pages
DM 09 Classification and Prediction 19112024 102854am
No ratings yet
DM 09 Classification and Prediction 19112024 102854am
21 pages
Module 2
No ratings yet
Module 2
151 pages
FALLSEM2024-25 BCSE334L TH VL2024250101768 2024-10-08 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE334L TH VL2024250101768 2024-10-08 Reference-Material-I
18 pages
DSML Clasification
No ratings yet
DSML Clasification
44 pages
Iso 188-98
No ratings yet
Iso 188-98
14 pages
Data Mining Final
No ratings yet
Data Mining Final
25 pages
Lecture 3b - Evaluation
No ratings yet
Lecture 3b - Evaluation
37 pages
Wind Energy Assessment-R1
No ratings yet
Wind Energy Assessment-R1
80 pages
9b. Evaluation of Classifiers
No ratings yet
9b. Evaluation of Classifiers
4 pages
AIML-HC Mod 03
No ratings yet
AIML-HC Mod 03
46 pages
Uncertainty in Measurements
No ratings yet
Uncertainty in Measurements
12 pages
Instrumentation Devices EC 3303: Dr. Sougata Kar
No ratings yet
Instrumentation Devices EC 3303: Dr. Sougata Kar
52 pages
Numerical Problems in Mechanics 1
No ratings yet
Numerical Problems in Mechanics 1
59 pages
Influence of Test Equipment and Procedures On Obtained Accuracy in CPTU
No ratings yet
Influence of Test Equipment and Procedures On Obtained Accuracy in CPTU
26 pages
Interpretation of Laboratory Eqa Reports
No ratings yet
Interpretation of Laboratory Eqa Reports
32 pages
WP 202 How To Choose It Rack Power Distribution
No ratings yet
WP 202 How To Choose It Rack Power Distribution
10 pages
The Role of Protection Performance Audits in The Lifetime Management of Protection Systems
No ratings yet
The Role of Protection Performance Audits in The Lifetime Management of Protection Systems
10 pages
ML-Lecture-12 (Evaluation Metrics For Classification)
No ratings yet
ML-Lecture-12 (Evaluation Metrics For Classification)
15 pages
Technical Writing Revision For Computer Science Colleges in Egypt
No ratings yet
Technical Writing Revision For Computer Science Colleges in Egypt
56 pages
Classification Metrics Mod 6
No ratings yet
Classification Metrics Mod 6
8 pages
01 Handout 1
No ratings yet
01 Handout 1
10 pages
Model Evaluation and Selection
No ratings yet
Model Evaluation and Selection
49 pages
Population and Sample - Dyhe Annura Husra
No ratings yet
Population and Sample - Dyhe Annura Husra
15 pages
ML Lecture 11 Evaluation
No ratings yet
ML Lecture 11 Evaluation
17 pages
Development of Content-Based SMS Classification Application by Using Word2Vec-based Feature Extraction
No ratings yet
Development of Content-Based SMS Classification Application by Using Word2Vec-based Feature Extraction
10 pages
Evaluating Model Performance Unit 6
No ratings yet
Evaluating Model Performance Unit 6
33 pages
Lesson 1.2 Conversion of Units and Error Analysis
No ratings yet
Lesson 1.2 Conversion of Units and Error Analysis
21 pages
SAFE: Google DeepMind's Open-Source Solution For Fact Verification
No ratings yet
SAFE: Google DeepMind's Open-Source Solution For Fact Verification
8 pages
Surveying Unit 1&2
No ratings yet
Surveying Unit 1&2
8 pages
Unit3 7 Issues
No ratings yet
Unit3 7 Issues
24 pages
Lecture 5 Evaluation - Classifer
No ratings yet
Lecture 5 Evaluation - Classifer
61 pages
ML Unit 3
No ratings yet
ML Unit 3
127 pages
3 - Model Evaluation & Validation
No ratings yet
3 - Model Evaluation & Validation
47 pages
Evaluation Matrix
No ratings yet
Evaluation Matrix
29 pages
Module 6
No ratings yet
Module 6
24 pages
Unit 4 Learning
No ratings yet
Unit 4 Learning
100 pages
Model Evaluation and Selection
No ratings yet
Model Evaluation and Selection
41 pages
Classification Metrics
No ratings yet
Classification Metrics
39 pages
A10 Model Performance v2 2up
No ratings yet
A10 Model Performance v2 2up
11 pages
ML Model Evaluation
No ratings yet
ML Model Evaluation
17 pages
Paper 2: Essay (HL) : Criterion A: Knowledge and Understanding
No ratings yet
Paper 2: Essay (HL) : Criterion A: Knowledge and Understanding
2 pages
Traffic Sign Detection For Autonomous VehicleApplication
No ratings yet
Traffic Sign Detection For Autonomous VehicleApplication
5 pages
UNIT-1-2.Binary Classification and Related Tasks
No ratings yet
UNIT-1-2.Binary Classification and Related Tasks
22 pages
MACHINELEARNING
No ratings yet
MACHINELEARNING
20 pages
Word List For Levels A2-B2
No ratings yet
Word List For Levels A2-B2
2,352 pages
STD 11 Physics Maharashtra Board
No ratings yet
STD 11 Physics Maharashtra Board
26 pages
2-Training and Testing Models, Evaluation Metrics-01-07-2023
No ratings yet
2-Training and Testing Models, Evaluation Metrics-01-07-2023
23 pages
Basics of ML and Evaluation
No ratings yet
Basics of ML and Evaluation
42 pages
CH-5 ML
No ratings yet
CH-5 ML
36 pages
CS 620 / DASC 600 Introduction To Data Science & Analytics: Lecture 8-Performance Evaluation
No ratings yet
CS 620 / DASC 600 Introduction To Data Science & Analytics: Lecture 8-Performance Evaluation
62 pages
Instruction & Option Choice
No ratings yet
Instruction & Option Choice
6 pages
Performance Parameters
No ratings yet
Performance Parameters
14 pages
ML CH 5
No ratings yet
ML CH 5
45 pages
Lecture11evaluationmetricsforclassification 240913060639 0c766554
No ratings yet
Lecture11evaluationmetricsforclassification 240913060639 0c766554
28 pages
Digital Earth Tester - Calibration Certificate
No ratings yet
Digital Earth Tester - Calibration Certificate
2 pages
Exp7 MLAI2
No ratings yet
Exp7 MLAI2
8 pages
DL IT324a 4
No ratings yet
DL IT324a 4
52 pages
Session 1 Evaluation Model
No ratings yet
Session 1 Evaluation Model
58 pages
AI & ML Notes
No ratings yet
AI & ML Notes
22 pages
ML Metrics
No ratings yet
ML Metrics
9 pages
Ai DS 2 Book-Chpt-5
No ratings yet
Ai DS 2 Book-Chpt-5
17 pages
Session 2 Evaluation Boosting Bagging Contemporary Business Anaytics
No ratings yet
Session 2 Evaluation Boosting Bagging Contemporary Business Anaytics
17 pages
Iai&ml Unit-5
No ratings yet
Iai&ml Unit-5
15 pages
Confusion Matrix in Machine Learning
No ratings yet
Confusion Matrix in Machine Learning
10 pages
19-Performance Metrics
No ratings yet
19-Performance Metrics
23 pages
L 13 Choose Your Own Algorithm D 07062024 111828am
No ratings yet
L 13 Choose Your Own Algorithm D 07062024 111828am
36 pages
2023 Scopus Kids Hobby Prediction
No ratings yet
2023 Scopus Kids Hobby Prediction
6 pages
Chapter 5 Model Evaluation
No ratings yet
Chapter 5 Model Evaluation
21 pages
Machine Learningassignment
No ratings yet
Machine Learningassignment
10 pages
Module 5 ML
No ratings yet
Module 5 ML
12 pages
Machine Learning Project Report (Group 3) Shahbaz Khan
No ratings yet
Machine Learning Project Report (Group 3) Shahbaz Khan
11 pages
D5035 11 Breaking Force and Elongation of Textile Fabrics (Strip Method)
No ratings yet
D5035 11 Breaking Force and Elongation of Textile Fabrics (Strip Method)
8 pages
Unit6 - 7 Issues
No ratings yet
Unit6 - 7 Issues
53 pages
Chapter 3 Model Evaluation Final
No ratings yet
Chapter 3 Model Evaluation Final
30 pages
Gr.7 Techdraw 1st Summative
No ratings yet
Gr.7 Techdraw 1st Summative
3 pages
Dokumen - Pub Aircraft Flight Instruments and Guidance Systems Principles Operations and Maintenance 0415706831 9780415706834
100% (3)
Dokumen - Pub Aircraft Flight Instruments and Guidance Systems Principles Operations and Maintenance 0415706831 9780415706834
273 pages
Evaluation Metrics
No ratings yet
Evaluation Metrics
11 pages
Evaluation Measures
No ratings yet
Evaluation Measures
8 pages
SAMPLING CRITERIA AND STRATEGY Lec
No ratings yet
SAMPLING CRITERIA AND STRATEGY Lec
3 pages
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
From Everand
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
SUJAUL CHOWDHURY
No ratings yet
Naive Bayes and Sentiment Classification
No ratings yet
Naive Bayes and Sentiment Classification
23 pages

Chap3 Part1 Classification

Uploaded by

Chap3 Part1 Classification

Uploaded by

Classification

• Classification is a machine learning task of predicting

• Considered as an instance of Supervised Learning (

• Data: A set of data records (also called examples,

Step 1: Construction of the model from the learning set

Step 2: The use of the model:

• Step 1: Dividing labeled data into a set of training

• The chosen machine learning algorithm creates a

• Testing data are used to evaluated the created model.

• Based on the obtained results, modifications may be

 The measurement and evaluation of the performance of a

• The precision answers the following question:

• The recall (recall) allows you to answer the following

• This confusion matrix gives

• To evaluate the performance of a model in a complete

• The harmonic mean of two numbers tends to be closer to the

• True positive rate:

• False positive rate:

Possible Case: Overlap between ROC

You might also like