0% found this document useful (0 votes)

111 views27 pages

Binary Classification PDF

Binary classification is a machine learning technique that categorizes data points into one of two classes. It works by training on a labeled dataset to learn the relationship between input features and binary outputs. Common models for binary classification include logistic regression, neural networks, support vector machines, random forests, and naive Bayes. Performance is evaluated using metrics like accuracy, precision, recall, F1 score, and the ROC curve. Challenges include imbalanced classes, overfitting, label noise, feature selection, interpretability, and scalability to large datasets.

Uploaded by

ashish kadam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

111 views27 pages

Binary Classification PDF

Uploaded by

ashish kadam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 27

Binary

Classification
What is Binary
Classification?

In machine learning, binary classification is a

supervised learning algorithm that categorizes
new observations into one of two outcomes
usually represented as 0 or 1, true or false,
positive or negative, etc.

For example, predicting whether a credit card

transaction is fraud or not fraud, whether an
email is a spam or not spam, and whether a
customer will purchase a product or not, are all
examples of binary classification problems.
How Binary
Classification Works?

In binary classification, the algorithm is trained on

a labeled dataset, where each data point is
associated with a binary label.

The algorithm then learns to map the input

features to the corresponding binary label. Once
trained, the algorithm can be used to predict the
binary label for new, unseen data points.
Common Binary
Classification Models
Logistic Regression

It is used for binary classification problems, where

the output variable is categorical with two
possible values. For example, predicting whether
a customer will buy a product or not based on
their demographics and purchase history.
Neural Networks

This algorithm is designed to cluster raw input,

recognize patterns, or interpret sensory data.
Despite their multiple advantages, neural
networks require significant computational
resources. It can get complicated to fit a neural
network when there are thousands of
observations.
Support Vector Machines

A support vector machine is typically used for

classification problems by constructing a
hyperplane where the distance between two
classes of data points is at its maximum. This
hyperplane is known as the decision boundary,
separating the classes of data points (e.g.,
oranges vs. apples) on either side of the plane.
Random Forest

Random forest is another flexible supervised

machine learning algorithm used for both
classification and regression purposes. It is an
ensemble learning algorithm that combines
multiple decision trees to improve accuracy and
reduce overfitting.
Naive Bayes

Naive Bayes assumes that the features (input

variables) are conditionally independent of each
other given the class label. This is a "naive"
assumption because in reality, features may be
correlated with each other. The three main types
of Naive Bayes algorithms: Gaussian Naive Bayes,
Multinomial Naive Bayes and Bernoulli Naive
Bayes
Evaluating a Binary
Classification Model
Key Concepts

For example, in a medical diagnosis scenario,

True Positive (TP) is when the patient is diseased
and the model predicts "diseased"

False Positive (FP) or Type 1 Error is when the

patient is healthy but the model predicts
"diseased"

True Negative (TN) is when the patient is healthy

and the model predicts "healthy"

False Negative (FN) or Type 2 Error is when the

patient is diseased and the model predicts
"healthy"
Impact of False
Negatives and False
Positives
False negatives and false positives can have
different impacts depending on the specific
problem and context of the classification model.

In a medical diagnosis scenario, a false negative

can result in a patient not receiving the necessary
treatment for a disease, leading to a worsened
health condition.

In airport security screening, a false positive

result for a potential threat can result in
unnecessary delays and inconvenience for the
passengers.
Confusion Matrix
ACTUAL
Positive Negative

True Positive False Positive

(TP) (FP)
Positive

Correctly Incorrectly
predicts a predicts a
diseased patient healthy patient
PREDICTED

as diseased as diseased

False Negative True Negative

Negative

(FN) (TN)
Incorrectly Correctly
predicts a predicts a
diseased patient healthy patient
as healthy as healthy
ACTUAL
Positive Negative
True Positive False Positive
Positive (TP) (FP)

5 10
PREDICTED

False Negative True Negative

(FN) (TN)
Negative

15 70

TP + TN
Accuracy = = 0.75
TP + TN + FP + FN
TP
Recall = = 0.25
TP + FN
TP
Precision = = 0.33
TP + FP
Precision * Recall
F1 Score = 2 x = 0.28
Precision + Recall
Accuracy
TP + TN
Accuracy =
TP + TN + FP + FN

Number of correct anwsers

Accuracy =
Total number of anwsers

When we want to analyze the performance of a

binary classifier, the most common and
accessible metric is the accuracy. It tells us how
many times our model has correctly classified an
item in our dataset with respect to the total.

it is not recommended to use accuracy as an

evaluation metric when we are working with an
unbalanced dataset.
Recall or Sensitivity

TP
Recall =
TP + FN

Recall is also called sensitivity because as recall

increases, our model becomes less and less
accurate and also classifies negative classes as
positive.

E.g. In the case of tumor detection, we want our

model to have high recall, as we want to be sure
that every single example considered positive by
the model is subjected to human inspection. We
don’t want a malignant tumor to go unnoticed,
and we will gladly accept false positives.
Precision or Specificity
TP
Precision =
TP + FP
Precision is just the accuracy calculated only for
positive classes. It is also called specificity since it
defines how sensitive an instrument is when
there is the signal to be recognized. In fact, the
metric tells us how often we are correct when we
classify a class as positive.

A high precision model is conservative: it

doesn’t always recognize the class correctly, but
when it does, we can be assured that its answer is
correct.
A high recall model is liberal: it recognizes a
class much more often, but in doing so it tends to
include a lot of noise as well (false positives).
Precision / Recall
Trade-off

Both precision and recall range from 0 to 1. As a

general rule of thumb, the closer to 1, the better
the model is. Unfortunately, you can’t have the
best of both worlds because increasing precision
would cause recall to drop and vice versa.
F1 Score
Precision * Recall
F1 Score = 2 x
Precision + Recall

F1 score combines precision and recall into one

metric.

This is the harmonic mean of precision and recall,

and is probably the most used metric for
evaluating binary classification models.

If our F1 score increases, it means that our model

has increased performance for accuracy, recall or
both.
ROC Curve

A Receiver Operating Characteristic (ROC) curve is

a plot of the True Positive Rate (TPR) against the
False Positive Rate (FPR) for different classification
thresholds. Generally, the closer the ROC curve is
to the upper left corner, the better performance
the model has.
ROC AUC

The area under the ROC curve (AUC) is a single

scalar value that measures the overall
performance of the model. The AUC ranges from
0 to 1, with a higher value indicating better
performance. An AUC of 0.5 indicates a random
guess, while an AUC of 1.0 indicates perfect
classification.
Challenges in Binary
Classification Models
Challenges
Imbalanced Classes: In many real-world scenarios, the
positive and negative classes are not equally represented
in the dataset. When one class has significantly more
instances than the other, it can lead to biased models
that have poor predictive performance on the minority
class.

Overfitting: Overfitting occurs when the model is too

complex and fits the noise in the training data instead of
the underlying patterns. This can result in poor
generalization performance and reduced predictive
accuracy on new data.

Label Noise: In some cases, the labels in the training

data may be noisy or incorrect, which can adversely affect
the model's performance.
Challenges
Feature Selection: The performance of binary
classification models can depend heavily on the quality
and relevance of the input features. Feature selection can
be challenging when dealing with high-dimensional data.

Model Interpretability: Binary classification models can

be highly complex and difficult to interpret, especially
when using non-linear or deep learning models.
Interpretability is important in many applications, such as
healthcare, where the model's predictions must be
explained to clinicians and patients.

Scalability: Binary classification models can require

significant computational resources and memory,
especially when dealing with large datasets or complex
models.
Follow #DataRanch on
LinkedIn for more...
Follow #DataRanch on
LinkedIn for more...
[email protected]

linkedin.com/company/dataranch

KNN Evaluation
No ratings yet
KNN Evaluation
51 pages
ML - Mod2 Classification
No ratings yet
ML - Mod2 Classification
74 pages
ML 2 PPT Unit 2
No ratings yet
ML 2 PPT Unit 2
214 pages
Unit8 (Evaluation Method)
No ratings yet
Unit8 (Evaluation Method)
43 pages
BSC ML CH1
No ratings yet
BSC ML CH1
63 pages
CS-6 Classification Evaluation Metrics
No ratings yet
CS-6 Classification Evaluation Metrics
26 pages
Unit 5 Classification PDF
No ratings yet
Unit 5 Classification PDF
131 pages
Unit 2
No ratings yet
Unit 2
28 pages
Artificial Intelligence Lec 3
No ratings yet
Artificial Intelligence Lec 3
17 pages
3 - Model Evaluation & Validation
No ratings yet
3 - Model Evaluation & Validation
47 pages
Lesson 4 - Performance Metrics
No ratings yet
Lesson 4 - Performance Metrics
46 pages
CSE4261 Lecture-10
No ratings yet
CSE4261 Lecture-10
50 pages
ML Lecture 11 Evaluation
No ratings yet
ML Lecture 11 Evaluation
17 pages
Evaluation Metrics: Yining Chen (Adapted From Slides by Anand Avati) May 1, 2020
No ratings yet
Evaluation Metrics: Yining Chen (Adapted From Slides by Anand Avati) May 1, 2020
31 pages
Classification Metrics
No ratings yet
Classification Metrics
39 pages
Session-11 Machine Learning
No ratings yet
Session-11 Machine Learning
27 pages
Data Mining Final
No ratings yet
Data Mining Final
25 pages
Lec5 Classification
No ratings yet
Lec5 Classification
27 pages
ML Unit 2
No ratings yet
ML Unit 2
31 pages
9 Roc Auc
No ratings yet
9 Roc Auc
27 pages
Unit III Iml Final
No ratings yet
Unit III Iml Final
36 pages
Machine Learning Note
No ratings yet
Machine Learning Note
40 pages
Intermediate Analytics-Regression-Week 3-1
No ratings yet
Intermediate Analytics-Regression-Week 3-1
44 pages
Assignment 5
No ratings yet
Assignment 5
22 pages
Classification Metrics Mod 6
No ratings yet
Classification Metrics Mod 6
8 pages
Evaluation Metrics and Statistical Tests For Machi
No ratings yet
Evaluation Metrics and Statistical Tests For Machi
15 pages
Lecture 10
No ratings yet
Lecture 10
16 pages
جلسه 13
No ratings yet
جلسه 13
76 pages
Unit 4 Learning
No ratings yet
Unit 4 Learning
100 pages
ML CH 5
No ratings yet
ML CH 5
45 pages
Confusion Matrix
No ratings yet
Confusion Matrix
43 pages
DL IT324a 4
No ratings yet
DL IT324a 4
52 pages
Confusion Matrix and Classification Evaluation Metrics
No ratings yet
Confusion Matrix and Classification Evaluation Metrics
16 pages
Classification Algorithm in Machine Learning
No ratings yet
Classification Algorithm in Machine Learning
13 pages
UNIT-1-2.Binary Classification and Related Tasks
No ratings yet
UNIT-1-2.Binary Classification and Related Tasks
22 pages
Unit 3
No ratings yet
Unit 3
13 pages
ML Unit 3
No ratings yet
ML Unit 3
127 pages
Comprehensive Guide On Confusion Matrix 1657202063
No ratings yet
Comprehensive Guide On Confusion Matrix 1657202063
5 pages
Ad3501-Dl-Unit 4 Notes
No ratings yet
Ad3501-Dl-Unit 4 Notes
16 pages
Evaluation Metrics and Statistical Tests For Machine Learning
No ratings yet
Evaluation Metrics and Statistical Tests For Machine Learning
14 pages
Unit 2 Chap 4
No ratings yet
Unit 2 Chap 4
14 pages
Hands On Machine Learning 3 Edition
No ratings yet
Hands On Machine Learning 3 Edition
31 pages
Iai&ml Unit-5
No ratings yet
Iai&ml Unit-5
15 pages
Evaluation Measures For Machine Learning Models
No ratings yet
Evaluation Measures For Machine Learning Models
6 pages
19-Performance Metrics
No ratings yet
19-Performance Metrics
23 pages
Machine Learningassignment
No ratings yet
Machine Learningassignment
10 pages
Confusion Matrix ROC
No ratings yet
Confusion Matrix ROC
8 pages
Confusion Matrix ROC
No ratings yet
Confusion Matrix ROC
8 pages
L 13 Choose Your Own Algorithm D 07062024 111828am
No ratings yet
L 13 Choose Your Own Algorithm D 07062024 111828am
36 pages
Module 5 ML
No ratings yet
Module 5 ML
12 pages
Chapter 7 - LAST
No ratings yet
Chapter 7 - LAST
29 pages
Imbalance Problem
No ratings yet
Imbalance Problem
13 pages
Ai DS 2 Book-Chpt-5
No ratings yet
Ai DS 2 Book-Chpt-5
17 pages
Accuracy Precision and Recall
No ratings yet
Accuracy Precision and Recall
21 pages
Confusion Matrix
No ratings yet
Confusion Matrix
14 pages
Evaluation Measures
No ratings yet
Evaluation Measures
8 pages
Machine Learning Project Report (Group 3) Shahbaz Khan
No ratings yet
Machine Learning Project Report (Group 3) Shahbaz Khan
11 pages
Evaluation Metrics
No ratings yet
Evaluation Metrics
11 pages
Confusion Matrix
No ratings yet
Confusion Matrix
18 pages
Errors of Regression Models: Bite-Size Machine Learning, #1
From Everand
Errors of Regression Models: Bite-Size Machine Learning, #1
Lee Baker
No ratings yet

Binary Classification PDF

Uploaded by

Binary Classification PDF

Uploaded by

Binary

In machine learning, binary classification is a

For example, predicting whether a credit card

In binary classification, the algorithm is trained on

The algorithm then learns to map the input

It is used for binary classification problems, where

This algorithm is designed to cluster raw input,

A support vector machine is typically used for

Random forest is another flexible supervised

Naive Bayes assumes that the features (input

For example, in a medical diagnosis scenario,

False Positive (FP) or Type 1 Error is when the

True Negative (TN) is when the patient is healthy

False Negative (FN) or Type 2 Error is when the

In a medical diagnosis scenario, a false negative

In airport security screening, a false positive

True Positive False Positive

False Negative True Negative

False Negative True Negative

Number of correct anwsers

When we want to analyze the performance of a

it is not recommended to use accuracy as an

Recall is also called sensitivity because as recall

E.g. In the case of tumor detection, we want our

A high precision model is conservative: it

Both precision and recall range from 0 to 1. As a

F1 score combines precision and recall into one

This is the harmonic mean of precision and recall,

If our F1 score increases, it means that our model

A Receiver Operating Characteristic (ROC) curve is

The area under the ROC curve (AUC) is a single

Overfitting: Overfitting occurs when the model is too

Label Noise: In some cases, the labels in the training

Model Interpretability: Binary classification models can

Scalability: Binary classification models can require

You might also like