0% found this document useful (0 votes)

17 views51 pages

Lecture 11 - 09.09.24 Classification Part 1

Uploaded by

Amritanshu Vivek

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views51 pages

Lecture 11 - 09.09.24 Classification Part 1

Uploaded by

Amritanshu Vivek

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 51

1

Classification - 1

Prof. Sashikumaar Ganesan, IISc Bangalore

Agenda

Introduction to Logistic
Decision Trees
Classification Regression

Classification Confusion
Q&A
Metrics Matrix
“ Introduction to Classification and
Classification algorithms
Classification

Classification is a type of supervised learning where the goal is to predict

the categorical class labels of new instances based on labelled training data

Fig: Location of classification on ML Tree Fig: Sample Classification

Introduction to Classification

Different Classification Algorithms

Logistic Regression
It is a simple and widely used method for binary classification. It
uses a logistic function to model the probability of the output
class.

Decision Trees
A decision tree is a hierarchical model that partitions the
feature space into a set of rectangular regions. Each leaf
node represents a class label.

Image Credits : Logistic regression – statquest , Decision Tree: cs.cmu.edu

Introduction to Classification

Different Classification Algorithms – Contd.

Random Forest
• Random forest is an ensemble method that combines
multiple decision trees to improve the prediction
accuracy.
• It creates a set of decision trees on randomly selected
subsets of the training data and then combines their
predictions.
Naive Bayes
• It is a probabilistic classifier that uses Bayes'
theorem to predict the class label of a new
instance.
• Naive Bayes assumes that the features are
conditionally independent given the class label.

Image Credits : Random Forest – wikipedia , Naive Bayes: https://medium.com/analytics-vidhya/na%C3%AFve-bayes-algorithm-5bf31e9032a2

Introduction to Classification

Different Classification Algorithms – Contd.

Support Vector Machines (SVM)

• SVMs are a powerful classification method that
constructs a hyperplane in a high-dimensional space
to separate the classes.
• SVMs maximize the margin between the hyperplane
and the closest points from each class.

Image Credits : wikipedia

Introduction to Classification
SVM Use Cases
When to use ? Types of SVM:

• Binary classification with clear separation • Linear SVM

• High-dimensional, small/medium datasets • Non-linear SVM (kernel-based)
• Support Vector Regression (SVR)
When not to use SVM ?

• Very large or imbalanced datasets

• When probabilistic outputs are needed

Image Credits : wikipedia

Introduction to Classification

Binary Classification

• Binary classification has only two labels for

classification
• Relevant examples for this model are
• Email – spam/not spam
• Anomaly / not Anomaly
• Popular algorithms used for binary classification
• Naïve Bayes
• Logistic Regression
• Support Vector Machines (SVM)
Introduction to Classification

Multi-Class Classification
• Multi-class classification has more than two labels for
classification
• Relevant examples for this model are
• Handwritten digits recognition
• Face expression classification
• Popular algorithms used for binary classification
• Naïve Bayes
• Random Forest
• Decision trees
• SVM
Introduction to Classification

Multi-Label Classification

• Multi-label classification have two or more class labels,

where one or more class labels may be predicted for
each example.
• Relevant examples for this model are
• Image classification with multiple objects on image
• Popular Multilabel algorithms
• Multi label decision trees
• Multi Label Random forests
Image Credits : https://towardsdatascience.com/yolo-object-detection-with-opencv-and-python-21e50ac599e9
Practical Example
Multi-Label vs Multi-Class -> Practical Example (Songs)

Multi-Class Multi-Label
Placing all your song collection into Tagging your songs in your media player
specific folder such as by year or by under different playlists
music director The song can be part of multiple playlists
Once placed, the song will belong to (or and can
will be inside) that specific folder only
Classification

Summary

• Classification is type of supervised learning where the goal is to predict the

categorical class labels
• There are different types of classification models such as decision trees, random
forest, logistic regression, SVM etc
• The most used types of classification are binary, Multi-class, Multi-label.
• In Multi-label classification, an object can be associated with multiple classes based
on a probability distribution
Test your understanding

1. True or False: In multi-class classification, each instance can belong to only one class.

2. True or False: Multi-label classification allows an instance to belong to multiple classes

simultaneously.

3. Which of the following is NOT typically used as a classification algorithm?

a) Random Forest b) Support Vector Machine c) Linear Regression d) Naive Bayes
Solutions

1. Answer: True Explanation: In multi-class classification, each instance is assigned to exactly

one class out of three or more possible classes. This is different from multi-label
classification where an instance can belong to multiple classes simultaneously.

2. Answer: True Explanation: In multi-label classification, each instance can be associated with
multiple classes at the same time. For example, a movie could be classified as both "action"
and "comedy".

3. Answer: Linear Regression Explanation: Linear Regression is primarily used for regression
tasks, where the goal is to predict a continuous numerical value. It's not typically used for
classification problems, which involve predicting discrete class labels.
“ Logistic Regression
Logistic Regression

• Logistic regression is a statistical method used for

binary or multi-class classification problems
• The logistic regression model is based on the logistic
function, which maps any real-valued input to a
probability value between 0 and 1.

Image Credits : Statquest

Logistic Regression

How is it different from linear Regression?

• Linear regression maps the input values to the output values (in continuous
domain)

• The logistic regression model is based on the logistic function, which maps any
real-valued input to a probability value between 0 and 1.
Logistic Regression

Solution : To use a function of z that goes from 0 to 1, where

Credits : Stanford – logistic regression

Logistic Regression

Idea of Logistic Regression

• Compute w∙x+b
• Pass it through the sigmoid function: σ(w∙x+b)
• Treat it as a probability

Here the value 0.5 is the decision

boundary
Logistic Regression

Idea of Logistic Regression

Credits : Stanford – logistic regression

Test your understanding

1. Which of the following is the standard activation function used in Logistic Regression?
a) ReLU b) Sigmoid c) Tanh d) Softmax

2. Which of the following loss functions is typically used in Logistic Regression? a) Mean
Squared Error b) Cross-Entropy Loss c) Hinge Loss d) Huber Loss

3. A Logistic Regression model outputs a probability of 0.7 for a certain instance. If the
classification threshold is 0.6, how will this instance be classified in a binary problem?
Solutions

1. Answer: b) Sigmoid Explanation: The sigmoid function is used to transform

the output to a probability between 0 and 1.

2. Answer: b)Cross-Entropy Loss (also known as Log Loss) is the standard loss
function for Logistic Regression as it measures the performance of a
classification model whose output is a probability value between 0 and 1.

3. Answer: The instance will be classified as the positive class (typically

labeled as 1). Explanation: Since 0.7 > 0.6 (the threshold), the model
predicts the positive class for this instance.
“ Accuracy of the classification model
Accuracy

MNIST Handwritten Dataset

• Set of small images of digits handwritten by

high school students and employees of the US
Census Bureau
• Generally referred as hello world problem in
ML classification
• 70,000 datapoints with 10 classes, with each
image of size 28x28 (784) features

Image Credits : Statquest

Accuracy