0% found this document useful (0 votes)

26 views

Machine Learning

Uploaded by

Rohit Sapkal Vlog

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views

Machine Learning

Uploaded by

Rohit Sapkal Vlog

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

What is Classification ?

Classification is the task of “classifying things” into sub-categories. But, by a machine! If that
doesn’t sound like much, imagine your computer being able to differentiate between you
and a stranger. Between a potato and a tomato. Between an A grade and an F. Now, it
sounds interesting now. Classification is part of supervised machine learning in which we
put labeled data for training. In Machine Learning and Statistics, Classification is the
problem of identifying to which of a set of categories (subpopulations), a new observation
belongs, on the basis of a training set of data containing observations and whose categories
membership is known.

Classification is a process of categorizing data or objects into predefined classes or

categories based on their features or attributes. In machine learning, classification is a type
of supervised learning technique where an algorithm is trained on a labeled dataset to
predict the class or category of new, unseen data. The main objective of classification is to
build a model that can accurately assign a label or category to a new observation based on
its features. For example, a classification model might be trained on a dataset of images
labeled as either dogs or cats and then used to predict the class of new, unseen images of
dogs or cats based on their features such as color, texture, and shape.

Types of Classification

Classification is of two types:

Binary Classification: In binary classification, the goal is to classify the input into one of two
classes or categories. Example – On the basis of the given health conditions of a person, we
have to determine whether the person has a certain disease or not.

Multiclass Classification: In multi-class classification, the goal is to classify the input into one
of several classes or categories. For Example – On the basis of data about different species
of flowers, we have to determine which specie our observation belongs to.
Types of classification algorithms

There are various types of classifiers. Some of them are :

Linear Classifiers: Linear models create a linear decision boundary between classes. They
are simple and computationally efficient. Some of the linear classification models are as
follows:

 Logistic Regression
 Support Vector Machines having kernel = ‘linear’
 Single-layer Perceptron
 Stochastic Gradient Descent (SGD) Classifier

Non-linear Classifiers: Non-linear models create a non-linear decision boundary between

classes. They can capture more complex relationships between the input features and the
target variable. Some of the non-linear classification models are as follows:

 K-Nearest Neighbours
 Kernel SVM
 Naive Bayes
 Decision Tree Classification
 Ensemble learning classifiers:
 Random Forests,
 AdaBoost,
 Bagging Classifier,
 Voting Classifier,
 ExtraTrees Classifier
 Multi-layer Artificial Neural Networks

Type of Learners in Classifications Algorithm

In machine learning, classification learners can also be classified as either “lazy” or “eager”
learners.

 Lazy Learners: Lazy Learners are also known as instance-based learners, lazy learners
do not learn a model during the training phase. Instead, they simply store the
training data and use it to classify new instances at prediction time. It is very fast at
prediction time because it does not require computations during the predictions. it is
less effective in high-dimensional spaces or when the number of training instances is
large. Examples of lazy learners include k-nearest neighbors and case-based
reasoning.
 Eager Learners: Eager Learners are also known as model-based learners, eager
learners learn a model from the training data during the training phase and use this
model to classify new instances at prediction time. It is more effective in high-
dimensional spaces having large training datasets. Examples of eager learners
include decision trees, random forests, and support vector machines.

Classification model Evaluations

Evaluating a classification model is an important step in machine learning, as it helps to

assess the performance and generalization ability of the model on new, unseen data. There
are several metrics and techniques that can be used to evaluate a classification model,
depending on the specific problem and requirements. Here are some commonly used
evaluation metrics:

 Classification Accuracy: The proportion of correctly classified instances over the

total number of instances in the test set. It is a simple and intuitive metric but can be
misleading in imbalanced datasets where the majority class dominates the accuracy
score.
 Confusion matrix: A table that shows the number of true positives, true negatives,
false positives, and false negatives for each class, which can be used to calculate
various evaluation metrics.
 Precision and Recall: Precision measures the proportion of true positives over the
total number of predicted positives, while recall measures the proportion of true
positives over the total number of actual positives. These metrics are useful in
scenarios where one class is more important than the other, or when there is a
trade-off between false positives and false negatives.
 F1-Score: The harmonic mean of precision and recall, calculated as 2 x (precision x
recall) / (precision + recall). It is a useful metric for imbalanced datasets where both
precision and recall are important.
 ROC curve and AUC: The Receiver Operating Characteristic (ROC) curve is a plot of
the true positive rate (recall) against the false positive rate (1-specificity) for
different threshold values of the classifier’s decision function. The Area Under the
Curve (AUC) measures the overall performance of the classifier, with values ranging
from 0.5 (random guessing) to 1 (perfect classification).
 Cross-validation: A technique that divides the data into multiple folds and trains the
model on each fold while testing on the others, to obtain a more robust estimate of
the model’s performance.

Applications of Classification Algorithm

Classification algorithms are widely used in many real-world applications across various
domains, including:

 Email spam filtering

 Credit risk assessment
 Medical diagnosis
 Image classification
 Sentiment analysis.
 Fraud detection
 Quality control
 Recommendation systems

Some algorithms are designed for binary classification problems. Examples include:

 Logistic Regression
 Perceptron
 Support Vector Machines

As such, they cannot be used for multi-class classification tasks, at least not directly.

Instead, heuristic methods can be used to split a multi-class classification problem into
multiple binary classification datasets and train a binary classification model each.

Two examples of these heuristic methods include:

 One-vs-Rest (OvR)
 One-vs-One (OvO)

One-Vs-Rest for Multi-Class Classification

One-vs-rest (OvR for short, also referred to as One-vs-All or OvA) is a heuristic method for
using binary classification algorithms for multi-class classification.

It involves splitting the multi-class dataset into multiple binary classification problems. A
binary classifier is then trained on each binary classification problem and predictions are
made using the model that is the most confident.

For example, given a multi-class classification problem with examples for each class ‘red,’
‘blue,’ and ‘green‘. This could be divided into three binary classification datasets as follows:

 Binary Classification Problem 1: red vs [blue, green]

 Binary Classification Problem 2: blue vs [red, green]
 Binary Classification Problem 3: green vs [red, blue]

A possible downside of this approach is that it requires one model to be created for each
class. For example, three classes requires three models. This could be an issue for large
datasets (e.g. millions of rows), slow models (e.g. neural networks), or very large numbers of
classes (e.g. hundreds of classes).

One-Vs-One for Multi-Class Classification

One-vs-One (OvO for short) is another heuristic method for using binary classification
algorithms for multi-class classification.

Like one-vs-rest, one-vs-one splits a multi-class classification dataset into binary

classification problems. Unlike one-vs-rest that splits it into one binary dataset for each
class, the one-vs-one approach splits the dataset into one dataset for each class versus
every other class.

For example, consider a multi-class classification problem with four classes: ‘red,’ ‘blue,’ and
‘green,’ ‘yellow.’ This could be divided into six binary classification datasets as follows:

 Binary Classification Problem 1: red vs. blue

 Binary Classification Problem 2: red vs. green
 Binary Classification Problem 3: red vs. yellow
 Binary Classification Problem 4: blue vs. green
 Binary Classification Problem 5: blue vs. yellow
 Binary Classification Problem 6: green vs. yellow

This is significantly more datasets, and in turn, models than the one-vs-rest strategy
described in the previous section.

The formula for calculating the number of binary datasets, and in turn, models, is as follows:

(NumClasses * (NumClasses – 1)) / 2

We can see that for four classes, this gives us the expected value of six binary classification
problems:

(NumClasses * (NumClasses – 1)) / 2

(4 * (4 – 1)) / 2

(4 * 3) / 2

12 / 2

Each binary classification model may predict one class label and the model with the most
predictions or votes is predicted by the one-vs-one strategy.

18 1994 - John Preston - The Golden Proportion Revisted
No ratings yet
18 1994 - John Preston - The Golden Proportion Revisted
5 pages
ECSE 2010 Circuits
No ratings yet
ECSE 2010 Circuits
3 pages
Unit 4 ML
No ratings yet
Unit 4 ML
28 pages
Classification
No ratings yet
Classification
21 pages
ML Unit 2
No ratings yet
ML Unit 2
31 pages
Multiclass Classification
No ratings yet
Multiclass Classification
3 pages
Classification in Machine Learning
No ratings yet
Classification in Machine Learning
25 pages
4 Types of Classification Tasks in Machine Learning
No ratings yet
4 Types of Classification Tasks in Machine Learning
14 pages
Multiclass Classification
No ratings yet
Multiclass Classification
45 pages
ml4
No ratings yet
ml4
32 pages
Classification Clustering Recommender System
No ratings yet
Classification Clustering Recommender System
12 pages
Module 4 - Classification (1)
No ratings yet
Module 4 - Classification (1)
10 pages
Machine Learning
No ratings yet
Machine Learning
28 pages
L 13 Choose Your Own Algorithm D 07062024 111828am
No ratings yet
L 13 Choose Your Own Algorithm D 07062024 111828am
36 pages
ml_unit2
No ratings yet
ml_unit2
22 pages
UNIT-2 (Classification)
No ratings yet
UNIT-2 (Classification)
51 pages
Classification
No ratings yet
Classification
22 pages
Data Mining-Unit-3
No ratings yet
Data Mining-Unit-3
16 pages
ML UNIT-1-1
No ratings yet
ML UNIT-1-1
16 pages
ML UNIT-II
No ratings yet
ML UNIT-II
37 pages
ML Notes -2025
No ratings yet
ML Notes -2025
145 pages
BSC ML CH1.pptx
No ratings yet
BSC ML CH1.pptx
63 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
24 pages
Unit Ii
No ratings yet
Unit Ii
118 pages
Lecture 3 1611410001002
No ratings yet
Lecture 3 1611410001002
51 pages
Data Mining Classification and Prediction
No ratings yet
Data Mining Classification and Prediction
17 pages
Hands On Machine Learning 3 Edition
No ratings yet
Hands On Machine Learning 3 Edition
31 pages
ml unit 3
No ratings yet
ml unit 3
13 pages
ARTIFICIAL INTELLIGENCE LEC 2
No ratings yet
ARTIFICIAL INTELLIGENCE LEC 2
17 pages
UNIT 4 Supervised Learning
No ratings yet
UNIT 4 Supervised Learning
38 pages
Lab 04 - Supervised ML Classification - Updated
No ratings yet
Lab 04 - Supervised ML Classification - Updated
21 pages
L02 Classification and Regression
No ratings yet
L02 Classification and Regression
26 pages
ML Unit-2
No ratings yet
ML Unit-2
51 pages
Statistical Classification: Fundamentals and Applications
From Everand
Statistical Classification: Fundamentals and Applications
Fouad Sabry
No ratings yet
ML+LVC+3+Post-Session+Summary
No ratings yet
ML+LVC+3+Post-Session+Summary
16 pages
Classification FoundationalMathofAI S24
No ratings yet
Classification FoundationalMathofAI S24
6 pages
Aimlf Unit 3
No ratings yet
Aimlf Unit 3
20 pages
lec5_Classification
No ratings yet
lec5_Classification
27 pages
Classification (Part II)
No ratings yet
Classification (Part II)
162 pages
4 Classification
No ratings yet
4 Classification
20 pages
Unit-Ii Chapter-3 Beyond Binary Classification Handling More Than Two Classes
No ratings yet
Unit-Ii Chapter-3 Beyond Binary Classification Handling More Than Two Classes
16 pages
Classification
100% (2)
Classification
105 pages
Machine Learning Note (2)
No ratings yet
Machine Learning Note (2)
40 pages
ML
No ratings yet
ML
22 pages
ITP4-Lesson 4-Week 7-8
No ratings yet
ITP4-Lesson 4-Week 7-8
18 pages
ML Unit 3
No ratings yet
ML Unit 3
127 pages
Survey On Multiclass Classification Methods
No ratings yet
Survey On Multiclass Classification Methods
9 pages
Unit-1 ML
No ratings yet
Unit-1 ML
19 pages
Binary Classification
No ratings yet
Binary Classification
2 pages
Lecture 11_09.09.24 Classification Part 1
No ratings yet
Lecture 11_09.09.24 Classification Part 1
51 pages
Classification
No ratings yet
Classification
53 pages
Project Report 2
No ratings yet
Project Report 2
11 pages
Machine Learning HC
No ratings yet
Machine Learning HC
4 pages
Unit - 3
No ratings yet
Unit - 3
83 pages
CCPS521 WIN2023 Week05 - Classification
No ratings yet
CCPS521 WIN2023 Week05 - Classification
47 pages
Day35 Classification Algorithm
No ratings yet
Day35 Classification Algorithm
5 pages
ML Mod1-CS467 Machine Learning - Ktustudents - in
No ratings yet
ML Mod1-CS467 Machine Learning - Ktustudents - in
16 pages
8.predictive Analytics - Classification 2
No ratings yet
8.predictive Analytics - Classification 2
28 pages
overview_basics
No ratings yet
overview_basics
16 pages
Unit 2
No ratings yet
Unit 2
28 pages
For More Visit WWW - Ktunotes.in
No ratings yet
For More Visit WWW - Ktunotes.in
21 pages
Alternating Decision Tree: Fundamentals and Applications
From Everand
Alternating Decision Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet
Fig. P3.129, P3.130
No ratings yet
Fig. P3.129, P3.130
4 pages
History of Motion and Time Study
No ratings yet
History of Motion and Time Study
9 pages
Architectural Technique
No ratings yet
Architectural Technique
6 pages
Solution: Theoretical Problem No. 2 / Solution
No ratings yet
Solution: Theoretical Problem No. 2 / Solution
7 pages
Icse Sample Paper-3 For Computer Applications
No ratings yet
Icse Sample Paper-3 For Computer Applications
4 pages
Eliminating Guess Work: Figure 1. Complex Flowsheet
100% (1)
Eliminating Guess Work: Figure 1. Complex Flowsheet
6 pages
Study Guide to Accompany Fundamentals of Physics 8th edition Halliday Resnick Walker Thomas E. Barrett - The ebook is available for instant download, read anywhere
100% (1)
Study Guide to Accompany Fundamentals of Physics 8th edition Halliday Resnick Walker Thomas E. Barrett - The ebook is available for instant download, read anywhere
52 pages
A Model For Predicting Music Popularity On Streami
No ratings yet
A Model For Predicting Music Popularity On Streami
10 pages
Pie Charts
No ratings yet
Pie Charts
2 pages
Integers Pie
No ratings yet
Integers Pie
2 pages
1 Weekly Test: Mathematic YEAR 6
No ratings yet
1 Weekly Test: Mathematic YEAR 6
9 pages
Finite Element Analysis of Reinforced Concrete Beams Strengthened With CFRP in Flexural
No ratings yet
Finite Element Analysis of Reinforced Concrete Beams Strengthened With CFRP in Flexural
17 pages
Algebra One Lesson 8.1 (3-1)
No ratings yet
Algebra One Lesson 8.1 (3-1)
3 pages
Mathematical Induction (1) - Section2
No ratings yet
Mathematical Induction (1) - Section2
13 pages
Comparing Quantities Worksheet Class 8
No ratings yet
Comparing Quantities Worksheet Class 8
8 pages
Tm440 Asim Basic Functions
No ratings yet
Tm440 Asim Basic Functions
58 pages
Control of Defects in Deep Drawing of Tailor-Welde
No ratings yet
Control of Defects in Deep Drawing of Tailor-Welde
20 pages
2-1 Writing Equations
No ratings yet
2-1 Writing Equations
22 pages
Optics & Laser Technology: Jian Zhou, Xingwu Long
No ratings yet
Optics & Laser Technology: Jian Zhou, Xingwu Long
6 pages
CHAPTER 03-20200827-v8
No ratings yet
CHAPTER 03-20200827-v8
28 pages
Pctb Sp Sun Math Primer a (1)
No ratings yet
Pctb Sp Sun Math Primer a (1)
36 pages
ENGR 3215: Hapter With Other References
No ratings yet
ENGR 3215: Hapter With Other References
24 pages
9781003158912_previewpdf
No ratings yet
9781003158912_previewpdf
18 pages
ECIGMA - Notebook (C++) Codes For ACM ICPC
No ratings yet
ECIGMA - Notebook (C++) Codes For ACM ICPC
18 pages
Solitary Waves in Plasma: Head On Collision
No ratings yet
Solitary Waves in Plasma: Head On Collision
19 pages
PHYSICS FOR HEALTH SCIENCE Activities
No ratings yet
PHYSICS FOR HEALTH SCIENCE Activities
10 pages
Data Manipulation - Notes
No ratings yet
Data Manipulation - Notes
9 pages
Adjacent Angle
No ratings yet
Adjacent Angle
3 pages

Machine Learning

Uploaded by

Machine Learning

Uploaded by

What is Classification ?

Classification is a process of categorizing data or objects into predefined classes or

Classification is of two types:

There are various types of classifiers. Some of them are :

Non-linear Classifiers: Non-linear models create a non-linear decision boundary between

Type of Learners in Classifications Algorithm

Classification model Evaluations

Evaluating a classification model is an important step in machine learning, as it helps to

 Classification Accuracy: The proportion of correctly classified instances over the

Applications of Classification Algorithm

 Email spam filtering

Two examples of these heuristic methods include:

One-Vs-Rest for Multi-Class Classification

 Binary Classification Problem 1: red vs [blue, green]

One-Vs-One for Multi-Class Classification

Like one-vs-rest, one-vs-one splits a multi-class classification dataset into binary

 Binary Classification Problem 1: red vs. blue

(NumClasses * (NumClasses – 1)) / 2

(NumClasses * (NumClasses – 1)) / 2

You might also like