0% found this document useful (0 votes)
26 views8 pages

ML Unit 3 Part 2

Uploaded by

jkdprince3
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
26 views8 pages

ML Unit 3 Part 2

Uploaded by

jkdprince3
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 8

CLASSIFICATION

MODEL
Introduction
● When we are trying to predict a categorical or nominal variable, then
problem is known as a classification problem.
● A classification problem is one where the output variable is a
category such as ‘red’ or ‘blue’ or ‘malignant tumour’ or ‘benign
tumour’, etc
● Where as when we are trying to predict a numerical variable such as
‘price’, ‘weight’, etc. the problem falls under the category of
regression.

Classification Problem: the whole problem centres around assigning a


label or category or class to a test data on the basis of the label or
category or class information that is imparted by the training data.
● And objective is to assign a class label, we call this type of problem
as a classification problem
● A classification model is obtained from the labelled training data by a
classifier algorithm.
● On the basis of the model, a class label (e.g. ‘Intel’ as in the case of the
test data referred in Fig) is assigned to the test data.
● Classification is a type of supervised learning where a target feature,
which is of categorical type, is predicted for test data on the basis of the
information imparted by the training data.
● The target categorical feature is known as class .
● Some typical classification problems include the following:
 Image classification
 Disease prediction
 Win–loss prediction of games
 Prediction of natural calamity such as earthquake, flood, etc.
 Handwriting recognition
CLASSIFICATION LEARNING STEPS
● First, there is a problem which is to be solved, and then, the required
data is evaluated and pre-processed based on the algorithm.
● Algorithm selection is a critical point in supervised learning.
● The result after iterative training rounds is a classifier for the problem
 Problem Identification: Identifying the problem is the first
step in the supervised learning model. The problem needs to be a
well-formed problem.
 Identification of Required Data: On the basis of the problem
identified above, the required data set that precisely represents the
identified problem needs to be identified/evaluated
● Data Pre-processing: This is related to the cleaning/transforming the
data set. This step ensures that all the unnecessary/irrelevant data
elements are removed.
● Definition of Training Data Set: Before starting the analysis, the user
should decide what kind of data set is to be used as a training set.
● Algorithm Selection: This involves determining the structure of the
learning function and the corresponding learning algorithm.
● Training: The learning algorithm identified in the previous
step is run on the gathered training set for further fine tuning.
● Evaluation with the Test Data Set: Training data is run on the
algorithm, and its performance is measured here.
COMMON CLASSIFICATION ALGORITHMS
● Let us now delve into some common classification algorithms.

1. k-Nearest Neighbour (kNN)


2. Decision tree
3. Random forest
4. Support Vector Machine (SVM)
5. Naïve Bayes classifier

You might also like