0% found this document useful (0 votes)

92 views26 pages

MachineLearning Unit-III

Uploaded by

Hemanth

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

92 views26 pages

MachineLearning Unit-III

Uploaded by

Hemanth

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 26

Unit-III

Classification
Topics to Be Covered
• What is Classification?
• General approach to Classification
• K-Nearest Neighbor Algorithm
• Logistic regression
• Decision Trees
• Naive Bayesian
• Support Vector Machine (SVM)
What is Classification?
• Classification is a supervised machine learning method where
the model tries to predict the correct label of a given input
data.

• In classification, the model is fully trained using the training

data, and then it is evaluated on test data before being used
to perform prediction on unseen data.

For Example, an algorithm can learn to predict whether

🡪a given email is spam or ham (no spam)

🡪a tumor is malignant or benign’

Classification model

🡪A target feature is
categorical type

🡪The target categorical

feature is known as
class .
Classification Terminologies In ML
• Classifier – It is an algorithm that is used to map the input data to a specific
category.

• Classification Model – The model predicts or draws a conclusion to the input

data given for training, it will predict the class or category for the data.

• Feature – A feature is an individual measurable property of the phenomenon

being observed.

• Binary Classification – It is a type of classification with two outcomes, for eg –

either True or False / 1 or 0 / Yes or No

• Multi-Class Classification – The classification with more than two classes, in

multi-class classification each sample is assigned to one and only one label or
target.

• Multi-label Classification – This is a type of classification where each sample is

assigned to a set of labels or targets.
Binary Classification
Multi-Class Classification
Multi-label Classification
Classification Usecases
• Image classification
• Disease prediction
• Win–loss prediction of games
• Prediction of natural calamity such as earthquake, flood, etc.
• Handwriting recognition
• Document Classification
• Spam Filters
Classification Model Steps in ML

1. Problem Identification
2. Identification of Required Data
3. Data Pre-processing
4. Definition of Training Data Set
5. Algorithm Selection
6. Training
7. Evaluation with the Test Data Set
Algorithms for Classification
• k nearest neighbour
• Logistic regression
• Decision tree
• support vector machine
• Naive bayes
• Random forest
K-Nearest Neighbor (KNN) Algorithm
• K-Nearest Neighbors is one of the simplest supervised machine
learning algorithms used for classification. It classifies a data point
based on its neighbors’ classifications. It stores all available cases and
classifies new cases based on similar features.

• It is a lazy learning algorithm that stores all instances corresponding to

training data in n-dimensional space

• Classification is computed from a simple majority vote of the k nearest

neighbors of each point.

• To label a new point, it looks at the labeled points closest to that new
point also known as its nearest neighbors. It has those neighbors vote,
so whichever label most of the neighbors have is the label for the new
point. The “k” is the number of neighbors it checks.
KNN
K-Nearest Neighbor (KNN) Algorithm
• These distance functions can be Euclidean, Manhattan, Minkowski
and Hamming distance

How to Choose the Factor ‘K’?

• A KNN algorithm is based on feature similarity. Selecting the right K
value is a process called parameter tuning, which is important to
achieve higher accuracy.

• Sqrt(n), where n is the total number of data points

• Odd value of ‘k’ is selected to avoid confusion
We can use KNN when data is labeled, noise free and dataset is small
kNN algorithm
• Input: Training data set, test data set (or data points), value of ‘k’ (i.e. number
of nearest neighbours to be considered)

Steps:

• Step-1: Calculate Similarity based on distance function

• Step-2: Choose the K value

• Step-2: Find K-Nearest Neighbors and rank them based on minimal distance

• Step-3: Among these k neighbors, count the number of the data points in
each category.

• Step-5: Assign the new data points to the category for which the number of
the neighbor is maximum.
How Does KNN work?
• Consider a dataset with two variables
height(cms), weight(kg) and each data
point is classified as Normal and
Underweight
How Does KNN work?
• Suppose we have height, weight and T-shirt
size of some customers and we need to
predict the T-shirt size of a new customer
given only height and weight information we
have. Data including height, weight and
T-shirt size information is shown below

If a customer has height 161cm and weight

61kg then what would be his T-Shirt size?
How Does KNN work?
• Step 1 : Calculate Similarity based on distance function
Another Problem
K-Nearest Neighbor (KNN) Algorithm
Applications
• Recommender Systems
• Document / Content Searching
Advantages
• Extremely simple algorithm – easy to understand
• Very effective in certain situations, e.g. for recommender system
design
• Very fast or almost no time required for the training phase
Disadvantages
• Does not learn anything
• Computation cost is high because of calculating the distance
between the data points for all the training samples.
Metrics to Evaluate ML Classification Algorithms
Metrics to Evaluate ML Classification Algorithms
• True positives: The number of positive observations the model
correctly predicted as positive.

• False-positive: The number of negative observations the model

incorrectly predicted as positive.

• True negative: The number of negative observations the model

correctly predicted as negative.

• False-negative: The number of positive observations the model

incorrectly predicted as negative.
Metrics to Evaluate ML Classification Algorithms
Metrics to Evaluate ML Classification Algorithms

Report 2
No ratings yet
Report 2
12 pages
6 - KNN Classifier
No ratings yet
6 - KNN Classifier
10 pages
FINAL OPS-MAN-Assignment-chapter-4
100% (1)
FINAL OPS-MAN-Assignment-chapter-4
11 pages
ML Unit 3
No ratings yet
ML Unit 3
106 pages
ML 03 Classification
No ratings yet
ML 03 Classification
15 pages
ML 7th Sem Aiml Ite Notes Complete Long (1) - 63-155
No ratings yet
ML 7th Sem Aiml Ite Notes Complete Long (1) - 63-155
93 pages
ML Unit 3
No ratings yet
ML Unit 3
12 pages
ML 4
No ratings yet
ML 4
33 pages
FPA Unit 2
No ratings yet
FPA Unit 2
20 pages
What Is KNN
No ratings yet
What Is KNN
9 pages
ML 2
No ratings yet
ML 2
6 pages
KNN Dan KMeans
No ratings yet
KNN Dan KMeans
37 pages
Unit-4 Unsupervised Algorithm
No ratings yet
Unit-4 Unsupervised Algorithm
18 pages
KNN Updated
No ratings yet
KNN Updated
30 pages
Distance-Based Methods - KNN
No ratings yet
Distance-Based Methods - KNN
8 pages
U3 KNN
No ratings yet
U3 KNN
6 pages
KNN HMM
No ratings yet
KNN HMM
51 pages
Machine Learning Unit-3.1
No ratings yet
Machine Learning Unit-3.1
20 pages
Jntuk R20 ML Unit-Ii
No ratings yet
Jntuk R20 ML Unit-Ii
37 pages
KNN
No ratings yet
KNN
26 pages
12 - 23ECE216 - Nearest Neighbors
No ratings yet
12 - 23ECE216 - Nearest Neighbors
29 pages
19-K-Nearest Neighbor Learning.-22-08-2024
No ratings yet
19-K-Nearest Neighbor Learning.-22-08-2024
25 pages
ML Lecture 13 KNN
No ratings yet
ML Lecture 13 KNN
14 pages
K-Nearest Neighbors
No ratings yet
K-Nearest Neighbors
35 pages
ML CH 3
No ratings yet
ML CH 3
88 pages
3.1 K Nearest Neighbour Classifier
No ratings yet
3.1 K Nearest Neighbour Classifier
24 pages
Classification
No ratings yet
Classification
58 pages
Machine Learning and Web Scraping Lecture 03
No ratings yet
Machine Learning and Web Scraping Lecture 03
22 pages
ML 5
No ratings yet
ML 5
76 pages
Machine Learning3
No ratings yet
Machine Learning3
51 pages
Research and Implementation of Machine
No ratings yet
Research and Implementation of Machine
6 pages
Why Do We Need A K-NN Algorithm?
No ratings yet
Why Do We Need A K-NN Algorithm?
11 pages
ML Unit-2
No ratings yet
ML Unit-2
33 pages
Unit 3 KNN
No ratings yet
Unit 3 KNN
6 pages
UNIT 3 - Final
No ratings yet
UNIT 3 - Final
37 pages
Classification KNN
No ratings yet
Classification KNN
11 pages
KNN Worshop1
No ratings yet
KNN Worshop1
34 pages
K Nearest Neighbour Classifier
No ratings yet
K Nearest Neighbour Classifier
24 pages
Lecture Slides#7
No ratings yet
Lecture Slides#7
21 pages
21 KNN
No ratings yet
21 KNN
28 pages
Unit 5
No ratings yet
Unit 5
73 pages
Lecture7 KNN
No ratings yet
Lecture7 KNN
40 pages
Lecture 3 - KNN Algorithm
No ratings yet
Lecture 3 - KNN Algorithm
28 pages
UNIT 2 - Notes
No ratings yet
UNIT 2 - Notes
31 pages
4.0 Supervised Learning 4.1 Discuss Classification Model
No ratings yet
4.0 Supervised Learning 4.1 Discuss Classification Model
48 pages
Jntuk r20 ML Unit-II
No ratings yet
Jntuk r20 ML Unit-II
33 pages
Experiment No 7 ML
No ratings yet
Experiment No 7 ML
4 pages
ML DSBA Lab4
No ratings yet
ML DSBA Lab4
5 pages
KNN PDF
No ratings yet
KNN PDF
30 pages
'Machine Learning (Nagarjun)
No ratings yet
'Machine Learning (Nagarjun)
10 pages
ML Assignment No. 3: 3.1 Title
No ratings yet
ML Assignment No. 3: 3.1 Title
6 pages
Clustering - KNN
No ratings yet
Clustering - KNN
10 pages
K Nearest Neighbor: Presented by
No ratings yet
K Nearest Neighbor: Presented by
29 pages
ML04 KNN-SVM 2024-2025
No ratings yet
ML04 KNN-SVM 2024-2025
57 pages
14 K - Nearest Neighbours
No ratings yet
14 K - Nearest Neighbours
8 pages
Unit 4 Supervised Learning
100% (1)
Unit 4 Supervised Learning
75 pages
ML Supervised Learning Unit 3
No ratings yet
ML Supervised Learning Unit 3
51 pages
KNN - Feb 19
No ratings yet
KNN - Feb 19
42 pages
Unit 5 - DA - Classification & Clustering
No ratings yet
Unit 5 - DA - Classification & Clustering
105 pages
4K-Nearest Neighbor
No ratings yet
4K-Nearest Neighbor
38 pages
Week 07
No ratings yet
Week 07
24 pages
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Unit 1
No ratings yet
Unit 1
78 pages
Markov Decision Process
No ratings yet
Markov Decision Process
3 pages
Entrepreneurship Development Scamper
No ratings yet
Entrepreneurship Development Scamper
6 pages
ML Unit-2
No ratings yet
ML Unit-2
34 pages
Unit IV
No ratings yet
Unit IV
51 pages
Sem 2 20172018 Final Exam Question Bum2413
No ratings yet
Sem 2 20172018 Final Exam Question Bum2413
11 pages
Regr06linregressionquiz2ans 1
No ratings yet
Regr06linregressionquiz2ans 1
2 pages
2 Notes PDF
No ratings yet
2 Notes PDF
5 pages
Activity Number 2
No ratings yet
Activity Number 2
4 pages
Biostat Lab 2024 07
No ratings yet
Biostat Lab 2024 07
27 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
13 pages
Quartiles and Interquartile Range
No ratings yet
Quartiles and Interquartile Range
27 pages
Case Study - 8
No ratings yet
Case Study - 8
21 pages
Convergent and Discriminant Validity
No ratings yet
Convergent and Discriminant Validity
13 pages
Omega: %OMEGA (Data Filename, Items List
No ratings yet
Omega: %OMEGA (Data Filename, Items List
6 pages
HW1
No ratings yet
HW1
4 pages
Essential Statistics For The Behavioral Sciences 2nd Edition Digital DOCX Download
100% (14)
Essential Statistics For The Behavioral Sciences 2nd Edition Digital DOCX Download
15 pages
Network Adjustment: Project Information
No ratings yet
Network Adjustment: Project Information
6 pages
Confirmatory Factor Analysis
100% (1)
Confirmatory Factor Analysis
38 pages
Chapter 2 Organizing and Summarizing Data
No ratings yet
Chapter 2 Organizing and Summarizing Data
8 pages
Student T - Test Notes
No ratings yet
Student T - Test Notes
7 pages
Measures of Position: MAT C301 Jose Rizal University
No ratings yet
Measures of Position: MAT C301 Jose Rizal University
16 pages
A Review of Statistical Outlier Methods
No ratings yet
A Review of Statistical Outlier Methods
8 pages
8 Karp Rcommander Intro2 PDF
No ratings yet
8 Karp Rcommander Intro2 PDF
52 pages
Statistical Treatment of Data Writing Guide
No ratings yet
Statistical Treatment of Data Writing Guide
4 pages
Homework Assignment 2
No ratings yet
Homework Assignment 2
8 pages
Inflation Hedging Characteristics of Chinese Real Estate Market Chu Yongqiang
No ratings yet
Inflation Hedging Characteristics of Chinese Real Estate Market Chu Yongqiang
11 pages
1kpolovie Statistical Analysis With SPSS For Research
No ratings yet
1kpolovie Statistical Analysis With SPSS For Research
13 pages
Design of Experiments Lab
No ratings yet
Design of Experiments Lab
7 pages
Unit - 2 RM
No ratings yet
Unit - 2 RM
18 pages
Business Statistics Assignment
No ratings yet
Business Statistics Assignment
9 pages
Data-Driven Modeling of Multimode Chemical Process - Validation With A Real-World Distillation Column
No ratings yet
Data-Driven Modeling of Multimode Chemical Process - Validation With A Real-World Distillation Column
12 pages
TQ StatisticsProbability
No ratings yet
TQ StatisticsProbability
8 pages

MachineLearning Unit-III

Uploaded by

MachineLearning Unit-III

Uploaded by

Unit-III

• In classification, the model is fully trained using the training

For Example, an algorithm can learn to predict whether

🡪a given email is spam or ham (no spam)

🡪a tumor is malignant or benign’

🡪The target categorical

• Classification Model – The model predicts or draws a conclusion to the input

• Feature – A feature is an individual measurable property of the phenomenon

• Binary Classification – It is a type of classification with two outcomes, for eg –

• Multi-Class Classification – The classification with more than two classes, in

• Multi-label Classification – This is a type of classification where each sample is

• It is a lazy learning algorithm that stores all instances corresponding to

• Classification is computed from a simple majority vote of the k nearest

How to Choose the Factor ‘K’?

• Sqrt(n), where n is the total number of data points

• Step-1: Calculate Similarity based on distance function

• Step-2: Choose the K value

If a customer has height 161cm and weight

• False-positive: The number of negative observations the model

• True negative: The number of negative observations the model

• False-negative: The number of positive observations the model

You might also like