KNN Using Python

notes

Uploaded by

abhishekpatekar2003

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views

KNN Using Python

notes

Uploaded by

abhishekpatekar2003

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 23

K-Nearest Neighbour using Python(Lazy

Classification
Learner)
Lets read it...

• Recently, I read an article describing a new type of

dining experience. Patrons are served in a
completely darkened restaurant by waiters who
move carefully around memorized routes using only
their sense of touch and sound.
• The allure of these establishments is rooted in the
idea that depriving oneself of visual sensory input
will enhance the sense of taste and smell, and foods
will be experienced in new and exciting ways. Each
bite is said to be a small adventure in which the
diner discovers the flavors the chef has prepared.
What sort of Machine Learning?

• An idea that can be used for machine learning—

as does another maxim involving poultry: "birds
of a feather flock together."
• In other words, things that are alike are likely to
have properties that are alike.
• We can use this principle to classify data by
placing it in the category with the most similar,
or "nearest" neighbors.
Nearest Neighbor Classification

• In a single sentence, nearest neighbor classifiers are defined

by their characteristic of classifying unlabeled examples by
assigning them the class of the most similar labeled examples.
Despite the simplicity of this idea, nearest neighbor methods
are extremely powerful. They have been used successfully for:
– Computer vision applications, including optical character
recognition and facial recognition in both still images and
video
– Predicting whether a person enjoys a movie which he/she
has been recommended (as in the Netflix challenge)
– Identifying patterns in genetic data, for use in detecting
specific protein or diseases
The kNN Algorithm

• The kNN algorithm begins with a training dataset

made up of examples that are classified into several
categories, as labeled by a nominal variable.
• Assume that we have a test dataset containing
unlabeled examples that otherwise have the same
features as the training data.
• For each record in the test dataset, kNN identifies k
records in the training data that are the "nearest" in
similarity, where k is an integer specified in advance.
• The unlabeled test instance is assigned the class of
the majority of the k nearest neighbors
Example:
Example:
Example:
Example:
Calculating Distance

• Locating the tomato's nearest neighbors requires

a distance function, or a formula that measures
the similarity between two instances.
• There are many different ways to calculate
distance.
• Traditionally, the kNN algorithm uses Euclidean
distance, which is the distance one would measure
if you could use a ruler to connect two points,
illustrated in the previous figure by the dotted
lines connecting the tomato to its neighbors.
Distance

• Euclidean distance is specified by the following formula, where p

and q are th examples to be compared, each having n features. The
term p1 refers to the value of the first feature of example p, while
q1 refers to the value of the first feature of example q:

• The distance formula involves comparing the values of each

feature. For example, to calculate the distance between the
tomato (sweetness = 6, crunchiness = 4), and the green bean
(sweetness = 3, crunchiness = 7), we can use the formula as follows:
Distance
Distance

Manhattan
Distance

Euclidean
Distance
Closest Neighbors
Choosing appropriate k

• Deciding how many neighbors to use for kNN

determines how well the mode will generalize to
future data.
• The balance between overfitting and underfitting
the training data is a problem known as the bias-
variance tradeoff.
• Choosing a large k reduces the impact or variance
caused by noisy data, but can bias the learner such
that it runs the risk of ignoring small, but important
patterns.
Choosing appropriate k
Choosing appropriate k

• In practice, choosing k depends on the difficulty

of the concept to be learned and the number of
records in the training data.
• Typically, k is set somewhere between 3 and 10.
One common practice is to set k equal to the
square root of the number of training examples.
• In the classifier, we might set k = 4, because
there were 15 example ingredients in the
training data and the square root of 15 is 3.87.
Python Packages needed : KNN

• pandas
– Data Analytics
• numpy
– Numerical Computing
• mat plot lib.pyplot
– Plotting graphs
• sklearn
– KNN Classes
Sample Application
Problem Statement:
KNN – Classification : Dataset

Q(X=6, Y= 6) , Class=?
Useful resources

• www.pythonprogramminglanguage.com
• www.scikit-learn.org
• www.towardsdatascience.com
• www.medium.com
• www.analyticsvidhya.com
• www.kaggle.com
• www.stephacking.com
• www.github.com
Thank you

15 Plus Invoice
No ratings yet
15 Plus Invoice
1 page
Geography Sba Guide
100% (2)
Geography Sba Guide
3 pages
Doosan Infracore EZ Guide-I Programming For Lathe.
100% (1)
Doosan Infracore EZ Guide-I Programming For Lathe.
108 pages
Android Training Using Kotlin
No ratings yet
Android Training Using Kotlin
8 pages
K Nearest Neighbor
No ratings yet
K Nearest Neighbor
33 pages
26. K Nearest Neighbor
No ratings yet
26. K Nearest Neighbor
32 pages
Lazy LearningClassification Using Nearest Neighbors
No ratings yet
Lazy LearningClassification Using Nearest Neighbors
36 pages
2.2 Lazy Learning
No ratings yet
2.2 Lazy Learning
26 pages
KNN - Feb 19
No ratings yet
KNN - Feb 19
42 pages
K- Nearest Neighbor
No ratings yet
K- Nearest Neighbor
13 pages
Supervised Example KNN
No ratings yet
Supervised Example KNN
22 pages
KNN
No ratings yet
KNN
10 pages
Algorithms - K Nearest Neighbors
No ratings yet
Algorithms - K Nearest Neighbors
23 pages
Unit 4_KVR
No ratings yet
Unit 4_KVR
111 pages
ML-Lecture-13-KNN
No ratings yet
ML-Lecture-13-KNN
14 pages
K - Nearest Neighbours (K-NN) Algorithm
No ratings yet
K - Nearest Neighbours (K-NN) Algorithm
10 pages
ml2
No ratings yet
ml2
6 pages
12 ML KNN
No ratings yet
12 ML KNN
28 pages
Instance Based Learning
No ratings yet
Instance Based Learning
7 pages
14 a i
No ratings yet
14 a i
14 pages
Live 1 - AI - K Nearest Neighbors
No ratings yet
Live 1 - AI - K Nearest Neighbors
21 pages
KNN_Algorithm
No ratings yet
KNN_Algorithm
2 pages
Lecture 07 KNN 14112022 034756pm
100% (1)
Lecture 07 KNN 14112022 034756pm
24 pages
CPE412 Pattern Recognition (Week 6)
No ratings yet
CPE412 Pattern Recognition (Week 6)
27 pages
m3 final-1
No ratings yet
m3 final-1
171 pages
Introduction To Machine Learning: K-Nearest Neighbors: Zhongheng Zhang
No ratings yet
Introduction To Machine Learning: K-Nearest Neighbors: Zhongheng Zhang
7 pages
Instance-Based Learning: K-Nearest Neighbour Learning
No ratings yet
Instance-Based Learning: K-Nearest Neighbour Learning
21 pages
K-Nearest Neighbor Classification-Algorithm and Characteristics
No ratings yet
K-Nearest Neighbor Classification-Algorithm and Characteristics
6 pages
K-Nearest Neighbor
No ratings yet
K-Nearest Neighbor
22 pages
AIML PPT[1]
No ratings yet
AIML PPT[1]
13 pages
K-Nearest-Neighbors-KNN-A-Fundamental-Machine-Learning-Algorithm (1).pptx
No ratings yet
K-Nearest-Neighbors-KNN-A-Fundamental-Machine-Learning-Algorithm (1).pptx
11 pages
KNN With Example (2)
No ratings yet
KNN With Example (2)
21 pages
Introduction To KNN
100% (1)
Introduction To KNN
8 pages
4.kNN Concepts
No ratings yet
4.kNN Concepts
12 pages
Lecture 22 - K-Nearnest Neighbours
No ratings yet
Lecture 22 - K-Nearnest Neighbours
11 pages
19-K-Nearest Neighbor Learning.-22-08-2024
No ratings yet
19-K-Nearest Neighbor Learning.-22-08-2024
25 pages
KNN
No ratings yet
KNN
7 pages
1694600817-Unit2.3 KNN CU 2.0
No ratings yet
1694600817-Unit2.3 KNN CU 2.0
25 pages
Lecture 14 and 15
No ratings yet
Lecture 14 and 15
42 pages
Lecture-11-KNearest Clustering-Part-1
No ratings yet
Lecture-11-KNearest Clustering-Part-1
18 pages
K- Nearest Neighbors.pptx
No ratings yet
K- Nearest Neighbors.pptx
33 pages
Lecture Note #3_PEC-CS701E
No ratings yet
Lecture Note #3_PEC-CS701E
27 pages
Week 5 - Instance-Based Learning & PCA
No ratings yet
Week 5 - Instance-Based Learning & PCA
69 pages
K Nearestneighborknnalgorithm 241117075907 d767c46d
No ratings yet
K Nearestneighborknnalgorithm 241117075907 d767c46d
13 pages
Bài-nhóm-tìm-hiểu-về-KNN
No ratings yet
Bài-nhóm-tìm-hiểu-về-KNN
5 pages
KNN
No ratings yet
KNN
3 pages
WEEK 07
No ratings yet
WEEK 07
24 pages
Unit 3 KNN
No ratings yet
Unit 3 KNN
16 pages
-21-KNN
No ratings yet
-21-KNN
28 pages
KNN
No ratings yet
KNN
53 pages
K Nearest Neighbors
No ratings yet
K Nearest Neighbors
22 pages
SUMSEM-2020-21 MEE6070 ETH VL2020210700842 Reference Material I 16-Jul-2021 K-Nearest Neighbors (KNN) Algorithm (Repaired) Week-3
No ratings yet
SUMSEM-2020-21 MEE6070 ETH VL2020210700842 Reference Material I 16-Jul-2021 K-Nearest Neighbors (KNN) Algorithm (Repaired) Week-3
40 pages
KNN Algorithm
No ratings yet
KNN Algorithm
3 pages
K-Nearest Neighbors (KNN)
No ratings yet
K-Nearest Neighbors (KNN)
9 pages
Algorithms: K Nearest Neighbors
No ratings yet
Algorithms: K Nearest Neighbors
16 pages
K-Nearest Neighbor (KNN) : Non-Parametric Algorithm
No ratings yet
K-Nearest Neighbor (KNN) : Non-Parametric Algorithm
7 pages
Part A 3. KNN Classification
No ratings yet
Part A 3. KNN Classification
35 pages
5. K-Nearest Neighbors
No ratings yet
5. K-Nearest Neighbors
35 pages
K Nearest Neighbor KNN
No ratings yet
K Nearest Neighbor KNN
18 pages
Machine Learning-Lecture 03
No ratings yet
Machine Learning-Lecture 03
19 pages
Dr. S. Vairachilai Department of CSE CVR College of Engineering Mangalpalli Telangana
No ratings yet
Dr. S. Vairachilai Department of CSE CVR College of Engineering Mangalpalli Telangana
18 pages
Decision Tree KNN
No ratings yet
Decision Tree KNN
9 pages
K Nearest Neighbor (KNN)
No ratings yet
K Nearest Neighbor (KNN)
9 pages
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
نموذج إختبار Step القطع 2
No ratings yet
نموذج إختبار Step القطع 2
18 pages
Thank You For Contacting Medgate Philippines Regarding Your Medical Concern. As Discussed During The Teleconsultation, Please Find Below A Summary of The Recommended Care Plan
No ratings yet
Thank You For Contacting Medgate Philippines Regarding Your Medical Concern. As Discussed During The Teleconsultation, Please Find Below A Summary of The Recommended Care Plan
2 pages
13 - Bagua Da Dao
No ratings yet
13 - Bagua Da Dao
1 page
15 Questions To Define Your Business's Core Values
No ratings yet
15 Questions To Define Your Business's Core Values
1 page
Room Allotment Application
No ratings yet
Room Allotment Application
1 page
Tenses -Basic One Shot
No ratings yet
Tenses -Basic One Shot
4 pages
Transport Airports: C) D) Ofair Ofairports. A/ FT
No ratings yet
Transport Airports: C) D) Ofair Ofairports. A/ FT
4 pages
Calcul Uk
No ratings yet
Calcul Uk
5 pages
Database Fundamentals: Made By: Shahinaz S. Azab Edited By: Mona Saleh
No ratings yet
Database Fundamentals: Made By: Shahinaz S. Azab Edited By: Mona Saleh
25 pages
Unit 5 Attention Perception Learning Memory and Forgetting
No ratings yet
Unit 5 Attention Perception Learning Memory and Forgetting
163 pages
High Calorie Meal Plan: Veganuary'S
No ratings yet
High Calorie Meal Plan: Veganuary'S
29 pages
Lean Business Plan: EOI Business School July 2019 Marta Garcia Soria Jorge Alonso Jiménez Cristian Merino Calvo
No ratings yet
Lean Business Plan: EOI Business School July 2019 Marta Garcia Soria Jorge Alonso Jiménez Cristian Merino Calvo
29 pages
Curriculum Vitae
No ratings yet
Curriculum Vitae
1 page
Bi Uasa Tahun 4
No ratings yet
Bi Uasa Tahun 4
20 pages
2.4 Measurement of Inventory Subsequent To Initial-Q-SE
No ratings yet
2.4 Measurement of Inventory Subsequent To Initial-Q-SE
10 pages
PDF Test Bank for Marketing 20th 2020 by Pride download
100% (7)
PDF Test Bank for Marketing 20th 2020 by Pride download
52 pages
150+ One Liner Census 2011
No ratings yet
150+ One Liner Census 2011
7 pages
Cmy2602 2024 TL 102 2 B (63116)
No ratings yet
Cmy2602 2024 TL 102 2 B (63116)
3 pages
Foundation CSR
No ratings yet
Foundation CSR
3 pages
Bell V Kennedy
No ratings yet
Bell V Kennedy
20 pages
Dispassionate Passions
No ratings yet
Dispassionate Passions
32 pages
Jacquelyn Carrie Gamble Resume
No ratings yet
Jacquelyn Carrie Gamble Resume
3 pages
EXPLICIT LESSON PLAN IN ARTS WEEK 6 Day 2
No ratings yet
EXPLICIT LESSON PLAN IN ARTS WEEK 6 Day 2
5 pages
Thesis Supervisor Advisor
100% (3)
Thesis Supervisor Advisor
7 pages
APQP Matrix
100% (1)
APQP Matrix
3 pages
12EC244-Mobile Communications: UNIT-1
No ratings yet
12EC244-Mobile Communications: UNIT-1
75 pages