0% found this document useful (0 votes)

61 views62 pages

K-Means Clustering and K-Nearest Neighbors Algorithm

Uploaded by

Griffithe Here

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

61 views62 pages

K-Means Clustering and K-Nearest Neighbors Algorithm

Uploaded by

Griffithe Here

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 62

K-Means Clustering &

K-Nearest Neighbors
Algorithm
K-Means
Clustering
What is K-Means Clustering
Algorithm?
K-means clustering is a method of vector quantization. K-
means clustering aims to partition n observations into k
clusters in which each observation belongs to the cluster
with the nearest mean, serving as a prototype of the
cluster.

3
What is K-Means Clustering
Algorithm?
It is basically a type of unsupervised learning method. An
unsupervised learning method is a method in which we draw
references from datasets consisting of input data without
labeled responses. Generally, it is used as a process to find
meaningful structure, explanatory underlying processes,
generative features, and groupings inherent in a set of
examples.

4
What is
Clustering?
Clustering is the task of dividing the population
or data points into a number of groups such that
data points in the same groups are more similar
to other data points in the same group and
dissimilar to the data points in other groups. It
is basically a collection of objects on the basis
of similarity and dissimilarity between them.

5
For example
The data points in the graph below clustered together can be classified into one
single group. We can distinguish the clusters, and we can identify that there are 3
clusters in the below picture.

6
It is not necessary for clusters to be a spherical.
Such as :

7
Why Clustering?
Clustering is very much important as it determines the
intrinsic grouping among the unlabeled data present. There
are no criteria for a good clustering. It depends on the user,
what is the criteria they may use which satisfy their need.

8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
Clustering Methods:
1. Density-Based Methods :

These methods consider the clusters as the dense region

having some similarity and different from the lower
dense region of the space. These methods have good
accuracy and ability to merge two clusters.

49
Clustering Methods:
2. Hierarchical Based Methods :

The clusters formed in this method forms a tree type

structure based on the hierarchy. New clusters are
formed using the previously formed one.
It is divided into two category:

 Agglomerative (bottom up approach)

 Divisive (top down approach) .

50
Clustering Methods:
3. Partitioning Methods :

These methods partition the objects into k clusters and

each partition forms one cluster. This method is used to
optimize an objective criterion similarity function such
as when the distance is a major parameter.

51
Clustering Methods:
4. Grid-based Methods :

In this method the data space are formulated into a

finite number of cells that form a grid-like structure. All
the clustering operation done on these grids are fast and
independent of the number of data objects.

52
Applications of Clustering in
different fields:
1. Marketing: It can be used to characterize
& discover customer segments for
marketing purposes.
2. Biology: It can be used for classification
among different species of plants and
animals.
3. Libraries: It is used in clustering different
books on the basis of topics and
information.
53
Applications of Clustering in
different fields:
4. Insurance: It is used to acknowledge the
customers, their policies and identifying the
frauds.
5. City Planning: It is used to make groups of
houses and to study their values based on
their geographical locations and other
factors present.
6. Earthquake studies: By learning the
earthquake affected areas we can determine
the dangerous zones.
54
Pseudocode of K-Means
Clustering:
 Initialize k means with random values.
 For a given number of iterations:
 Iterate through items:

 Find the mean closest to the item

 Assign item to mean
 Update mean

55
K-Nearest
Neighbors
What is K-Nearest Neighbors
Algorithm?
In pattern recognition, the k-nearest neighbors algorithm
(k-NN) is a non-parametric method used for classification
and regression. In both cases, the input consists of the k
closest training examples in the feature space. The output
depends on whether k-NN is used for classification or
regression:

57
K-NN Approach
 k-NN assumes that all instances are points
in some n-dimensional space and defines
neighbors in terms of distance (usually
Euclidean in R-space).
 k is the number of neighbors considered.

58
K-NN Classification
In k-NN classification, the output is a class
membership. An object is classified by a plurality vote
of its neighbors, with the object being assigned to the
class most common among its k nearest neighbors (k is
a positive integer, typically small). If k = 1, then the
object is simply assigned to the class of that single
nearest neighbor.

59
Example of k-NN classification. The test sample (green dot) should be classified
either to blue squares or to red triangles. If k = 3 (solid line circle) it is assigned to the
red triangles because there are 2 triangles and only 1 square inside the inner circle. If
k = 5 (dashed line circle) it is assigned to the blue squares (3 squares vs. 2 triangles
inside the outer circle).

60
K-NN Regression
In k-NN regression, the output is the property value for
the object. This value is the average of the values
of k nearest neighbors.

61
Thanks!
Any Questions?
You can find us at:
@OurHome

NN Unit - 1
No ratings yet
NN Unit - 1
27 pages
7.introduction To Clustering
No ratings yet
7.introduction To Clustering
11 pages
Distance-Based Methods - KNN
No ratings yet
Distance-Based Methods - KNN
8 pages
Unit 4
No ratings yet
Unit 4
29 pages
Chapter 3 Unsupervised Learning
No ratings yet
Chapter 3 Unsupervised Learning
45 pages
9.54 Class 13: Unsupervised Learning
No ratings yet
9.54 Class 13: Unsupervised Learning
54 pages
UNIT-6 K Means Clustering
No ratings yet
UNIT-6 K Means Clustering
12 pages
Cluster
No ratings yet
Cluster
50 pages
DM Lecture 06
No ratings yet
DM Lecture 06
32 pages
IS4242 W8 Similarity, NN and Clusters
No ratings yet
IS4242 W8 Similarity, NN and Clusters
29 pages
Clustering and Pattern Recognition Unit 5
No ratings yet
Clustering and Pattern Recognition Unit 5
21 pages
Lec09 Clustering
No ratings yet
Lec09 Clustering
27 pages
UNIT 3 ML Distance Based Learning
No ratings yet
UNIT 3 ML Distance Based Learning
19 pages
ML - Unit - 2
No ratings yet
ML - Unit - 2
13 pages
Clustering
No ratings yet
Clustering
75 pages
Week 10
No ratings yet
Week 10
41 pages
K Means Algorithm
No ratings yet
K Means Algorithm
4 pages
Unit4 Datascience
No ratings yet
Unit4 Datascience
43 pages
Clustering in Machine Learning
No ratings yet
Clustering in Machine Learning
20 pages
Clustering Algorithm: An Unsupervised Learning Approach
No ratings yet
Clustering Algorithm: An Unsupervised Learning Approach
23 pages
KNN VS Kmeans
No ratings yet
KNN VS Kmeans
3 pages
LSTM PPT
No ratings yet
LSTM PPT
22 pages
ML Unit-4-1
No ratings yet
ML Unit-4-1
39 pages
AD3461 ML Lab Manual
No ratings yet
AD3461 ML Lab Manual
32 pages
Chapter 5. Clustering Algorithms-Stud
No ratings yet
Chapter 5. Clustering Algorithms-Stud
44 pages
UCS 401 Unit-Lll Lect 13 Distance Based Models Neighbours and Examples
No ratings yet
UCS 401 Unit-Lll Lect 13 Distance Based Models Neighbours and Examples
20 pages
FML Unit4
No ratings yet
FML Unit4
14 pages
ML CH 4
No ratings yet
ML CH 4
51 pages
Clustering Part1
No ratings yet
Clustering Part1
79 pages
Unit 4 Clustering - K-Means and Hierarchical
No ratings yet
Unit 4 Clustering - K-Means and Hierarchical
40 pages
Final ML Unit3 May24
No ratings yet
Final ML Unit3 May24
154 pages
Unit - 4 (ML)
No ratings yet
Unit - 4 (ML)
13 pages
ML Unit-4
No ratings yet
ML Unit-4
14 pages
ML UNIT 4 Sir
No ratings yet
ML UNIT 4 Sir
42 pages
U20cs604 Machine Learning Unit III
No ratings yet
U20cs604 Machine Learning Unit III
23 pages
PART2
No ratings yet
PART2
61 pages
Practical # 12
No ratings yet
Practical # 12
3 pages
Medical Imabmnge Analysis
No ratings yet
Medical Imabmnge Analysis
41 pages
Clustering
No ratings yet
Clustering
84 pages
K-NN Algorithm and Clustering Analysis
No ratings yet
K-NN Algorithm and Clustering Analysis
93 pages
Cluster Evaluation Techniques: Atds Assignment
No ratings yet
Cluster Evaluation Techniques: Atds Assignment
4 pages
DW & DM Unit 4 Notes
No ratings yet
DW & DM Unit 4 Notes
40 pages
DSV - Unit 3 - Data Analysis in Depth
No ratings yet
DSV - Unit 3 - Data Analysis in Depth
53 pages
Clustering
No ratings yet
Clustering
34 pages
Unit 4
No ratings yet
Unit 4
74 pages
ML Unit-5
No ratings yet
ML Unit-5
30 pages
ML Unit-4 Final 2024-25
No ratings yet
ML Unit-4 Final 2024-25
28 pages
Unsupervised Learning 1
No ratings yet
Unsupervised Learning 1
40 pages
Unit 4
No ratings yet
Unit 4
40 pages
K Mean Clustering1
No ratings yet
K Mean Clustering1
23 pages
DMW Unit 5
No ratings yet
DMW Unit 5
10 pages
Clustering Algorithm
No ratings yet
Clustering Algorithm
17 pages
K Means Clustering Lecture
No ratings yet
K Means Clustering Lecture
32 pages
Graph Partitioning Advance Clustering Technique
No ratings yet
Graph Partitioning Advance Clustering Technique
14 pages
ML Unit III
No ratings yet
ML Unit III
82 pages
Unit-4 ML
No ratings yet
Unit-4 ML
16 pages
Unit 5
No ratings yet
Unit 5
5 pages
Lecture 3
No ratings yet
Lecture 3
32 pages
Lec 05 Unsupervised-Kmeans
No ratings yet
Lec 05 Unsupervised-Kmeans
50 pages
Clustering Explanation
No ratings yet
Clustering Explanation
8 pages
Clustering and K-Means Algorithm
No ratings yet
Clustering and K-Means Algorithm
81 pages
Unit 4
No ratings yet
Unit 4
125 pages
Agglomerative Hierarchical Clustering
No ratings yet
Agglomerative Hierarchical Clustering
22 pages
Multiple-Layer Networks Backpropagation Algorithms
No ratings yet
Multiple-Layer Networks Backpropagation Algorithms
46 pages
Artificial Intelligence Fundamentals Midterm Q1
No ratings yet
Artificial Intelligence Fundamentals Midterm Q1
4 pages
H2o Prot
No ratings yet
H2o Prot
359 pages
Prashanth2022 Article HandwrittenDevanagariCharacter
No ratings yet
Prashanth2022 Article HandwrittenDevanagariCharacter
30 pages
ANN Backpropagation Algorithm
No ratings yet
ANN Backpropagation Algorithm
4 pages
21CS743
100% (1)
21CS743
1 page
NN Matlab - Examples
No ratings yet
NN Matlab - Examples
14 pages
Pretrained Convolutional Neural Network - MATLAB & Simulink - MathWorks India
No ratings yet
Pretrained Convolutional Neural Network - MATLAB & Simulink - MathWorks India
3 pages
L13 Intro-Cnn Slides
No ratings yet
L13 Intro-Cnn Slides
65 pages
CVDL
No ratings yet
CVDL
3 pages
Chapter 05 - Sharda 11e Full Accessible PPT 05
No ratings yet
Chapter 05 - Sharda 11e Full Accessible PPT 05
31 pages
Machine Learning
No ratings yet
Machine Learning
31 pages
Unit 5 - Cluster Analysis
No ratings yet
Unit 5 - Cluster Analysis
14 pages
Hands On Machine Learning With Scikit Learn and TensorFlow Techniques and Tools to Build Learning Machines 1st Edition by AurÃ©lien GÃ©ron 9352135210 9789352135219 - Read the ebook online or download it for the best experience
100% (8)
Hands On Machine Learning With Scikit Learn and TensorFlow Techniques and Tools to Build Learning Machines 1st Edition by AurÃ©lien GÃ©ron 9352135210 9789352135219 - Read the ebook online or download it for the best experience
85 pages
ML Unit 4
No ratings yet
ML Unit 4
19 pages
Implementasi Data Mining Clustering Tingkat Kepuasan Konsumen Terhadap Pelayanan Go-Jek
No ratings yet
Implementasi Data Mining Clustering Tingkat Kepuasan Konsumen Terhadap Pelayanan Go-Jek
7 pages
Comparative Study On Spoken Language Identification Based On Deep Learning
No ratings yet
Comparative Study On Spoken Language Identification Based On Deep Learning
5 pages
AI Anomaly Detection in Network Traffic
No ratings yet
AI Anomaly Detection in Network Traffic
17 pages
Alex Net
No ratings yet
Alex Net
11 pages
2017 - Lecture 5 - Smaller Network - CNN - 2 (Ming Li) (10 Slides)
No ratings yet
2017 - Lecture 5 - Smaller Network - CNN - 2 (Ming Li) (10 Slides)
10 pages
Lesson 9
No ratings yet
Lesson 9
15 pages
A Text Classification Model Based On GCN and BiGRU Fusion
No ratings yet
A Text Classification Model Based On GCN and BiGRU Fusion
5 pages
Introduction To Neural Networks 67103 - 2019 Exam B
No ratings yet
Introduction To Neural Networks 67103 - 2019 Exam B
2 pages
DWDM Externallab2022for Student
No ratings yet
DWDM Externallab2022for Student
3 pages
ML 2022 Sheet 10
No ratings yet
ML 2022 Sheet 10
1 page
A Comparison of Document Clustering Techniques
No ratings yet
A Comparison of Document Clustering Techniques
3 pages
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet

K-Means Clustering and K-Nearest Neighbors Algorithm

Uploaded by

K-Means Clustering and K-Nearest Neighbors Algorithm

Uploaded by

K-Means Clustering &

These methods consider the clusters as the dense region

The clusters formed in this method forms a tree type

 Agglomerative (bottom up approach)

These methods partition the objects into k clusters and

In this method the data space are formulated into a

 Find the mean closest to the item

You might also like