K-Means Clustering

Uploaded by

97 Haseeb

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views14 pages

K-Means Clustering

Uploaded by

97 Haseeb

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 14

K-Means Clustering

Introduction
• K-Means Clustering is an Unsupervised Machine
Learning algorithm, which groups the unlabeled dataset
into different clusters. The article aims to explore the
fundamentals and working of k mean clustering along
with the implementation.
Use of Clustering
• The goal of clustering is to divide the population or set
of data points into a number of groups so that the data
points within each group are more comparable to one
another and different from the data points within the
other groups. It is essentially a grouping of things based
on how similar and different they are to one another.
Algorithm
• The algorithm works as follows:

1. First, we randomly initialize k points, called means or

cluster centroids.
2. We categorize each item to its closest mean, and we
update the mean’s coordinates, which are the
averages of the items categorized in that cluster so
far.
3. We repeat the process for a given number of iterations
and at the end, we have our clusters.
• Initialize k means with random values
• --> For a given number of iterations:
• --> Iterate through items:
• --> Find the mean closest to the item by calculating
• the Euclidean distance of the item with each of the
means
• --> Assign item to mean
• --> Update mean by shifting it to the average of the
items in that cluster
• The elbow method is a technique used to determine
the optimal number of clusters (k) in k-means
clustering. It evaluates how well the data points fit
within the clusters by analyzing the within-cluster
sum of squares (WCSS), also known as inertia.
• Key Steps in the Elbow Method:
1.Run k-means for different values of k:
1.Start with a range of cluster numbers (e.g., 1 to 10).
2.Compute the WCSS for each k.
• Create a plot with 𝑘 values on the x-axis and the WCSS
• Plot the results:

on the y-axis.
• Look for the "elbow point":
• The elbow point is where the WCSS starts decreasing at
a slower rate, resembling an elbow.It indicates the
optimal number of clusters because adding more
clusters beyond this point yields diminishing returns.
• Why it Works:
• At low k, clusters are large and have higher WCSS
because many points are far from their cluster center.
• As k increases, WCSS decreases since clusters become
smaller and tighter.
• Beyond the optimal k, WCSS reduction slows as clusters
start to overfit (splitting data unnecessarily).
• Limitations:
• The elbow point may not always be distinct, making the
method subjective.
• The method does not guarantee the best clustering
structure but provides a useful heuristic.
• What is the Silhouette Score?
• Measures how similar a data point is to its own cluster
(cohesion) compared to other clusters (separation).
• The score ranges from −1to 1:
• +1: Perfect clustering (points are close to their cluster and far
from others).
• 0: Overlapping clusters.
• −1: Poor clustering (points assigned to the wrong cluster)

K Means
No ratings yet
K Means
26 pages
EML %TH Module
No ratings yet
EML %TH Module
40 pages
K-MEANS CLUSTERING PPT Kpu
No ratings yet
K-MEANS CLUSTERING PPT Kpu
4 pages
Clustering Analysis
No ratings yet
Clustering Analysis
12 pages
Lecture 18 K Means Clustering
No ratings yet
Lecture 18 K Means Clustering
77 pages
K Clustering
No ratings yet
K Clustering
28 pages
Unit 4 Aiml
No ratings yet
Unit 4 Aiml
24 pages
66 Yash DM PR9
No ratings yet
66 Yash DM PR9
4 pages
ML Practical 4
No ratings yet
ML Practical 4
2 pages
Data Mining-4
No ratings yet
Data Mining-4
9 pages
Elbow Method For Optimal Cluster Number in K-Means
No ratings yet
Elbow Method For Optimal Cluster Number in K-Means
8 pages
K Means Clustering
No ratings yet
K Means Clustering
27 pages
Machine Learning Unit 4
No ratings yet
Machine Learning Unit 4
22 pages
Unit 4
No ratings yet
Unit 4
63 pages
K-Means Clustering
No ratings yet
K-Means Clustering
7 pages
ML Ch-5 Clustering, Dimensionality Reduction and Recommender System
No ratings yet
ML Ch-5 Clustering, Dimensionality Reduction and Recommender System
13 pages
AI Week 11
No ratings yet
AI Week 11
21 pages
Assignment 4 A
No ratings yet
Assignment 4 A
15 pages
ML Unit III
No ratings yet
ML Unit III
82 pages
K Means Clustering
No ratings yet
K Means Clustering
13 pages
K-Means Clustering
No ratings yet
K-Means Clustering
5 pages
Data Mining
No ratings yet
Data Mining
10 pages
Machine Learning
No ratings yet
Machine Learning
3 pages
IDS Unit-3 L2
No ratings yet
IDS Unit-3 L2
26 pages
K-Mean Clustering
No ratings yet
K-Mean Clustering
8 pages
Kmeans Clustering
No ratings yet
Kmeans Clustering
3 pages
CSC649 Lecture 3 Unsupervised ML - KMeansClustering
No ratings yet
CSC649 Lecture 3 Unsupervised ML - KMeansClustering
22 pages
Unit 4
No ratings yet
Unit 4
22 pages
K-Means Algorithm
No ratings yet
K-Means Algorithm
6 pages
K-Means Clustering
No ratings yet
K-Means Clustering
8 pages
K-Means Clustering
No ratings yet
K-Means Clustering
3 pages
Determining Clusters
No ratings yet
Determining Clusters
4 pages
Clustering - K-Means: Prerequisite
No ratings yet
Clustering - K-Means: Prerequisite
8 pages
KMean Merged
No ratings yet
KMean Merged
13 pages
Algo
No ratings yet
Algo
59 pages
CPE412 Pattern Recognition (Week 7)
No ratings yet
CPE412 Pattern Recognition (Week 7)
48 pages
Unit 4 Aam
No ratings yet
Unit 4 Aam
26 pages
Unsupervised Learning 1
No ratings yet
Unsupervised Learning 1
40 pages
Kmeansfinal
No ratings yet
Kmeansfinal
16 pages
6 Clustering
No ratings yet
6 Clustering
15 pages
MODULE 4 Clustering
No ratings yet
MODULE 4 Clustering
23 pages
UNIT - 3 - Clustering
No ratings yet
UNIT - 3 - Clustering
21 pages
PART2
No ratings yet
PART2
61 pages
KMeans Clustering Report
No ratings yet
KMeans Clustering Report
2 pages
K Means Clustering
No ratings yet
K Means Clustering
22 pages
ML Unit-2
No ratings yet
ML Unit-2
31 pages
Introduction To The K-Means Clustering Algorithm Based On The Elbow
No ratings yet
Introduction To The K-Means Clustering Algorithm Based On The Elbow
4 pages
Clustering Kmeans
No ratings yet
Clustering Kmeans
6 pages
K-Means Clustering Algorithm - Javatpoint
No ratings yet
K-Means Clustering Algorithm - Javatpoint
21 pages
Unsupervised Machine Learning
No ratings yet
Unsupervised Machine Learning
10 pages
Lecture 11 K Means Clustering
No ratings yet
Lecture 11 K Means Clustering
8 pages
Mod4 - Unsupervised Learning
No ratings yet
Mod4 - Unsupervised Learning
9 pages
DWDM Unit5
No ratings yet
DWDM Unit5
14 pages
AI Chapter 3 Part 5
No ratings yet
AI Chapter 3 Part 5
30 pages
Clustering Algorithms
No ratings yet
Clustering Algorithms
19 pages
"These Are Just Rough Notes For References" What Is K-Means Clustering
No ratings yet
"These Are Just Rough Notes For References" What Is K-Means Clustering
9 pages
K-Means and PCA
No ratings yet
K-Means and PCA
69 pages
K-Means Clustering Algorithm
No ratings yet
K-Means Clustering Algorithm
13 pages
K Means Clustering Algorithm
No ratings yet
K Means Clustering Algorithm
12 pages
Algorithms Lab Viva Questions
No ratings yet
Algorithms Lab Viva Questions
2 pages
Prolog - Unification - Backtracking - Recursion - Lists - Cut
No ratings yet
Prolog - Unification - Backtracking - Recursion - Lists - Cut
78 pages
Bow - Stat Quarter Iii Sy 2023 2024
100% (1)
Bow - Stat Quarter Iii Sy 2023 2024
3 pages
EM9 U1 Lesson 2 PPT
No ratings yet
EM9 U1 Lesson 2 PPT
35 pages
BYJU'S Answer: Study Materials
No ratings yet
BYJU'S Answer: Study Materials
13 pages
Digital Certificates and Digital Signature
No ratings yet
Digital Certificates and Digital Signature
5 pages
Chapter-6 (Association Analysis Basic Concepts and Algorithms)
No ratings yet
Chapter-6 (Association Analysis Basic Concepts and Algorithms)
75 pages
IITK Gradesheet
No ratings yet
IITK Gradesheet
4 pages
Overfitting Vs Underfitting
No ratings yet
Overfitting Vs Underfitting
14 pages
Revision V5no
No ratings yet
Revision V5no
14 pages
AIDS - DM Using Python - Lab Programs
No ratings yet
AIDS - DM Using Python - Lab Programs
19 pages
Breast Cancer Classification and Prediction Using Machine Learning IJERTV9IS020280
No ratings yet
Breast Cancer Classification and Prediction Using Machine Learning IJERTV9IS020280
5 pages
Form Finding of Shells by Structural Optimization
No ratings yet
Form Finding of Shells by Structural Optimization
9 pages
Emotion Recognition Using Eeg Dignals
No ratings yet
Emotion Recognition Using Eeg Dignals
8 pages
Privasea Whitepaper
No ratings yet
Privasea Whitepaper
44 pages
K - Nearest Neighbor
No ratings yet
K - Nearest Neighbor
13 pages
Clustering
No ratings yet
Clustering
19 pages
7.1 First Order Differential Equation
No ratings yet
7.1 First Order Differential Equation
35 pages
Markov
No ratings yet
Markov
21 pages
Nabeel and Awais
No ratings yet
Nabeel and Awais
13 pages
Class 12 Maths
No ratings yet
Class 12 Maths
7 pages
WORD EMBEDDING Project
No ratings yet
WORD EMBEDDING Project
15 pages
Simple Equations Questions
No ratings yet
Simple Equations Questions
4 pages
Activation Function
No ratings yet
Activation Function
10 pages
Feed Forward Neural Network Presentation of Sami and Daim
No ratings yet
Feed Forward Neural Network Presentation of Sami and Daim
9 pages
DL Unit-3
No ratings yet
DL Unit-3
10 pages
Assignment
No ratings yet
Assignment
20 pages
Model Seirs Penyakit Malaria Dengan Vaksinasi
No ratings yet
Model Seirs Penyakit Malaria Dengan Vaksinasi
47 pages
School in Delhi
No ratings yet
School in Delhi
56 pages
Final Value Theorem PPT Electronics 1
No ratings yet
Final Value Theorem PPT Electronics 1
6 pages
SSL - C4.5 Rules
No ratings yet
SSL - C4.5 Rules
13 pages
IE 312-5.1-Location Problem Basic Models-Continuous II
No ratings yet
IE 312-5.1-Location Problem Basic Models-Continuous II
32 pages
Computer Network Applications in Fuzzy System - Ma...
No ratings yet
Computer Network Applications in Fuzzy System - Ma...
2 pages
Unit-I - ADS - IMP QP
No ratings yet
Unit-I - ADS - IMP QP
2 pages
Left Recursion
No ratings yet
Left Recursion
9 pages
Birla Institute of Technology & Science, Pilani Pilani Campus SECOND SEMESTER 2015 - 2016 Database Systems (Cs F212/ Is F243) Mid Semester Exam
No ratings yet
Birla Institute of Technology & Science, Pilani Pilani Campus SECOND SEMESTER 2015 - 2016 Database Systems (Cs F212/ Is F243) Mid Semester Exam
3 pages
Probabilities
No ratings yet
Probabilities
2 pages
M6 Check in Activity 4
No ratings yet
M6 Check in Activity 4
4 pages
The Tech Interview Playbook: From DSA to System Design
From Everand
The Tech Interview Playbook: From DSA to System Design
Chinmoy Mukherjee
No ratings yet

K-Means Clustering

Uploaded by

K-Means Clustering

Uploaded by

K-Means Clustering

1. First, we randomly initialize k points, called means or

You might also like