Kmean Clustering

Uploaded by

minichel

k-means algorithm in clusturing

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Kmean Clustering

Uploaded by

minichel

0% found this document useful (0 votes)

16 views3 pages

k-means algorithm in clusturing

Copyright

Available Formats

DOCX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

k-means algorithm in clusturing

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Download as docx, pdf, or txt

0% found this document useful (0 votes)

16 views3 pages

Kmean Clustering

Uploaded by

minichel

k-means algorithm in clusturing

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Download as docx, pdf, or txt

Jump to Page

You are on page 1of 3

Search inside document

k-means clustering algorithm

k-means is one of the simplest unsupervised learning algorithms that solve the well known clustering
problem. The procedure follows a simple and easy way to classify a given data set through a certain number
of clusters (assume k clusters) fixed apriori. The main idea is to define k centers, one for each cluster. These
centers should be placed in a cunning way because of different location causes different result. So, the
better choice is to place them as much as possible far away from each other. The next step is to take each
point belonging to a given data set and associate it to the nearest center. When no point is pending, the first
step is completed and an early group age is done. At this point we need to re-calculate k new centroids as
barycenter of the clusters resulting from the previous step. After we have these k new centroids, a new binding
has to be done between the same data set points and the nearest new center. A loop has been generated. As a
result of this loop we may notice that the k centers change their location step by step until no more changes
are done or in other words centers do not move any more. Finally, this algorithm aims at minimizing an
objective function know as squared error function given by:

where,
||xi - vj|| is the Euclidean distance between xi and vj.
ci is the number of data points in ith cluster.
c is the number of cluster centers.

Algorithmic steps for k-means clustering

Let X = {x1,x2,x3,..,xn} be the set of data points and V = {v1,v2,.,vc} be the set of centers.
1) Randomly select c cluster centers.
2) Calculate the distance between each data point and cluster centers.

3) Assign the data point to the cluster center whose distance from the cluster center is minimum of all the cluste
centers..
4) Recalculate the new cluster center using:

where, ci represents the number of data points in ith cluster.

5) Recalculate the distance between each data point and new obtained cluster centers.
6) If no data point was reassigned then stop, otherwise repeat from step 3).

Advantages
1) Fast, robust and easier to understand.
2) Relatively efficient: O(tknd), where n is # objects, k is # clusters, d is # dimension of each object, and t is #
iterations. Normally, k, t, d << n.
3) Gives best result when data set are distinct or well separated from each other.

Fig I: Showing the result of k-means for 'N' = 60 and 'c' = 3

Note: For more detailed figure for k-means algorithm please refer to k-means figure sub page.
Disadvantages
1) The learning algorithm requires apriori specification of the number of cluster centers.

2) The use of Exclusive Assignment - If there are two highly overlapping data then k-means will not be able to
resolve
that there are two clusters.

3) The learning algorithm is not invariant to non-linear transformations i.e. with different representation of data

we get
different results (data represented in form of cartesian co-ordinates and polar co-ordinates will give different
results).
4) Euclidean distance measures can unequally weight underlying factors.
5) The learning algorithm provides the local optima of the squared error function.
6) Randomly choosing of the cluster center cannot lead us to the fruitful result. Pl. refer Fig.
7) Applicable only when mean is defined i.e. fails for categorical data.
8) Unable to handle noisy data and outliers.
9) Algorithm fails for non-linear data set.

Fig II: Showing the non-linear data set where k-means algorithm fails

K-Means Clustering Algorithm: - V - ' Is The Euclidean Distance Between X ' Is The Number of Data Points in I
Document3 pages
K-Means Clustering Algorithm: - V - ' Is The Euclidean Distance Between X ' Is The Number of Data Points in I
Carlos Perez S
No ratings yet
ML - Unit - 2
Document13 pages
ML - Unit - 2
Dr D S Naga Malleswara Rao
No ratings yet
A Tutorial On Clustering Algorithms
Document4 pages
A Tutorial On Clustering Algorithms
jczerna
No ratings yet
Jaipur National University: Project Design With Seminar
Document26 pages
Jaipur National University: Project Design With Seminar
Faizan Shaikh
100% (1)
K-Mean Algo. On Iris Data Set - 15129145 PDF
Document7 pages
K-Mean Algo. On Iris Data Set - 15129145 PDF
Mohammad Waqas Moin Sheikh
No ratings yet
DWM Exp7 C49
Document11 pages
DWM Exp7 C49
yadneshshende2223
No ratings yet
Clustering
Document23 pages
Clustering
Aditya Mohite
No ratings yet
Lecture+Notes+ +clustering
Document13 pages
Lecture+Notes+ +clustering
Pankaj Pandey
No ratings yet
Lecture Notes - Clustering
Document13 pages
Lecture Notes - Clustering
gunjan Bhardwaj
No ratings yet
A Paper With 12pt Global Font Size
Document13 pages
A Paper With 12pt Global Font Size
vishnugorantla0308
No ratings yet
K Means Algo
Document7 pages
K Means Algo
Prakash Chorage
No ratings yet
Unit 4 Aam
Document26 pages
Unit 4 Aam
davidhackwell531
No ratings yet
"These Are Just Rough Notes For References" What Is K-Means Clustering
Document9 pages
"These Are Just Rough Notes For References" What Is K-Means Clustering
Nikhil Jojen
No ratings yet
Clustering
Document17 pages
Clustering
Aatri Pal
No ratings yet
18 A Comparison of Various Distance Functions On K - Mean Clustering Algorithm
Document9 pages
18 A Comparison of Various Distance Functions On K - Mean Clustering Algorithm
Irtefaa A.
No ratings yet
Text Analytics Unit-3
Document11 pages
Text Analytics Unit-3
aathyukthas.ai20001
No ratings yet
K Mean Clustering
Document24 pages
K Mean Clustering
discodancerhasan
No ratings yet
Unit 3 & 4 (p18)
Document18 pages
Unit 3 & 4 (p18)
Kashif Baig
No ratings yet
The International Journal of Engineering and Science (The IJES)
Document4 pages
The International Journal of Engineering and Science (The IJES)
theijes
No ratings yet
Hierarchical Clustering: Required Data
Document6 pages
Hierarchical Clustering: Required Data
Hritik Agrawal
No ratings yet
Clustering
Document10 pages
Clustering
Saif Fazal
No ratings yet
Unsupervisd Learning Algorithm
Document6 pages
Unsupervisd Learning Algorithm
Shrey Dixit
No ratings yet
ML Unit-2
Document31 pages
ML Unit-2
2021pcecscharul037
No ratings yet
Clustering Techniques - Hierarchical, K-Means Clustering
Document22 pages
Clustering Techniques - Hierarchical, K-Means Clustering
Tanya Sharma
No ratings yet
K-Mean Clustering
Document8 pages
K-Mean Clustering
hokijic810
No ratings yet
Clustering
Document24 pages
Clustering
1138 Anuj Bhowmick
No ratings yet
KMeans Clustering
Document16 pages
KMeans Clustering
Basant Kothari
No ratings yet
K-Means Clustering and The Iris Plan Dataset
Document7 pages
K-Means Clustering and The Iris Plan Dataset
Monique Kirkman-Bey
No ratings yet
4 Clustering
Document9 pages
4 Clustering
Bibek Neupane
No ratings yet
KMean Merged
Document13 pages
KMean Merged
Abhyudya Singh
No ratings yet
An Initial Seed Selection Algorithm
Document11 pages
An Initial Seed Selection Algorithm
hamzarash090
No ratings yet
CV UNIT 4
Document60 pages
CV UNIT 4
jayalakshmi.mca staff
No ratings yet
Assignment 4 A
Document15 pages
Assignment 4 A
sahilmukund.awasarkar
No ratings yet
UNIT - 3 - Clustering
Document21 pages
UNIT - 3 - Clustering
Dev Goyal
No ratings yet
KNN VS Kmeans
Document3 pages
KNN VS Kmeans
Soubhagya Kumar Sahoo
No ratings yet
Assignment No. A6: 1 Title
Document5 pages
Assignment No. A6: 1 Title
Pallavi Vetal
No ratings yet
Lecture 18 K Means Clustering
Document77 pages
Lecture 18 K Means Clustering
Fasih Ullah
No ratings yet
Journal of Computer Applications - WWW - Jcaksrce.org - Volume 4 Issue 2
Document5 pages
Journal of Computer Applications - WWW - Jcaksrce.org - Volume 4 Issue 2
Journal of Computer Applications
No ratings yet
Clustering Analysis: What Is Cluster Analysis?
Document5 pages
Clustering Analysis: What Is Cluster Analysis?
shyama
No ratings yet
ML Minors Exp7
Document6 pages
ML Minors Exp7
Deep Prajapati
No ratings yet
Unit 3 Data
Document37 pages
Unit 3 Data
Sangam
No ratings yet
1731009606_Clustering_(Class_38-39)
Document45 pages
1731009606_Clustering_(Class_38-39)
TANISHA SINHA
No ratings yet
KMEANS
Document9 pages
KMEANS
johnzenbano120
No ratings yet
Hierarchical Clustering - 11.3.2024 - Full
Document14 pages
Hierarchical Clustering - 11.3.2024 - Full
0707thecaptaincool
No ratings yet
Techniques of Cluster Analysis: A Seminar On
Document25 pages
Techniques of Cluster Analysis: A Seminar On
VAIBHAV NANAWARE
No ratings yet
Learneverythingai
Document12 pages
Learneverythingai
nasby18
No ratings yet
Experiment No 07: Mihir Patel Teit 2
Document5 pages
Experiment No 07: Mihir Patel Teit 2
MIHIR PATEL
No ratings yet
K-Means Clustering Algorithm - Javatpoint
Document21 pages
K-Means Clustering Algorithm - Javatpoint
mangotwin22
No ratings yet
Simple K Means
Document3 pages
Simple K Means
Srisai Krishna
No ratings yet
UNIT 4 K-Means Clustring
Document13 pages
UNIT 4 K-Means Clustring
sahil.utube2003
No ratings yet
Clustering - K-Means: Prerequisite
Document8 pages
Clustering - K-Means: Prerequisite
Varun Bhayana
No ratings yet
DS - ML - 7 - 60019210046 1
Document6 pages
DS - ML - 7 - 60019210046 1
Deep Prajapati
No ratings yet
Clustering Lecture
Document46 pages
Clustering Lecture
ahmetdursun03
No ratings yet
Pattern Recognition Letters: Krista Rizman Z Alik
Document7 pages
Pattern Recognition Letters: Krista Rizman Z Alik
durdanecoban
No ratings yet
EXP 7
Document6 pages
EXP 7
Kratos grime
No ratings yet
K Mean Clustering
Document48 pages
K Mean Clustering
Rexline S J
No ratings yet
K Means Clustering Algorithm
Document12 pages
K Means Clustering Algorithm
nandanvarma.dandu9
No ratings yet
A Novel Approach of Implementing An Optimal K-Means Plus Plus Algorithm For Scalar Data
Document6 pages
A Novel Approach of Implementing An Optimal K-Means Plus Plus Algorithm For Scalar Data
sinigersky
No ratings yet
Machine Learning with Python for Beginners
From Everand
Machine Learning with Python for Beginners
Saimon Carrie
No ratings yet
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
How To Use Pronouns
Document1 page
How To Use Pronouns
minichel
No ratings yet
Chapter One: Problem Solving Using Computers
Document220 pages
Chapter One: Problem Solving Using Computers
minichel
No ratings yet
HS Fitness Assessment and Plan Study Guide
Document21 pages
HS Fitness Assessment and Plan Study Guide
minichel
No ratings yet
Chapter 5 Audit of FA
Document6 pages
Chapter 5 Audit of FA
minichel
No ratings yet
Chaper On1
Document25 pages
Chaper On1
minichel
No ratings yet
Roap Rolling
Document44 pages
Roap Rolling
minichel
No ratings yet
Installing and Configuring Printers
Document64 pages
Installing and Configuring Printers
minichel
No ratings yet
Chapter 4 Audit of Inventory and CGS
Document9 pages
Chapter 4 Audit of Inventory and CGS
minichel
No ratings yet
Chapter Three 3. Data Structures 3.1. Structures: Syntax: Struct
Document50 pages
Chapter Three 3. Data Structures 3.1. Structures: Syntax: Struct
minichel
No ratings yet
Basic Computer Hardware Quiz Questions and Answer
Document10 pages
Basic Computer Hardware Quiz Questions and Answer
minichel
No ratings yet
Chapter 5
Document41 pages
Chapter 5
minichel
No ratings yet
Tree
Document48 pages
Tree
minichel
No ratings yet
Structure of Ethiopian Tax System and Administration
Document9 pages
Structure of Ethiopian Tax System and Administration
minichel
No ratings yet
Example Code
Document3 pages
Example Code
minichel
No ratings yet
Presentation On Statement Problem: Name: Kassahun Azezew PRN. 031 Advisor: Dr. Preeti Mulay
Document13 pages
Presentation On Statement Problem: Name: Kassahun Azezew PRN. 031 Advisor: Dr. Preeti Mulay
minichel
No ratings yet
NLP Steemer
Document15 pages
NLP Steemer
minichel
No ratings yet
Chapter Seven Bus and Cards Bus: Processor-Memory Bus (May Be Proprietary)
Document10 pages
Chapter Seven Bus and Cards Bus: Processor-Memory Bus (May Be Proprietary)
minichel
No ratings yet
Final Exam Review
Document6 pages
Final Exam Review
minichel
No ratings yet
W1L2 Complexity PDF
Document38 pages
W1L2 Complexity PDF
minichel
No ratings yet
Data Mining Practice Final Exam Solutions: True/False Questions
Document5 pages
Data Mining Practice Final Exam Solutions: True/False Questions
minichel
100% (1)
IoT Assignment
Document3 pages
IoT Assignment
minichel
100% (1)
Facebook Distributed System Case Study For Distributed System Inside Facebook Datacenters PDF
Document9 pages
Facebook Distributed System Case Study For Distributed System Inside Facebook Datacenters PDF
minichel
No ratings yet
IoT Assignment
Document3 pages
IoT Assignment
minichel
No ratings yet
Abigail Kennedy Et Al. - Improving Novel Food Choices in Preschool Children Using Acceptance and Commitment Therapy
Document8 pages
Abigail Kennedy Et Al. - Improving Novel Food Choices in Preschool Children Using Acceptance and Commitment Therapy
Irving Pérez Méndez
No ratings yet
Delm 116 Q&a
Document16 pages
Delm 116 Q&a
Sonny Matias
No ratings yet
Mathematics For Natural Scientists II Advanced Methods 1st Edition Lev Kantorovich (Auth.) 2024 Scribd Download
Document62 pages
Mathematics For Natural Scientists II Advanced Methods 1st Edition Lev Kantorovich (Auth.) 2024 Scribd Download
saraylytzen
100% (16)
Speaking
Document2 pages
Speaking
Nguyen Le Minh Khue
No ratings yet
Luechauer y Shulman - Creating Empowered Learners PDF
Document10 pages
Luechauer y Shulman - Creating Empowered Learners PDF
abustag
No ratings yet
Visioncor: A Case Study
Document3 pages
Visioncor: A Case Study
ᎷᏬᏒᏗᏝᎥ
No ratings yet
Usa Amc - 8 2005 42
Document6 pages
Usa Amc - 8 2005 42
Nova Ronaldo
No ratings yet
Mathematics GR 8 Project Term 3
Document7 pages
Mathematics GR 8 Project Term 3
Thegn's Pickles
No ratings yet
0.2 - Quantitative Methods Syllabus For CSS (Lia Tsuladze)
Document8 pages
0.2 - Quantitative Methods Syllabus For CSS (Lia Tsuladze)
Nabila Ayu Khairani
No ratings yet
DLP CPD
Document3 pages
DLP CPD
Adrayyanna Zila
No ratings yet
Progress Test 1 and Test 1
Document2 pages
Progress Test 1 and Test 1
seba11bena11
No ratings yet
Handbook of Research On Innovative Management Using AI in - Vikas Garg (Editor), Richa Goel (Editor) - 2021 - Business Science Reference - 9781799884972 - Anna's Archive
Document377 pages
Handbook of Research On Innovative Management Using AI in - Vikas Garg (Editor), Richa Goel (Editor) - 2021 - Business Science Reference - 9781799884972 - Anna's Archive
Gary Chan
No ratings yet
ICMR - Reproducible AI in Medicine and Health
Document9 pages
ICMR - Reproducible AI in Medicine and Health
vignesh16vlsi
No ratings yet
Kea Resume by Trisha Jamili With App Letter
Document3 pages
Kea Resume by Trisha Jamili With App Letter
AlcsEsperacion Ken
No ratings yet
Handloom in Odisha: An Overview: Shruti Sudha Mishra
Document15 pages
Handloom in Odisha: An Overview: Shruti Sudha Mishra
niku007
No ratings yet
Syllabus-in-Management-Science
Document11 pages
Syllabus-in-Management-Science
ehlaandres
No ratings yet
Eva Hoffman Lost in Translation PDF
Document2 pages
Eva Hoffman Lost in Translation PDF
Brianna
8% (13)
Sample IppGr. 1 Unit 7 IPP
Document8 pages
Sample IppGr. 1 Unit 7 IPP
Jancey Clark
No ratings yet
BIA Data Science Artificial Intelligence
Document24 pages
BIA Data Science Artificial Intelligence
Rohit Vinayak Hegde
No ratings yet
Health Committee AND Their Recommendation
Document16 pages
Health Committee AND Their Recommendation
Pankaj Kumar
No ratings yet
1916 PDF
Document244 pages
1916 PDF
Avijit Porel
No ratings yet
Keberkesanan Kaedah Flipped Classroom Dalam Meningkatkan Kemahiran Mengolah Isi Karangan Murid-Murid Sekolah Rendah
Document15 pages
Keberkesanan Kaedah Flipped Classroom Dalam Meningkatkan Kemahiran Mengolah Isi Karangan Murid-Murid Sekolah Rendah
Jeonseagull bts
No ratings yet
Course Outline - S1-2020 PDF
Document9 pages
Course Outline - S1-2020 PDF
Malia i Lutu Leonia Kueva Losalu
100% (1)
Pe Ko Table Tennis
Document1 page
Pe Ko Table Tennis
Dizon Harvey -BSE 1B
No ratings yet
2ndperiodical Exam
Document3 pages
2ndperiodical Exam
Chezed Lopez
No ratings yet
I. Purpose of The Grant: Grant Writing: A Best Practice Guide: by Bridget Newell, PH.D
Document9 pages
I. Purpose of The Grant: Grant Writing: A Best Practice Guide: by Bridget Newell, PH.D
nicklasorte
100% (2)
7 Grade Grading Policy
Document2 pages
7 Grade Grading Policy
api-413794743
No ratings yet
ESIS Newsletter55
Document25 pages
ESIS Newsletter55
vonipko
No ratings yet
Director Talent Acquisition HR in San Francisco Bay CA Resume Marie Minder
Document2 pages
Director Talent Acquisition HR in San Francisco Bay CA Resume Marie Minder
MarieMinder2
No ratings yet
Lampiran 4-8 - Hasil Olah Data
Document11 pages
Lampiran 4-8 - Hasil Olah Data
IstriPratiwi
No ratings yet