What Is Cluster Analysis?

K-means clustering is an unsupervised learning algorithm that groups objects into K number of clusters defined by centroids, where each object belongs to the cluster with the nearest centroid. It works by assigning random initial centroids and iteratively recomputing centroids as the mean of points in each cluster until centroids stop changing. K-means clustering has limitations when clusters differ in size, density or shape or when outliers are present, and determining the optimal K is challenging.

Uploaded by

tara345w

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views4 pages

What Is Cluster Analysis?

Uploaded by

tara345w

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 4

What is Cluster Analysis?

Finding groups of objects such that the objects in a group will be similar (or related) to one another and different from (or unrelated to) the objects in other groups
Intra-cluster distances are minimized Inter-cluster distances are maximized

K-means Clustering
Partitional clustering approach Each cluster is associated with a centroid (center point) Each point is assigned to the cluster with the closest centroid Number of clusters, K, must be specified The basic algorithm is very simple

K-means Clustering Details

Initial centroids are often chosen randomly. The centroid is (typically) the mean of the points in the cluster. Closeness is measured by Euclidean distance, cosine similarity, correlation, etc. K-means will converge for common similarity measures mentioned above. Most of the convergence happens in the first few iterations. Complexity is O( n * K * I * d )
Often the stopping condition is changed to Until relatively few points change clusters Clusters produced vary from one run to another.

n = number of points, K = number of clusters, I = number of iterations, d = number of attributes

Limitations of K-means
K-means has problems when clusters are of differing
Sizes Densities Non-globular shapes

K-means has problems when the data contains outliers.

The number of clusters (K) is difficult to determine.

07 Clustering
No ratings yet
07 Clustering
34 pages
Lecture 6
No ratings yet
Lecture 6
14 pages
Clustering
No ratings yet
Clustering
29 pages
Module 5
No ratings yet
Module 5
98 pages
Data Mining - Clustering
No ratings yet
Data Mining - Clustering
90 pages
K Mean Cluster Analysis
No ratings yet
K Mean Cluster Analysis
16 pages
Clustering
No ratings yet
Clustering
104 pages
Clustering Analysis
No ratings yet
Clustering Analysis
17 pages
Unit V - Clustering
No ratings yet
Unit V - Clustering
19 pages
Clustering
No ratings yet
Clustering
39 pages
Clustering
No ratings yet
Clustering
125 pages
Unit 5
No ratings yet
Unit 5
85 pages
What Is Cluster Analysis?: - Cluster: A Collection of Data Objects
No ratings yet
What Is Cluster Analysis?: - Cluster: A Collection of Data Objects
42 pages
Machine Learning-Lecture#7-Fall 2020
No ratings yet
Machine Learning-Lecture#7-Fall 2020
18 pages
Clustering
No ratings yet
Clustering
84 pages
DMDWUNITV
No ratings yet
DMDWUNITV
72 pages
Chapter 04 Clustering
No ratings yet
Chapter 04 Clustering
36 pages
Clustering Part-1
No ratings yet
Clustering Part-1
48 pages
Datamining Lect8
No ratings yet
Datamining Lect8
79 pages
Clustering
No ratings yet
Clustering
6 pages
Cluster Analysis: Talha Farooq Faizan Ali Muhammad Abdul Basit
No ratings yet
Cluster Analysis: Talha Farooq Faizan Ali Muhammad Abdul Basit
16 pages
Lect 10 DM
No ratings yet
Lect 10 DM
36 pages
Cluster Analysis: Basic Concepts Partitioning Methods Hierarchical Methods Density-Based Methods Grid-Based Methods Evaluation of Clustering
No ratings yet
Cluster Analysis: Basic Concepts Partitioning Methods Hierarchical Methods Density-Based Methods Grid-Based Methods Evaluation of Clustering
38 pages
L11 Cluster Analysis
No ratings yet
L11 Cluster Analysis
47 pages
K Means
No ratings yet
K Means
40 pages
Unit5 Clustering
No ratings yet
Unit5 Clustering
74 pages
Clustering-Part 1
No ratings yet
Clustering-Part 1
35 pages
Data Mining: I Gede Mahendra Darmawiguna
No ratings yet
Data Mining: I Gede Mahendra Darmawiguna
25 pages
Lec 5
No ratings yet
Lec 5
10 pages
DWDS Unit 6 Cluster Analysis
No ratings yet
DWDS Unit 6 Cluster Analysis
31 pages
Clustering in Python
No ratings yet
Clustering in Python
31 pages
8 - Clustering
No ratings yet
8 - Clustering
85 pages
Data Mining Lecture Notes-1: Bsc. (H) Computer Science: Vi Semester Teacher: Ms. Sonal Linda
No ratings yet
Data Mining Lecture Notes-1: Bsc. (H) Computer Science: Vi Semester Teacher: Ms. Sonal Linda
40 pages
BDA Unit 2
No ratings yet
BDA Unit 2
31 pages
Clustering
No ratings yet
Clustering
80 pages
Unit 4
No ratings yet
Unit 4
125 pages
Unit - 5 Cluster Analysis
No ratings yet
Unit - 5 Cluster Analysis
83 pages
K Mean Clustering1
No ratings yet
K Mean Clustering1
23 pages
Cluster Analysis: Dr. Bernard Chen Ph.D. Assistant Professor
No ratings yet
Cluster Analysis: Dr. Bernard Chen Ph.D. Assistant Professor
43 pages
Cluster Analysis: G Sreenivas
No ratings yet
Cluster Analysis: G Sreenivas
29 pages
CS8091 BDA Unit 2
No ratings yet
CS8091 BDA Unit 2
101 pages
Datamining-Lect5 - Clustering. The K-Means Algorithm. Hierarchical Clustering. The DBSCAN Algorithm. Clustering Evaluation
No ratings yet
Datamining-Lect5 - Clustering. The K-Means Algorithm. Hierarchical Clustering. The DBSCAN Algorithm. Clustering Evaluation
110 pages
Clustering Methods
No ratings yet
Clustering Methods
14 pages
DM Unit Iv
No ratings yet
DM Unit Iv
45 pages
L7 Clustering
No ratings yet
L7 Clustering
58 pages
Lecture - 10 Unsupervised Learning & K-Means Clustering
No ratings yet
Lecture - 10 Unsupervised Learning & K-Means Clustering
31 pages
Unit IV
No ratings yet
Unit IV
96 pages
Clustering Part1
No ratings yet
Clustering Part1
79 pages
05 Clustering
No ratings yet
05 Clustering
96 pages
Cluster Analysis
No ratings yet
Cluster Analysis
3 pages
w6 Clustering
No ratings yet
w6 Clustering
29 pages
Fds Unit03
No ratings yet
Fds Unit03
11 pages
Mini Project
No ratings yet
Mini Project
8 pages
4 Clustering1
No ratings yet
4 Clustering1
41 pages
ML Unit-4 Final 2024-25
No ratings yet
ML Unit-4 Final 2024-25
28 pages
Clustering K Means Agnes
No ratings yet
Clustering K Means Agnes
36 pages
Unit - 4 DWDM
No ratings yet
Unit - 4 DWDM
27 pages
Analysis of Cluteruing
No ratings yet
Analysis of Cluteruing
16 pages
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Competitive Learning: Fundamentals and Applications for Reinforcement Learning through Competition
From Everand
Competitive Learning: Fundamentals and Applications for Reinforcement Learning through Competition
Fouad Sabry
No ratings yet

What Is Cluster Analysis?

Uploaded by

What Is Cluster Analysis?

Uploaded by

What is Cluster Analysis?

K-means Clustering Details

n = number of points, K = number of clusters, I = number of iterations, d = number of attributes

K-means has problems when the data contains outliers.

The number of clusters (K) is difficult to determine.

You might also like