0% found this document useful (0 votes)

9 views3 pages

Kmean PGM

The document outlines a process for applying k-Means clustering to the Iris dataset, utilizing the 'cluster' and 'factoextra' libraries for analysis and visualization. It calculates the Silhouette Score to evaluate clustering quality, indicating reasonable performance with some overlap between clusters. The results are visualized through various plots, and a comparison with original species labels is provided to understand cluster assignments.

Uploaded by

Triveni Jayaram

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views3 pages

Kmean PGM

Uploaded by

Triveni Jayaram

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

# Load necessary libraries

#library(datasets) # For the iris dataset

library(cluster) # For kmeans

library(factoextra) # For visualization (optional, but recommended)

# Load the Iris dataset

data(iris)

# Apply k-Means clustering (assuming 3 clusters)

set.seed(42) # For reproducibility

kmeans_result <- kmeans(iris[, 1:4], 3) # Cluster based on the 4 features

# Evaluate clustering quality (Silhouette score)

silhouette_avg <- silhouette(kmeans_result$cluster, dist(iris[, 1:4]))

print("Silhouette Score:")

print(summary(silhouette_avg))

# Comment on clustering quality

cat("\nComment on Clustering Quality:\n")

cat("The Silhouette Score measures how similar a data point is to its own cluster compared
to other clusters. A score closer to +1 indicates better clustering, -1 indicates poor
clustering, and 0 indicates overlapping clusters.\n")

cat("The average Silhouette Score is", mean(silhouette_avg[,3]), ". This suggests a

reasonable clustering performance, although it's not exceptionally high. There is some
overlap between the clusters, which is expected with the Iris dataset.\n")

# Plot the clusters (using factoextra - highly recommended for k-means visualization)

fviz_cluster(kmeans_result, data = iris[, 1:4],

palette = c("#E41A1C", "#377EB8", "#4DAF4A"), # Color palette

geom = "point", # Show points

ellipse = TRUE, # Show ellipses around clusters

ggtheme = theme_bw() # Use a white background

# Plot clusters (base R - Sepal features)

plot(iris[, 1], iris[, 2], col = kmeans_result$cluster,

pch = 19, xlab = "Sepal Length", ylab = "Sepal Width",

main = "K-Means Clustering of Iris (Sepal Features)")

points(kmeans_result$centers[, 1], kmeans_result$centers[, 2],

col = 1:3, pch = 8, cex = 2) # Plot cluster centers

# Plot clusters (base R - Petal features)

plot(iris[, 3], iris[, 4], col = kmeans_result$cluster,

pch = 19, xlab = "Petal Length", ylab = "Petal Width",

main = "K-Means Clustering of Iris (Petal Features)")

points(kmeans_result$centers[, 3], kmeans_result$centers[, 4],

col = 1:3, pch = 8, cex = 2) # Plot cluster centers

# Compare with original labels (for understanding cluster assignments)

print("\nComparison with Original Labels:")

table(iris$Species, kmeans_result$cluster)

#Investigate which species are in which cluster

for (i in 1:3) { #loop through the 3 clusters

cluster_data <- iris[kmeans_result$cluster == i, ]

print(paste("\nCluster", i, ":"))

print(table(cluster_data$Species)) #show the counts of each species in this cluster

Document From Triveni
No ratings yet
Document From Triveni
2 pages
Cyber Scheme May 2024
No ratings yet
Cyber Scheme May 2024
20 pages
Lin Reg Outputs
No ratings yet
Lin Reg Outputs
1 page
p4 Outputs
No ratings yet
p4 Outputs
3 pages
Iris Flower Classification Project
No ratings yet
Iris Flower Classification Project
9 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
96 pages
Income (K-Means Clustering On A Sample Data Set)
No ratings yet
Income (K-Means Clustering On A Sample Data Set)
3 pages
Partition
No ratings yet
Partition
52 pages
Baidurya Debnath 4
No ratings yet
Baidurya Debnath 4
37 pages
Yogesh Siddiq Edited
No ratings yet
Yogesh Siddiq Edited
6 pages
Banner GPTT
No ratings yet
Banner GPTT
1 page
K-Means Cluter Analysis For IRIS Data Frame in R
No ratings yet
K-Means Cluter Analysis For IRIS Data Frame in R
3 pages
Week1 Notes
No ratings yet
Week1 Notes
5 pages
K-Means Clustering Numerical Example
No ratings yet
K-Means Clustering Numerical Example
5 pages
Gaussianmixture
No ratings yet
Gaussianmixture
2 pages
9 Ds
No ratings yet
9 Ds
5 pages
Kmeans Steps
No ratings yet
Kmeans Steps
3 pages
Rlab SS
No ratings yet
Rlab SS
25 pages
Ds Paper
No ratings yet
Ds Paper
35 pages
Unit IV
No ratings yet
Unit IV
51 pages
L08 Hierachical Agglomerative Clustering
No ratings yet
L08 Hierachical Agglomerative Clustering
41 pages
Vansh 3089 CA2
No ratings yet
Vansh 3089 CA2
13 pages
Density Based Clustering
No ratings yet
Density Based Clustering
70 pages
Anuj Khandelwal 3029 BCP A Business Analytics Continuous Assessment 2
No ratings yet
Anuj Khandelwal 3029 BCP A Business Analytics Continuous Assessment 2
20 pages
Iris HC Solution
No ratings yet
Iris HC Solution
31 pages
KNN - Jupyter Notebook
No ratings yet
KNN - Jupyter Notebook
8 pages
Experiment 4 1
No ratings yet
Experiment 4 1
4 pages
Bi 5to 8
No ratings yet
Bi 5to 8
6 pages
NJ - Corrected Final
No ratings yet
NJ - Corrected Final
27 pages
Unit 2 DMW
No ratings yet
Unit 2 DMW
26 pages
Clustering
No ratings yet
Clustering
7 pages
Datamininganddataware
No ratings yet
Datamininganddataware
25 pages
Kmeans
No ratings yet
Kmeans
2 pages
DATAMINING
No ratings yet
DATAMINING
24 pages
UAS Mechine Learning
No ratings yet
UAS Mechine Learning
5 pages
Clustering
No ratings yet
Clustering
1 page
Practical 7 1
No ratings yet
Practical 7 1
9 pages
K-Means Clustering Using PCA Analysis Lab Report
No ratings yet
K-Means Clustering Using PCA Analysis Lab Report
9 pages
K - Means - Clustering - Ipynb - Colaboratory
No ratings yet
K - Means - Clustering - Ipynb - Colaboratory
2 pages
Algoritma K-Means Clustering Dan Contoh Soal - KETUTRARE
No ratings yet
Algoritma K-Means Clustering Dan Contoh Soal - KETUTRARE
17 pages
Compute2
No ratings yet
Compute2
10 pages
Clustering R Codes
No ratings yet
Clustering R Codes
2 pages
Experiment 11ml
No ratings yet
Experiment 11ml
1 page
06 - Unsupervised Learning - 18 Dec 2023
No ratings yet
06 - Unsupervised Learning - 18 Dec 2023
50 pages
Ads Exp 3
No ratings yet
Ads Exp 3
7 pages
Import As Import As Import As From Import Import As Import
No ratings yet
Import As Import As Import As From Import Import As Import
7 pages
ML 7
No ratings yet
ML 7
2 pages
Clustering - With - Elbow - Plot - ML - 4 - Jupyter Notebook
No ratings yet
Clustering - With - Elbow - Plot - ML - 4 - Jupyter Notebook
6 pages
From Import Import As Import As From Import From Import From Import From Import
No ratings yet
From Import Import As Import As From Import From Import From Import From Import
9 pages
RDM Slides Clustering With R 1
No ratings yet
RDM Slides Clustering With R 1
64 pages
Pertemuan-X - Manajemen Data Bagian 2
No ratings yet
Pertemuan-X - Manajemen Data Bagian 2
31 pages
Fo DS
No ratings yet
Fo DS
9 pages
Iris Flower Classification
No ratings yet
Iris Flower Classification
47 pages
Metode Subtractive Fuzzy C-Means (SFCM) Dalam Pengelompokan
No ratings yet
Metode Subtractive Fuzzy C-Means (SFCM) Dalam Pengelompokan
13 pages
Minor Project by Ali (Intrainz)
No ratings yet
Minor Project by Ali (Intrainz)
17 pages
Zafira fk,+4 Vol11No1 855+ (36-47) +
No ratings yet
Zafira fk,+4 Vol11No1 855+ (36-47) +
12 pages
Demonstrate Clustering On Data Set
No ratings yet
Demonstrate Clustering On Data Set
15 pages
Chapter 4 PDF
No ratings yet
Chapter 4 PDF
89 pages
Department Of: Computer Science & Engineering
No ratings yet
Department Of: Computer Science & Engineering
4 pages
Kmeansrcode
No ratings yet
Kmeansrcode
2 pages
ISYE 6501 Georgia Tech hmwk4.2
No ratings yet
ISYE 6501 Georgia Tech hmwk4.2
4 pages
Implement Clustering Algorithms
No ratings yet
Implement Clustering Algorithms
4 pages
EastWestAirlines Cluster
100% (1)
EastWestAirlines Cluster
6 pages
Program 7-EM Algorithm-K Means Algorithm
No ratings yet
Program 7-EM Algorithm-K Means Algorithm
3 pages
Lec 06 Clustering
No ratings yet
Lec 06 Clustering
44 pages
Solution HW2
No ratings yet
Solution HW2
6 pages
K Means On IRIS Dataset
No ratings yet
K Means On IRIS Dataset
4 pages
Ex No 10
No ratings yet
Ex No 10
2 pages
ML Clustering
No ratings yet
ML Clustering
3 pages
Analisis Algoritma K-Medoids Clustering Dalam Pengelompokan Penyebaran Covid-19 Di Indonesia
No ratings yet
Analisis Algoritma K-Medoids Clustering Dalam Pengelompokan Penyebaran Covid-19 Di Indonesia
8 pages
K-Means Cluster
No ratings yet
K-Means Cluster
2 pages
Objective: For One Dimensional Data Set (7,10,20,28,35), Perform Hierarchical Clustering
No ratings yet
Objective: For One Dimensional Data Set (7,10,20,28,35), Perform Hierarchical Clustering
13 pages
ML 2.3 Prashant
No ratings yet
ML 2.3 Prashant
4 pages
Exp 7 PDF
No ratings yet
Exp 7 PDF
4 pages
10 - DBSCANClusteringOnIRIS-Copy1 - Jupyter Notebook
No ratings yet
10 - DBSCANClusteringOnIRIS-Copy1 - Jupyter Notebook
4 pages
R Lab Program
No ratings yet
R Lab Program
20 pages
Tara Venit Per Capita (US$) Rata de Alfabetizare (%) Rata de Mortalitate Infantila (%) Durata Medie de Viata (Ani)
No ratings yet
Tara Venit Per Capita (US$) Rata de Alfabetizare (%) Rata de Mortalitate Infantila (%) Durata Medie de Viata (Ani)
8 pages
KMeans
No ratings yet
KMeans
2 pages
Amazon-Fine-Food-Review - K-Means, Agglomerative & DBSCAN Clustering
No ratings yet
Amazon-Fine-Food-Review - K-Means, Agglomerative & DBSCAN Clustering
79 pages
Overview of Clustering:: UNIT-5
No ratings yet
Overview of Clustering:: UNIT-5
27 pages
Iris Species IB
No ratings yet
Iris Species IB
7 pages
American Journal of Physics Volume 53 Issue 9 1985 (Doi 10.1119/1.14356) MacKeown, P. K. - Evaluation of Feynman Path Integrals by Monte Carlo Methods
No ratings yet
American Journal of Physics Volume 53 Issue 9 1985 (Doi 10.1119/1.14356) MacKeown, P. K. - Evaluation of Feynman Path Integrals by Monte Carlo Methods
6 pages
Doucet, de Freitas, Gordon - An Introduction To Sequential Monte Carlo Methods
No ratings yet
Doucet, de Freitas, Gordon - An Introduction To Sequential Monte Carlo Methods
12 pages
Classification Using R
No ratings yet
Classification Using R
9 pages
AMR - Assignment 1-Sample Solutions
No ratings yet
AMR - Assignment 1-Sample Solutions
7 pages
PGM 7
No ratings yet
PGM 7
3 pages
Iris Visual Code
No ratings yet
Iris Visual Code
6 pages
03 - K Means Clustering On Iris Datasets
No ratings yet
03 - K Means Clustering On Iris Datasets
4 pages
Summary:: Petalwidthcm, Species) Samples: 150
No ratings yet
Summary:: Petalwidthcm, Species) Samples: 150
5 pages
Clustering - Jupyter Notebook
100% (1)
Clustering - Jupyter Notebook
11 pages
CSE 319 Pattern Recognition: Clustering
No ratings yet
CSE 319 Pattern Recognition: Clustering
58 pages
Materi Praktikum
No ratings yet
Materi Praktikum
7 pages
Cluster Analysis Usingr PDF
No ratings yet
Cluster Analysis Usingr PDF
0 pages
Clustering
No ratings yet
Clustering
8 pages
FullMarks - Clustering StudentSolution 2
No ratings yet
FullMarks - Clustering StudentSolution 2
13 pages

Kmean PGM

Uploaded by

Kmean PGM

Uploaded by

# Load necessary libraries

#library(datasets) # For the iris dataset

library(cluster) # For kmeans

library(factoextra) # For visualization (optional, but recommended)

# Load the Iris dataset

# Apply k-Means clustering (assuming 3 clusters)

set.seed(42) # For reproducibility

kmeans_result <- kmeans(iris[, 1:4], 3) # Cluster based on the 4 features

# Evaluate clustering quality (Silhouette score)

silhouette_avg <- silhouette(kmeans_result$cluster, dist(iris[, 1:4]))

# Comment on clustering quality

cat("\nComment on Clustering Quality:\n")

cat("The average Silhouette Score is", mean(silhouette_avg[,3]), ". This suggests a

fviz_cluster(kmeans_result, data = iris[, 1:4],

palette = c("#E41A1C", "#377EB8", "#4DAF4A"), # Color palette

ellipse = TRUE, # Show ellipses around clusters

ggtheme = theme_bw() # Use a white background

# Plot clusters (base R - Sepal features)

plot(iris[, 1], iris[, 2], col = kmeans_result$cluster,

pch = 19, xlab = "Sepal Length", ylab = "Sepal Width",

main = "K-Means Clustering of Iris (Sepal Features)")

points(kmeans_result$centers[, 1], kmeans_result$centers[, 2],

col = 1:3, pch = 8, cex = 2) # Plot cluster centers

# Plot clusters (base R - Petal features)

plot(iris[, 3], iris[, 4], col = kmeans_result$cluster,

pch = 19, xlab = "Petal Length", ylab = "Petal Width",

main = "K-Means Clustering of Iris (Petal Features)")

points(kmeans_result$centers[, 3], kmeans_result$centers[, 4],

col = 1:3, pch = 8, cex = 2) # Plot cluster centers

# Compare with original labels (for understanding cluster assignments)

print("\nComparison with Original Labels:")

#Investigate which species are in which cluster

for (i in 1:3) { #loop through the 3 clusters

print(table(cluster_data$Species)) #show the counts of each species in this cluster

You might also like