0% found this document useful (0 votes)

11 views9 pages

DWM Exp4

The document outlines an experiment on implementing clustering algorithms, specifically K-means and Agglomerative clustering, using Python. It includes a detailed procedure for loading and preprocessing data, applying the algorithms, evaluating clusters, and visualizing results. The conclusion highlights the strengths and weaknesses of both algorithms, and review questions address key concepts related to K-means clustering and distance metrics used in Agglomerative clustering.

Uploaded by

Mayank vora

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views9 pages

DWM Exp4

Uploaded by

Mayank vora

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

DWM Exp.

Name – Mayank Vora

Class - TE-03 / 63
Batch - C

Aim: Implementation of Clustering Algorithm (K-means / Agglomerative)

using Python

Introduction:

Clustering is an unsupervised machine learning technique used to group

similar data points together. K-means clustering partitions the dataset into
K clusters by minimizing intra-cluster variance, whereas Agglomerative
clustering follows a hierarchical approach by merging or splitting clusters
based on distance metrics.

Procedure:

1. Load the dataset.

2. Preprocess the data (if necessary).
3. Apply the K-means and Agglomerative clustering algorithms.
4. Evaluate the clusters formed.
5. Visualize the results.

Program Codes:

import matplotlib.pyplot as plt

from sklearn.cluster import KMeans, AgglomerativeClustering
import pandas as pd
import numpy as np

# K means algorithm on predefined data values.

x = [4, 5, 10, 4, 3, 11, 14 , 6, 10, 12]
y = [21, 19, 24, 17, 16, 25, 24, 22, 21, 21]
plt.scatter(x, y)
plt.show()

data = list(zip(x, y))

inertias = []

for i in range(1,11):
kmeans = KMeans(n_clusters=i)
kmeans.fit(data)
inertias.append(kmeans.inertia_)

plt.plot(range(1,11), inertias, marker='o')

plt.title('Elbow method')
plt.xlabel('Number of clusters')
plt.ylabel('Inertia')
plt.show()

kmeans = KMeans(n_clusters=2)
kmeans.fit(data)

plt.scatter(x, y, c=kmeans.labels_)
plt.show()

X = np.random.rand(100, 2)

kmeans = KMeans(n_clusters=3)
kmeans.fit(X)
labels = kmeans.labels_
centroids = kmeans.cluster_centers_

# Visualize the clusters

plt.scatter(X[:, 0], X[:, 1], c=labels)
plt.scatter(centroids[:, 0], centroids[:, 1], marker='x', s=200,
linewidths=3, color='r')
plt.title('K-means Clustering')
plt.xlabel('X')
plt.ylabel('Y')
plt.show()

# Agglomerative Clustering
agg_clustering = AgglomerativeClustering(n_clusters=3)
agg_labels = agg_clustering.fit_predict(X)

# Visualize the agglomerative clustering results

plt.scatter(X[:, 0], X[:, 1], c=agg_labels)
plt.title('Agglomerative Clustering')
plt.xlabel('X')
plt.ylabel('Y')
plt.show()

import numpy as np
import matplotlib.pyplot as plt
from sklearn.datasets import make_blobs
from sklearn.cluster import KMeans, AgglomerativeClustering
from sklearn.preprocessing import StandardScaler

X, _ = make_blobs(n_samples=300, centers=4, cluster_std=1.0,

random_state=42)
X = StandardScaler().fit_transform(X)

# K-Means Clustering
kmeans = KMeans(n_clusters=4, random_state=42)
kmeans_labels = kmeans.fit_predict(X)

# Agglomerative Clustering
agglo = AgglomerativeClustering(n_clusters=4)
agglo_labels = agglo.fit_predict(X)

# Plot results
fig, (ax1, ax2) = plt.subplots(1, 2, figsize=(12, 5))
ax1.scatter(X[:, 0], X[:, 1], c=kmeans_labels, cmap='viridis', marker='o')
ax1.set_title("K-Means Clustering")
ax2.scatter(X[:, 0], X[:, 1], c=agglo_labels, cmap='plasma', marker='o')
ax2.set_title("Agglomerative Clustering")
plt.show()
Implementation/Output snapshot:
Conclusion: K-means and Agglomerative clustering effectively group data
points into clusters. K-means is computationally efficient but requires
specifying the number of clusters, while Agglomerative clustering is
hierarchical and does not require pre-specifying cluster numbers but can be
computationally expensive.

Review Questions:

1. What is the K-means clustering algorithm, and how does it work?

Answer: K-means is an unsupervised machine learning algorithm used for clustering data into
KKK groups. It operates through the following steps:

 Randomly initialize KKK cluster centroids.

 Assign each data point to the closest centroid.
 Update each centroid by computing the mean of all points assigned to it.
 Repeat the assignment and centroid update process until the centroids remain unchanged
or a stopping condition is met.

2. How do you determine the optimal number of clusters in K-means?

Answer: The ideal number of clusters can be identified using the following methods:

 Elbow Method: Plot the within-cluster sum of squares (WCSS) against the number of
clusters and find the "elbow point" where the rate of decrease slows.
 Silhouette Score: Evaluates how well-separated clusters are, with higher scores
indicating better-defined clusters.
 Gap Statistic: Compares clustering performance against a random reference dataset to
determine the most suitable number of clusters.

3. What are the common distance metrics used in Agglomerative

Clustering?

Answer: Some widely used distance metrics include:

 Euclidean Distance (default): Measures the straight-line distance between points.

 Manhattan Distance: Computes distance based on grid-like paths, summing absolute
differences between coordinates.
 Cosine Similarity: Evaluates the cosine of the angle between vectors to measure
similarity rather than direct distance.

09.unsupervised Learning
No ratings yet
09.unsupervised Learning
50 pages
ML - K-Means
No ratings yet
ML - K-Means
12 pages
K Means Clustering
No ratings yet
K Means Clustering
11 pages
K-Means Algorithm
No ratings yet
K-Means Algorithm
29 pages
K Means - Ipynb - Colab
No ratings yet
K Means - Ipynb - Colab
10 pages
Wa0033.
No ratings yet
Wa0033.
38 pages
Lab Report6 - B21CI014
No ratings yet
Lab Report6 - B21CI014
8 pages
Aam Unit 4 QB With Answer
No ratings yet
Aam Unit 4 QB With Answer
11 pages
DWDM Lab All
No ratings yet
DWDM Lab All
20 pages
Da Exp 10
No ratings yet
Da Exp 10
6 pages
Da Exp 10
No ratings yet
Da Exp 10
6 pages
K-Means Clustering
No ratings yet
K-Means Clustering
6 pages
Unit 4
No ratings yet
Unit 4
63 pages
Unit 4
No ratings yet
Unit 4
19 pages
3.1 K - Means
No ratings yet
3.1 K - Means
16 pages
AI Week 11
No ratings yet
AI Week 11
21 pages
Assignment 4 A
No ratings yet
Assignment 4 A
15 pages
Lect 10 - Unsupervised Learning
No ratings yet
Lect 10 - Unsupervised Learning
50 pages
Presentation 1
No ratings yet
Presentation 1
47 pages
K.means Clustering
No ratings yet
K.means Clustering
8 pages
EXP-6 K Mean Clustring
No ratings yet
EXP-6 K Mean Clustring
6 pages
K Means
No ratings yet
K Means
4 pages
CSC649 Lecture 3 Unsupervised ML - KMeansClustering
No ratings yet
CSC649 Lecture 3 Unsupervised ML - KMeansClustering
22 pages
Unit 4 Aam
No ratings yet
Unit 4 Aam
26 pages
16 K Mean Clustring 1 18052023 095249am 08042024 093324am
No ratings yet
16 K Mean Clustring 1 18052023 095249am 08042024 093324am
20 pages
K Mean Clustering
No ratings yet
K Mean Clustering
32 pages
KMEANS
No ratings yet
KMEANS
9 pages
Unit 4
No ratings yet
Unit 4
22 pages
Clustering Classification and Intro Neural Network
No ratings yet
Clustering Classification and Intro Neural Network
168 pages
Avinash Tiwari 9
No ratings yet
Avinash Tiwari 9
4 pages
ML Exp5 C36
No ratings yet
ML Exp5 C36
18 pages
Da Exp 10 66
No ratings yet
Da Exp 10 66
6 pages
Algo
No ratings yet
Algo
59 pages
Kmean
No ratings yet
Kmean
24 pages
UNIT - 3 - Clustering
No ratings yet
UNIT - 3 - Clustering
21 pages
Intro To ML Ass
No ratings yet
Intro To ML Ass
3 pages
MODULE 4 Clustering
No ratings yet
MODULE 4 Clustering
23 pages
01 K Means - Merged
No ratings yet
01 K Means - Merged
26 pages
Clustering in Python
No ratings yet
Clustering in Python
31 pages
Artificial Intelligence Lab 10
No ratings yet
Artificial Intelligence Lab 10
8 pages
Experiment 3.1 K-Mean
No ratings yet
Experiment 3.1 K-Mean
8 pages
AI-AG-Day-2-28th Feb 2023
No ratings yet
AI-AG-Day-2-28th Feb 2023
44 pages
Unsupervisd Learning Algorithm
No ratings yet
Unsupervisd Learning Algorithm
6 pages
K-Means Clustering Report
No ratings yet
K-Means Clustering Report
2 pages
K-Means Clustering Algorithm - Javatpoint
No ratings yet
K-Means Clustering Algorithm - Javatpoint
21 pages
ML-Unit III - K-Means Clustering
No ratings yet
ML-Unit III - K-Means Clustering
22 pages
Mod4 - Unsupervised Learning
No ratings yet
Mod4 - Unsupervised Learning
9 pages
Unit 3 Data
No ratings yet
Unit 3 Data
37 pages
Unit 4 Machine Learning
No ratings yet
Unit 4 Machine Learning
12 pages
02.1 K-Means Example
No ratings yet
02.1 K-Means Example
12 pages
Clustering
No ratings yet
Clustering
84 pages
ML Seminar
No ratings yet
ML Seminar
37 pages
K Means Clustering
No ratings yet
K Means Clustering
5 pages
ML Module 4 2022 1 PDF
No ratings yet
ML Module 4 2022 1 PDF
31 pages
K Means Algorithms
No ratings yet
K Means Algorithms
27 pages
ML DSBA Lab7
No ratings yet
ML DSBA Lab7
6 pages
Maths AA HL Review Sheet
No ratings yet
Maths AA HL Review Sheet
26 pages
FullMarks - Clustering StudentSolution 2
No ratings yet
FullMarks - Clustering StudentSolution 2
13 pages
K Means Algorithm
No ratings yet
K Means Algorithm
6 pages
KCSE 2024 121 PP2 MS Printable
50% (2)
KCSE 2024 121 PP2 MS Printable
14 pages
K-Means in Python - Solution
No ratings yet
K-Means in Python - Solution
6 pages
QBasic Manual
0% (1)
QBasic Manual
34 pages
Root Sy Pastpaper Bin Files 2020 F5 M2 EXAM2 PAPER1 QUESTION PAPER 2 PDF
No ratings yet
Root Sy Pastpaper Bin Files 2020 F5 M2 EXAM2 PAPER1 QUESTION PAPER 2 PDF
20 pages
Inverse Trigonometric Functions - Formula Sheet - 12th Hacker - CBSE 2025
No ratings yet
Inverse Trigonometric Functions - Formula Sheet - 12th Hacker - CBSE 2025
25 pages
Chapter 12 Further Trigonometry
No ratings yet
Chapter 12 Further Trigonometry
34 pages
Gr11 Mathematics P2 (ENG) NOV Question Paper
No ratings yet
Gr11 Mathematics P2 (ENG) NOV Question Paper
11 pages
Cambridge IGCSE™: Mathematics 0580/43 May/June 2022
No ratings yet
Cambridge IGCSE™: Mathematics 0580/43 May/June 2022
11 pages
Week 10
No ratings yet
Week 10
86 pages
Analytical Gr10 DOC-20240714-WA0024 - 240802 - 103430
No ratings yet
Analytical Gr10 DOC-20240714-WA0024 - 240802 - 103430
44 pages
Ph-1,2,3 & Seq & Prog13th
No ratings yet
Ph-1,2,3 & Seq & Prog13th
16 pages
Mathematics AA HL - Paper 1 Non Calculator Mock 2021
No ratings yet
Mathematics AA HL - Paper 1 Non Calculator Mock 2021
12 pages
Mathematics Problems in 2D and 3D
No ratings yet
Mathematics Problems in 2D and 3D
43 pages
11-11-2023 - SR - IIT - STAR CO-SC (MODEL-A, B&C) - JEE MAIN - PTM-14: Physics MAX - MARKS: 100
No ratings yet
11-11-2023 - SR - IIT - STAR CO-SC (MODEL-A, B&C) - JEE MAIN - PTM-14: Physics MAX - MARKS: 100
22 pages
DSP Lab Report
No ratings yet
DSP Lab Report
27 pages
More About Trigonmetry
No ratings yet
More About Trigonmetry
127 pages
Grade 12 NSC Maths P2 - X5 Preparatory 2018 Question Paper
No ratings yet
Grade 12 NSC Maths P2 - X5 Preparatory 2018 Question Paper
17 pages
Calc 2.8 Packet
No ratings yet
Calc 2.8 Packet
4 pages
Maths 1B LT
No ratings yet
Maths 1B LT
2 pages
2024 James Ruse Adv
No ratings yet
2024 James Ruse Adv
52 pages
Maths Q1
No ratings yet
Maths Q1
16 pages
WMA13 01 Rms 20220303
No ratings yet
WMA13 01 Rms 20220303
19 pages
Question 1936615
No ratings yet
Question 1936615
3 pages
Trigonometry Calculator. Simple Way To Find Sin, Cos, Tan, Cot
No ratings yet
Trigonometry Calculator. Simple Way To Find Sin, Cos, Tan, Cot
1 page
SMT1105 1
No ratings yet
SMT1105 1
43 pages
Quadrotor p2
No ratings yet
Quadrotor p2
30 pages
CSC403 Data Communications and Network DelSU 2023
No ratings yet
CSC403 Data Communications and Network DelSU 2023
31 pages
DPP 2
No ratings yet
DPP 2
3 pages
Mock Test-Icse-C-X-S-Ii-Set-1-Math
No ratings yet
Mock Test-Icse-C-X-S-Ii-Set-1-Math
5 pages
Basic Triangle Trigonometry
No ratings yet
Basic Triangle Trigonometry
1 page
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet