0% found this document useful (0 votes)

8 views11 pages

K Means Clustering

The document explains K-Means Clustering, an unsupervised learning algorithm used to partition data into clusters based on similarity. It describes the process of customer segmentation in a retail store using features like annual income and spending habits, detailing the steps involved in the K-Means algorithm. Additionally, it includes a Python code example for implementing K-Means clustering and visualizing the results.

Uploaded by

jeyaboopathi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views11 pages

K Means Clustering

Uploaded by

jeyaboopathi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 11

K – Means Clustering

Dr.J.Jeyaboopathiraja
Assistant Professor
Department of Computer Science
Sri Ramakrishna College of Arts & Science
Coimbatore
Machine Learning is broadly classified into Supervised and Unsupervised
Learning based on how the model learns from data.

Supervised Learning

🔹 Definition: The model is trained on labeled data, meaning each input has a
corresponding output.
🔹 Goal: Learn from the past data and make predictions on new data.
🔹 Types:
Classification (Predict categories) → e.g., Spam or Not Spam

Unsupervised Learning

🔹 Definition: The model is trained on unlabeled data (no predefined outputs).

🔹 Goal: Identify patterns, relationships, and hidden structures in data.
🔹 Types:
Clustering (Grouping similar data) → e.g., Customer Segmentation
K – Means Clustering

K-Means Clustering is a popular unsupervised learning algorithm

that partitions data into distinct clusters based on similarity.

Unsupervised learning is a type of machine learning that works with data

that has no labels or categories. The main goal is to find patterns and
relationships in the data without any guidance.

In this approach, the machine analyzes unorganized information and

groups it based on similarities, patterns, or differences.
Customer Segmentation in a Retail Store
A retail store wants to classify its customers into different groups based
on their annual income and spending habits.

Collect Data
Suppose we have a dataset with two key features for each customer:
Annual Income (in $1000s)
Spending Score (1–100) (how much they spend compared to their
income)
Initialize Centroids
Randomly select 3 points as initial centroids.
Let's assume:
Centroid 1: (15, 80)
Centroid 2: (40, 50)
Centroid 3: (80, 20)
Step 1: Choose the Number of Clusters (K)
The value of K (number of clusters) is chosen manually or using the
Elbow Method.
Here, we assume K = 3, meaning we divide customers into 3 groups.

Step 2: Initialize K Cluster Centroids

K-Means starts by selecting K random points from the dataset as initial
cluster centers.

Step 3: Assign Each Customer to the Nearest Centroid

Each customer is assigned to the nearest cluster using Euclidean distance

Step 4: Compute New Cluster Centroids

The centroid of each cluster is updated as the average of all customers in
that group.

Step 5: Repeat Steps 3-4 Until Clusters Stabilize

The algorithm stops when cluster centers no longer change.
Cluster 1: Budget Shoppers
Low income, high spending score
These customers spend a lot despite having a low income.
Example: Frequently shop for fashion but have limited earnings.

Cluster 2: Average Shoppers

Moderate income, moderate spending score
These customers spend proportionally to their earnings.
Example: Middle-class customers who buy regularly but not
excessively.

Cluster 3: Luxury Shoppers

High income, low spending score
These customers earn a lot but spend cautiously.
Example: Wealthy individuals who only buy high-end products
occasionally.
import numpy as np
import matplotlib.pyplot as plt from sklearn.cluster import KMeans

# Sample data: [Annual Income ($1000s), Spending Score (1-100)]

data = np.array([
[15, 80], [16, 75], [40, 50],
[42, 45], [80, 20], [85, 15]
])
# Apply K-Means clustering with 3 clusters
kmeans = KMeans(n_clusters=3, random_state=42, n_init=10)
kmeans.fit(data)
# Get cluster centers and labels
centroids = kmeans.cluster_centers_
labels = kmeans.labels_

# Plot the clusters

plt.scatter(data[:, 0], data[:, 1], c=labels, cmap='viridis', marker='o', edgecolors='k')
plt.scatter(centroids[:, 0], centroids[:, 1], c='red', marker='X', s=200, label='Centroids')
plt.xlabel("Annual Income ($1000s)")
plt.ylabel("Spending Score (1-100)")
plt.title("Customer Segmentation using K-Means Clustering")
plt.legend()
plt.show()
THANK YOU

Customer Segmentation Using Machine Learning
100% (1)
Customer Segmentation Using Machine Learning
28 pages
09.unsupervised Learning
No ratings yet
09.unsupervised Learning
50 pages
Słowacja Wszystko PDF
No ratings yet
Słowacja Wszystko PDF
379 pages
1694601073-Unit 3.1 Unsupervised Learning CU 2.0
No ratings yet
1694601073-Unit 3.1 Unsupervised Learning CU 2.0
35 pages
Unit II Final
No ratings yet
Unit II Final
152 pages
Artificial Intelligence Report
No ratings yet
Artificial Intelligence Report
23 pages
Customer Segmentation
No ratings yet
Customer Segmentation
43 pages
EAI13
No ratings yet
EAI13
19 pages
Week 9
No ratings yet
Week 9
66 pages
LP I Assignment A4 Clustering
No ratings yet
LP I Assignment A4 Clustering
13 pages
Report ML 2
No ratings yet
Report ML 2
10 pages
ML - K-Means
No ratings yet
ML - K-Means
12 pages
Unit 4
No ratings yet
Unit 4
125 pages
Group 11 Ba Presentation
No ratings yet
Group 11 Ba Presentation
11 pages
K Means
No ratings yet
K Means
9 pages
CE345 - Lecture #9 - Clustering
No ratings yet
CE345 - Lecture #9 - Clustering
56 pages
Som New
No ratings yet
Som New
21 pages
K Means - Ipynb - Colab
No ratings yet
K Means - Ipynb - Colab
10 pages
Lesson 5 - Unsupervised Learning
No ratings yet
Lesson 5 - Unsupervised Learning
11 pages
10.lab Activity
No ratings yet
10.lab Activity
11 pages
ML Unit 4
No ratings yet
ML Unit 4
110 pages
Unsupervised Learning Final
No ratings yet
Unsupervised Learning Final
17 pages
K-Means Clustering
No ratings yet
K-Means Clustering
6 pages
Facebook Live Seller
No ratings yet
Facebook Live Seller
8 pages
Minor Project
No ratings yet
Minor Project
10 pages
Final Synopsis
No ratings yet
Final Synopsis
9 pages
Presentation 1
No ratings yet
Presentation 1
47 pages
04-FSSR DS610 2024 2025T1 Kmeans
No ratings yet
04-FSSR DS610 2024 2025T1 Kmeans
57 pages
Data Science Laboratory Lab Manual: Prepared by Dr. R Obulakonda Reddy, Associate Professor
No ratings yet
Data Science Laboratory Lab Manual: Prepared by Dr. R Obulakonda Reddy, Associate Professor
35 pages
K Clustering
No ratings yet
K Clustering
28 pages
Da Exp 10
No ratings yet
Da Exp 10
6 pages
Da Exp 10
No ratings yet
Da Exp 10
6 pages
Python Machine Learning
No ratings yet
Python Machine Learning
19 pages
K Mean Clustering
No ratings yet
K Mean Clustering
59 pages
Class6 Unsupervised Learning Clustering
No ratings yet
Class6 Unsupervised Learning Clustering
13 pages
K Means Final
No ratings yet
K Means Final
10 pages
L7 Clustering
No ratings yet
L7 Clustering
58 pages
Chapter 3 p4
No ratings yet
Chapter 3 p4
18 pages
UNIT - 3 - Clustering
No ratings yet
UNIT - 3 - Clustering
21 pages
Machine Learning Chapter 3
No ratings yet
Machine Learning Chapter 3
12 pages
ML Unit5 Notes
No ratings yet
ML Unit5 Notes
18 pages
K - Means Clustering
No ratings yet
K - Means Clustering
13 pages
K Means Clustering
No ratings yet
K Means Clustering
22 pages
3.unsupervised Learning
No ratings yet
3.unsupervised Learning
9 pages
KMeans Clustering Report
No ratings yet
KMeans Clustering Report
2 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
23 pages
Customer Segemntation
No ratings yet
Customer Segemntation
26 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
12 pages
Mod4 - Unsupervised Learning
No ratings yet
Mod4 - Unsupervised Learning
9 pages
Clustering
No ratings yet
Clustering
18 pages
5 Minute Summary Lecture - 1
No ratings yet
5 Minute Summary Lecture - 1
2 pages
Tae1 A12
No ratings yet
Tae1 A12
1 page
Machine Learning Is Fun 1565131730
No ratings yet
Machine Learning Is Fun 1565131730
48 pages
K Means Clustering
No ratings yet
K Means Clustering
5 pages
Clustering Kmeans
No ratings yet
Clustering Kmeans
6 pages
Machine Learning - Iv
No ratings yet
Machine Learning - Iv
13 pages
Pilot
No ratings yet
Pilot
3 pages
Advanced Machine Learning Mastering Level Learning With Python
No ratings yet
Advanced Machine Learning Mastering Level Learning With Python
81 pages
Settlement Geog Notes
No ratings yet
Settlement Geog Notes
54 pages
Ds Un4
No ratings yet
Ds Un4
11 pages
Session4 KMeansClustering
No ratings yet
Session4 KMeansClustering
10 pages
BCS602 Module 1
No ratings yet
BCS602 Module 1
35 pages
13: Clustering: Unsupervised Learning - Introduction
No ratings yet
13: Clustering: Unsupervised Learning - Introduction
4 pages
Monaco Static MLC Sequencer Technical Reference (2.0)
No ratings yet
Monaco Static MLC Sequencer Technical Reference (2.0)
24 pages
SPE-134076-MS - Integrated Formation Evaluation Using A Combination of Image Logs, WFTs and Mini-DSTs
100% (1)
SPE-134076-MS - Integrated Formation Evaluation Using A Combination of Image Logs, WFTs and Mini-DSTs
17 pages
Ai Notes
No ratings yet
Ai Notes
7 pages
Thesis Topic On Web Mining
100% (3)
Thesis Topic On Web Mining
7 pages
How To Run Cluster Analysis in Excel
No ratings yet
How To Run Cluster Analysis in Excel
9 pages
Text Summarization Using Machine Learning LST M
No ratings yet
Text Summarization Using Machine Learning LST M
18 pages
Machine Learning Lab File (BTCS619-18)
No ratings yet
Machine Learning Lab File (BTCS619-18)
50 pages
Datamining Bits
No ratings yet
Datamining Bits
16 pages
Lecture4 Slides
No ratings yet
Lecture4 Slides
43 pages
Concepts and Techniques: - Chapter 7
No ratings yet
Concepts and Techniques: - Chapter 7
70 pages
To HPC With MPI For Data Science: Frank Nielsen
No ratings yet
To HPC With MPI For Data Science: Frank Nielsen
304 pages
6th Sem
No ratings yet
6th Sem
15 pages
EDA Mini Report
No ratings yet
EDA Mini Report
32 pages
KDD96 037
No ratings yet
KDD96 037
6 pages
Temporal Mining
No ratings yet
Temporal Mining
15 pages
Investigating AI-Powered Tutoring Systems That Ada
No ratings yet
Investigating AI-Powered Tutoring Systems That Ada
7 pages
Neural Network-Based Reversible Data Hiding For Medical Image
No ratings yet
Neural Network-Based Reversible Data Hiding For Medical Image
8 pages
Customer Behavior Model Using Data Mining: Milan Patel, Srushti Karvekar, Zeal Mehta
No ratings yet
Customer Behavior Model Using Data Mining: Milan Patel, Srushti Karvekar, Zeal Mehta
8 pages
E-Commerce Data: Topic-5.2: Text Mining/Analytics
No ratings yet
E-Commerce Data: Topic-5.2: Text Mining/Analytics
63 pages
Dissertation Thesis
No ratings yet
Dissertation Thesis
42 pages
Ajithkumar - Inframind Season
No ratings yet
Ajithkumar - Inframind Season
12 pages
Data Warehouse and Data Mining: Syllabus
No ratings yet
Data Warehouse and Data Mining: Syllabus
28 pages
Ijcirv13n8 08
No ratings yet
Ijcirv13n8 08
8 pages
Automatic Yield Management System
No ratings yet
Automatic Yield Management System
5 pages
An Ontology-Based NLP Approach To Semantic Annotation of Annual Report
No ratings yet
An Ontology-Based NLP Approach To Semantic Annotation of Annual Report
4 pages

K Means Clustering

Uploaded by

K Means Clustering

Uploaded by

K – Means Clustering

🔹 Definition: The model is trained on unlabeled data (no predefined outputs).

K-Means Clustering is a popular unsupervised learning algorithm

Unsupervised learning is a type of machine learning that works with data

In this approach, the machine analyzes unorganized information and

Step 2: Initialize K Cluster Centroids

Step 3: Assign Each Customer to the Nearest Centroid

Step 4: Compute New Cluster Centroids

Step 5: Repeat Steps 3-4 Until Clusters Stabilize

Cluster 2: Average Shoppers

Cluster 3: Luxury Shoppers

# Sample data: [Annual Income ($1000s), Spending Score (1-100)]

# Plot the clusters

You might also like