0% found this document useful (0 votes)

16 views11 pages

Clustering Mall Data Students

The document outlines a Python code implementation for clustering mall customer data using KMeans. It includes data loading, feature selection, scaling, and determining the optimal number of clusters through the Elbow Method and silhouette scores. The final clusters are visualized using PCA for dimensionality reduction.

Uploaded by

vinaybuddyy2000

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views11 pages

Clustering Mall Data Students

Uploaded by

vinaybuddyy2000

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

11/28/24, 12:31 PM Clustering_Mall_data

In [3]: import numpy as np

import matplotlib.pyplot as plt
import pandas as pd
import sklearn

In [4]: df = pd.read_csv('Mall_Customers.csv')
df

Out[4]: CustomerID Genre Age Annual Income (k$) Spending Score (1-100)

0 1 Male 19 15 39

1 2 Male 21 15 81

2 3 Female 20 16 6

3 4 Female 23 16 77

4 5 Female 31 17 40

... ... ... ... ... ...

195 196 Female 35 120 79

196 197 Female 45 126 28

197 198 Male 32 126 74

198 199 Male 32 137 18

199 200 Male 30 137 83

200 rows × 5 columns

In [4]:

In [5]: # 2. Select relevant features and scale

features = ['Annual Income (k$)', 'Spending Score (1-100)']
X = df[features]

In [6]: # Import the necessary class

from sklearn.preprocessing import StandardScaler

file:///C:/Users/Admin/OneDrive - NATIONAL INSTITUTE OF INDUSTRIAL ENGINEERING/COURSES/AI&ML/PPT/My PPT 2024/Code/Clustering_Mall_data.html 1/11

11/28/24, 12:31 PM Clustering_Mall_data

scaler = StandardScaler()
X_scaled = scaler.fit_transform(X)

In [7]: # Import the necessary class

from sklearn.cluster import KMeans # Import KMeans
from sklearn.metrics import silhouette_score

# 3. Find optimal k using Elbow Method

inertia = []
silhouette_scores = []
k_range = range(2, 11)

for k in k_range:
kmeans = KMeans(n_clusters=k, random_state=42)
kmeans.fit(X_scaled)
inertia.append(kmeans.inertia_)
silhouette_scores.append(silhouette_score(X_scaled, kmeans.labels_))

# Plot Elbow Method

plt.figure(figsize=(10, 5))
plt.plot(k_range, inertia, marker='o')
plt.title('Elbow Method for Optimal k')
plt.xlabel('Number of Clusters (k)')
plt.ylabel('Inertia (Sum of Squared Distances)')
plt.grid()
plt.show()

file:///C:/Users/Admin/OneDrive - NATIONAL INSTITUTE OF INDUSTRIAL ENGINEERING/COURSES/AI&ML/PPT/My PPT 2024/Code/Clustering_Mall_data.html 2/11

11/28/24, 12:31 PM Clustering_Mall_data

In [8]: # Plot Silhouette Scores

plt.figure(figsize=(10, 5))
plt.plot(k_range, silhouette_scores, marker='o', color='orange')
plt.title('Silhouette Scores for Optimal k')
plt.xlabel('Number of Clusters (k)')
plt.ylabel('Silhouette Score')
plt.grid()
plt.show()

file:///C:/Users/Admin/OneDrive - NATIONAL INSTITUTE OF INDUSTRIAL ENGINEERING/COURSES/AI&ML/PPT/My PPT 2024/Code/Clustering_Mall_data.html 3/11

11/28/24, 12:31 PM Clustering_Mall_data

In [9]: # 4. Apply KMeans with the optimal k

optimal_k = 5 # Choose based on elbow/silhouette analysis
kmeans = KMeans(n_clusters=optimal_k, random_state=42)
df['Cluster'] = kmeans.fit_predict(X_scaled)
df

file:///C:/Users/Admin/OneDrive - NATIONAL INSTITUTE OF INDUSTRIAL ENGINEERING/COURSES/AI&ML/PPT/My PPT 2024/Code/Clustering_Mall_data.html 4/11

11/28/24, 12:31 PM Clustering_Mall_data

Out[9]: CustomerID Genre Age Annual Income (k$) Spending Score (1-100) Cluster

0 1 Male 19 15 39 4

1 2 Male 21 15 81 2

2 3 Female 20 16 6 4

3 4 Female 23 16 77 2

4 5 Female 31 17 40 4

... ... ... ... ... ... ...

195 196 Female 35 120 79 1

196 197 Female 45 126 28 3

197 198 Male 32 126 74 1

198 199 Male 32 137 18 3

199 200 Male 30 137 83 1

200 rows × 6 columns

In [10]: import matplotlib.pyplot as plt

import numpy as np

# Assuming you have optimal_k, X_scaled, and kmeans defined from your previous code

plt.figure(figsize=(10, 7))

# Define a list of colors for the clusters

colors = ['deeppink', 'green', 'red', 'purple', 'orange'] # Adjust colors as needed

# Plot each cluster with a different color

for cluster in range(optimal_k):
cluster_points = X_scaled[df['Cluster'] == cluster] # Select points in the current cluster
plt.scatter(cluster_points[:, 0], cluster_points[:, 1],
c=colors[cluster], label=f'Cluster {cluster}')

plt.scatter(kmeans.cluster_centers_[:, 0], kmeans.cluster_centers_[:, 1],

s=300, c='black', marker='*', label='Centroids')

file:///C:/Users/Admin/OneDrive - NATIONAL INSTITUTE OF INDUSTRIAL ENGINEERING/COURSES/AI&ML/PPT/My PPT 2024/Code/Clustering_Mall_data.html 5/11

11/28/24, 12:31 PM Clustering_Mall_data
plt.title('Customer Segments Visualization')
plt.xlabel('Component 1')
plt.ylabel('Component 2')
plt.legend()
plt.grid()
plt.show()

file:///C:/Users/Admin/OneDrive - NATIONAL INSTITUTE OF INDUSTRIAL ENGINEERING/COURSES/AI&ML/PPT/My PPT 2024/Code/Clustering_Mall_data.html 6/11

11/28/24, 12:31 PM Clustering_Mall_data

In [11]: df = pd.read_csv('Mall_Customers.csv')

In [12]: # Assuming 'df' is your DataFrame and 'Genre' is the categorical column
encoded_columns = pd.get_dummies(df['Genre'], prefix='Genre') # 'Genre' is used as a prefix for new columns
file:///C:/Users/Admin/OneDrive - NATIONAL INSTITUTE OF INDUSTRIAL ENGINEERING/COURSES/AI&ML/PPT/My PPT 2024/Code/Clustering_Mall_data.html 7/11
11/28/24, 12:31 PM Clustering_Mall_data

# Concatenate the encoded columns to the DataFrame

df = pd.concat([df, encoded_columns], axis=1)

# Remove the original 'Genre' column (optional)

df = df.drop('Genre', axis=1)

In [13]: # 2. Select relevant features and scale

features = ['Genre_Male', 'Genre_Female', 'Age', 'Annual Income (k$)', 'Spending Score (1-100)'] # Update features
X = df[features]

In [13]:

In [14]: # Import the necessary class

from sklearn.preprocessing import StandardScaler

# Now your existing code should work:

scaler = StandardScaler()
X_scaled = scaler.fit_transform(X)

In [15]: # Import the necessary class

from sklearn.cluster import KMeans # Import KMeans
from sklearn.metrics import silhouette_score

# 3. Find optimal k using Elbow Method

inertia = []
silhouette_scores = []
k_range = range(2, 9)

for k in k_range:
kmeans = KMeans(n_clusters=k, random_state=42)
kmeans.fit(X_scaled)
inertia.append(kmeans.inertia_)
silhouette_scores.append(silhouette_score(X_scaled, kmeans.labels_))

# Plot Elbow Method

plt.figure(figsize=(10, 5))
plt.plot(k_range, inertia, marker='o')
plt.title('Elbow Method for Optimal k')
plt.xlabel('Number of Clusters (k)')
plt.ylabel('Inertia (Sum of Squared Distances)')

file:///C:/Users/Admin/OneDrive - NATIONAL INSTITUTE OF INDUSTRIAL ENGINEERING/COURSES/AI&ML/PPT/My PPT 2024/Code/Clustering_Mall_data.html 8/11

11/28/24, 12:31 PM Clustering_Mall_data
plt.grid()
plt.show()

In [16]: # 4. Apply KMeans with the optimal k

optimal_k = 4 # Choose based on elbow/silhouette analysis
kmeans = KMeans(n_clusters=optimal_k, random_state=42)
df['Cluster'] = kmeans.fit_predict(X_scaled)
df

file:///C:/Users/Admin/OneDrive - NATIONAL INSTITUTE OF INDUSTRIAL ENGINEERING/COURSES/AI&ML/PPT/My PPT 2024/Code/Clustering_Mall_data.html 9/11

11/28/24, 12:31 PM Clustering_Mall_data

Out[16]: CustomerID Age Annual Income (k$) Spending Score (1-100) Genre_Female Genre_Male Cluster

0 1 19 15 39 False True 3

1 2 21 15 81 False True 3

2 3 20 16 6 True False 2

3 4 23 16 77 True False 1

4 5 31 17 40 True False 2

... ... ... ... ... ... ... ...

195 196 35 120 79 True False 1

196 197 45 126 28 True False 2

197 198 32 126 74 False True 3

198 199 32 137 18 False True 0

199 200 30 137 83 False True 3

200 rows × 7 columns

In [17]: from sklearn.decomposition import PCA

# Apply PCA to reduce to 2 dimensions

pca = PCA(n_components=2)
X_pca = pca.fit_transform(X_scaled)

# Modify the plotting part to use X_pca:

for cluster in range(optimal_k):
cluster_points = X_pca[df['Cluster'] == cluster] # Select points in the current cluster
plt.scatter(cluster_points[:, 0], cluster_points[:, 1],
c=colors[cluster], label=f'Cluster {cluster}')

plt.scatter(pca.transform(kmeans.cluster_centers_)[:, 0], pca.transform(kmeans.cluster_centers_)[:, 1],

s=300, c='black', marker='*', label='Centroids')

plt.title('Customer Segments Visualization')

plt.xlabel('PCA Component 1')
plt.ylabel('PCA Component 2')
plt.legend()
file:///C:/Users/Admin/OneDrive - NATIONAL INSTITUTE OF INDUSTRIAL ENGINEERING/COURSES/AI&ML/PPT/My PPT 2024/Code/Clustering_Mall_data.html 10/11
11/28/24, 12:31 PM Clustering_Mall_data
plt.grid()
plt.show()

file:///C:/Users/Admin/OneDrive - NATIONAL INSTITUTE OF INDUSTRIAL ENGINEERING/COURSES/AI&ML/PPT/My PPT 2024/Code/Clustering_Mall_data.html 11/11

Ds Paper
No ratings yet
Ds Paper
35 pages
DS Prac 8
No ratings yet
DS Prac 8
4 pages
K Means Clustering
No ratings yet
K Means Clustering
1 page
Clustering Model XX
No ratings yet
Clustering Model XX
5 pages
K Means Clustering
No ratings yet
K Means Clustering
5 pages
K Means
No ratings yet
K Means
26 pages
Lab 11 - HT
No ratings yet
Lab 11 - HT
4 pages
BDA LabReport-9
No ratings yet
BDA LabReport-9
17 pages
Untitled Document-2-1-13-7-11.4
No ratings yet
Untitled Document-2-1-13-7-11.4
5 pages
AAM 7th Prac
No ratings yet
AAM 7th Prac
4 pages
ML Lab
No ratings yet
ML Lab
8 pages
Unit 4
No ratings yet
Unit 4
63 pages
Market Analysis by Pchandru
No ratings yet
Market Analysis by Pchandru
10 pages
2403res62 - CS564 - Assignment - 4 - K-Means-Iris - Intrinsic - CVIs
No ratings yet
2403res62 - CS564 - Assignment - 4 - K-Means-Iris - Intrinsic - CVIs
30 pages
23CC554
No ratings yet
23CC554
10 pages
ML Clustering2
No ratings yet
ML Clustering2
11 pages
K-Means Algoritham
No ratings yet
K-Means Algoritham
3 pages
Bone Suplement Market Segmentation
No ratings yet
Bone Suplement Market Segmentation
20 pages
Kmeans
No ratings yet
Kmeans
5 pages
Practical 03
No ratings yet
Practical 03
3 pages
Soal Try Out UN Fis
No ratings yet
Soal Try Out UN Fis
6 pages
Kmeansclustering Sales Dataset
No ratings yet
Kmeansclustering Sales Dataset
6 pages
PMA Experiment 2
No ratings yet
PMA Experiment 2
6 pages
Department Of: Computer Science & Engineering
No ratings yet
Department Of: Computer Science & Engineering
4 pages
DWM Exp4
No ratings yet
DWM Exp4
9 pages
Lab Report6 - B21CI014
No ratings yet
Lab Report6 - B21CI014
8 pages
Reading Data: #Importing Required Libraries
No ratings yet
Reading Data: #Importing Required Libraries
16 pages
Experiment 4 1
No ratings yet
Experiment 4 1
4 pages
End To End Machine Learning Problem
No ratings yet
End To End Machine Learning Problem
20 pages
MLT Exp 09
No ratings yet
MLT Exp 09
3 pages
Nata Supermarket
No ratings yet
Nata Supermarket
238 pages
ML - K-Means
No ratings yet
ML - K-Means
12 pages
Kman 07
No ratings yet
Kman 07
9 pages
Implement Clustering Algorithms For Unsupervised Classification
No ratings yet
Implement Clustering Algorithms For Unsupervised Classification
4 pages
Data Science Analysis Final Project
No ratings yet
Data Science Analysis Final Project
10 pages
Da Exp 10
No ratings yet
Da Exp 10
6 pages
PeerEval Unsupervised
No ratings yet
PeerEval Unsupervised
6 pages
Customer Segmentation With K-Means and RMF
No ratings yet
Customer Segmentation With K-Means and RMF
13 pages
ML Assignment
No ratings yet
ML Assignment
11 pages
Experiment 9
No ratings yet
Experiment 9
10 pages
LP I Assignment A4 Clustering
No ratings yet
LP I Assignment A4 Clustering
13 pages
Da Exp 10
No ratings yet
Da Exp 10
6 pages
Clustering Algorithms SciKit Learn 1705740354
No ratings yet
Clustering Algorithms SciKit Learn 1705740354
22 pages
Practical File of AI and ML
No ratings yet
Practical File of AI and ML
26 pages
Practical-8: Import As Import As Import As Import Import As
No ratings yet
Practical-8: Import As Import As Import As Import Import As
9 pages
DWDM Lab All
No ratings yet
DWDM Lab All
20 pages
Final Code
No ratings yet
Final Code
3 pages
Customer Segmentation Report
No ratings yet
Customer Segmentation Report
8 pages
ML2 Practical List
No ratings yet
ML2 Practical List
80 pages
ML 2.3 Prashant
No ratings yet
ML 2.3 Prashant
4 pages
01 K Means - Merged
No ratings yet
01 K Means - Merged
26 pages
Pratibha Sikheriya (Data Mining)
No ratings yet
Pratibha Sikheriya (Data Mining)
4 pages
Durbin Watson H Test
No ratings yet
Durbin Watson H Test
4 pages
Data Mining Ex1
No ratings yet
Data Mining Ex1
10 pages
2.3 Aiml Rishit
No ratings yet
2.3 Aiml Rishit
7 pages
Salesforce PD1
No ratings yet
Salesforce PD1
3 pages
Econometrics Whole Course PDF
No ratings yet
Econometrics Whole Course PDF
50 pages
ML Exp5 C36
No ratings yet
ML Exp5 C36
18 pages
Experiment-7: Implementation of K-Means Clustering Algorithm
No ratings yet
Experiment-7: Implementation of K-Means Clustering Algorithm
3 pages
Unsupervisd Learning Algorithm
No ratings yet
Unsupervisd Learning Algorithm
6 pages
Advanced Statistical Methods
No ratings yet
Advanced Statistical Methods
39 pages
Analysis of Longitudinal Data Second Edition Peter Diggle PDF Download
100% (1)
Analysis of Longitudinal Data Second Edition Peter Diggle PDF Download
49 pages
Subject: ML Name: Priyanshu Gandhi Date: 10/4/21 Expt. No.: 9 Roll No.: C008 Title: Clustering Implementation in Python
No ratings yet
Subject: ML Name: Priyanshu Gandhi Date: 10/4/21 Expt. No.: 9 Roll No.: C008 Title: Clustering Implementation in Python
7 pages
Multivariate Analysis (Minitab)
No ratings yet
Multivariate Analysis (Minitab)
43 pages
Machine Learning Week 4
No ratings yet
Machine Learning Week 4
24 pages
The Upper East Side Home Prices
No ratings yet
The Upper East Side Home Prices
43 pages
Practise Numerical On Companies Final Accounts
No ratings yet
Practise Numerical On Companies Final Accounts
7 pages
Questions - FinCom
No ratings yet
Questions - FinCom
8 pages
An Investigation of Playfulness of Pre-School Children in Turkey
No ratings yet
An Investigation of Playfulness of Pre-School Children in Turkey
17 pages
Exercise - Corriges AUTO HETERO E7
No ratings yet
Exercise - Corriges AUTO HETERO E7
24 pages
Event-Wise Requirement
No ratings yet
Event-Wise Requirement
5 pages
CSE-4119 Assignment
No ratings yet
CSE-4119 Assignment
3 pages
Unit - 5
No ratings yet
Unit - 5
111 pages
When Does Heckman's Two-Step Procedure For Censored Data Work and When Does It Not?
No ratings yet
When Does Heckman's Two-Step Procedure For Censored Data Work and When Does It Not?
22 pages
Vinay Prakash Tupe - 202401312 - FIG Sponsor Pitch Task
No ratings yet
Vinay Prakash Tupe - 202401312 - FIG Sponsor Pitch Task
1 page
Bonnal Et Al 2002 The Life Cycle of Technical Projects
No ratings yet
Bonnal Et Al 2002 The Life Cycle of Technical Projects
8 pages
Applied Multivariate Analysis
No ratings yet
Applied Multivariate Analysis
14 pages
Testbank KTLTC
No ratings yet
Testbank KTLTC
54 pages
Chapter 2 SOLVING NONLINEAR EQUATION 3
No ratings yet
Chapter 2 SOLVING NONLINEAR EQUATION 3
14 pages
تقدير دالة الطلب على الواردات في السودان خلال الفترة (1998- 2017)
No ratings yet
تقدير دالة الطلب على الواردات في السودان خلال الفترة (1998- 2017)
15 pages
Forecasting
No ratings yet
Forecasting
6 pages
Cheat Sheet: Interpreting Regressions: L (P (Y X X
No ratings yet
Cheat Sheet: Interpreting Regressions: L (P (Y X X
1 page
Morans I and Spatial Regression
No ratings yet
Morans I and Spatial Regression
23 pages
Anova How To Statistica
No ratings yet
Anova How To Statistica
3 pages
Lampiran Uji Korelasi Product Moment Pearson
No ratings yet
Lampiran Uji Korelasi Product Moment Pearson
9 pages
How To Interpret A Correlation Coefficient R
No ratings yet
How To Interpret A Correlation Coefficient R
2 pages
MS Excel Instruction Steps in Matrimony Conjoint Analysis
No ratings yet
MS Excel Instruction Steps in Matrimony Conjoint Analysis
8 pages
Panel Data Model For Tourism Demand: Kadek Jemmy Waciko, Ismail, B
No ratings yet
Panel Data Model For Tourism Demand: Kadek Jemmy Waciko, Ismail, B
7 pages
Statistical Machine Learning Solutions For Exam 2020-08-22
No ratings yet
Statistical Machine Learning Solutions For Exam 2020-08-22
7 pages
ML Unit 1 MCQ
100% (1)
ML Unit 1 MCQ
9 pages
Lampiran SPSS 25 Desember 2023
No ratings yet
Lampiran SPSS 25 Desember 2023
2 pages
M6 Check in Activity 4
No ratings yet
M6 Check in Activity 4
4 pages
11 85-95PNJ Pengaruh+Customer+Experience+dan+Rating+Pengguna+Aplikasi+GrabFood+Terhadap+Repurchase+IntentionTri+Wahyuningsih1,+Kadunci2,+Riza+Hadikusuma Compressed
No ratings yet
11 85-95PNJ Pengaruh+Customer+Experience+dan+Rating+Pengguna+Aplikasi+GrabFood+Terhadap+Repurchase+IntentionTri+Wahyuningsih1,+Kadunci2,+Riza+Hadikusuma Compressed
11 pages
Me Summer 2021
No ratings yet
Me Summer 2021
1 page
Solving Math Problems
From Everand
Solving Math Problems
George N. Frempong
No ratings yet
IGNOU BCA Introduction to Algorithm Design Previous Year Unsolved Papers BCS 042
From Everand
IGNOU BCA Introduction to Algorithm Design Previous Year Unsolved Papers BCS 042
Manish Soni
No ratings yet

Clustering Mall Data Students

Uploaded by

Clustering Mall Data Students

Uploaded by

11/28/24, 12:31 PM Clustering_Mall_data

In [3]: import numpy as np

... ... ... ... ... ...

195 196 Female 35 120 79

196 197 Female 45 126 28

197 198 Male 32 126 74

198 199 Male 32 137 18

199 200 Male 30 137 83

200 rows × 5 columns

In [5]: # 2. Select relevant features and scale

In [6]: # Import the necessary class

file:///C:/Users/Admin/OneDrive - NATIONAL INSTITUTE OF INDUSTRIAL ENGINEERING/COURSES/AI&ML/PPT/My PPT 2024/Code/Clustering_Mall_data.html 1/11

In [7]: # Import the necessary class

# 3. Find optimal k using Elbow Method

# Plot Elbow Method

file:///C:/Users/Admin/OneDrive - NATIONAL INSTITUTE OF INDUSTRIAL ENGINEERING/COURSES/AI&ML/PPT/My PPT 2024/Code/Clustering_Mall_data.html 2/11

In [8]: # Plot Silhouette Scores

file:///C:/Users/Admin/OneDrive - NATIONAL INSTITUTE OF INDUSTRIAL ENGINEERING/COURSES/AI&ML/PPT/My PPT 2024/Code/Clustering_Mall_data.html 3/11

In [9]: # 4. Apply KMeans with the optimal k

file:///C:/Users/Admin/OneDrive - NATIONAL INSTITUTE OF INDUSTRIAL ENGINEERING/COURSES/AI&ML/PPT/My PPT 2024/Code/Clustering_Mall_data.html 4/11

... ... ... ... ... ... ...

195 196 Female 35 120 79 1

196 197 Female 45 126 28 3

197 198 Male 32 126 74 1

198 199 Male 32 137 18 3

199 200 Male 30 137 83 1

200 rows × 6 columns

In [10]: import matplotlib.pyplot as plt

# Define a list of colors for the clusters

# Plot each cluster with a different color

plt.scatter(kmeans.cluster_centers_[:, 0], kmeans.cluster_centers_[:, 1],

file:///C:/Users/Admin/OneDrive - NATIONAL INSTITUTE OF INDUSTRIAL ENGINEERING/COURSES/AI&ML/PPT/My PPT 2024/Code/Clustering_Mall_data.html 5/11

file:///C:/Users/Admin/OneDrive - NATIONAL INSTITUTE OF INDUSTRIAL ENGINEERING/COURSES/AI&ML/PPT/My PPT 2024/Code/Clustering_Mall_data.html 6/11

# Concatenate the encoded columns to the DataFrame

# Remove the original 'Genre' column (optional)

In [13]: # 2. Select relevant features and scale

In [14]: # Import the necessary class

# Now your existing code should work:

In [15]: # Import the necessary class

# 3. Find optimal k using Elbow Method

# Plot Elbow Method

file:///C:/Users/Admin/OneDrive - NATIONAL INSTITUTE OF INDUSTRIAL ENGINEERING/COURSES/AI&ML/PPT/My PPT 2024/Code/Clustering_Mall_data.html 8/11

In [16]: # 4. Apply KMeans with the optimal k

file:///C:/Users/Admin/OneDrive - NATIONAL INSTITUTE OF INDUSTRIAL ENGINEERING/COURSES/AI&ML/PPT/My PPT 2024/Code/Clustering_Mall_data.html 9/11

... ... ... ... ... ... ... ...

195 196 35 120 79 True False 1

196 197 45 126 28 True False 2

197 198 32 126 74 False True 3

198 199 32 137 18 False True 0

199 200 30 137 83 False True 3

200 rows × 7 columns

In [17]: from sklearn.decomposition import PCA

# Apply PCA to reduce to 2 dimensions

# Modify the plotting part to use X_pca:

plt.scatter(pca.transform(kmeans.cluster_centers_)[:, 0], pca.transform(kmeans.cluster_centers_)[:, 1],

plt.title('Customer Segments Visualization')

file:///C:/Users/Admin/OneDrive - NATIONAL INSTITUTE OF INDUSTRIAL ENGINEERING/COURSES/AI&ML/PPT/My PPT 2024/Code/Clustering_Mall_data.html 11/11

You might also like