0% found this document useful (0 votes)

3 views2 pages

Lab 10

The document outlines a Python script that performs K-Means clustering on the breast cancer dataset using libraries such as NumPy, Pandas, and Scikit-learn. It includes data scaling, clustering, and visualization of the results using PCA for dimensionality reduction. The script also prints a confusion matrix and classification report to evaluate the clustering performance.

Uploaded by

2022becs152

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views2 pages

Lab 10

Uploaded by

2022becs152

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 2

import numpy as np

import pandas as pd

import matplotlib.pyplot as plt

import seaborn as sns

from sklearn.datasets import load_breast_cancer

from sklearn.cluster import KMeans

from sklearn.preprocessing import StandardScaler

from sklearn.decomposition import PCA

from sklearn.metrics import confusion_matrix, classification_report

data = load_breast_cancer()

X = data.data

y = data.target

scaler = StandardScaler()

X_scaled = scaler.fit_transform(X)

kmeans = KMeans(n_clusters=2, random_state=42)

y_kmeans = kmeans.fit_predict(X_scaled)

print("Confusion Matrix:")

print(confusion_matrix(y, y_kmeans))

print("\nClassification Report:")

print(classification_report(y, y_kmeans))

pca = PCA(n_components=2)

X_pca = pca.fit_transform(X_scaled)

df = pd.DataFrame(X_pca, columns=['PC1', 'PC2'])

df['Cluster'] = y_kmeans

df['True Label'] = y

plt.figure(figsize=(8, 6))

sns.scatterplot(data=df, x='PC1', y='PC2', hue='Cluster', palette='Set1', s=100, edgecolor='black', alpha=0.7)

plt.title('K-Means Clustering of Breast Cancer Dataset')

plt.xlabel('Principal Component 1')

plt.ylabel('Principal Component 2')

plt.legend(title="Cluster")

plt.show()

plt.figure(figsize=(8, 6))

sns.scatterplot(data=df, x='PC1', y='PC2', hue='True Label', palette='coolwarm', s=100, edgecolor='black', alpha=0.7)

plt.title('True Labels of Breast Cancer Dataset')

plt.xlabel('Principal Component 1')

plt.ylabel('Principal Component 2')

plt.legend(title="True Label")

plt.show()

plt.figure(figsize=(8, 6))

sns.scatterplot(data=df, x='PC1', y='PC2', hue='Cluster', palette='Set1', s=100, edgecolor='black', alpha=0.7)

centers = pca.transform(kmeans.cluster_centers_)

plt.scatter(centers[:, 0], centers[:, 1], s=200, c='red', marker='X', label='Centroids')

plt.title('K-Means Clustering with Centroids')

plt.xlabel('Principal Component 1')

plt.ylabel('Principal Component 2')

plt.legend(title="Cluster")

plt.show()

Clustering
No ratings yet
Clustering
1 page
10 PRGM
No ratings yet
10 PRGM
3 pages
ML Lab Programs
No ratings yet
ML Lab Programs
23 pages
AML Lab
No ratings yet
AML Lab
14 pages
K Means
No ratings yet
K Means
2 pages
Linear SVM: 'Target'
No ratings yet
Linear SVM: 'Target'
13 pages
KMeans Clustering Bidimensional Daniel Ames Camayo
No ratings yet
KMeans Clustering Bidimensional Daniel Ames Camayo
15 pages
Lab Report6 - B21CI014
No ratings yet
Lab Report6 - B21CI014
8 pages
Suneel Varma
No ratings yet
Suneel Varma
11 pages
ML Lab Experiment Shortened With Same Output
No ratings yet
ML Lab Experiment Shortened With Same Output
6 pages
ML
No ratings yet
ML
7 pages
KnnClassifier - Jupyter Notebook
No ratings yet
KnnClassifier - Jupyter Notebook
2 pages
MLP Kmeans
No ratings yet
MLP Kmeans
3 pages
pgm9 & 10
No ratings yet
pgm9 & 10
5 pages
Breast Cancer Classification Using DTC
No ratings yet
Breast Cancer Classification Using DTC
1 page
PMA Experiment 2
No ratings yet
PMA Experiment 2
6 pages
Program 9
No ratings yet
Program 9
2 pages
ML Lab 5
No ratings yet
ML Lab 5
2 pages
Prog 10
No ratings yet
Prog 10
3 pages
Kmeans
No ratings yet
Kmeans
2 pages
K-Means Cluster
No ratings yet
K-Means Cluster
2 pages
Preductive Modelling Assignment
No ratings yet
Preductive Modelling Assignment
3 pages
Code 1
No ratings yet
Code 1
3 pages
MLL
No ratings yet
MLL
2 pages
Week 8. K-Means
No ratings yet
Week 8. K-Means
7 pages
ML 7
No ratings yet
ML 7
2 pages
Labaihw
No ratings yet
Labaihw
1 page
EX7
No ratings yet
EX7
3 pages
LAB9
No ratings yet
LAB9
3 pages
Mids Practical 5
No ratings yet
Mids Practical 5
2 pages
ML 1
No ratings yet
ML 1
11 pages
9 Ds
No ratings yet
9 Ds
5 pages
Code Examples in Space
No ratings yet
Code Examples in Space
13 pages
K Means
No ratings yet
K Means
3 pages
ML
No ratings yet
ML
11 pages
Assignment 6 ML
No ratings yet
Assignment 6 ML
4 pages
K-Means 10
No ratings yet
K-Means 10
2 pages
5 Clustering Algorithm 17-09-2024
No ratings yet
5 Clustering Algorithm 17-09-2024
2 pages
ML2 Practical List
No ratings yet
ML2 Practical List
80 pages
Import As Import As Import As Import As From Import
No ratings yet
Import As Import As Import As Import As From Import
3 pages
DMDW Lab8
No ratings yet
DMDW Lab8
3 pages
Program 10
No ratings yet
Program 10
3 pages
LAB7 Kmeans
No ratings yet
LAB7 Kmeans
11 pages
A Mini Rpoject
No ratings yet
A Mini Rpoject
7 pages
Appendix - Complete Code Implementation
No ratings yet
Appendix - Complete Code Implementation
8 pages
DWDM Lab 3
No ratings yet
DWDM Lab 3
10 pages
From Import: Dict - Keys ( ('Data', 'Target', 'Frame', 'Target - Names', 'DESCR', 'Feature - Names', 'Filename', 'Data - Module') )
No ratings yet
From Import: Dict - Keys ( ('Data', 'Target', 'Frame', 'Target - Names', 'DESCR', 'Feature - Names', 'Filename', 'Data - Module') )
4 pages
DWDM Lab All
No ratings yet
DWDM Lab All
20 pages
Yogesh Siddiq Edited
No ratings yet
Yogesh Siddiq Edited
6 pages
Python Code For KNN Classifier 1. Initial Message
No ratings yet
Python Code For KNN Classifier 1. Initial Message
7 pages
Final ML Programs 075005
No ratings yet
Final ML Programs 075005
15 pages
ML 2.3 Prashant
No ratings yet
ML 2.3 Prashant
4 pages
Baidurya Debnath 4
No ratings yet
Baidurya Debnath 4
37 pages
Experiment 10
No ratings yet
Experiment 10
1 page
All in One
No ratings yet
All in One
13 pages
Experiment 10 Vtu ML
No ratings yet
Experiment 10 Vtu ML
5 pages
2.3 Aiml Rishit
No ratings yet
2.3 Aiml Rishit
7 pages
SVM K NN MLP With Sklearn Jupyter NoteBo
No ratings yet
SVM K NN MLP With Sklearn Jupyter NoteBo
22 pages
Breast Cancer Classification
No ratings yet
Breast Cancer Classification
18 pages
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet

Lab 10

Uploaded by

Lab 10

Uploaded by

import numpy as np

import matplotlib.pyplot as plt

import seaborn as sns

from sklearn.datasets import load_breast_cancer

from sklearn.cluster import KMeans

from sklearn.preprocessing import StandardScaler

from sklearn.decomposition import PCA

from sklearn.metrics import confusion_matrix, classification_report

kmeans = KMeans(n_clusters=2, random_state=42)

df = pd.DataFrame(X_pca, columns=['PC1', 'PC2'])

sns.scatterplot(data=df, x='PC1', y='PC2', hue='Cluster', palette='Set1', s=100, edgecolor='black', alpha=0.7)

plt.title('K-Means Clustering of Breast Cancer Dataset')

plt.xlabel('Principal Component 1')

sns.scatterplot(data=df, x='PC1', y='PC2', hue='True Label', palette='coolwarm', s=100, edgecolor='black', alpha=0.7)

plt.title('True Labels of Breast Cancer Dataset')

plt.xlabel('Principal Component 1')

plt.ylabel('Principal Component 2')

sns.scatterplot(data=df, x='PC1', y='PC2', hue='Cluster', palette='Set1', s=100, edgecolor='black', alpha=0.7)

plt.scatter(centers[:, 0], centers[:, 1], s=200, c='red', marker='X', label='Centroids')

plt.title('K-Means Clustering with Centroids')

plt.xlabel('Principal Component 1')

plt.ylabel('Principal Component 2')

You might also like