0% found this document useful (0 votes)

16 views

Clustering Code Explaination

Uploaded by

Mangesh P Joshi

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views

Clustering Code Explaination

Uploaded by

Mangesh P Joshi

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

let's break down the code step by step:

Import Libraries:

import numpy as np

import pandas as pd

from sklearn.cluster import KMeans

import matplotlib.pyplot as plt

This imports necessary libraries:

numpy for numerical computations.

pandas for handling data in a DataFrame.

KMeans from sklearn.cluster for performing K-means clustering.

matplotlib.pyplot for visualization.

Sample Data:

data = {

'Player': ['Player 1', 'Player 2', 'Player 3', 'Player 4', 'Player 5', 'Player 6', 'Player 7', 'Player 8', 'Player
9', 'Player 10'],

'Runs Scored': [350, 280, 420, 200, 320, 380, 240, 400, 310, 360],

'Wickets Taken': [15, 10, 20, 5, 12, 18, 8, 17, 14, 16]

This is the sample data representing runs scored and wickets taken by cricket players.

Create DataFrame:

df = pd.DataFrame(data)

This creates a DataFrame df using the sample data.

Select Features:

X = df[['Runs Scored', 'Wickets Taken']]

This selects the features 'Runs Scored' and 'Wickets Taken' for clustering.

Visualize the Data:

plt.scatter(X['Runs Scored'], X['Wickets Taken'], color='blue')

plt.xlabel('Runs Scored')

plt.ylabel('Wickets Taken')

plt.title('Cricket Players - Runs vs Wickets')

plt.show()

This visualizes the data points on a scatter plot.

Perform K-means Clustering:

k = 4 # Number of clusters

kmeans = KMeans(n_clusters=k)

kmeans.fit(X)

This performs K-means clustering with k=4 clusters.

Get Cluster Centers and Labels:

centroids = kmeans.cluster_centers_

labels = kmeans.labels_

This retrieves the cluster centers and labels assigned to each data point.
Add Cluster Labels to DataFrame:

df['Cluster'] = labels

This adds the cluster labels to the DataFrame.

Visualize the Clusters:

colors = ['r', 'g', 'b', 'orange']

for i in range(k):

plt.scatter(X[df['Cluster'] == i]['Runs Scored'], X[df['Cluster'] == i]['Wickets Taken'], c=colors[i],

label='Cluster {}'.format(i+1))

plt.scatter(centroids[:, 0], centroids[:, 1], marker='*', s=300, c='k', label='Centroids')

plt.xlabel('Runs Scored')

plt.ylabel('Wickets Taken')

plt.title('K-means Clustering of Cricket Players')

plt.legend()

plt.show()

This visualizes the clusters along with their centroids on a scatter plot.

Print Cluster Centers:

print("Cluster Centers:")

for i, centroid in enumerate(centroids):

print("Cluster {}: {}".format(i+1, centroid))

This prints the coordinates of the cluster centers.

This code performs K-means clustering on the given dataset of cricket player statistics and visualizes
the resulting clusters. Adjustments can be made to the number of clusters (k) and the features
selected for clustering as needed.

120 DS-With Answer
100% (1)
120 DS-With Answer
32 pages
PandasAI + Cricket
No ratings yet
PandasAI + Cricket
10 pages
ml1
No ratings yet
ml1
16 pages
Week 8. K-Means
No ratings yet
Week 8. K-Means
7 pages
Dream Team
No ratings yet
Dream Team
4 pages
Python Code Longterm
No ratings yet
Python Code Longterm
5 pages
Matplotlib Data Visualization Notebook
No ratings yet
Matplotlib Data Visualization Notebook
77 pages
Astros
No ratings yet
Astros
20 pages
DAV_WEEK8_240953580
No ratings yet
DAV_WEEK8_240953580
15 pages
Asset-V1 VIT+MBA109+2020+type@asset+block@Introductio To ML Using Python
No ratings yet
Asset-V1 VIT+MBA109+2020+type@asset+block@Introductio To ML Using Python
7 pages
Indian Premier League Ip Project File
No ratings yet
Indian Premier League Ip Project File
42 pages
ml lab
No ratings yet
ml lab
14 pages
Practical 5
No ratings yet
Practical 5
6 pages
DWDM Lab All
No ratings yet
DWDM Lab All
20 pages
Fds Slips
No ratings yet
Fds Slips
6 pages
IPL Data Analysis
100% (1)
IPL Data Analysis
26 pages
DSM 1
No ratings yet
DSM 1
6 pages
8960 - DWM Experiment 5
No ratings yet
8960 - DWM Experiment 5
6 pages
Matplotlib 1722309886
No ratings yet
Matplotlib 1722309886
99 pages
8 Taks
No ratings yet
8 Taks
3 pages
DSM 2
No ratings yet
DSM 2
7 pages
K-Means Cluster
No ratings yet
K-Means Cluster
2 pages
Clustering Algorithms SciKit Learn 1705740354
No ratings yet
Clustering Algorithms SciKit Learn 1705740354
22 pages
Unit 5 Descriptive Statistics
No ratings yet
Unit 5 Descriptive Statistics
7 pages
24 Gourav
No ratings yet
24 Gourav
75 pages
BDA Lab 4: Python Data Visualization: Your Name: Mohamad Salehuddin Bin Zulkefli Matric No: 17005054
No ratings yet
BDA Lab 4: Python Data Visualization: Your Name: Mohamad Salehuddin Bin Zulkefli Matric No: 17005054
10 pages
K Means
No ratings yet
K Means
3 pages
Assignment4_CH5650_CH21B112
No ratings yet
Assignment4_CH5650_CH21B112
3 pages
DSM 3
No ratings yet
DSM 3
6 pages
ml_labmanual (3)
No ratings yet
ml_labmanual (3)
33 pages
Data Science Practical Book - Ipynb
No ratings yet
Data Science Practical Book - Ipynb
21 pages
Practical-Pgm9to16
No ratings yet
Practical-Pgm9to16
6 pages
ml lab exam document
No ratings yet
ml lab exam document
14 pages
IPL DATA ANLYSIS (1)
No ratings yet
IPL DATA ANLYSIS (1)
20 pages
ML Minors Exp7
No ratings yet
ML Minors Exp7
6 pages
data science programs
No ratings yet
data science programs
11 pages
MACHINE LEARNING manual
No ratings yet
MACHINE LEARNING manual
36 pages
Cricket World Cup Management System
No ratings yet
Cricket World Cup Management System
53 pages
IPL DATA ANALYSIS (1)
No ratings yet
IPL DATA ANALYSIS (1)
19 pages
Roll NO 2020
No ratings yet
Roll NO 2020
8 pages
Exp2 - Data Visualization and Cleaning and Feature Selection
No ratings yet
Exp2 - Data Visualization and Cleaning and Feature Selection
13 pages
v aiml 12
No ratings yet
v aiml 12
2 pages
Exemplar - Perform Feature Engineering
No ratings yet
Exemplar - Perform Feature Engineering
14 pages
DWM Practical
No ratings yet
DWM Practical
12 pages
IPL - Prediction - Model - Training - Final - Ipynb - Colab
No ratings yet
IPL - Prediction - Model - Training - Final - Ipynb - Colab
8 pages
Kmeans Algorithm
No ratings yet
Kmeans Algorithm
3 pages
Code shabab error 7
No ratings yet
Code shabab error 7
5 pages
AML Clustering
No ratings yet
AML Clustering
7 pages
exp_2_sdk_ok
No ratings yet
exp_2_sdk_ok
18 pages
DM Lab Internal
No ratings yet
DM Lab Internal
37 pages
ML 2.3 Prashant
No ratings yet
ML 2.3 Prashant
4 pages
Data Science Algorithmen Master - 02 Data Handling
No ratings yet
Data Science Algorithmen Master - 02 Data Handling
76 pages
2.3 Aiml Rishit
No ratings yet
2.3 Aiml Rishit
7 pages
AI&ML lab-Ex.9corre
No ratings yet
AI&ML lab-Ex.9corre
5 pages
DataScience All 1to8
No ratings yet
DataScience All 1to8
6 pages
Matplotlib Fundamentals
No ratings yet
Matplotlib Fundamentals
31 pages
Maxbox - Starter68 Machine Learning
No ratings yet
Maxbox - Starter68 Machine Learning
5 pages
Untitled document-2-1-13-7-11.4
No ratings yet
Untitled document-2-1-13-7-11.4
5 pages
Mini Project With Output
No ratings yet
Mini Project With Output
8 pages
Scala Data Analysis Cookbook
From Everand
Scala Data Analysis Cookbook
Manivannan Arun
No ratings yet
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
DSA Lab manual 2024-25
No ratings yet
DSA Lab manual 2024-25
66 pages
List of experiments
No ratings yet
List of experiments
1 page
Installing vs Code
No ratings yet
Installing vs Code
1 page
Flow
No ratings yet
Flow
8 pages
7th Sem Remedial Class TT
No ratings yet
7th Sem Remedial Class TT
1 page
Practical Batches For Odd Semester
No ratings yet
Practical Batches For Odd Semester
1 page
Workstat Ion No Workstation Name No of Work Station Batch Size Time Per Batch
No ratings yet
Workstat Ion No Workstation Name No of Work Station Batch Size Time Per Batch
8 pages
Min Chen, Yixue Hao, Kai Hwang, Fellow, IEEE, Lu Wang, and Lin Wang
No ratings yet
Min Chen, Yixue Hao, Kai Hwang, Fellow, IEEE, Lu Wang, and Lin Wang
7 pages
Improvement in Power Transformer Intelligent Dissolved Gas Analysis Method
No ratings yet
Improvement in Power Transformer Intelligent Dissolved Gas Analysis Method
4 pages
Ensemble Learning in Machine Learning
No ratings yet
Ensemble Learning in Machine Learning
39 pages
R Machine Learning PDF
No ratings yet
R Machine Learning PDF
137 pages
Seminar Report Machine Learning
No ratings yet
Seminar Report Machine Learning
20 pages
Root Cause Analysis of Incidents Using Text Clustering and Classification Algorithms
No ratings yet
Root Cause Analysis of Incidents Using Text Clustering and Classification Algorithms
12 pages
A95-R5 July2023
No ratings yet
A95-R5 July2023
8 pages
Rosetta: Large Scale System For Text Detection and Recognition in Images
No ratings yet
Rosetta: Large Scale System For Text Detection and Recognition in Images
9 pages
Instant download Pervasive Computing and Social Networking Proceedings of ICPCSN 2022 G Ranganathan Robert Bestak Xavier Fernando Eds pdf all chapter
100% (1)
Instant download Pervasive Computing and Social Networking Proceedings of ICPCSN 2022 G Ranganathan Robert Bestak Xavier Fernando Eds pdf all chapter
55 pages
Unit Ii ML MCQ
No ratings yet
Unit Ii ML MCQ
9 pages
FYP Thesis
No ratings yet
FYP Thesis
83 pages
Electrical Engineering Technical Seminar Report
No ratings yet
Electrical Engineering Technical Seminar Report
19 pages
Lecture 4
No ratings yet
Lecture 4
31 pages
Hardware Implementation For Lower Limb Surface EMG Measurement and Analysis Using Explainable AI For Activity Recognition
No ratings yet
Hardware Implementation For Lower Limb Surface EMG Measurement and Analysis Using Explainable AI For Activity Recognition
9 pages
1 s2.0 S0034425717302821 Main
No ratings yet
1 s2.0 S0034425717302821 Main
15 pages
Urban Landscape Morphological Analysis Using Spatial Matrices. A Case of Benin City
No ratings yet
Urban Landscape Morphological Analysis Using Spatial Matrices. A Case of Benin City
23 pages
Breast Cancerr Main
100% (1)
Breast Cancerr Main
47 pages
A Beginner's Guide To Machine Learning Fundamentals (Compressed)
No ratings yet
A Beginner's Guide To Machine Learning Fundamentals (Compressed)
10 pages
Naive Bayes Classifier
No ratings yet
Naive Bayes Classifier
9 pages
Fundamentals of Machine Learning II
No ratings yet
Fundamentals of Machine Learning II
13 pages
Environmental Science: Series Editors: R. Allan - Forstner - W. Salomons
No ratings yet
Environmental Science: Series Editors: R. Allan - Forstner - W. Salomons
274 pages
Science Process Skills
No ratings yet
Science Process Skills
65 pages
Data Science Journey1
No ratings yet
Data Science Journey1
13 pages
Hybrid-Recursive Feature Elimination for Efficient Feature Selection
No ratings yet
Hybrid-Recursive Feature Elimination for Efficient Feature Selection
9 pages
Shelly
No ratings yet
Shelly
15 pages
Question Bank: Q1) What Is Data Warehouse?
No ratings yet
Question Bank: Q1) What Is Data Warehouse?
17 pages
The Adaline Learning Algorithm
No ratings yet
The Adaline Learning Algorithm
11 pages
Exam2005 2
0% (1)
Exam2005 2
19 pages
Computer Networks and Information Security
No ratings yet
Computer Networks and Information Security
35 pages

Clustering Code Explaination

Uploaded by

Clustering Code Explaination

Uploaded by

let's break down the code step by step:

from sklearn.cluster import KMeans

import matplotlib.pyplot as plt

This imports necessary libraries:

numpy for numerical computations.

pandas for handling data in a DataFrame.

KMeans from sklearn.cluster for performing K-means clustering.

matplotlib.pyplot for visualization.

This creates a DataFrame df using the sample data.

X = df[['Runs Scored', 'Wickets Taken']]

Visualize the Data:

plt.scatter(X['Runs Scored'], X['Wickets Taken'], color='blue')

plt.title('Cricket Players - Runs vs Wickets')

This visualizes the data points on a scatter plot.

Perform K-means Clustering:

This performs K-means clustering with k=4 clusters.

Get Cluster Centers and Labels:

This adds the cluster labels to the DataFrame.

Visualize the Clusters:

colors = ['r', 'g', 'b', 'orange']

plt.scatter(X[df['Cluster'] == i]['Runs Scored'], X[df['Cluster'] == i]['Wickets Taken'], c=colors[i],

plt.scatter(centroids[:, 0], centroids[:, 1], marker='*', s=300, c='k', label='Centroids')

plt.title('K-means Clustering of Cricket Players')

Print Cluster Centers:

for i, centroid in enumerate(centroids):

print("Cluster {}: {}".format(i+1, centroid))

This prints the coordinates of the cluster centers.

You might also like