0% found this document useful (0 votes)

46 views

Lab Assignment 3 Ai

Uploaded by

yashutank46

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

46 views

Lab Assignment 3 Ai

Uploaded by

yashutank46

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 1

In [1]: #Write python script to implement KMeans Algorithm over a inputted dataset (Any data take of your own).

import pandas as pd
from sklearn.cluster import KMeans

data = pd.DataFrame({
"age": [25, 32, 40, 28, 35, 48, 38, 22, 27, 30],
"income": [50000, 70000, 85000, 62000, 78000, 95000, 82000, 45000, 52000, 65000]
})

k = 3

kmeans = KMeans(n_clusters=k, random_state=42)

kmeans.fit(data)

cluster_labels = kmeans.labels_

print("Cluster labels:", cluster_labels)

centroids = kmeans.cluster_centers_
print("Centroids:", centroids)

data["cluster"] = cluster_labels

print(data)

import matplotlib.pyplot as plt

plt.scatter(data["age"], data["income"], c=cluster_labels)

plt.xlabel("Age")
plt.ylabel("Income")
plt.title("Customer Clusters")
plt.show()

C:\tools\Anaconda3\lib\site-packages\sklearn\cluster\_kmeans.py:870: FutureWarning: The default value of `n_init` will change from 10 to 'auto' in 1.4. Set th
e value of `n_init` explicitly to suppress the warning
warnings.warn(
C:\tools\Anaconda3\lib\site-packages\sklearn\cluster\_kmeans.py:1382: UserWarning: KMeans is known to have a memory leak on Windows with MKL, when there are l
ess chunks than available threads. You can avoid it by setting the environment variable OMP_NUM_THREADS=1.
warnings.warn(
Cluster labels: [1 2 0 2 0 0 0 1 1 2]
Centroids: [[4.02500000e+01 8.50000000e+04]
[2.46666667e+01 4.90000000e+04]
[3.00000000e+01 6.56666667e+04]]
age income cluster
0 25 50000 1
1 32 70000 2
2 40 85000 0
3 28 62000 2
4 35 78000 0
5 48 95000 0
6 38 82000 0
7 22 45000 1
8 27 52000 1
9 30 65000 2

In [2]: #Write python script to implement Hierarchical clustering Algorithm over a inputted dataset (Any data take of your own).

import numpy as np
import matplotlib.pyplot as plt
from sklearn.datasets import make_blobs
from sklearn.cluster import AgglomerativeClustering
from scipy.cluster.hierarchy import dendrogram, linkage

np.random.seed(42)
data, _ = make_blobs(n_samples=300, centers=4, random_state=42)

k = int(input("Enter the number of clusters (K): "))

hc_model = AgglomerativeClustering(n_clusters=k, affinity='euclidean', linkage='ward')

hc_labels = hc_model.fit_predict(data)

plt.scatter(data[:, 0], data[:, 1], c=hc_labels, cmap='viridis', edgecolors='k', s=50)

plt.title('Hierarchical Clustering')
plt.xlabel('Feature 1')
plt.ylabel('Feature 2')
plt.show()

linked = linkage(data, 'ward')

dendrogram(linked, orientation='top', distance_sort='descending', show_leaf_counts=True)
plt.title('Hierarchical Clustering Dendrogram')
plt.xlabel('Sample Index')
plt.ylabel('Cluster Distance')
plt.show()

Enter the number of clusters (K): 3

C:\tools\Anaconda3\lib\site-packages\sklearn\cluster\_agglomerative.py:983: FutureWarning: Attribute `affinity` was deprecated in version 1.2 and will be remo
ved in 1.4. Use `metric` instead
warnings.warn(

In [3]: #Write python script to implement decision tree over a inputted dataset (Any data take of your own).

import numpy as np
import pandas as pd
from sklearn.model_selection import train_test_split
from sklearn.tree import DecisionTreeClassifier
from sklearn.metrics import accuracy_score, classification_report, confusion_matrix

np.random.seed(42)
data = pd.DataFrame({
'Feature1': np.random.rand(100),
'Feature2': np.random.rand(100),
'Label': np.random.choice([0, 1], size=100)
})

X = data[['Feature1', 'Feature2']]
y = data['Label']

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

dt_classifier = DecisionTreeClassifier(random_state=42)

dt_classifier.fit(X_train, y_train)

y_pred = dt_classifier.predict(X_test)

accuracy = accuracy_score(y_test, y_pred)

conf_matrix = confusion_matrix(y_test, y_pred)
class_report = classification_report(y_test, y_pred)

print(f'Accuracy: {accuracy:.2f}')
print('Confusion Matrix:\n', conf_matrix)
print('Classification Report:\n', class_report)

Accuracy: 0.65
Confusion Matrix:
[[4 2]
[5 9]]
Classification Report:
precision recall f1-score support

0 0.44 0.67 0.53 6

1 0.82 0.64 0.72 14

accuracy 0.65 20
macro avg 0.63 0.65 0.63 20
weighted avg 0.71 0.65 0.66 20

In [ ]:

1.1 Read The Data and Do Exploratory Data Analysis. Describe The Data Briefly
100% (19)
1.1 Read The Data and Do Exploratory Data Analysis. Describe The Data Briefly
50 pages
Exam LTAM: You Have What It Takes To Pass
100% (1)
Exam LTAM: You Have What It Takes To Pass
12 pages
A Linear Program
No ratings yet
A Linear Program
58 pages
Exercice 1 TP K-Means
No ratings yet
Exercice 1 TP K-Means
1 page
customers-k-means
No ratings yet
customers-k-means
11 pages
k-means-clustering
No ratings yet
k-means-clustering
6 pages
program-8
No ratings yet
program-8
11 pages
1 Kmeans-Pratical-No-1
No ratings yet
1 Kmeans-Pratical-No-1
8 pages
Tugas Clustering - 132021012 - Kevin Gazkia Naufal
No ratings yet
Tugas Clustering - 132021012 - Kevin Gazkia Naufal
6 pages
Day59 K Means Clustering 1701989733
No ratings yet
Day59 K Means Clustering 1701989733
5 pages
ml lab
No ratings yet
ml lab
8 pages
ML 5
No ratings yet
ML 5
12 pages
Kmeansclustering Sales Dataset
No ratings yet
Kmeansclustering Sales Dataset
6 pages
KMeans
No ratings yet
KMeans
1 page
DWDM Lab All
No ratings yet
DWDM Lab All
20 pages
HW5 Clustering (50 PTS) : Test Algorithms
No ratings yet
HW5 Clustering (50 PTS) : Test Algorithms
5 pages
K Means
100% (2)
K Means
329 pages
ML2 Practical List
No ratings yet
ML2 Practical List
80 pages
23CC554
No ratings yet
23CC554
10 pages
Data Mining - Project
100% (2)
Data Mining - Project
11 pages
D3 docs
No ratings yet
D3 docs
6 pages
Mlda - Lab
No ratings yet
Mlda - Lab
35 pages
21BCE5775 Clustering
No ratings yet
21BCE5775 Clustering
42 pages
6
No ratings yet
6
4 pages
Pa66 ML Exp6
No ratings yet
Pa66 ML Exp6
9 pages
machine learning lab
No ratings yet
machine learning lab
20 pages
JAVIER KMeans Clustering Jupyter Notebook
No ratings yet
JAVIER KMeans Clustering Jupyter Notebook
7 pages
S6 - Data Mining Lab Experiments (Except 1)
No ratings yet
S6 - Data Mining Lab Experiments (Except 1)
6 pages
K-Means Clustering - Jupyter Notebook
No ratings yet
K-Means Clustering - Jupyter Notebook
11 pages
Data Mining
No ratings yet
Data Mining
27 pages
Week 8 DS Practical (1)
No ratings yet
Week 8 DS Practical (1)
13 pages
01 K Means - Merged
No ratings yet
01 K Means - Merged
26 pages
SE_KMeansClustering
No ratings yet
SE_KMeansClustering
21 pages
assg 3
No ratings yet
assg 3
31 pages
SUMERA - Kmeans Clustering - Jupyter Notebook
No ratings yet
SUMERA - Kmeans Clustering - Jupyter Notebook
7 pages
K-Means in Python - Solution
No ratings yet
K-Means in Python - Solution
6 pages
AAM 7th prac
No ratings yet
AAM 7th prac
4 pages
Unsupervisd Learning Algorithm
No ratings yet
Unsupervisd Learning Algorithm
6 pages
PRAC9_23BME053
No ratings yet
PRAC9_23BME053
4 pages
Practical-8: Import As Import As Import As Import Import As
No ratings yet
Practical-8: Import As Import As Import As Import Import As
9 pages
Ass6(DMDS)
No ratings yet
Ass6(DMDS)
7 pages
Ex No: Date: K-Means Clustering Using Python: Scatter
No ratings yet
Ex No: Date: K-Means Clustering Using Python: Scatter
10 pages
Practical 5
No ratings yet
Practical 5
6 pages
Untitled Document
No ratings yet
Untitled Document
6 pages
TOO
No ratings yet
TOO
7 pages
Mall Customer Segmentation Using KMeans Clustering Algorithm and Classification Algorithm
No ratings yet
Mall Customer Segmentation Using KMeans Clustering Algorithm and Classification Algorithm
40 pages
Prac7 8 9 10
No ratings yet
Prac7 8 9 10
12 pages
DWM_EXP4
No ratings yet
DWM_EXP4
9 pages
K Means Clustering
No ratings yet
K Means Clustering
11 pages
Practical 03
No ratings yet
Practical 03
3 pages
Suneel Varma
No ratings yet
Suneel Varma
11 pages
K Means Clustering - Experiment 12
No ratings yet
K Means Clustering - Experiment 12
3 pages
ML Exp5 C36
No ratings yet
ML Exp5 C36
18 pages
Aiml Unit 3 4
No ratings yet
Aiml Unit 3 4
19 pages
EXPERIMENT 9
No ratings yet
EXPERIMENT 9
10 pages
ML assignment
No ratings yet
ML assignment
11 pages
Import Numpy As NP Import Pandas As PD
No ratings yet
Import Numpy As NP Import Pandas As PD
7 pages
Data Mining
No ratings yet
Data Mining
18 pages
KMeans Clustering Bidimensional Daniel Ames Camayo
No ratings yet
KMeans Clustering Bidimensional Daniel Ames Camayo
15 pages
ML
No ratings yet
ML
11 pages
St. John College of Engineering and Management, Palghar - Maharashtra
No ratings yet
St. John College of Engineering and Management, Palghar - Maharashtra
11 pages
Advanced C Concepts and Programming: First Edition
From Everand
Advanced C Concepts and Programming: First Edition
Gayatri
3/5 (1)
Ghadeer Haider - Effect of Sampling Freq On Steady State Error
No ratings yet
Ghadeer Haider - Effect of Sampling Freq On Steady State Error
8 pages
Tutorial Differentiation
No ratings yet
Tutorial Differentiation
10 pages
12.05.backtracking Algorithms
No ratings yet
12.05.backtracking Algorithms
37 pages
A Gentle Introduction To Graph Neural Networks
No ratings yet
A Gentle Introduction To Graph Neural Networks
9 pages
Chapter12 PDF
No ratings yet
Chapter12 PDF
5 pages
ECO465_SampleMidterm
No ratings yet
ECO465_SampleMidterm
2 pages
Eigen Values Eign Vectors - QB
No ratings yet
Eigen Values Eign Vectors - QB
8 pages
The Enigmatic Realm of Randomness - An Esoteric Exploration
No ratings yet
The Enigmatic Realm of Randomness - An Esoteric Exploration
2 pages
Unit Roots Tests Methods and Problems
No ratings yet
Unit Roots Tests Methods and Problems
28 pages
Richard Feynman: Simulating Physics With Computers
100% (1)
Richard Feynman: Simulating Physics With Computers
8 pages
The Schrodinger Equation: Fisika Kuantum
No ratings yet
The Schrodinger Equation: Fisika Kuantum
13 pages
Ncert Sol Cbse Class 10 Maths Chapt 2 Polynomials PDF
No ratings yet
Ncert Sol Cbse Class 10 Maths Chapt 2 Polynomials PDF
17 pages
Ai Lecture1
No ratings yet
Ai Lecture1
16 pages
Polyphase Merge
100% (1)
Polyphase Merge
9 pages
Spatial Filtering: CS474/674 - Prof. Bebis
No ratings yet
Spatial Filtering: CS474/674 - Prof. Bebis
55 pages
Aes Document 1 Final
No ratings yet
Aes Document 1 Final
102 pages
Mat1512 Assignment 03 2023 Edited Version
No ratings yet
Mat1512 Assignment 03 2023 Edited Version
3 pages
DSP All Labs 1-13 by NANGYAL KHAN
No ratings yet
DSP All Labs 1-13 by NANGYAL KHAN
141 pages
Rapid Miner Process - Getting Started With Assignment 2 and 3 (Fundraising Data)
No ratings yet
Rapid Miner Process - Getting Started With Assignment 2 and 3 (Fundraising Data)
7 pages
Industrial Applications Using Neural Networks
No ratings yet
Industrial Applications Using Neural Networks
11 pages
PPC Bits For Students
No ratings yet
PPC Bits For Students
4 pages
Matrix Chain Multiplication Example1
No ratings yet
Matrix Chain Multiplication Example1
8 pages
Lab Assignment 8: Nishiv Singh (B20MT029) Google Colab Notebooks Link: Task 1
No ratings yet
Lab Assignment 8: Nishiv Singh (B20MT029) Google Colab Notebooks Link: Task 1
4 pages
Jpeg Ls Loco
No ratings yet
Jpeg Ls Loco
16 pages
Project Assignment.2025
No ratings yet
Project Assignment.2025
2 pages
PDL Final Assignment-3 Aryan
No ratings yet
PDL Final Assignment-3 Aryan
8 pages
Plant Disease Detection Using AutoML and Deep Learning 1
No ratings yet
Plant Disease Detection Using AutoML and Deep Learning 1
12 pages
Soft Vs Hard Clustering
No ratings yet
Soft Vs Hard Clustering
5 pages

Lab Assignment 3 Ai

Uploaded by

Lab Assignment 3 Ai

Uploaded by

In [1]: #Write python script to implement KMeans Algorithm over a inputted dataset (Any data take of your own).

kmeans = KMeans(n_clusters=k, random_state=42)

print("Cluster labels:", cluster_labels)

import matplotlib.pyplot as plt

plt.scatter(data["age"], data["income"], c=cluster_labels)

k = int(input("Enter the number of clusters (K): "))

hc_model = AgglomerativeClustering(n_clusters=k, affinity='euclidean', linkage='ward')

plt.scatter(data[:, 0], data[:, 1], c=hc_labels, cmap='viridis', edgecolors='k', s=50)

linked = linkage(data, 'ward')

Enter the number of clusters (K): 3

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

accuracy = accuracy_score(y_test, y_pred)

0 0.44 0.67 0.53 6

You might also like