0% found this document useful (0 votes)

5 views5 pages

Dbscan Implementation in Python

This document details an experiment on Density-based spatial clustering (DBSCAN) conducted by Anvita Singh. It includes Python code for data preprocessing, applying PCA for dimensionality reduction, and implementing the DBSCAN algorithm to identify clusters and noise in a dataset. The results are visualized using scatter plots and count plots to illustrate the clustering output.

Uploaded by

Anvita Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views5 pages

Dbscan Implementation in Python

Uploaded by

Anvita Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

Pattern Recognition & Anomaly Detection

Lab
EXPERIMENT – 12
Density-based spatial clustering(DBSCAN)

NAME – ANVITA SINGH

ROLL NO – R2142221063

SAP_ID – 500107712

BATCH – 8

CODE -
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns

from sklearn.model_selection import train_test_split

from sklearn.preprocessing import StandardScaler
from sklearn.decomposition import PCA
from sklearn.cluster import DBSCAN
from sklearn.metrics import confusion_matrix, ConfusionMatrixDisplay

# Load dataset
df = pd.read_csv("/content/city_day.csv")
print("Initial Data Sample:")
print(df.head())

# Remove missing values

df.dropna(inplace=True)

# Feature Selection (only numeric columns)

X = df.select_dtypes(include=['float64', 'int64'])
# Feature Scaling
scaler = StandardScaler()
X_scaled = scaler.fit_transform(X)

# Apply PCA for dimensionality reduction

pca = PCA(n_components=2) # You can choose the number of components (2
for 2D, or more for higher dimensions)
X_pca = pca.fit_transform(X_scaled)

# Explained variance ratio

print("\nExplained Variance Ratio of the PCA Components:")
print(pca.explained_variance_ratio_)

# Train-Test Split (optional for DBSCAN, but we'll do it for

visualization)
X_train, X_test = train_test_split(X_pca, test_size=0.2,
random_state=42)

# DBSCAN Model
dbscan = DBSCAN(eps=0.5, min_samples=5) # You can adjust eps and
min_samples based on your data
dbscan.fit(X_train)

# Predict Clusters
y_pred_train = dbscan.labels_ # DBSCAN assigns labels, where -1
represents noise (outliers)

# Show sample predictions

print("\nSample DBSCAN Clusters (Noise = -1, Clusters = 0, 1,
2, ...):")
print(y_pred_train[:10])

# Count of Clusters vs Noise

unique, counts = np.unique(y_pred_train, return_counts=True)
result_counts = dict(zip(unique, counts))

print("\nCluster Counts:")
print(result_counts)

# Visualize Clusters and Noise

plt.figure(figsize=(8,5))
sns.countplot(x=y_pred_train)
plt.title("DBSCAN Clustering Output")
plt.xlabel("Cluster/Noise")
plt.ylabel("Count")
plt.show()
# Visualizing the PCA-reduced data with clusters highlighted
plt.figure(figsize=(10, 6))
sns.scatterplot(x=X_train[:, 0], y=X_train[:, 1], hue=y_pred_train,
palette="coolwarm", style=y_pred_train, legend="full")
plt.title("DBSCAN Clustering on PCA-reduced Data (Train Set)")
plt.xlabel("Principal Component 1")
plt.ylabel("Principal Component 2")
plt.show()

# Visualizing the test set with clusters

y_pred_test = dbscan.fit_predict(X_test) # DBSCAN on the test set

# Visualizing the PCA-reduced data with anomalies (clusters)

highlighted for the test set
plt.figure(figsize=(10, 6))
sns.scatterplot(x=X_test[:, 0], y=X_test[:, 1], hue=y_pred_test,
palette="coolwarm", style=y_pred_test, legend="full")
plt.title("DBSCAN Clustering on PCA-reduced Data (Test Set)")
plt.xlabel("Principal Component 1")
plt.ylabel("Principal Component 2")
plt.show()

print("✅ DBSCAN Clustering Model Trained, Clusters Identified, and

Visualized.")

OUTPUT –

MCQ Machine Learning
No ratings yet
MCQ Machine Learning
23 pages
Baidurya Debnath 4
No ratings yet
Baidurya Debnath 4
37 pages
Cheat Sheet-Building Unsupervised Learning Models
No ratings yet
Cheat Sheet-Building Unsupervised Learning Models
3 pages
DBSCAN
No ratings yet
DBSCAN
29 pages
Se Demo
No ratings yet
Se Demo
29 pages
3 - Modeling - Ipynb - Colaboratory
No ratings yet
3 - Modeling - Ipynb - Colaboratory
31 pages
ML Programs
No ratings yet
ML Programs
14 pages
Machine Learning Lab
No ratings yet
Machine Learning Lab
20 pages
Assignments Introduction To Machine Learning 2024
No ratings yet
Assignments Introduction To Machine Learning 2024
45 pages
CC Unit IV
No ratings yet
CC Unit IV
30 pages
LAB7 Kmeans
No ratings yet
LAB7 Kmeans
11 pages
Week 8 DS Practical
No ratings yet
Week 8 DS Practical
13 pages
Exp 6
No ratings yet
Exp 6
10 pages
ML2 Practical List
No ratings yet
ML2 Practical List
80 pages
22mid0187 ML Lab-5
No ratings yet
22mid0187 ML Lab-5
13 pages
AML Lab
No ratings yet
AML Lab
14 pages
M PDF
No ratings yet
M PDF
13 pages
ML Shristi File
No ratings yet
ML Shristi File
49 pages
IDM Assignment
No ratings yet
IDM Assignment
15 pages
Aam Codes
No ratings yet
Aam Codes
8 pages
DWDM Lab All
No ratings yet
DWDM Lab All
20 pages
DM ML Practical
No ratings yet
DM ML Practical
13 pages
Deep Learning
No ratings yet
Deep Learning
24 pages
Top 10 Deep Learning Algorithms You Should Know in 2023
No ratings yet
Top 10 Deep Learning Algorithms You Should Know in 2023
14 pages
DBSCAN Clustering in ML - Density Based Clustering
No ratings yet
DBSCAN Clustering in ML - Density Based Clustering
5 pages
Phase3 3
No ratings yet
Phase3 3
8 pages
Lab Report 4
No ratings yet
Lab Report 4
6 pages
Mercedes-Benz Greener Manufacturing Ai
0% (1)
Mercedes-Benz Greener Manufacturing Ai
16 pages
Drawback of Standard K-Means Algorithm
No ratings yet
Drawback of Standard K-Means Algorithm
5 pages
Python DM Lab Manual Part 2
No ratings yet
Python DM Lab Manual Part 2
8 pages
DBSCAN - Introduction in Machine Learning.
No ratings yet
DBSCAN - Introduction in Machine Learning.
3 pages
Untitled Document-2-1-13-7-11.4
No ratings yet
Untitled Document-2-1-13-7-11.4
5 pages
5 Clustering Algorithm 17-09-2024
No ratings yet
5 Clustering Algorithm 17-09-2024
2 pages
SVM K NN MLP With Sklearn Jupyter NoteBo
No ratings yet
SVM K NN MLP With Sklearn Jupyter NoteBo
22 pages
4.cluster Analysis
No ratings yet
4.cluster Analysis
7 pages
B22EE010 Report
No ratings yet
B22EE010 Report
9 pages
Difference Between ANN, CNN and RNN
100% (1)
Difference Between ANN, CNN and RNN
5 pages
Accelerated Data Science Introduction To Machine Learning Algorithms
No ratings yet
Accelerated Data Science Introduction To Machine Learning Algorithms
37 pages
Practical 5
No ratings yet
Practical 5
6 pages
Convolutional Neural Networks: CMSC 35246: Deep Learning
No ratings yet
Convolutional Neural Networks: CMSC 35246: Deep Learning
166 pages
DBSCAN Clustering
No ratings yet
DBSCAN Clustering
6 pages
Dbscan and Optics
No ratings yet
Dbscan and Optics
28 pages
Experiment 4 1
No ratings yet
Experiment 4 1
4 pages
ML0101EN Clus DBSCN Weather Py v1
No ratings yet
ML0101EN Clus DBSCN Weather Py v1
16 pages
Assignment4 CH5650 CH21B112
No ratings yet
Assignment4 CH5650 CH21B112
3 pages
MLT Unit-4 Notes
No ratings yet
MLT Unit-4 Notes
30 pages
DB Scan
No ratings yet
DB Scan
7 pages
K-Means 10
No ratings yet
K-Means 10
2 pages
Types of Neural Networks
No ratings yet
Types of Neural Networks
11 pages
NF Assighment4
No ratings yet
NF Assighment4
5 pages
ML Minors Exp7
No ratings yet
ML Minors Exp7
6 pages
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
DS - ML - 7 - 60019210046 1
No ratings yet
DS - ML - 7 - 60019210046 1
6 pages
Lab 8
No ratings yet
Lab 8
8 pages
CC511 Week 7 - Deep - Learning
No ratings yet
CC511 Week 7 - Deep - Learning
33 pages
Esam - DWM Lab 8
No ratings yet
Esam - DWM Lab 8
5 pages
ML Notes 1
No ratings yet
ML Notes 1
3 pages
Dbscan Code Python
No ratings yet
Dbscan Code Python
1 page
Image Classification
No ratings yet
Image Classification
18 pages
Deep Learning Lab
No ratings yet
Deep Learning Lab
20 pages
ML Short
No ratings yet
ML Short
2 pages
Deep Learning For Financial Applications - A Survey
No ratings yet
Deep Learning For Financial Applications - A Survey
52 pages
Slip Clustering
No ratings yet
Slip Clustering
2 pages
10 - DBSCANClusteringOnIRIS-Copy1 - Jupyter Notebook
No ratings yet
10 - DBSCANClusteringOnIRIS-Copy1 - Jupyter Notebook
4 pages
Py 2
No ratings yet
Py 2
7 pages
4 4 Choosing The Right Activation Function For Neural Networks
No ratings yet
4 4 Choosing The Right Activation Function For Neural Networks
25 pages
Topic 08 - Data Modelling - Part II
No ratings yet
Topic 08 - Data Modelling - Part II
59 pages
How To Run Cluster Analysis in Excel
No ratings yet
How To Run Cluster Analysis in Excel
9 pages
Maxbox Starter60 Machine Learning
No ratings yet
Maxbox Starter60 Machine Learning
8 pages
Clustering
No ratings yet
Clustering
1 page
Predicting Rapid Impact Compaction - Case Study
No ratings yet
Predicting Rapid Impact Compaction - Case Study
36 pages
Maxbox - Starter67 Machine Learning
No ratings yet
Maxbox - Starter67 Machine Learning
7 pages
Image Processing
No ratings yet
Image Processing
5 pages
Bigdata External Programs 181801120034
No ratings yet
Bigdata External Programs 181801120034
4 pages
Week 6 Prev & Current Assignments
No ratings yet
Week 6 Prev & Current Assignments
21 pages
Ad3501 - Deep Learning
No ratings yet
Ad3501 - Deep Learning
2 pages
External Program2
No ratings yet
External Program2
2 pages
C-3 Pap365er
No ratings yet
C-3 Pap365er
4 pages
I2ml3e Chap11
No ratings yet
I2ml3e Chap11
38 pages
Numpy NP Sklearn - Cluster Sklearn Sklearn - Datasets Sklearn - Preprocessing
No ratings yet
Numpy NP Sklearn - Cluster Sklearn Sklearn - Datasets Sklearn - Preprocessing
1 page
A Comprehensive Review On Fake News Detection With Deep Learning
No ratings yet
A Comprehensive Review On Fake News Detection With Deep Learning
20 pages
One-Shot Learning in FaceNet
No ratings yet
One-Shot Learning in FaceNet
11 pages
ML Unit Iv
No ratings yet
ML Unit Iv
17 pages
Soft Computing Techniques (ECE - 425)
No ratings yet
Soft Computing Techniques (ECE - 425)
2 pages
Unit Iv
No ratings yet
Unit Iv
12 pages
Sharma S. - Activation Functions in Neural Networks
No ratings yet
Sharma S. - Activation Functions in Neural Networks
11 pages
AI Midterm Quiz 1 - Attempt Review
No ratings yet
AI Midterm Quiz 1 - Attempt Review
6 pages
ML Sample PDF
No ratings yet
ML Sample PDF
5 pages
Week 10
No ratings yet
Week 10
3 pages
Nasa Fy23 Ai Inventory CSV Final
No ratings yet
Nasa Fy23 Ai Inventory CSV Final
3 pages

Dbscan Implementation in Python

Uploaded by

Dbscan Implementation in Python

Uploaded by

Pattern Recognition & Anomaly Detection

NAME – ANVITA SINGH

from sklearn.model_selection import train_test_split

# Remove missing values

# Feature Selection (only numeric columns)

# Apply PCA for dimensionality reduction

# Explained variance ratio

# Train-Test Split (optional for DBSCAN, but we'll do it for

# Show sample predictions

# Count of Clusters vs Noise

# Visualize Clusters and Noise

# Visualizing the test set with clusters

# Visualizing the PCA-reduced data with anomalies (clusters)

print("✅ DBSCAN Clustering Model Trained, Clusters Identified, and

You might also like