0% found this document useful (0 votes)

8 views10 pages

Workshop Project Report

This project report details using DBSCAN and K-Means clustering algorithms to segment customers based on their purchasing behavior from transactional record data. The methodology included data preprocessing, implementing the clustering algorithms in Python, comparing model performance using metrics like silhouette score and inertia, and concluding that K-Means demonstrated simplicity and identified well-defined customer clusters while requiring parameter tuning for DBSCAN. Key results and recommendations are provided.

Uploaded by

Rajveer Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views10 pages

Workshop Project Report

Uploaded by

Rajveer Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Workshop Project Report

Year of Submission: - 2023-24

Submit by,
Divyanshu Khandelwal_2115500055_3S_Class roll no. :- 22
Suryansh Agrawal_2115500147_3S_Class roll no. :- 42
Sonal Mittal_2115500140_3S_Class roll no. :- 40
Anshika Singh_2115500024_3S_Class roll no. :- 10

Department of Computer Engineering and Applications

GLA University, Mathura
Project Report: Customer Segmentation through Clustering
Analysis

Introduction:
Customer segmentation is a crucial aspect of marketing
strategies. Clustering algorithms aid in identifying patterns
within data to categorize customers into groups with similar
traits. This project utilizes two clustering algorithms—DBSCAN
and K-Means—to segment customers based on their
purchasing behavior.

Dataset:
The dataset used in this project contains transactional records
from a retail store. It includes attributes such as customer ID,
purchase history, frequency of purchases, and total amount
spent.
Methodology:

Data Preprocessing

1. Data Cleaning: Removing duplicates, handling missing

values, and ensuring data consistency.

2. Feature Selection: Choosing relevant attributes for

clustering, such as purchase frequency and total
spending.

3. Feature Scaling: Normalizing numerical features to ensure

uniformity.
Clustering Algorithms

1. DBSCAN (Density-Based Spatial Clustering of Applications

with Noise)
- DBSCAN identifies clusters based on density. It groups
together points that are closely packed.
- Parameters: Epsilon (ε) and Minimum Points (MinPts).
- Advantages: Robust to outliers and doesn’t require
specifying the number of clusters.
- Implementation: Using scikit-learn's DBSCAN algorithm.

2. K-Means Clustering
- K-Means partitions data into K clusters based on centroids'
proximity.
- Parameters: Number of clusters (K).
- Advantages: Simple, scalable, and efficient for large
datasets.
- Implementation: Utilizing scikit-learn's KMeans algorithm.
Model Building and Evaluation

DBSCAN Model
- Identified clusters based on varying epsilon values and
minimum points.
- Evaluated silhouette scores and visualized clusters using
scatter plots.

K-Means Model
- Explored different K values to find optimal clusters.
- Assessed the inertia scores and visualized clusters using
scatter plots.
Comparative Study

Performance Metrics
- Silhouette Score: Measures the compactness and separation
between clusters. Higher scores indicate better-defined
clusters.
-Inertia: Measures how internally coherent clusters are. Lower
values represent better clustering.
Results and Observations

- DBSCAN: Showed varying performance with different

parameter settings. Achieved silhouette score of X.
- K-Means: Found an optimal number of clusters (K) with
silhouette score of Y and inertia value of Z.
Conclusion

- Both algorithms effectively segmented customers based on

purchasing behavior.
- DBSCAN proved robust to outliers but required careful
parameter tuning.
- K-Means demonstrated simplicity and scalability, providing
well-defined clusters with optimal K values.
Recommendations

- For datasets with clear cluster densities, DBSCAN can be a

suitable choice.
- In scenarios where scalability and simplicity are vital, K-
Means can be preferred.
Future Work

- Experiment with other clustering algorithms like Hierarchical

Clustering or Gaussian Mixture Models.
- Incorporate additional features or external data sources for
more robust segmentation.

---

This report provides an overview of customer segmentation

using DBSCAN and K-Means algorithms, highlighting their
strengths, weaknesses, and comparative performance.

Exercise - Analytical Exposition Text
40% (5)
Exercise - Analytical Exposition Text
3 pages
Customer Segmentation Report
No ratings yet
Customer Segmentation Report
31 pages
Low Code AIML USL Project CreditCardCustomerSegmentation Vijay Borade Aug23
67% (3)
Low Code AIML USL Project CreditCardCustomerSegmentation Vijay Borade Aug23
66 pages
Laptop Issue Form Sample
100% (1)
Laptop Issue Form Sample
3 pages
Mall Customer Segmentation Using Machine Learning Techniques
No ratings yet
Mall Customer Segmentation Using Machine Learning Techniques
17 pages
Segmentation Analysis
No ratings yet
Segmentation Analysis
17 pages
Numericals (Force)
No ratings yet
Numericals (Force)
22 pages
2629 Gembali Maneesh
No ratings yet
2629 Gembali Maneesh
59 pages
I Love Merge
No ratings yet
I Love Merge
56 pages
Customer Segmentation
No ratings yet
Customer Segmentation
7 pages
Preliminar Não Fabricar: Plan View From Above Showing Foundation Hole Drilling
No ratings yet
Preliminar Não Fabricar: Plan View From Above Showing Foundation Hole Drilling
1 page
Interships 10037
No ratings yet
Interships 10037
31 pages
Research Paper Mini Project
No ratings yet
Research Paper Mini Project
13 pages
UNIT II-Segmentation, Positioning, and Product Optimization
No ratings yet
UNIT II-Segmentation, Positioning, and Product Optimization
48 pages
Machine Learning Project Report - Customer Segmentation
No ratings yet
Machine Learning Project Report - Customer Segmentation
2 pages
Customer Segmentation Report
No ratings yet
Customer Segmentation Report
8 pages
288175101
No ratings yet
288175101
51 pages
CUSTOMER - MALL - SEGMENTATION.1 (1) (1) (Autosaved)
No ratings yet
CUSTOMER - MALL - SEGMENTATION.1 (1) (1) (Autosaved)
9 pages
Final
No ratings yet
Final
48 pages
ML Review PPT 2
No ratings yet
ML Review PPT 2
29 pages
Sousa Graphics Gems CryENGINE3
No ratings yet
Sousa Graphics Gems CryENGINE3
59 pages
Machine Learning Project Report - Customer Segmentation
No ratings yet
Machine Learning Project Report - Customer Segmentation
2 pages
Energy Consumption Prediction System
No ratings yet
Energy Consumption Prediction System
21 pages
ML Assignment 1
No ratings yet
ML Assignment 1
23 pages
Internship Report-1
No ratings yet
Internship Report-1
27 pages
Aiml Project Review
No ratings yet
Aiml Project Review
22 pages
Review2 A15
No ratings yet
Review2 A15
14 pages
Customer Segemntation
No ratings yet
Customer Segemntation
26 pages
DW&DM PROJECT Sawan
No ratings yet
DW&DM PROJECT Sawan
14 pages
Report
No ratings yet
Report
22 pages
Customer Segmentation Report
No ratings yet
Customer Segmentation Report
31 pages
Customer Segmentation Using K
No ratings yet
Customer Segmentation Using K
16 pages
Customer Segmentation Using Flying Fox Optimization Algorithm
No ratings yet
Customer Segmentation Using Flying Fox Optimization Algorithm
20 pages
ADS Phase4
No ratings yet
ADS Phase4
21 pages
ML Project Report
No ratings yet
ML Project Report
22 pages
DWDM PPT
No ratings yet
DWDM PPT
13 pages
A Comparative Analyis of K-Means and Its Varinats For Customer Segmentation
No ratings yet
A Comparative Analyis of K-Means and Its Varinats For Customer Segmentation
15 pages
DS MP
No ratings yet
DS MP
18 pages
Customer Segmentation New
No ratings yet
Customer Segmentation New
11 pages
5
No ratings yet
5
14 pages
Da cs-1
No ratings yet
Da cs-1
11 pages
WQD7005 Case Study - 17219402
No ratings yet
WQD7005 Case Study - 17219402
21 pages
Customer Segmentation Using Machine Learning With A Coupon Generator GUI
No ratings yet
Customer Segmentation Using Machine Learning With A Coupon Generator GUI
6 pages
CSUDS Project
No ratings yet
CSUDS Project
13 pages
IJCRT2407525
No ratings yet
IJCRT2407525
9 pages
Ads Phase 4
No ratings yet
Ads Phase 4
12 pages
Customer Segmentation Literature Review 1
No ratings yet
Customer Segmentation Literature Review 1
8 pages
IJCSP23D1055
No ratings yet
IJCSP23D1055
9 pages
Mall Customer Segmentation: Submitted By: Batch No:8
No ratings yet
Mall Customer Segmentation: Submitted By: Batch No:8
17 pages
DWDM Report
No ratings yet
DWDM Report
6 pages
Adm Final
No ratings yet
Adm Final
7 pages
Behavioural Customer Segmentation Based
No ratings yet
Behavioural Customer Segmentation Based
7 pages
Customer Segmentation Using K Means Clustering IJERTV11IS030152
No ratings yet
Customer Segmentation Using K Means Clustering IJERTV11IS030152
6 pages
Mall Customer Segmentation Kalash Daf
No ratings yet
Mall Customer Segmentation Kalash Daf
12 pages
Customer Segmentation Using Data Science
No ratings yet
Customer Segmentation Using Data Science
7 pages
DM Lab Report
No ratings yet
DM Lab Report
13 pages
Universiti Teknologi: Mohamad Amir Salihin
No ratings yet
Universiti Teknologi: Mohamad Amir Salihin
5 pages
JPSP202244
No ratings yet
JPSP202244
7 pages
Honey Research Paper
No ratings yet
Honey Research Paper
4 pages
Customer Categorization by Data Analysis Using Clustering Algorithms of Machine Learning
No ratings yet
Customer Categorization by Data Analysis Using Clustering Algorithms of Machine Learning
4 pages
IEEE Conference Template 5
No ratings yet
IEEE Conference Template 5
5 pages
Chapter 1,2 Report
No ratings yet
Chapter 1,2 Report
5 pages
Customer Segmentation IEEE Report
No ratings yet
Customer Segmentation IEEE Report
2 pages
VL2024250504566 Ast03
No ratings yet
VL2024250504566 Ast03
2 pages
Vlsi Term Paper Topics
100% (1)
Vlsi Term Paper Topics
7 pages
BS en 50164-6-2009
No ratings yet
BS en 50164-6-2009
18 pages
Internal Analysis of FedEx V
100% (1)
Internal Analysis of FedEx V
3 pages
HW SW Codesign
No ratings yet
HW SW Codesign
514 pages
Practice Problems: Paul Dawkins
No ratings yet
Practice Problems: Paul Dawkins
75 pages
ME 111 Thermodynamics 1
No ratings yet
ME 111 Thermodynamics 1
8 pages
CHP 19 Rehman-Et-Al-2022-Developing-The-Integrated-Marketing-Communication-Imc-Through-Social-Media-Sm-The-Modern-Marketing
No ratings yet
CHP 19 Rehman-Et-Al-2022-Developing-The-Integrated-Marketing-Communication-Imc-Through-Social-Media-Sm-The-Modern-Marketing
23 pages
Angular
No ratings yet
Angular
330 pages
CCS 2124 2202 Operating Systems I Course Outline January 2025 Se
No ratings yet
CCS 2124 2202 Operating Systems I Course Outline January 2025 Se
3 pages
LiFePO4 Battery Material For The Production of Lit
No ratings yet
LiFePO4 Battery Material For The Production of Lit
13 pages
Encyclopedia of Giftedness Creativity and Talent 1st Edition Barbara Kerr Download
No ratings yet
Encyclopedia of Giftedness Creativity and Talent 1st Edition Barbara Kerr Download
86 pages
04 CTTC Detailed Syllabus 2016
No ratings yet
04 CTTC Detailed Syllabus 2016
9 pages
Generation Y: Success in The Workplace
No ratings yet
Generation Y: Success in The Workplace
12 pages
CE341 - FCH - Civil Eng Communication Skills
No ratings yet
CE341 - FCH - Civil Eng Communication Skills
2 pages
Motherboard Labeling Designed by Fujitsu
No ratings yet
Motherboard Labeling Designed by Fujitsu
3 pages
18CSP83 - Project Phase 2 - Body
No ratings yet
18CSP83 - Project Phase 2 - Body
11 pages
Chapter4performanceparav2 28student 29
No ratings yet
Chapter4performanceparav2 28student 29
19 pages
Mscds Ad 2025
No ratings yet
Mscds Ad 2025
1 page
Improve English
No ratings yet
Improve English
3 pages
Module #2 Part 4 Gradient Series
No ratings yet
Module #2 Part 4 Gradient Series
15 pages
Malabanan, Edd Brandon G. March 07, 2020: Engr. Senen D. Fenomeno
No ratings yet
Malabanan, Edd Brandon G. March 07, 2020: Engr. Senen D. Fenomeno
17 pages
Đề thi học kì 2 2022 - 2023
No ratings yet
Đề thi học kì 2 2022 - 2023
3 pages
Water Jet Cutter
No ratings yet
Water Jet Cutter
7 pages
Development Length Tables
No ratings yet
Development Length Tables
1 page
Attitude Defines Our Altitude
No ratings yet
Attitude Defines Our Altitude
3 pages
The Secret Of Machine Learning
From Everand
The Secret Of Machine Learning
Mhd Arjunanta
No ratings yet

Workshop Project Report

Uploaded by

Workshop Project Report

Uploaded by

Workshop Project Report

Year of Submission: - 2023-24

Department of Computer Engineering and Applications

1. Data Cleaning: Removing duplicates, handling missing

2. Feature Selection: Choosing relevant attributes for

3. Feature Scaling: Normalizing numerical features to ensure

1. DBSCAN (Density-Based Spatial Clustering of Applications

- DBSCAN: Showed varying performance with different

- Both algorithms effectively segmented customers based on

- For datasets with clear cluster densities, DBSCAN can be a

- Experiment with other clustering algorithms like Hierarchical

This report provides an overview of customer segmentation

You might also like