0% found this document useful (0 votes)

14 views9 pages

Classification Analysis Report PDF

The Classification Analysis Report aims to predict customer satisfaction using classification techniques on the E-commerce Customer Behavior Dataset. The analysis includes data preprocessing, model building with Logistic Regression and Decision Tree Classifier, and evaluation, revealing that the Decision Tree model outperformed Logistic Regression with an accuracy of 85%. Key findings indicate that discounts and total spending are significant predictors of customer satisfaction.

Uploaded by

missionkhadka13

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views9 pages

Classification Analysis Report PDF

Uploaded by

missionkhadka13

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Similarity Report

PAPER NAME AUTHOR

Classification_Analysis_Report.pdf -

WORD COUNT CHARACTER COUNT

738 Words 7753 Characters

PAGE COUNT FILE SIZE

5 Pages 501.5KB

SUBMISSION DATE REPORT DATE

Feb 11, 2025 10:54 PM GMT+5:45 Feb 11, 2025 10:54 PM GMT+5:45

64% Overall Similarity

The combined total of all matches, including overlapping sources, for each database.
11% Internet database 4% Publications database
Crossref database Crossref Posted Content database
64% Submitted Works database

Summary
Report on:
19

Classification Analysis Report

Name: Mission khadka

Group: L5CG1
Student ID: 2408838
8
Module Leader: Siman giri
Contents
Classification Analysis Report.................................................................................................................................... 3
Abstract ............................................................................................................................................................................ 3
1. Introduction .............................................................................................................................................................. 3
1.1 Problem Statement ........................................................................................................................................ 3
1.2 Dataset ................................................................................................................................................................. 3
1
1.3 Objective ............................................................................................................................................................. 3
2. Methodology ............................................................................................................................................................. 3
2.1 Data Preprocessing........................................................................................................................................ 3
2.2 Exploratory Data Analysis (EDA) ........................................................................................................... 3
2.3 Model Building ................................................................................................................................................. 4
2.4 Model Evaluation ............................................................................................................................................ 4
2.5 Hyper-parameter Optimization .............................................................................................................. 4
2.6 Feature Selection ............................................................................................................................................ 4
3. Conclusion .................................................................................................................................................................. 4
3.1 Key Findings...................................................................................................................................................... 4
3.2 Final Model ........................................................................................................................................................ 5
3.3 Challenges .......................................................................................................................................................... 5
3.4 Future Work ...................................................................................................................................................... 5
4. Discussion .................................................................................................................................................................. 5
4.1 Model Performance ....................................................................................................................................... 5
4.2 Impact of Hyperparameter Tuning and Feature Selection ....................................................... 5
4.3 Interpretation of Results ............................................................................................................................ 5
4.4 Limitations ......................................................................................................................................................... 5
4.5 Suggestions for Future Research............................................................................................................ 5
Classification Analysis Report

Abstract
5
Purpose: The purpose of this report is to predict a categorical variable using classification
techniques.

Approach: The dataset chosen for this analysis is the E-commerce Customer Behavior
Dataset, which contains customer purchase history, demographics, and satisfaction ratings.
2
The steps involved include Exploratory Data Analysis (EDA), model building with Logistic
Regression and Decision Tree Classifier, hyper-parameter optimization, and feature
selection.

Key Results: The performance of the models was evaluated using accuracy, precision, recall,
and F1-score. The models showed Decision Tree outperformed Logistic Regression with
higher accuracy and recall.
22
Conclusion: The classification models performed well in predicting customer satisfaction,
17
and key insights include the importance of discount offers and total spending in
4
determining satisfaction levels.

1. Introduction

1.1 Problem Statement

The goal of this project is to predict customer satisfaction levels based on their
5
demographic and purchasing behavior.

1.2 Dataset
The dataset used in this analysis is the E-commerce Customer Behavior Dataset, sourced
from an independent e-commerce business. It contains customer purchase behavior,
6
satisfaction ratings, and demographic data. This dataset aligns with the United Nations
Sustainable Development Goals (UNSDG) by improving customer insights for better
economic and sustainable business practices.

1.3 Objective
The objective of this analysis is to build a predictive classification model that estimates the
10
customer satisfaction level (Satisfied, Neutral, or Dissatisfied) based on the given features.

2. Methodology

2.1 Data Preprocessing

The data was cleaned by handling missing values using median imputation, encoding
12
categorical variables, and standardizing numerical features to improve model performance.

2.2 Exploratory Data Analysis (EDA)

EDA was performed using visualizations such as:
- Histograms to analyze numerical feature distributions

- Bar charts to examine class imbalances

23
- Correlation matrices to determine relationships between features

Key insights:

- Discount Applied and Total Spend had a strong influence on satisfaction levels.

- The dataset was slightly imbalanced, with fewer dissatisfied customers.

14
2.3 Model Building
Two classification models were built:

- Logistic Regression (Baseline model)

- Decision Tree Classifier (Non-linear approach)

3
The data was split into 80% training and 20% testing sets, followed by model training and
evaluation.

2.4 Model Evaluation

The model performance was evaluated using:

- Accuracy: Measures overall correctness.

7
- Precision: Measures correctness of positive predictions.

- Recall: Measures how well positive cases are identified.

- F1-Score: Harmonic mean of precision and recall.

2.5 Hyper-parameter Optimization

18
GridSearchCV was used to optimize model parameters:

- Best Decision Tree Parameters: max_depth=5, min_samples_split=4.

16
- Best Logistic Regression Parameters: C=0.1, solver='liblinear'.
3
2.6 Feature Selection
Feature selection was done using Recursive Feature Elimination (RFE), selecting:

- Age, Membership Type, Total Spend, Discount Applied

11
3. Conclusion

3.1 Key Findings

- The Decision Tree model outperformed Logistic Regression with higher accuracy (85%)
and recall (82%).
- Discounts and total spending were the strongest predictors of satisfaction.
13
3.2 Final Model
The best model was Decision Tree, which achieved an accuracy of 85%.
9
3.3 Challenges
Challenges included handling missing data and slight class imbalance, requiring careful
preprocessing.
9
3.4 Future Work
Future improvements include exploring ensemble models like Random Forest or XGBoost
for higher accuracy.
4
4. Discussion

4.1 Model Performance

The Decision Tree model performed best, providing better recall for dissatisfied customers.
15
4.2 Impact of Hyperparameter Tuning and Feature Selection
Fine-tuning max_depth and min_samples_split improved Decision Tree accuracy.
21
4.3 Interpretation of Results
Customers with higher spending and discounts applied were more likely to be satisfied.

4.4 Limitations
- Dataset had class imbalance, which could bias results.

- Simple models were used; more complex models may perform better.
24
4.5 Suggestions for Future Research
20
- Using ensemble models like Random Forest or Gradient Boosting.

- Expanding dataset size for better generalization.

Similarity Report

64% Overall Similarity

Top sources found in the following databases:
11% Internet database 4% Publications database
Crossref database Crossref Posted Content database
64% Submitted Works database

TOP SOURCES
The sources with the highest number of matches within the submission. Overlapping sources will not be
displayed.

University of Wolverhampton on 2025-02-11

1 13%
Submitted works

University of Wolverhampton on 2025-02-09

2 6%
Submitted works

University of Wolverhampton on 2025-02-11

3 4%
Submitted works

University of Wolverhampton on 2025-02-11

4 4%
Submitted works

University of Wolverhampton on 2025-02-11

5 4%
Submitted works

University of Wolverhampton on 2025-02-11

6 4%
Submitted works

University of Wolverhampton on 2025-02-11

7 3%
Submitted works

University of Wolverhampton on 2025-02-11

8 3%
Submitted works

Sources overview
Similarity Report

University of Wolverhampton on 2025-02-11

9 3%
Submitted works

University of Wolverhampton on 2025-02-11

10 2%
Submitted works

University of Wolverhampton on 2025-02-10

11 2%
Submitted works

University of Wolverhampton on 2025-02-10

12 2%
Submitted works

University of Wolverhampton on 2025-02-11

13 2%
Submitted works

University of Wolverhampton on 2025-02-11

14 2%
Submitted works

University of Wolverhampton on 2025-02-10

15 2%
Submitted works

University of Wolverhampton on 2025-02-11

16 1%
Submitted works

University of Wolverhampton on 2025-02-11

17 1%
Submitted works

University of Wolverhampton on 2025-02-11

18 1%
Submitted works

University of Wolverhampton on 2025-02-11

19 1%
Submitted works

medium.com
20 1%
Internet

Sources overview
Similarity Report

University of Wolverhampton on 2025-02-08

21 <1%
Submitted works

University of Wolverhampton on 2025-02-11

22 <1%
Submitted works

University of Wolverhampton on 2025-02-11

23 <1%
Submitted works

University of Wolverhampton on 2025-02-11

24 <1%
Submitted works

Sources overview

Customer Satisfaction Prediction (ML - FA - DA Projects)
No ratings yet
Customer Satisfaction Prediction (ML - FA - DA Projects)
35 pages
Seippel MA Eemcs
No ratings yet
Seippel MA Eemcs
95 pages
Final HLD
No ratings yet
Final HLD
11 pages
Group 14
No ratings yet
Group 14
63 pages
Group11 DL Project Presentation
No ratings yet
Group11 DL Project Presentation
19 pages
Predicting & Optimizing Airlines Customer Satisfaction Using Clas
No ratings yet
Predicting & Optimizing Airlines Customer Satisfaction Using Clas
84 pages
Log-Based Session Profiling and Online Behavioral Prediction in ECommerce Websites
No ratings yet
Log-Based Session Profiling and Online Behavioral Prediction in ECommerce Websites
17 pages
Data Mining Using Python Lab
100% (1)
Data Mining Using Python Lab
63 pages
Assigment2 IndividualReport
No ratings yet
Assigment2 IndividualReport
10 pages
Project Report
No ratings yet
Project Report
11 pages
Iot Based Predicitive Maintenance Management of Medical Equipment
No ratings yet
Iot Based Predicitive Maintenance Management of Medical Equipment
12 pages
Regression Analysis Report PDF
No ratings yet
Regression Analysis Report PDF
7 pages
Presented by
No ratings yet
Presented by
17 pages
22-cp-57 Assignment #02
No ratings yet
22-cp-57 Assignment #02
5 pages
Rahul Jha Capstone Final
No ratings yet
Rahul Jha Capstone Final
14 pages
SM Cpa File 1
No ratings yet
SM Cpa File 1
29 pages
DAM HO 4 Decision Tree 2014 With Answers
No ratings yet
DAM HO 4 Decision Tree 2014 With Answers
24 pages
2015-17 Web
No ratings yet
2015-17 Web
68 pages
22-CP-63 ML Assignment Report
No ratings yet
22-CP-63 ML Assignment Report
5 pages
Majorpptfin
No ratings yet
Majorpptfin
19 pages
Demand Estimation of Full-Cut Promotion On E-Commerce Company
No ratings yet
Demand Estimation of Full-Cut Promotion On E-Commerce Company
73 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
15 pages
PROJECT
No ratings yet
PROJECT
70 pages
PreDefense (201 15 13919)
No ratings yet
PreDefense (201 15 13919)
18 pages
BDMDM Telemarketing
No ratings yet
BDMDM Telemarketing
16 pages
Varshini Phase 2
No ratings yet
Varshini Phase 2
19 pages
Predictive DA 21BCE3925
No ratings yet
Predictive DA 21BCE3925
11 pages
Open Machine Learning With Decision Trees and Random Forests
No ratings yet
Open Machine Learning With Decision Trees and Random Forests
30 pages
Full Text 01
No ratings yet
Full Text 01
26 pages
Decision-Making Under Certainty: Planning Self-Instructional Material 65
100% (1)
Decision-Making Under Certainty: Planning Self-Instructional Material 65
2 pages
BADM
No ratings yet
BADM
9 pages
Iranian Churn
No ratings yet
Iranian Churn
16 pages
Agrihub: Revolutionizing Indian Agricultureusing Machine Learning
No ratings yet
Agrihub: Revolutionizing Indian Agricultureusing Machine Learning
7 pages
Reference Report 2
No ratings yet
Reference Report 2
43 pages
Project Report
No ratings yet
Project Report
12 pages
Lab-Practice-I (ML) - Lab Manual-Vaishali
No ratings yet
Lab-Practice-I (ML) - Lab Manual-Vaishali
57 pages
Naresh PBL
No ratings yet
Naresh PBL
18 pages
Assignment 1 DA - E Oct 2023 V1-1
No ratings yet
Assignment 1 DA - E Oct 2023 V1-1
3 pages
Comparison of Learning Techniques For Prediction of Customer Churn in Telecommunication
No ratings yet
Comparison of Learning Techniques For Prediction of Customer Churn in Telecommunication
36 pages
Quadexp IDS Project
No ratings yet
Quadexp IDS Project
22 pages
Chapter 3
No ratings yet
Chapter 3
7 pages
Erum
No ratings yet
Erum
18 pages
Final Report
No ratings yet
Final Report
38 pages
Daa 01
No ratings yet
Daa 01
11 pages
ML Project Stage 2
No ratings yet
ML Project Stage 2
9 pages
Bda Review
No ratings yet
Bda Review
13 pages
Customer Churn Prediction Capstone Projectdocx
No ratings yet
Customer Churn Prediction Capstone Projectdocx
11 pages
Hefner Et Al-2014-Journal of Forensic Sciences-3
No ratings yet
Hefner Et Al-2014-Journal of Forensic Sciences-3
8 pages
Heart Attack Prediction Using Machine Learning
No ratings yet
Heart Attack Prediction Using Machine Learning
11 pages
Unit 4 Classification
No ratings yet
Unit 4 Classification
15 pages
1822 B.E Cse Batchno 242
No ratings yet
1822 B.E Cse Batchno 242
54 pages
Risk Analysis: Managerial Economics
No ratings yet
Risk Analysis: Managerial Economics
31 pages
Report
No ratings yet
Report
17 pages
Data Mining
No ratings yet
Data Mining
7 pages
A Computational Modelfor Predicting Customer Behaviors Using
No ratings yet
A Computational Modelfor Predicting Customer Behaviors Using
8 pages
Assignment 1
No ratings yet
Assignment 1
4 pages
Da Unit-4
No ratings yet
Da Unit-4
43 pages
Advanced Machine Learning Model To Detect Spam On Instagram
No ratings yet
Advanced Machine Learning Model To Detect Spam On Instagram
6 pages
Customer Churn Analysis and Prediction
No ratings yet
Customer Churn Analysis and Prediction
4 pages
Ex 5.1 Customer Behaviour Prediction
No ratings yet
Ex 5.1 Customer Behaviour Prediction
8 pages
Decision Analysis
No ratings yet
Decision Analysis
37 pages
Obesity Disease Risk Prediction Using Machine Learning
No ratings yet
Obesity Disease Risk Prediction Using Machine Learning
10 pages
Random Forest Model
No ratings yet
Random Forest Model
16 pages
6 DM
No ratings yet
6 DM
2 pages
AC 1103 Presentations
No ratings yet
AC 1103 Presentations
10 pages
Regression Trees, Step by Step. Learn How To Build Regression Trees and - by Ivo Bernardo - Aug, 2022 - Towards Data Science
No ratings yet
Regression Trees, Step by Step. Learn How To Build Regression Trees and - by Ivo Bernardo - Aug, 2022 - Towards Data Science
36 pages
Grade Project
No ratings yet
Grade Project
1 page
BUSI 2013 Unit 1-10 Notes
No ratings yet
BUSI 2013 Unit 1-10 Notes
10 pages
Chapter Non-Parametric Methods
No ratings yet
Chapter Non-Parametric Methods
9 pages
Bonna Akter A Machine Learning Approach To Detect The
No ratings yet
Bonna Akter A Machine Learning Approach To Detect The
5 pages
Demand Forecasting
No ratings yet
Demand Forecasting
10 pages
RCS Prediction of A Target Based On The Machine Learning
No ratings yet
RCS Prediction of A Target Based On The Machine Learning
3 pages
Jurnal Internasional
No ratings yet
Jurnal Internasional
6 pages
IT-510 Module 4 Part Two
No ratings yet
IT-510 Module 4 Part Two
3 pages
Introduction to Robotics
From Everand
Introduction to Robotics
Swarnalata Verma
No ratings yet
Core Concepts in Statistical Learning
From Everand
Core Concepts in Statistical Learning
Tushar Gulati
No ratings yet
Introduction to Business Analytics
From Everand
Introduction to Business Analytics
Dwaipayan Sethi
No ratings yet
Intricating And Navigating The Resilience With Multidiciplinary Approach Towards Economical Sustainable Growth
From Everand
Intricating And Navigating The Resilience With Multidiciplinary Approach Towards Economical Sustainable Growth
Dr.Veda D. Malagatti
No ratings yet
SAS Data Analytic Development: Dimensions of Software Quality
From Everand
SAS Data Analytic Development: Dimensions of Software Quality
Troy Martin Hughes
No ratings yet
Data Mining Models: Techniques and Applications
From Everand
Data Mining Models: Techniques and Applications
Ravi Deshpande
No ratings yet
Agile by Design: An Implementation Guide to Analytic Lifecycle Management
From Everand
Agile by Design: An Implementation Guide to Analytic Lifecycle Management
Rachel Alt-Simmons
No ratings yet
Big Data and Machine Learning in Quantitative Investment
From Everand
Big Data and Machine Learning in Quantitative Investment
Tony Guida
No ratings yet
Introduction to Data Analytics
From Everand
Introduction to Data Analytics
Dan Martin
No ratings yet
Manufacturing: Engineering, Management and Marketing
From Everand
Manufacturing: Engineering, Management and Marketing
S.O.T Ogaji
No ratings yet
The Analytics Lifecycle Toolkit: A Practical Guide for an Effective Analytics Capability
From Everand
The Analytics Lifecycle Toolkit: A Practical Guide for an Effective Analytics Capability
Gregory S. Nelson
No ratings yet
Smart Business Problems and Analytical Hints
From Everand
Smart Business Problems and Analytical Hints
Zemelak Goraga
No ratings yet
Unlocking Statistics for the Social Sciences
From Everand
Unlocking Statistics for the Social Sciences
Norma Sinclair
No ratings yet
Expert Cube Development with Microsoft SQL Server 2008 Analysis Services
From Everand
Expert Cube Development with Microsoft SQL Server 2008 Analysis Services
Alberto Ferrari
5/5 (2)
Blog Smarter, Not Harder: SEO, Blogging, and AI Strategies to Skyrocket Your Traffic
From Everand
Blog Smarter, Not Harder: SEO, Blogging, and AI Strategies to Skyrocket Your Traffic
Jay Nans
No ratings yet
Knight's Microsoft Business Intelligence 24-Hour Trainer: Leveraging Microsoft SQL Server Integration, Analysis, and Reporting Services with Excel and SharePoint
From Everand
Knight's Microsoft Business Intelligence 24-Hour Trainer: Leveraging Microsoft SQL Server Integration, Analysis, and Reporting Services with Excel and SharePoint
Brian Knight
3/5 (1)

Classification Analysis Report PDF

Uploaded by

Classification Analysis Report PDF

Uploaded by

Similarity Report

PAPER NAME AUTHOR

WORD COUNT CHARACTER COUNT

738 Words 7753 Characters

PAGE COUNT FILE SIZE

SUBMISSION DATE REPORT DATE

64% Overall Similarity

Classification Analysis Report

Name: Mission khadka

1.1 Problem Statement

2.1 Data Preprocessing

2.2 Exploratory Data Analysis (EDA)

- Bar charts to examine class imbalances

- The dataset was slightly imbalanced, with fewer dissatisfied customers.

- Logistic Regression (Baseline model)

- Decision Tree Classifier (Non-linear approach)

2.4 Model Evaluation

- Accuracy: Measures overall correctness.

- Recall: Measures how well positive cases are identified.

- F1-Score: Harmonic mean of precision and recall.

2.5 Hyper-parameter Optimization

- Best Decision Tree Parameters: max_depth=5, min_samples_split=4.

- Age, Membership Type, Total Spend, Discount Applied

3.1 Key Findings

4.1 Model Performance

- Expanding dataset size for better generalization.

64% Overall Similarity

University of Wolverhampton on 2025-02-11

University of Wolverhampton on 2025-02-09

University of Wolverhampton on 2025-02-11

University of Wolverhampton on 2025-02-11

University of Wolverhampton on 2025-02-11

University of Wolverhampton on 2025-02-11

University of Wolverhampton on 2025-02-11

University of Wolverhampton on 2025-02-11

University of Wolverhampton on 2025-02-11

University of Wolverhampton on 2025-02-11

University of Wolverhampton on 2025-02-10

University of Wolverhampton on 2025-02-10

University of Wolverhampton on 2025-02-11

University of Wolverhampton on 2025-02-11

University of Wolverhampton on 2025-02-10

University of Wolverhampton on 2025-02-11

University of Wolverhampton on 2025-02-11

University of Wolverhampton on 2025-02-11

University of Wolverhampton on 2025-02-11

University of Wolverhampton on 2025-02-08

University of Wolverhampton on 2025-02-11

University of Wolverhampton on 2025-02-11

University of Wolverhampton on 2025-02-11

You might also like