Classification Analysis Report PDF
Classification Analysis Report PDF
Classification_Analysis_Report.pdf -
5 Pages 501.5KB
Feb 11, 2025 10:54 PM GMT+5:45 Feb 11, 2025 10:54 PM GMT+5:45
Summary
Report on:
19
Abstract
5
Purpose: The purpose of this report is to predict a categorical variable using classification
techniques.
Approach: The dataset chosen for this analysis is the E-commerce Customer Behavior
Dataset, which contains customer purchase history, demographics, and satisfaction ratings.
2
The steps involved include Exploratory Data Analysis (EDA), model building with Logistic
Regression and Decision Tree Classifier, hyper-parameter optimization, and feature
selection.
Key Results: The performance of the models was evaluated using accuracy, precision, recall,
and F1-score. The models showed Decision Tree outperformed Logistic Regression with
higher accuracy and recall.
22
Conclusion: The classification models performed well in predicting customer satisfaction,
17
and key insights include the importance of discount offers and total spending in
4
determining satisfaction levels.
1. Introduction
1.2 Dataset
The dataset used in this analysis is the E-commerce Customer Behavior Dataset, sourced
from an independent e-commerce business. It contains customer purchase behavior,
6
satisfaction ratings, and demographic data. This dataset aligns with the United Nations
Sustainable Development Goals (UNSDG) by improving customer insights for better
economic and sustainable business practices.
1.3 Objective
The objective of this analysis is to build a predictive classification model that estimates the
10
customer satisfaction level (Satisfied, Neutral, or Dissatisfied) based on the given features.
2. Methodology
Key insights:
- Discount Applied and Total Spend had a strong influence on satisfaction levels.
4.4 Limitations
- Dataset had class imbalance, which could bias results.
- Simple models were used; more complex models may perform better.
24
4.5 Suggestions for Future Research
20
- Using ensemble models like Random Forest or Gradient Boosting.
TOP SOURCES
The sources with the highest number of matches within the submission. Overlapping sources will not be
displayed.
Sources overview
Similarity Report
medium.com
20 1%
Internet
Sources overview
Similarity Report
Sources overview