0% found this document useful (0 votes)

32 views4 pages

Solution

Uploaded by

kkesarkar5

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

32 views4 pages

Solution

Uploaded by

kkesarkar5

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Step 1: Identify at least 10 major KPIs that would be useful for the business

Based on the dataset, I have identified the following 10 major KPIs that would be useful for the
business:

 Sales Revenue: Total sales revenue generated by the supermarket chain

 Customer Count: Number of unique customers who have made purchases

 Average Order Value (AOV): Average amount spent by customers in a single transaction

 Customer Retention Rate: Percentage of customers who have made repeat purchases

 Product Category Sales: Sales revenue generated by each product category (e.g. dairy,
bakery, etc.)

 Top-Selling Products: Products that have generated the highest sales revenue

 Region-wise Sales: Sales revenue generated by each region (e.g. Chennai, Coimbatore, etc.)

 State-wise Sales: Sales revenue generated by each state (e.g. Tamil Nadu, Karnataka, etc.)

 Gross Margin: Difference between revenue and cost of goods sold

 Inventory Turnover: Number of times inventory is sold and replaced within a given period

Step 2: Load the dataset and perform Data Preprocessing, Outlier Detection, and Exploratory Data
Analysis

To perform data preprocessing, outlier detection, and exploratory data analysis, I will use Python
with the Pandas and NumPy libraries.

import pandas as pd

import numpy as np

# Load the dataset

df = pd.read_csv('Supermart Grocery Sales - Retail Analytics Dataset.csv')

# Data Preprocessing

# Check for missing values

print(df.isnull().sum())

# Handle missing values (e.g. impute with mean or median)

df.fillna(df.mean(), inplace=True)
# Outlier Detection

# Use the Z-score method to detect outliers

from scipy import stats

z_scores = np.abs(stats.zscore(df))

print(z_scores)

# Exploratory Data Analysis

# Summary statistics

print(df.describe())

# Visualize the data using plots and charts

import matplotlib.pyplot as plt

df.plot(kind='bar')

plt.show()

Output:

 Summary statistics of the dataset

 Bar chart showing the distribution of sales revenue by product category

Step 3: Use Association Rule Mining technique to identify the items frequently bought together
and their demands

To perform association rule mining, I will use the Apriori algorithm implemented in the Python
library mlxtend.

from mlxtend.frequent_patterns import apriori

from mlxtend.frequent_patterns import association_rules

# Convert the dataset to a transactional format

transactions = []

for index, row in df.iterrows():

transactions.append(row['Item Name'])

# Perform association rule mining

frequent_itemsets = apriori(transactions, min_support=0.01, use_colnames=True)

rules = association_rules(frequent_itemsets, metric='confidence', min_threshold=0.5)

# Print the top 10 rules

print(rules.head(10))

Output:

 Top 10 association rules showing the items frequently bought together and their demands

Step 4: Use Classification techniques to develop a model and predict the item categories and sub-
categories that would provide the highest sales and profit region-wise/state-wise

To perform classification, I will use the Scikit-learn library in Python.

from sklearn.model_selection import train_test_split

from sklearn.ensemble import RandomForestClassifier

from sklearn.metrics import accuracy_score, classification_report

# Prepare the dataset for classification

X = df.drop(['Item Category', 'Item Sub-Category'], axis=1)

y = df['Item Category']

# Split the dataset into training and testing sets

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

# Train a random forest classifier

rfc = RandomForestClassifier(n_estimators=100, random_state=42)

rfc.fit(X_train, y_train)

# Make predictions on the testing set

y_pred = rfc.predict(X_test)

# Evaluate the model

print('Accuracy:', accuracy_score(y_test, y_pred))

print('Classification Report:')

print(classification_report(y_test, y_pred))

Output:

 Accuracy and classification report of the random forest classifier

Step 5: Modify the dataset to incorporate the Non-Volatile feature of data warehouse

To modify the dataset to incorporate the Non-Volatile feature of data warehouse, I will create a new
column Version to track changes to the data.

# Create a new column 'Version' to track changes

df['Version'] = 1

# Save the modified dataset to a new CSV file

df.to_csv('Supermart Grocery Sales - Retail Analytics Dataset_Modified

Task 2 - Experimentation and Uplift Testing - Jupyter Notebook
No ratings yet
Task 2 - Experimentation and Uplift Testing - Jupyter Notebook
41 pages
Supermart Grocery Sales Analysis
No ratings yet
Supermart Grocery Sales Analysis
8 pages
Project Report
No ratings yet
Project Report
57 pages
Project Amazon Sales Data Analysis
No ratings yet
Project Amazon Sales Data Analysis
12 pages
Customer Segmentation in Python
No ratings yet
Customer Segmentation in Python
71 pages
Amazon Sales Analysis
No ratings yet
Amazon Sales Analysis
51 pages
Assignment Day 8 Subhash Chaudhary
No ratings yet
Assignment Day 8 Subhash Chaudhary
22 pages
Task 1 - Data Preparation and Customer Analytics - Jupyter Notebook
No ratings yet
Task 1 - Data Preparation and Customer Analytics - Jupyter Notebook
64 pages
Supermart Grocery Sales - Retail Analytics Dataset - (Data Analyst)
No ratings yet
Supermart Grocery Sales - Retail Analytics Dataset - (Data Analyst)
17 pages
Lab Manual 4
No ratings yet
Lab Manual 4
23 pages
Sample 3rd Project I
No ratings yet
Sample 3rd Project I
13 pages
Data Mining
No ratings yet
Data Mining
10 pages
ML Report
No ratings yet
ML Report
11 pages
MRA Milestone-2
No ratings yet
MRA Milestone-2
20 pages
The Factors Affecting Big Mart's Sales
No ratings yet
The Factors Affecting Big Mart's Sales
20 pages
EDA Report Week2
No ratings yet
EDA Report Week2
15 pages
PRJ Sales Forecasting
No ratings yet
PRJ Sales Forecasting
22 pages
21f1000089 BDM Proposal
No ratings yet
21f1000089 BDM Proposal
8 pages
Lab3 Data Mining
No ratings yet
Lab3 Data Mining
2 pages
Rithika Content
No ratings yet
Rithika Content
25 pages
CUSTOMER ANALYSIS - Report
No ratings yet
CUSTOMER ANALYSIS - Report
10 pages
Final Ca
No ratings yet
Final Ca
10 pages
Major ppt-1
No ratings yet
Major ppt-1
13 pages
Prediction of Sales On Market Basket Data Using: Machine Learning Techniques (Apriori and FP Growth)
No ratings yet
Prediction of Sales On Market Basket Data Using: Machine Learning Techniques (Apriori and FP Growth)
23 pages
Document 11
No ratings yet
Document 11
6 pages
File 2620
No ratings yet
File 2620
24 pages
Mini Project - Documentation (0052)
No ratings yet
Mini Project - Documentation (0052)
3 pages
Marketing & Retail Analytics - Project 2
No ratings yet
Marketing & Retail Analytics - Project 2
28 pages
Data Analysis
No ratings yet
Data Analysis
10 pages
Another Project-Creating Customer Segments
No ratings yet
Another Project-Creating Customer Segments
31 pages
CDAC Assignment
No ratings yet
CDAC Assignment
3 pages
21f1000089 BDM Proposal
No ratings yet
21f1000089 BDM Proposal
6 pages
Python For Business Decision Making Asm2
No ratings yet
Python For Business Decision Making Asm2
21 pages
Excel Project
No ratings yet
Excel Project
2 pages
2024 Salary Guide UAE
100% (1)
2024 Salary Guide UAE
32 pages
Olist Kasyapa
No ratings yet
Olist Kasyapa
22 pages
DSML - Project Report - Group 3
No ratings yet
DSML - Project Report - Group 3
17 pages
Dinya Antony MRA ML2
100% (1)
Dinya Antony MRA ML2
24 pages
Ai Phase2
No ratings yet
Ai Phase2
4 pages
Supermarket Sales Analysis 1
No ratings yet
Supermarket Sales Analysis 1
13 pages
MRA Project Milestone 2
71% (17)
MRA Project Milestone 2
20 pages
Wa0002.
No ratings yet
Wa0002.
4 pages
Market Basket Analysis
No ratings yet
Market Basket Analysis
12 pages
Big Mart Sales Analysis
No ratings yet
Big Mart Sales Analysis
3 pages
Case Study Reportf
No ratings yet
Case Study Reportf
6 pages
Answers: SAP SD Certification Premium Questions (Vol. 1) : Answers Are Highlighted With Green Color
No ratings yet
Answers: SAP SD Certification Premium Questions (Vol. 1) : Answers Are Highlighted With Green Color
17 pages
Case Study-1-Pattern Discovery in Supermarket Sales Transactions Using EDA
No ratings yet
Case Study-1-Pattern Discovery in Supermarket Sales Transactions Using EDA
3 pages
Research Paper On Retail Data Analytics
No ratings yet
Research Paper On Retail Data Analytics
6 pages
Supermarket Sales Data Analysis
No ratings yet
Supermarket Sales Data Analysis
6 pages
Advance Data Analytics ASSIGNMENT
No ratings yet
Advance Data Analytics ASSIGNMENT
10 pages
SQL Capstone Project
No ratings yet
SQL Capstone Project
4 pages
BigMart PDF
100% (1)
BigMart PDF
42 pages
Data Analysis On BigMart Sales
67% (3)
Data Analysis On BigMart Sales
17 pages
B M Sale Analysis
No ratings yet
B M Sale Analysis
3 pages
Retail Sales Analytics Project
No ratings yet
Retail Sales Analytics Project
3 pages
Intro To BA
No ratings yet
Intro To BA
7 pages
Mkt425 Group 2 Nestle
No ratings yet
Mkt425 Group 2 Nestle
10 pages
Market Basket Analysis Using: R Tool
No ratings yet
Market Basket Analysis Using: R Tool
23 pages
Rakesh R.Vadlakonda
100% (1)
Rakesh R.Vadlakonda
4 pages
CV Ananya
No ratings yet
CV Ananya
2 pages
Gaurav Upadhyay ML Project
No ratings yet
Gaurav Upadhyay ML Project
8 pages
Big Mart Sales Analysis
No ratings yet
Big Mart Sales Analysis
3 pages
Debt Free For Life
No ratings yet
Debt Free For Life
3 pages
Ent Paper Two Notes For A Level
No ratings yet
Ent Paper Two Notes For A Level
31 pages
Developing Governance
No ratings yet
Developing Governance
71 pages
BBM 124 Principles of Marketing Course Outline
No ratings yet
BBM 124 Principles of Marketing Course Outline
3 pages
Sol Man Chapter 13 Accounting For Derivatives and Hedging Transactions Part 3 Compress
No ratings yet
Sol Man Chapter 13 Accounting For Derivatives and Hedging Transactions Part 3 Compress
13 pages
Portfolio Final Report
No ratings yet
Portfolio Final Report
42 pages
E-Commerce Thulo - Com Final Report
No ratings yet
E-Commerce Thulo - Com Final Report
35 pages
Aryan Project 2
No ratings yet
Aryan Project 2
11 pages
Laws That Protect Entrepreneurship
No ratings yet
Laws That Protect Entrepreneurship
1 page
Topic 1 The Dynamics of Business
No ratings yet
Topic 1 The Dynamics of Business
27 pages
Dissertation Topics in Marketing Management
100% (2)
Dissertation Topics in Marketing Management
4 pages
A Project Report
No ratings yet
A Project Report
34 pages
Paper-1 Accounts Mock Test-Q.P FINAL 31454403
No ratings yet
Paper-1 Accounts Mock Test-Q.P FINAL 31454403
7 pages
Mutule Trust BNK (MD Jamalur Rahaman)
No ratings yet
Mutule Trust BNK (MD Jamalur Rahaman)
3 pages
Pas 40 Investment Property
No ratings yet
Pas 40 Investment Property
6 pages
ATTO January Investor Presentation 01.05.2016
No ratings yet
ATTO January Investor Presentation 01.05.2016
35 pages
Walpole Power List 2019
No ratings yet
Walpole Power List 2019
5 pages
Boycott Israeli Products....... Docx Final
No ratings yet
Boycott Israeli Products....... Docx Final
15 pages
Main Cirque Du Soleil Final
No ratings yet
Main Cirque Du Soleil Final
4 pages
Chapter 5 Tool Support For Testing
No ratings yet
Chapter 5 Tool Support For Testing
7 pages
AI Annotation in Image
No ratings yet
AI Annotation in Image
7 pages
Research - Paper - Format - For MCA
No ratings yet
Research - Paper - Format - For MCA
6 pages
6 Appendix: Sample Answers......................... : Sc3. Strategic Management. Bux'K I 13
No ratings yet
6 Appendix: Sample Answers......................... : Sc3. Strategic Management. Bux'K I 13
30 pages
Pedido - S00025
No ratings yet
Pedido - S00025
5 pages
Robotics Project 1
No ratings yet
Robotics Project 1
4 pages
Heritage Arboretum Report
No ratings yet
Heritage Arboretum Report
4 pages
Motivation and Training
No ratings yet
Motivation and Training
5 pages
Choose Abm
No ratings yet
Choose Abm
2 pages
Invoice
No ratings yet
Invoice
1 page
Gujarat Pipavav Port LTD (GPPL) Gujarat Pipavav Port LTD (GPPL)
No ratings yet
Gujarat Pipavav Port LTD (GPPL) Gujarat Pipavav Port LTD (GPPL)
11 pages
Mini Project Certificate
No ratings yet
Mini Project Certificate
3 pages
Exercise 1: Discussion Questions
No ratings yet
Exercise 1: Discussion Questions
2 pages
Backtrader Essentials: Building Successful Strategies with Python
From Everand
Backtrader Essentials: Building Successful Strategies with Python
Ali AZARY
No ratings yet
Salesforce Certified Platform Developer I CRT-450 Exam Preparation
From Everand
Salesforce Certified Platform Developer I CRT-450 Exam Preparation
Georgio Daccache
No ratings yet