0% found this document useful (0 votes)

8 views5 pages

DMV 6 Output

sppu dmv practical 6 output

Uploaded by

sachin ahankari

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views5 pages

DMV 6 Output

sppu dmv practical 6 output

Uploaded by

sachin ahankari

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

DMV PRACTICAL 6

Data Aggregation Problem Statement: Analyzing Sales Performance by Region in a Retail Company Datas
et: " customer_shopping_data.csv" Description: The dataset contains information about sales tra
nsactions in a retail company. It includes attributes such as transaction date, product category, quantity
sold, and sales amount. The goal is to perform data aggregation to analyze the sales performance by regi
on and identify the top-performing regions.
Tasks to Perform:
1. Import the " customer_shopping_data.csv" dataset.
2. Explore the dataset to understand its structure and content.
3. Identify the relevant variables for aggregating sales data, such as region, sales amount, and product ca
tegory.
4. Group the sales data by region and calculate the total sales amount for each region.
5. Create bar plots or pie charts to visualize the sales distribution by region.
6. Identify the top-performing regions based on the highest sales amount.
7. Group the sales data by region and product category to calculate the total sales amount for each com
bination.
8. Create stacked bar plots or grouped bar plots to compare the sales amounts across different regions
and product categories.
PYTHON CODE :
import pandas as pd
import matplotlib.pyplot as plt

# Ensure this path points to the actual location of your CSV file
df = pd.read_csv("customer_shopping_data.csv")

# To check the count of records grouped by region/branch of the mall

print(df.groupby("shopping_mall").count())

# To check the count of records grouped by the product categories

print(df.groupby("category").count())

# Total sales for each mall branch

branch_sales = df.groupby("shopping_mall").sum()

# Total sales for each category of product

category_sales = df.groupby("category").sum()

# To get the top performing branches

top_branches = branch_sales.sort_values(by="price", ascending=False)

# To get the top selling categories

top_categories = category_sales.sort_values(by="price", ascending=False)

# To get total sales for each combination of branch and product_category

combined_branch_category_sales = df.groupby(["shopping_mall", "category"]).su
m()

# Pie chart for sales by branch

plt.pie(branch_sales["price"], labels=branch_sales.index, autopct='%1.1f%%',
shadow=True, startangle=140)
plt.title('Sales by Branch')
plt.axis('equal') # Equal aspect ratio ensures that pie is drawn as a circle
plt.show()

# Pie chart for sales by product category

plt.pie(category_sales["price"], labels=category_sales.index, autopct='%1.1f%
%', shadow=True, startangle=140)
plt.title('Sales by Product Category')
plt.axis('equal') # Equal aspect ratio ensures that pie is drawn as a circle
plt.show()

# Pivot table for combined sales by branch and category

combined_pivot = df.pivot_table(index="shopping_mall", columns="category", va
lues="price", aggfunc="sum")

# Grouped bar chart for sales of different categories at different branches

combined_pivot.plot(kind='bar', figsize=(10, 6))
plt.title('Sales of Different Categories at Different Branches')
plt.ylabel('Sales')
plt.show()
invoice_no customer_id gender age category quantity \
shopping_mall
Cevahir AVM 4991 4991 4991 4991 4991 4991
Emaar Square Mall 4811 4811 4811 4811 4811 4811
Forum Istanbul 4947 4947 4947 4947 4947 4947
Istinye Park 9781 9781 9781 9781 9781 9781
Kanyon 19823 19823 19823 19823 19823 19823
Mall of Istanbul 19943 19943 19943 19943 19943 19943
Metrocity 15011 15011 15011 15011 15011 15011
Metropol AVM 10161 10161 10161 10161 10161 10161
Viaport Outlet 4914 4914 4914 4914 4914 4914
Zorlu Center 5075 5075 5075 5075 5075 5075

price payment_method invoice_date

shopping_mall
Cevahir AVM 4991 4991 4991
Emaar Square Mall 4811 4811 4811
Forum Istanbul 4947 4947 4947
Istinye Park 9781 9781 9781
Kanyon 19823 19823 19823
Mall of Istanbul 19943 19943 19943
Metrocity 15011 15011 15011
Metropol AVM 10161 10161 10161
Viaport Outlet 4914 4914 4914
Zorlu Center 5075 5075 5075
invoice_no customer_id gender age quantity price \
category
Books 4981 4981 4981 4981 4981 4981
Clothing 34487 34487 34487 34487 34487 34487
Cosmetics 15097 15097 15097 15097 15097 15097
Food & Beverage 14776 14776 14776 14776 14776 14776
Shoes 10034 10034 10034 10034 10034 10034
Souvenir 4999 4999 4999 4999 4999 4999
Technology 4996 4996 4996 4996 4996 4996
Toys 10087 10087 10087 10087 10087 10087

payment_method invoice_date shopping_mall

category
Books 4981 4981 4981
Clothing 34487 34487 34487
Cosmetics 15097 15097 15097
Food & Beverage 14776 14776 14776
Shoes 10034 10034 10034
Souvenir 4999 4999 4999
Technology 4996 4996 4996
Toys 10087 10087 10087
C:\Users\AI&DS\AppData\Local\Temp\ipykernel_12148\2859295099.py:14: FutureWar
ning: The default value of numeric_only in DataFrameGroupBy.sum is deprecated
. In a future version, numeric_only will default to False. Either specify num
eric_only or select only columns which should be valid for the function.
branch_sales = df.groupby("shopping_mall").sum()
C:\Users\AI&DS\AppData\Local\Temp\ipykernel_12148\2859295099.py:17: FutureWar
ning: The default value of numeric_only in DataFrameGroupBy.sum is deprecated
. In a future version, numeric_only will default to False. Either specify num
eric_only or select only columns which should be valid for the function.
category_sales = df.groupby("category").sum()
C:\Users\AI&DS\AppData\Local\Temp\ipykernel_12148\2859295099.py:26: FutureWar
ning: The default value of numeric_only in DataFrameGroupBy.sum is deprecated
. In a future version, numeric_only will default to False. Either specify num
eric_only or select only columns which should be valid for the function.
combined_branch_category_sales = df.groupby(["shopping_mall", "category"]).
sum()

Product Catalog
0% (1)
Product Catalog
3,837 pages
A101 Turkey
50% (6)
A101 Turkey
435 pages
NLP Using Python
100% (3)
NLP Using Python
12 pages
UnionPay Merchants
0% (1)
UnionPay Merchants
88 pages
Algorithms: Notes For Professionals
100% (1)
Algorithms: Notes For Professionals
252 pages
Mapa Orlando International Premium Outlets
No ratings yet
Mapa Orlando International Premium Outlets
2 pages
San Marcos Premium Outlets: NO RT H
100% (1)
San Marcos Premium Outlets: NO RT H
2 pages
Huawei SinlgeSDB HSS9860-BE Feature Description
No ratings yet
Huawei SinlgeSDB HSS9860-BE Feature Description
26 pages
Sales Data Set
No ratings yet
Sales Data Set
2,073 pages
Sales Data
No ratings yet
Sales Data
2,211 pages
Electrostatics (Formula Sheet)
No ratings yet
Electrostatics (Formula Sheet)
6 pages
North Georgia Premium Outlets - Atlanta
No ratings yet
North Georgia Premium Outlets - Atlanta
2 pages
Recent Advances in Diagnostic Aids
No ratings yet
Recent Advances in Diagnostic Aids
59 pages
Custom1000231240export1628749740615 - 0812 14 29 00
No ratings yet
Custom1000231240export1628749740615 - 0812 14 29 00
2,984 pages
PS1 Solutions PDF
100% (1)
PS1 Solutions PDF
3 pages
Wanxiang Refrigeration: Lecturer: Jane Xie
No ratings yet
Wanxiang Refrigeration: Lecturer: Jane Xie
34 pages
1 Planning Merchandising Assortments
100% (1)
1 Planning Merchandising Assortments
33 pages
Comparar Emancipacion A
No ratings yet
Comparar Emancipacion A
446 pages
Timber Home Living 2015-09-10
No ratings yet
Timber Home Living 2015-09-10
84 pages
03.a. Raw - WishfulBazaar E-Commerce
No ratings yet
03.a. Raw - WishfulBazaar E-Commerce
330 pages
Group 3: Molecular Orbital Theory
No ratings yet
Group 3: Molecular Orbital Theory
37 pages
Comparar Puno A
No ratings yet
Comparar Puno A
325 pages
Chs 10 - Lesson 2
No ratings yet
Chs 10 - Lesson 2
43 pages
StoreDirectory PDF
No ratings yet
StoreDirectory PDF
2 pages
Cincinnati Premium Outlets, A Simon Center: Brooks Brothers Joe'S Jeans
No ratings yet
Cincinnati Premium Outlets, A Simon Center: Brooks Brothers Joe'S Jeans
2 pages
Radio Shack Stores Closing
No ratings yet
Radio Shack Stores Closing
126 pages
P235GH Engl PDF
No ratings yet
P235GH Engl PDF
4 pages
Retail Sell Data Analysis DABI Shayekh Arif Presentation
No ratings yet
Retail Sell Data Analysis DABI Shayekh Arif Presentation
15 pages
Comparar Chorrillos - A
No ratings yet
Comparar Chorrillos - A
161 pages
Strength Tests On Concrete: (1) Compressive Strength Test (ASTM C 39)
No ratings yet
Strength Tests On Concrete: (1) Compressive Strength Test (ASTM C 39)
12 pages
Bar Code
No ratings yet
Bar Code
1 page
Sales Des Nsity Analysis V 5
No ratings yet
Sales Des Nsity Analysis V 5
170 pages
PH Retailers Masterlist
No ratings yet
PH Retailers Masterlist
487 pages
Performance Task #5: University of San Agustin
No ratings yet
Performance Task #5: University of San Agustin
7 pages
Att 8 - ASTM B8-4
No ratings yet
Att 8 - ASTM B8-4
7 pages
EH Liquipoint FTW31 FTW32 Datasheet
No ratings yet
EH Liquipoint FTW31 FTW32 Datasheet
24 pages
DIN A Rail Sections
100% (1)
DIN A Rail Sections
1 page
Asset Type Location Asset Description ID Asset Serial Number
No ratings yet
Asset Type Location Asset Description ID Asset Serial Number
177 pages
Modding Manual
No ratings yet
Modding Manual
25 pages
Welding Machine Pre Start Checklist
No ratings yet
Welding Machine Pre Start Checklist
2 pages
FinalRequirement DATASET
No ratings yet
FinalRequirement DATASET
77 pages
03+forecasting+v0 1
No ratings yet
03+forecasting+v0 1
64 pages
Week 33 - SES LDU Deployment Monitoring Report For Audit
No ratings yet
Week 33 - SES LDU Deployment Monitoring Report For Audit
30 pages
WNP 5
No ratings yet
WNP 5
57 pages
OR 7th Sem NIT Raipur QPaper
No ratings yet
OR 7th Sem NIT Raipur QPaper
37 pages
Model Store Order Final
No ratings yet
Model Store Order Final
34 pages
Transactions of The Indian Institute of Metals Guidelines To Authors
No ratings yet
Transactions of The Indian Institute of Metals Guidelines To Authors
5 pages
Liberty Cashback TNC
No ratings yet
Liberty Cashback TNC
22 pages
China Products Show 2014 & CACF 2014 Exhibitor List: No. Product Category
No ratings yet
China Products Show 2014 & CACF 2014 Exhibitor List: No. Product Category
36 pages
5f71fd5469da7 Tatacliq Data CSV
No ratings yet
5f71fd5469da7 Tatacliq Data CSV
19 pages
Final Course List
No ratings yet
Final Course List
46 pages
MPU3343 - Glossary Chapter 4 Protein - Amino Acids
No ratings yet
MPU3343 - Glossary Chapter 4 Protein - Amino Acids
4 pages
Shops Effiency
No ratings yet
Shops Effiency
1 page
Dimensionality Reduction in Hyperspectral Image Analysis Using Independent Component Analysis
No ratings yet
Dimensionality Reduction in Hyperspectral Image Analysis Using Independent Component Analysis
19 pages
Book WS
No ratings yet
Book WS
25 pages
BizStats - Retail Sales Per Square Foot
No ratings yet
BizStats - Retail Sales Per Square Foot
3 pages
SAL Event Documentation
No ratings yet
SAL Event Documentation
13 pages
1306 - Deloitte TR - Retail Sector Update 2013
No ratings yet
1306 - Deloitte TR - Retail Sector Update 2013
12 pages
TDMS File Format Internal Structure
No ratings yet
TDMS File Format Internal Structure
14 pages
Constructive Cost Model
No ratings yet
Constructive Cost Model
14 pages
AC-S - S Katalog2022 - Eng - High
No ratings yet
AC-S - S Katalog2022 - Eng - High
17 pages
2409 - CVC - Rooster - Boca Raon
No ratings yet
2409 - CVC - Rooster - Boca Raon
20 pages
Cbds 2103
No ratings yet
Cbds 2103
11 pages
PC & Devices Shop
No ratings yet
PC & Devices Shop
9 pages
Data Analytics
No ratings yet
Data Analytics
19 pages
Math 2
No ratings yet
Math 2
17 pages
Chemical Shift
No ratings yet
Chemical Shift
10 pages
PSIS Waltermart Outright - Maintenance Table
No ratings yet
PSIS Waltermart Outright - Maintenance Table
16 pages
Indv 1
No ratings yet
Indv 1
9 pages
Lab Task 9.ipynb - Colab
No ratings yet
Lab Task 9.ipynb - Colab
4 pages
1.untitled: 4. What Is The Mean of Customer Age? Interpret Result
No ratings yet
1.untitled: 4. What Is The Mean of Customer Age? Interpret Result
8 pages
E201-Aakah Jathore - Lab - Ass - No - 04
No ratings yet
E201-Aakah Jathore - Lab - Ass - No - 04
3 pages
Abc Supply Chain
No ratings yet
Abc Supply Chain
2 pages
Wholesale Customer Retail
No ratings yet
Wholesale Customer Retail
1 page
P Premium Directory
No ratings yet
P Premium Directory
2 pages
Map & Directory: Gloucester Premium Outlets, A Simon Center
No ratings yet
Map & Directory: Gloucester Premium Outlets, A Simon Center
2 pages
Connected Consumer-Multichannel Insight Report 2022 - Drapers
No ratings yet
Connected Consumer-Multichannel Insight Report 2022 - Drapers
2 pages
.Mymediahomestore Guidepdf File 7ba5dc81ee PDF
No ratings yet
.Mymediahomestore Guidepdf File 7ba5dc81ee PDF
2 pages
.Mymediahomestore Guidepdf File D9e6063cad PDF
No ratings yet
.Mymediahomestore Guidepdf File D9e6063cad PDF
2 pages
Directory of Stores - Ground Floor (G) : Flight Check-In Centre
No ratings yet
Directory of Stores - Ground Floor (G) : Flight Check-In Centre
2 pages
Directory of Stores - Ground Floor (G) : Flight Check-In Centre
No ratings yet
Directory of Stores - Ground Floor (G) : Flight Check-In Centre
2 pages
Aula 01 - 1.02.1 Lab1Start v5
No ratings yet
Aula 01 - 1.02.1 Lab1Start v5
1 page
Web Map PDF
No ratings yet
Web Map PDF
1 page
Florida Keys Outlet Center: Designer Fashions & Sportswear Accessories & Jewelry Information & Services
No ratings yet
Florida Keys Outlet Center: Designer Fashions & Sportswear Accessories & Jewelry Information & Services
1 page
Question Database Structure Practical Quiz 4
No ratings yet
Question Database Structure Practical Quiz 4
3 pages
Adobe Scan 30 Dec 2024
No ratings yet
Adobe Scan 30 Dec 2024
1 page
Tension 13: 5or1 He T TH Ro No H RD in
No ratings yet
Tension 13: 5or1 He T TH Ro No H RD in
1 page

DMV 6 Output

Uploaded by

DMV 6 Output

Uploaded by

DMV PRACTICAL 6

# To check the count of records grouped by region/branch of the mall

# To check the count of records grouped by the product categories

# Total sales for each mall branch

# Total sales for each category of product

# To get the top performing branches

# To get the top selling categories

# To get total sales for each combination of branch and product_category

# Pie chart for sales by branch

# Pie chart for sales by product category

# Pivot table for combined sales by branch and category

# Grouped bar chart for sales of different categories at different branches

price payment_method invoice_date

payment_method invoice_date shopping_mall

You might also like