How To Create Data Analytics Slides

The document presents an exploratory data analysis (EDA) highlighting sales trends across various product categories, revealing that technology products, particularly phones, have the highest sales. It also discusses seasonal sales patterns, indicating peaks during the last quarter and specific months like November and December. Additionally, the document covers data preprocessing steps, including balancing datasets through oversampling and undersampling techniques to improve model performance.

Uploaded by

Unicorn Spider

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views3 pages

How To Create Data Analytics Slides

Uploaded by

Unicorn Spider

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

EXPLORATORY DATA ANALYSIS

• We have also performed the category & product wise split up in fig(a)
• From EDA we have shown weekly sales graph in fig(b) which depicts the “Seasonality factors in sales”

CATEGORY BREAK-UP OF PRODUCTS fig(a)

• Chairs, Storage and Phones

SEASONALITY FACTORS IN SALES
are having the highest sale in There is general trend of high sales in the last quarter of the year
the Furniture, Office Supply
and Technology category During the festive season, sales peak in month of Nov & Dec
respectively.

o Chairs : $15,01,682
Due to summer discount season, there is peak in June of every year
o Storage : $11,26,813
o Phones : $17,06,824 Sales are at their lowest in a particular year in the month of July

• Technology segment has the Q3 & Q4 have more sales compared to Q1 & Q2
biggest chunk in sales of
$47,44,557
SEASONALITY FACTORS IN SALES fig(b)
VIDEO DISPLAYING DATAPOINTS IN 3D FIELD
The video shows the groups
plotted in 3D space where they
are clustered in 5 regions
• X-axis : Profit
• Y-axis : Sales
• Z-axis : Region
They are clustered on the basis
of RFM groups which we will
discuss later.
EXPLORATORY DATA ANALYSIS
VISUALIZING THE DATA AND HIGHLIGHTING THE STRIKING INSIGHTS

1 Ideal Duration 2 Price of Courses 3 Preferred Programs

Chart displaying duration of completed courses Chord Chart revealing Price ranges of courses Force Directed Chart revealing Preferred course subjects

MOST COMPLETED COURSES WERE BELOW MOST FREQUENT PURCHASES FOR COURSES HIGHEST PURCHASES MADE FOR MARKETING,
15 HOURS IN AVERAGE DURATION WORTH RS. 1,000 – RS. 5,000 FINANCE, IT AND HR COURSES

• An exploratory data analysis revealed that • To increase purchases company must release • The Force Directed Chart shows Marketing &
highest percentage of completed courses were in courses in range of Rs. 1,000 to Rs. 5,000 Technical courses most preferred
range of 10-15 hours
• Most customers willing to spend maximum of Rs. • Customer survey analysis revealed few
• Highest percentage of purchased courses, had 8,000 on courses but not more as many did not purchasers interested in courses outside their
average duration of less than 5 hours purchase courses above Rs. 8,000 field of interest or scope of job
DATA-PREPROCESSING
COMBINED DATA FROM VARIOUS FILES TO CREATE A SINGLE DATASET

1 MAJOR STEPS INVOLVED 2 BALANCING THE DATASET

Dropped irrelevant/repeated columns e.g. “Over 18”, “Std Hours”.
UNDERSAMPLING Data 0’s 1’s Total

Imputed Data
3,605 695 4,300
(Unbalanced)
Oversampled
3,605 3,605 7,210
Data
Undersampled
695 695 1,390
Data

UNDERSAMPLING & OVERSAMPLING

TO BALANCE THE DATASET
OVERSAMPLING

• Perform Label Encoding to convert ordinal data into Interval data • We balanced the dataset (attrition) using Imblearn’s -
Oversampling (RandomSampler) & Undersampling (N-v2)
• Multi-collinearity check & VIF to remove variables with high degree of
correlation among them which are dropped • Models performed good for under sampled data

• OLS logit regression performed to find statistically significant variables • Other methods (undersampling): condensed nearest neighbor,
(p-value < 5%) which are dropped nearmiss v1, v2, v3 were tried & nearmiss v2 performed best

• Data complexity is reduced by reducing the number of independent • Scaling numerical column decreases the spread & variance and
variables from 25 to 13 increase the computational efficiency for later models

Marketing & Retail Analytics-Milestone 1 - 300521
71% (14)
Marketing & Retail Analytics-Milestone 1 - 300521
18 pages
GST Notes by Riddhi Baghmar
100% (2)
GST Notes by Riddhi Baghmar
283 pages
Safe Work Method Statement: Solar PV System Installation
100% (6)
Safe Work Method Statement: Solar PV System Installation
13 pages
Lead Scoring Case Study - Summary
80% (5)
Lead Scoring Case Study - Summary
2 pages
Supermarket Sales Analysis and Prediction
100% (1)
Supermarket Sales Analysis and Prediction
34 pages
EM 1110-2-5025 - Dredging and Dredged Material Disposal - Web
No ratings yet
EM 1110-2-5025 - Dredging and Dredged Material Disposal - Web
94 pages
Leadership On The Line
No ratings yet
Leadership On The Line
7 pages
Templatepptpiechaart
No ratings yet
Templatepptpiechaart
3 pages
Sales Forecasting
100% (1)
Sales Forecasting
19 pages
MRA Project - Shehroz Khan
67% (3)
MRA Project - Shehroz Khan
19 pages
Abhishek Singh Report
No ratings yet
Abhishek Singh Report
9 pages
Complete Summer Internships Report
No ratings yet
Complete Summer Internships Report
37 pages
Rogers
No ratings yet
Rogers
5 pages
Data Analysis Portfolio Sample
No ratings yet
Data Analysis Portfolio Sample
25 pages
MKTM 697 Ca 1
No ratings yet
MKTM 697 Ca 1
11 pages
Data Analysis Portfolio
No ratings yet
Data Analysis Portfolio
21 pages
Q1063255 Jeromebasil VSTT Set Assignment
No ratings yet
Q1063255 Jeromebasil VSTT Set Assignment
24 pages
Delhi Institute of Higher Education: Summer Training Project Report
No ratings yet
Delhi Institute of Higher Education: Summer Training Project Report
35 pages
Final BDM
No ratings yet
Final BDM
22 pages
Data Analytics and Its Processess - Models - Methods
No ratings yet
Data Analytics and Its Processess - Models - Methods
55 pages
Sales Performance
No ratings yet
Sales Performance
41 pages
Day 1 Resources
No ratings yet
Day 1 Resources
6 pages
Data Presentation Tips
No ratings yet
Data Presentation Tips
28 pages
Ayush Report
No ratings yet
Ayush Report
17 pages
Data Analysis Report
No ratings yet
Data Analysis Report
27 pages
Report
No ratings yet
Report
17 pages
Stats Project 1
No ratings yet
Stats Project 1
27 pages
ZFL KM ICT702 Assessment 4
No ratings yet
ZFL KM ICT702 Assessment 4
7 pages
UNIT 05 - Data Science - Final
No ratings yet
UNIT 05 - Data Science - Final
36 pages
Justin Saunders Findings - v1
No ratings yet
Justin Saunders Findings - v1
14 pages
Olist Marketing Analytics
No ratings yet
Olist Marketing Analytics
5 pages
Assigment 2
No ratings yet
Assigment 2
12 pages
MAT4200 Documentation
No ratings yet
MAT4200 Documentation
62 pages
Report Shawari
No ratings yet
Report Shawari
10 pages
Sample Insights & Recommendation Slides
No ratings yet
Sample Insights & Recommendation Slides
3 pages
BI Class Test-2-AKEY
No ratings yet
BI Class Test-2-AKEY
12 pages
Hihi 3
No ratings yet
Hihi 3
4 pages
Data Analysis and Data Science Task - 2
No ratings yet
Data Analysis and Data Science Task - 2
3 pages
Sales Analysis and Forecasting in Shopping Mart: Amit Kumar, Kartik Sharma, Anup Singh, Dravid Kumar
No ratings yet
Sales Analysis and Forecasting in Shopping Mart: Amit Kumar, Kartik Sharma, Anup Singh, Dravid Kumar
4 pages
Data Visualization Techniques 1
No ratings yet
Data Visualization Techniques 1
27 pages
Data Analysis and Visualisation
No ratings yet
Data Analysis and Visualisation
3 pages
Approaches To Sales Forecasting
No ratings yet
Approaches To Sales Forecasting
3 pages
Final Chart It 1
No ratings yet
Final Chart It 1
18 pages
BRM Project
No ratings yet
BRM Project
65 pages
S&D Distribution Management
No ratings yet
S&D Distribution Management
49 pages
Coding and Communication in Statistics Presentation 2024
No ratings yet
Coding and Communication in Statistics Presentation 2024
11 pages
Session 1 - Marketing Business Analytics - 0621
No ratings yet
Session 1 - Marketing Business Analytics - 0621
68 pages
Sales Perfomance Analysis
No ratings yet
Sales Perfomance Analysis
24 pages
Week 1 Upload
No ratings yet
Week 1 Upload
43 pages
Data Presentation - Descriptive Stats - PGPEX
No ratings yet
Data Presentation - Descriptive Stats - PGPEX
87 pages
Data Analysis and Visualization Uncovering Insights
No ratings yet
Data Analysis and Visualization Uncovering Insights
10 pages
Marketing and Customer Analytics (MSCDA614) Cousrse Outline
No ratings yet
Marketing and Customer Analytics (MSCDA614) Cousrse Outline
5 pages
Business Analytics Course
No ratings yet
Business Analytics Course
11 pages
Sales
No ratings yet
Sales
19 pages
Data Driven For Business
No ratings yet
Data Driven For Business
16 pages
SEVILLA, JEREMY. - Pre-Assessment
No ratings yet
SEVILLA, JEREMY. - Pre-Assessment
5 pages
Hihi 2
No ratings yet
Hihi 2
2 pages
Fresh Foods Ordering Process
No ratings yet
Fresh Foods Ordering Process
5 pages
Marketing Analytics Instructional Manual Version 1.0
No ratings yet
Marketing Analytics Instructional Manual Version 1.0
14 pages
Debenhams Summer Sale QT
No ratings yet
Debenhams Summer Sale QT
18 pages
Mra 1
No ratings yet
Mra 1
48 pages
Day 1
No ratings yet
Day 1
3 pages
Scales & Balances World Summary: Market Values & Financials by Country
From Everand
Scales & Balances World Summary: Market Values & Financials by Country
Editorial DataGroup
No ratings yet
Cable Network Revenues World Summary: Market Values & Financials by Country
From Everand
Cable Network Revenues World Summary: Market Values & Financials by Country
Editorial DataGroup
No ratings yet
1741 Class Action Abuses and Recent Reforms in The United States Lessons For Europe
No ratings yet
1741 Class Action Abuses and Recent Reforms in The United States Lessons For Europe
33 pages
Leadershio Ad Mot
No ratings yet
Leadershio Ad Mot
40 pages
Auditing and Corporate Governance
No ratings yet
Auditing and Corporate Governance
124 pages
10yrs Audit
No ratings yet
10yrs Audit
74 pages
Vedic Math Notes Hand Written (Sscnotes - Com)
No ratings yet
Vedic Math Notes Hand Written (Sscnotes - Com)
228 pages
Methods of Training & Development
No ratings yet
Methods of Training & Development
4 pages
E-Filling of Returns (Shivdas 10 Years)
No ratings yet
E-Filling of Returns (Shivdas 10 Years)
122 pages
Excel
No ratings yet
Excel
2 pages
Trafficking
No ratings yet
Trafficking
3 pages
Adaptive Finite Element Methods: Lecture Notes Winter Term 2011/12
No ratings yet
Adaptive Finite Element Methods: Lecture Notes Winter Term 2011/12
144 pages
Non-Pharmacological Pain Management
No ratings yet
Non-Pharmacological Pain Management
19 pages
Health Problems
No ratings yet
Health Problems
10 pages
Distinctive Symbols in Heart of Darkness by Joseph Conrad
No ratings yet
Distinctive Symbols in Heart of Darkness by Joseph Conrad
21 pages
ÔN TẬP CK
No ratings yet
ÔN TẬP CK
3 pages
Advances in Engineering Software: M.J. Esfandiari, G.S. Urgessa, S. Sheikholare Fin, S.H. Dehghan Manshadi
No ratings yet
Advances in Engineering Software: M.J. Esfandiari, G.S. Urgessa, S. Sheikholare Fin, S.H. Dehghan Manshadi
12 pages
Velcro How To Make A Velcro Activity
No ratings yet
Velcro How To Make A Velcro Activity
3 pages
Daily Lesson Log Grade 10 - 3rd Week
100% (2)
Daily Lesson Log Grade 10 - 3rd Week
3 pages
Bulletin Personality and Social Psychology: Solitude Experiences: Varieties, Settings, and Individual Differences
No ratings yet
Bulletin Personality and Social Psychology: Solitude Experiences: Varieties, Settings, and Individual Differences
7 pages
Your Profile 16personalities
No ratings yet
Your Profile 16personalities
3 pages
Problem Definition - Software Engineering
No ratings yet
Problem Definition - Software Engineering
10 pages
Static Balancing
No ratings yet
Static Balancing
4 pages
VRF Catalog 2017-02-08
No ratings yet
VRF Catalog 2017-02-08
24 pages
Electrohydrodynamic Atomization (EHDA)
No ratings yet
Electrohydrodynamic Atomization (EHDA)
15 pages
Nexaura Magazine Final
No ratings yet
Nexaura Magazine Final
104 pages
Patrich Geddes Cities in Evolution
100% (2)
Patrich Geddes Cities in Evolution
442 pages
Computations of Flows For On Demand Irrigation Systems
No ratings yet
Computations of Flows For On Demand Irrigation Systems
52 pages
Homework Signs
100% (1)
Homework Signs
5 pages
Problem Solving - Pdca
No ratings yet
Problem Solving - Pdca
61 pages
Unit 1
No ratings yet
Unit 1
50 pages
Exercises - Chapter - 21 PDF
No ratings yet
Exercises - Chapter - 21 PDF
6 pages
A Project Report ON Competitive Analysis and Study of Zomato's Online Ordering Business
No ratings yet
A Project Report ON Competitive Analysis and Study of Zomato's Online Ordering Business
81 pages
Marketing Plan
0% (1)
Marketing Plan
48 pages
Beginning The Analysis: Investigating System Requirements: Systems Analysis and Design in A Changing World, 3 Edition
No ratings yet
Beginning The Analysis: Investigating System Requirements: Systems Analysis and Design in A Changing World, 3 Edition
37 pages
Engineering 23 06 2017
No ratings yet
Engineering 23 06 2017
137 pages
Multi-Attribute Evaluation of Flood Management in Japan: A Choice Experiment Approach
No ratings yet
Multi-Attribute Evaluation of Flood Management in Japan: A Choice Experiment Approach
10 pages
Crop Tool and Lasso Tool Lesson Plan
No ratings yet
Crop Tool and Lasso Tool Lesson Plan
2 pages

How To Create Data Analytics Slides

Uploaded by

How To Create Data Analytics Slides

Uploaded by

EXPLORATORY DATA ANALYSIS

CATEGORY BREAK-UP OF PRODUCTS fig(a)

• Chairs, Storage and Phones

1 Ideal Duration 2 Price of Courses 3 Preferred Programs

1 MAJOR STEPS INVOLVED 2 BALANCING THE DATASET

UNDERSAMPLING & OVERSAMPLING

You might also like