Data Analysis Report
Data Analysis Report
Student Name:
Student ID:
Course Name:
Date:
2
INTRODUCTION
Data Analysis is a process of inspecting, cleansing, transforming, and modeling data with the
making. Data analysis has multiple facets and approaches, encompassing diverse techniques
under a variety of names, while being used in different business, science, and social science
domains. The Analytics team of a Company anywhere in the world would want to design a
Sales and Performance dashboard to analyze the sales based on various product categories
and other factors which have a role to play in the running of the store. The store managing
head, or the owner wants to add user control for product category, so users can select a
category and can see the trend month-wise and product-wise accordingly. The Analytics team
would also want to analyze various other things like how many days the store takes to ship
the product, how many times a Customer orders a product, how much time is there between
the first and second order of the customer etc. The Company’s database keeps track of the
- Order Date and Ship Date – Date when the item was ordered and the date when the
Literature Review
The Company wants to see and analyze the sales trend month-wise and product-wise and
work upon the lagging segments and outperforming employees accordingly. The Analytics
team also wants to create analyze the database in depth to help the Company grow
Aim of this project is to answer the above objectives in the form of visualization by creating a
Methodology
In computing, extract, transform, load (ETL) is a process in database usage to prepare data
for analysis, especially in data warehousing. Data extraction involves extracting data from
transforming them into a proper storage format/structure for the purposes of querying and
analysis; finally, data loading describes the insertion of data into the final target database
such as an operational data store, a data mart, or a data warehouse. A properly designed ETL
system extracts data from the source systems, enforces data quality and consistency
standards, conforms data so that separate sources can be used together, and finally delivers
data in a presentation-ready format so that application developers can build applications and
end users can make decisions. Precisely, ETL is defined as a process that extracts the data
from different RDBMS source systems, then transforms the data (like applying calculations,
concatenations, etc.) and finally loads the data into the Data Warehouse system. ETL stands
for Extract, Transform and Load. Before ETL, the dataset looked like this. This data is taken
from Kaggle.
5
Through the process of ETL, we are going to clean the dataset and bring all the entities to
For this, select the whole dataset. Go to Find and Select in the Home tab of excel. Select Go
to Special from the drop-down menu and then tick the blank option. All the blank cells will
be selected. Then go to Delete option in the home tab again and select Delete Rows from the
drop-down menu. This will remove any rows with blank cells.
Step 2: Removing columns which are not properly defined or not crucial to our analysis.
6
For this we will columns which are redundant like the column with just the index numbers.
For this we will select that particular column and then go to delete option in the home tag and
The dataset does not have proper columns so our next step would be to giver proper column
We’ll be using Tableau prep for this work as it’ll make the work simple and faster because
we might not know how many null values could be there in this huge data set. Tableau helps
Without proper Data Formatting, proper analysis will not take place. So, we will bring down
certain columns to their proper format. For example, the dates should be in the date format
and price and sales should be in currency format for better results.
8
It might be possible that our data may be containing duplicate values which may hinder in
precise analysis. So, our last task in ETL will be removing duplicate values and making our
ANALYSIS OF DATASET
9
Description:
By knowing about sales and profit over month we can know about the months which
are more profitable for sales and hence customize our advertisement plan to increase
the sales even more. After finding out the sales and profit we visualize the result with
We have to create a pivot table. No specific functions are used. We then put the
priority c and count of their respective sales in the columns of the pivot table.
Results:
Visualization:
The results are then visualized in the form of a stacked bar graph for both profit and
sales
10
Description:
By knowing which segment of sales has themost number of sales and which has least
we can identify factors which affect the sales and thereby improve our strategy of
making sales.
We have to create a pivot table. No specific functions are used. We then put the
priority c and count of their respective sales in the columns of the pivot table.
Results:
Visualization:
Description:
Monthly sales can help us identify which month is more profitable and helps identify
the factor which helps us to do so. We can apply the identified the factors in other
We have to create a pivot table. No specific functions are used. We then put the
Results
Visualization:
The results are visualized with the help of line graph with a trend line displaying the
Description:
Every sale is going to have an order priority associated with it. Greater the priority,
We have to create a pivot table. No specific functions are used. We then put the
priority c and count of their respective sales in the columns of the pivot table.
Results:
Visualization:
We visualize the above results with the help a pie chart created using pivot charts.
Description:
By comparing sales of each product category side by side, we can come to know what
kind of products are sold the most and which the least. This information can help us
target customers more effectively to improve the sales and thus by increasing profits
We have to create a pivot table. No specific functions are used. Product category is
used as columns with summation of profit and sales of each product category.
Results:
Visualization:
6. Employee Performance
Description:
In this we analyze which regional manager is doing well and which one is performing
the least. It’ll help us giving them incentives, promoting them and training them for
better performance,
Results:
16
Visualization:
Description:
17
In this we analyze which particular region is having most amount of sales and which
is least. Furthermore we can look upon the factors which might be impacting the sales
and we can look upon them to increase the sales and invest in the areas of maximum
sales.
Results:
Visualization:
18
Description:
In this we analyze how much time the Company is taking to ship a product after
successful placement of order by the user. This can help us to improve the customer
We are using Tableau prep for this and simply creating a calculated field .
19
Results:
Description:
In this we analyze after how much time does a particular revisits us and places the
order again. We can create offers and Discounts accordingly and increase the
customer engagement for frequent visits. This will ultimately help us to improve the
customer service and improve the service quality provided to the customer.
We are using Tableau prep for this and applying aggregate function to extract the
first order date and the second order date and then finally joining them in a single
field.
20
Results:
21
ANALYSIS RESULTS
Sales was high in the Jan but still resulted in negative profit i.e. loss. Still the
Company managed to work well and increase the profit exponentially by the end
of Jun.
Office. We can offer them special discounts and can have tie ups to increase the
Sales were at peak once in mid Feb and again in Starting of the April followed by
June End. We can create offers for other times as well to increase the sales
growth.
We are having more of the orders without any priority followed by medium
the revenue.
It is clear that tables are our best selling products followed by chairs and chair
mats. We can work upon the one’s not performing well to increase their sales also.
6. Employee Performance
24
Erin was our Best employee for this quarter with maximum sales whereas Sam lagged
behind everyone with a huge margin and needs to perform well in the other half of the
year.
7. Regional Sales
We performed the best in California followed by New York and Texas. We might
The products are shipped within 2 days of the order date according to their
priority.
It can be seen that the frequency of Customer is quite low and needs to be
FINAL DASHBOARD
26
References
https://asialinkbusiness.com.au/china/business-practicalities-in-china/taxation-in-
china? doNothing=1
https://santandertrade.com/en/portal/establish-overseas/china/tax-system
Taxsummaries.pwc.com.https://taxsummaries.pwc.com/peoples-republic-of-china/
individual/taxes-on-personal-income
2022.