0% found this document useful (0 votes)
10 views15 pages

Business Analytics DA-1

The document presents various data visualizations including bar graphs, histograms, pie charts, tree plots, scatter plots, and box plots to analyze different datasets such as car profit margins, air pollution levels, energy source distribution, organizational salary structures, CO2 emissions, and rainfall distribution in India. Each visualization includes its purpose, merits and demerits, steps for preparation, and inferences drawn from the data. The document also provides reference links for the datasets used in the analyses.

Uploaded by

gigachadmaleme
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
10 views15 pages

Business Analytics DA-1

The document presents various data visualizations including bar graphs, histograms, pie charts, tree plots, scatter plots, and box plots to analyze different datasets such as car profit margins, air pollution levels, energy source distribution, organizational salary structures, CO2 emissions, and rainfall distribution in India. Each visualization includes its purpose, merits and demerits, steps for preparation, and inferences drawn from the data. The document also provides reference links for the datasets used in the analyses.

Uploaded by

gigachadmaleme
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 15

Business Analytics

Digital Assignment – 1
Santhosh.v
24BCE1447

1. Bar Graph

Purpose
The purpose of this Bar graph is to analyze which industry holds the
largest profit margin for selling cars in India from the data available in the
internet, after gathering, organizing and visualizing we can conclude
based upon the inference we gather

Merits and Demerits of this Graph


Merits:
 Clear representation of the average revenue per car of each brand
which is easy to comparing them and analyse.
 These simple visual representation of the data makes it easy to
understand even to the non technical person.
 Features like colour, sorting and data labeling in bar graph enhances
readability.

Demerits:
 If too many brands are included the graph becomes over
crowded and affects are readabilty of the bar graph.
 Inconsistent bar width or scale can distort the representation.

Steps involved in preparing this graph -


Step1: Open excel sheet, enter the data instances we want to visualize.

Step2 : Click on inset tab and select Clustered Bar graph

Step3: Click on chart elements and add chart titles, axis title and enable
data labels and select outside top for more pleasantable representation
and to avoid over crowding of data

Step4: Go to data tab and select sort followed by select largest to smallest
to make the data representation more clear and formal

Step5: Click on the graph and copy paste it in ms word (I took screenshot
of my graphs and charts)

Inference
From this we can conclude that the maruti owns the highest profit per car
in India compared to other top premium brands.

The dataset used for this representation -

Data
Avg
Price Revenu
Units Per Car e (₹ Avg Revenue
Brand Sold (₹ Lakh) Crore) Per Car (₹)
160000
Maruti Suzuki 0 7.5 120000 3500000
Hyundai Motor 557546 10.2 56870 1450000
Tata Motors 570955 9.8 55954 1380000
Mahindra &
Mahindra 416000 14.5 60320 1020000
Kia India 238523 13.8 32919 980000
Toyota 239802 35 83931 750000

Reference Link
https://www.team-bhp.com/forum/indian-car-scene/276257-2023-brand-
brand-analysis-indian-passenger-car-market.html

2. Histogram

Purpose
The purpose of this histogram is the analyse and represent the amount of
PM10 concentration in the air in the top 10 major metro Politian cities.

Merits and demerits:


Merits
 This histogram effectively shows how the PM10 concentration are
distributed across different cities in ranges.
 The shape of the histogram can be used to derive insights like
severity of the pollution, indicating whether the pollution levels are
low, moderately low, high or severe
 It shows whether PM10 levels are concentrated in a specific range, if
the data is skewed or if the data has extreme high values.

Demerits

 The bin ranges affect the representation of the data to the larger
instance if the bin is too small making the histogram to be too
detailed making the representation too crowded, if the bin size is
too large making the data representation more generalised and less
informative.
 Since histogram groups the concentration of the PM10 in ranges
slight fluctutation of the data points is not visible clearly as
compared to the bar graph.

Steps involved in preparing this graph -

 Step1: Open excel sheet, enter the data instances we want to


visualize.
 Step2 : Click on inset tab and select Histogram.
 Step3: Click on chart elements and add chart titles, axis title and
enable data labels.
 Step4: labels and select outside top for more pleasantable
representation and to avoid over crowding of data.
 Step5: Click on the chart elements followed by data labels and then
more data labels options, there under numbers column adjust the
bin size.
 Step6: Click on the graph and copy paste it in ms word (I took
screenshot of my graphs and charts)

Inference
From this histogram we can conclude that Delhi begins the most air
polluted city with PM10 concentration level compared to the other metro
polytan cities.
Data
Bangalor
124
e
Chennai 62
Delhi 268
Hyderaba
112
d
Jaipur 180
Kolkata 109
Mumbai 155
Pune 89
Surat 97

Reference Link
https://www.data.gov.in/search?title=air%20quality%20among%20major
%20cities&sortby=_score&type=resources

3. Pie chart
Purpose
The purpose of this chart is to understand the energy source distribution
in India using pie chart

Merits and Demerits


Merits
 Pie chart visually shows how different energy sources contribute to
the energy distribution in India, the shares of each such groups is
easy to compare along with their percentages.
 The circular representation of the data makes it visually intuitive
making it clear and readable for non-technical person.
 The biggest advantage of the pie chart over the other
representations is the easy of understandable, readability and its
simplicity.

Demerits
 If the data include too many energy sources the representation
become too crowded and unreadable.
 Unlike bar graph and histogram these pie charts cannot represent
the change in trends or increasing-decreasing behaviour of the data.

Steps involved in preparing this chart -

 Step1: Open excel sheet, enter the data instances we want to


visualize.
 Step2 : Click on inset tab and select Piechart.
 Step3: Click on chart elements and add chart titles, and enable
data lables under which tick the percentage and untick the values
(piechart usually represents the percentage distribution of the
dataset).
 labels and select outside top for more pleasantable representation
and to avoid over crowding of data.
 Step4: Click on the chart and copy paste it in ms word (I took
screenshot of my graphs and charts).
Inference
From this energy distribution data source of India we can conclude that
energy from the coal being the most abundantly used source.

Data
Coal 205235 MW
Lignite 6620 MW
Gas 24824 MW
Diesel 589 MW
Hydro 46850 MW
Wind, Solar & Other RE 125692 MW
Nuclear 6780 MW

Reference Link
https://en.wikipedia.org/wiki/Electricity_sector_in_India

4. Tree Plot
Purpose
The purpose of this Tree plot is to understand the organizational role and
its salary separated by the levels (years of experience), The visual
representation and the colour categorization makes the dataset simple
and more readable.

Merits and Demerits


Merits
 The clear representation of the Tree plot with multiple attributes
helps us to understand and visualize the data more easily
 It accepts and process with larger data than compared to Bar graph,
Histogram and Pie chart
 The table layout is convenient for the users to read
and understand the data. The salary of each
role is laid out clearly, and thus it becomes easy to compare
salaries across levels and in different departments.

Demerits
 While it is readable, it doesn’t give an immediate, intuitive sense of
relationships, patterns, or trends between the roles and their
salaries.
 With more roles, departments, and levels of hierarchy, a
table is clumsy to use. It may become hard to follow relationships
and work with as the dataset grows

Steps involved in preparing this graph -


 Step1: Open excel sheet, enter the data instances we want to
visualize.
 Step2 : Click on inset tab and select Tree plot in Hyrarchial chart.
 Step3: Click on chart elements and add chart titles and enable data
labels and tick category name and values for more pleasantable
representation and to avoid over crowding of data.
 Step4: Click on the graph and copy paste it in ms word (I took
screenshot of my graphs and charts)
Inference
This dataset reveals a clear hyrarchy of salary distribution in a
oraganization based on the individuals level and role, While CEO earning
the most while the level three HR earning the least according to the
dataset.

Data
Parent Child Leve Monthly Salary
Category Category l (INR)
Roo Executive Level 1,500,00
t Leadership 1 0
Executive CE Level 1,200,00
Leadership O 2 0
Executive CT Level 1,000,00
Leadership O 2 0
Roo Manageme Level 800,00
t nt 1 0
Manageme Project Level 400,00
nt Manager 2 0
Manageme Program Level 450,00
nt Manager 2 0
Roo Engineeri Level 600,00
t ng 1 0
Engineeri Senior Software Level 300,00
ng Engineer 2 0
Engineeri Software Level 200,00
ng Engineer 2 0
Engineeri Junior Software Level 120,00
ng Engineer 3 0
Engineeri QA Level 250,00
ng Engineer 2 0
Engineeri UI/UX Level 220,00
ng Designer 2 0
Roo Support & Level 400,00
t Operations 1 0
Support & IT Support Level 180,00
Operations Specialist 2 0
Support & Network Level 250,00
Operations Engineer 2 0
Support & System Level 300,00
Operations Administrator 2 0
Roo Sales & Level 500,00
t Marketing 1 0
Sales & Sales Level 250,00
Marketing Manager 2 0
Sales & Marketing Level 250,00
Marketing Manager 2 0
Sales & Sales Level 150,00
Marketing Executive 3 0
Sales & Marketing Level 150,00
Marketing Executive 3 0
Roo HR & Level 400,00
t Administration 1 0
HR & HR Level 250,00
Administration Manager 2 0
HR & HR Level 120,00
Administration Executive 3 0

Reference Link
https://economictimes.indiatimes.com

.
5. Scatter plot

Purpose
The purpose of the graph Is to find the relationship between the Co2
emission of a country and its GDP, in this representation I have used top
10 GDP highest country and its Co2 emission level.

Merits and Demerits


Merits

 The scatter plot effectively visualizes the co-relationship between


the country’s GDP and Co2 emission level, this type of
representation makes the work of prediction simpler and with a
simple gradient we can predict the data instances unlike pie charts.
 Usually unlike Bar graphs and Histogram, Scatter plot can handle
large number of datasets (up to several thousands) without making
the representation looks messier, hence has large number of
scalabilities.

Demerits

 If the dataset is too large then the datapoints may overlap making it
harder to understand
 Scatter plot represent the insights of the overall data but however If
we want to understand about individual data instance unlike bar
graph and Pie chart scatter plot is hard to understand about each
individual data points

Steps involved in preparing this graph -

Step1: Open excel sheet, enter the data instances we want to visualize.

Step2 : Click on inset tab and select Clustered Bar graph

Step3: Click on chart elements and add chart titles, axis title and enable
data labels and select top for more pleasantable representation and to
avoid over crowding of data

Step4: Go to data tab and select sort followed by select largest to smallest
to make the data representation more clear and formal

Step5: Click on the graph and copy paste it in ms word (I took screenshot
of my graphs and charts)

Inference

From this dataset using scatter plot we can conclude that there a exist a
relationship between the country’s GDP and its Co2 emission level and
china tops the graph in terms of GDP and highest Co2 emission.

Reference link
https://ourworldindata.org/grapher/co2-emissions-vs-gdp
Data
CO₂ Emissions
(million metric
Country tons)
China 11472.9
USA 4752.3
India 2597.4
Russia 1859.2
Japan 1024.6
Iran 744.7
Germany 644.1
Saudi
Arabia 626.1
South
Korea 611.2
Indonesia 590.3

6. Boxplot
Purpose
The purpose of this box plot is to compare the Rainfall distribution (in mm)
in North and South India. Box plot being the better way of representation
for this case because it provides a lot of attributes (minimum, first
quartile, median, third quartile, maximum, interquartile range, outliers) to
compare multiple datasets.

Merits and Demerits


Merits
 Shows median, quartiles, and variability in a single visualization.
 Identifies outliers, helping to analyse extreme rainfall events.
 Useful for comparing multiple datasets.

Demerits
 The visualization of data does not give the precise values, but only a
general sketch of the data.
 Representation is quite complex it is not as simple as Bar graph and
Pie chart which can make users understand just by looking at it.
 Less effective for smaller datasets.

Steps involved in preparing this graph -


Step1: Open excel sheet, enter the data instances we want to visualize.

Step2 : Click on inset tab and select Box plot.

Step3: Click on chart elements and add chart titles, axis title and enable
data labels and select outside top for more pleasantable representation
and to avoid over crowding of data

Step4: Click on the graph and copy paste it in ms word (I took screenshot
of my graphs and charts)

Inference
The median rainfall of South India is generally higher than the North India.
South India have wider interquartile range compared to North India.
The Box representing the South India is broad while for the North India it is
compact indicating South India receives larger rainfall the North India.

Reference Link
http://www.rainwaterharvesting.org/Urban/Rainfall.htm

Data
South
Indian
states North Indian states

1094 1011
3456 1251
3055 649
998 617
998 617
1515 617
1025
1326

You might also like