Business Analytics DA-1
Business Analytics DA-1
Digital Assignment – 1
Santhosh.v
24BCE1447
1. Bar Graph
Purpose
The purpose of this Bar graph is to analyze which industry holds the
largest profit margin for selling cars in India from the data available in the
internet, after gathering, organizing and visualizing we can conclude
based upon the inference we gather
Demerits:
If too many brands are included the graph becomes over
crowded and affects are readabilty of the bar graph.
Inconsistent bar width or scale can distort the representation.
Step3: Click on chart elements and add chart titles, axis title and enable
data labels and select outside top for more pleasantable representation
and to avoid over crowding of data
Step4: Go to data tab and select sort followed by select largest to smallest
to make the data representation more clear and formal
Step5: Click on the graph and copy paste it in ms word (I took screenshot
of my graphs and charts)
Inference
From this we can conclude that the maruti owns the highest profit per car
in India compared to other top premium brands.
Data
Avg
Price Revenu
Units Per Car e (₹ Avg Revenue
Brand Sold (₹ Lakh) Crore) Per Car (₹)
160000
Maruti Suzuki 0 7.5 120000 3500000
Hyundai Motor 557546 10.2 56870 1450000
Tata Motors 570955 9.8 55954 1380000
Mahindra &
Mahindra 416000 14.5 60320 1020000
Kia India 238523 13.8 32919 980000
Toyota 239802 35 83931 750000
Reference Link
https://www.team-bhp.com/forum/indian-car-scene/276257-2023-brand-
brand-analysis-indian-passenger-car-market.html
2. Histogram
Purpose
The purpose of this histogram is the analyse and represent the amount of
PM10 concentration in the air in the top 10 major metro Politian cities.
Demerits
The bin ranges affect the representation of the data to the larger
instance if the bin is too small making the histogram to be too
detailed making the representation too crowded, if the bin size is
too large making the data representation more generalised and less
informative.
Since histogram groups the concentration of the PM10 in ranges
slight fluctutation of the data points is not visible clearly as
compared to the bar graph.
Inference
From this histogram we can conclude that Delhi begins the most air
polluted city with PM10 concentration level compared to the other metro
polytan cities.
Data
Bangalor
124
e
Chennai 62
Delhi 268
Hyderaba
112
d
Jaipur 180
Kolkata 109
Mumbai 155
Pune 89
Surat 97
Reference Link
https://www.data.gov.in/search?title=air%20quality%20among%20major
%20cities&sortby=_score&type=resources
3. Pie chart
Purpose
The purpose of this chart is to understand the energy source distribution
in India using pie chart
Demerits
If the data include too many energy sources the representation
become too crowded and unreadable.
Unlike bar graph and histogram these pie charts cannot represent
the change in trends or increasing-decreasing behaviour of the data.
Data
Coal 205235 MW
Lignite 6620 MW
Gas 24824 MW
Diesel 589 MW
Hydro 46850 MW
Wind, Solar & Other RE 125692 MW
Nuclear 6780 MW
Reference Link
https://en.wikipedia.org/wiki/Electricity_sector_in_India
4. Tree Plot
Purpose
The purpose of this Tree plot is to understand the organizational role and
its salary separated by the levels (years of experience), The visual
representation and the colour categorization makes the dataset simple
and more readable.
Demerits
While it is readable, it doesn’t give an immediate, intuitive sense of
relationships, patterns, or trends between the roles and their
salaries.
With more roles, departments, and levels of hierarchy, a
table is clumsy to use. It may become hard to follow relationships
and work with as the dataset grows
Data
Parent Child Leve Monthly Salary
Category Category l (INR)
Roo Executive Level 1,500,00
t Leadership 1 0
Executive CE Level 1,200,00
Leadership O 2 0
Executive CT Level 1,000,00
Leadership O 2 0
Roo Manageme Level 800,00
t nt 1 0
Manageme Project Level 400,00
nt Manager 2 0
Manageme Program Level 450,00
nt Manager 2 0
Roo Engineeri Level 600,00
t ng 1 0
Engineeri Senior Software Level 300,00
ng Engineer 2 0
Engineeri Software Level 200,00
ng Engineer 2 0
Engineeri Junior Software Level 120,00
ng Engineer 3 0
Engineeri QA Level 250,00
ng Engineer 2 0
Engineeri UI/UX Level 220,00
ng Designer 2 0
Roo Support & Level 400,00
t Operations 1 0
Support & IT Support Level 180,00
Operations Specialist 2 0
Support & Network Level 250,00
Operations Engineer 2 0
Support & System Level 300,00
Operations Administrator 2 0
Roo Sales & Level 500,00
t Marketing 1 0
Sales & Sales Level 250,00
Marketing Manager 2 0
Sales & Marketing Level 250,00
Marketing Manager 2 0
Sales & Sales Level 150,00
Marketing Executive 3 0
Sales & Marketing Level 150,00
Marketing Executive 3 0
Roo HR & Level 400,00
t Administration 1 0
HR & HR Level 250,00
Administration Manager 2 0
HR & HR Level 120,00
Administration Executive 3 0
Reference Link
https://economictimes.indiatimes.com
.
5. Scatter plot
Purpose
The purpose of the graph Is to find the relationship between the Co2
emission of a country and its GDP, in this representation I have used top
10 GDP highest country and its Co2 emission level.
Demerits
If the dataset is too large then the datapoints may overlap making it
harder to understand
Scatter plot represent the insights of the overall data but however If
we want to understand about individual data instance unlike bar
graph and Pie chart scatter plot is hard to understand about each
individual data points
Step1: Open excel sheet, enter the data instances we want to visualize.
Step3: Click on chart elements and add chart titles, axis title and enable
data labels and select top for more pleasantable representation and to
avoid over crowding of data
Step4: Go to data tab and select sort followed by select largest to smallest
to make the data representation more clear and formal
Step5: Click on the graph and copy paste it in ms word (I took screenshot
of my graphs and charts)
Inference
From this dataset using scatter plot we can conclude that there a exist a
relationship between the country’s GDP and its Co2 emission level and
china tops the graph in terms of GDP and highest Co2 emission.
Reference link
https://ourworldindata.org/grapher/co2-emissions-vs-gdp
Data
CO₂ Emissions
(million metric
Country tons)
China 11472.9
USA 4752.3
India 2597.4
Russia 1859.2
Japan 1024.6
Iran 744.7
Germany 644.1
Saudi
Arabia 626.1
South
Korea 611.2
Indonesia 590.3
6. Boxplot
Purpose
The purpose of this box plot is to compare the Rainfall distribution (in mm)
in North and South India. Box plot being the better way of representation
for this case because it provides a lot of attributes (minimum, first
quartile, median, third quartile, maximum, interquartile range, outliers) to
compare multiple datasets.
Demerits
The visualization of data does not give the precise values, but only a
general sketch of the data.
Representation is quite complex it is not as simple as Bar graph and
Pie chart which can make users understand just by looking at it.
Less effective for smaller datasets.
Step3: Click on chart elements and add chart titles, axis title and enable
data labels and select outside top for more pleasantable representation
and to avoid over crowding of data
Step4: Click on the graph and copy paste it in ms word (I took screenshot
of my graphs and charts)
Inference
The median rainfall of South India is generally higher than the North India.
South India have wider interquartile range compared to North India.
The Box representing the South India is broad while for the North India it is
compact indicating South India receives larger rainfall the North India.
Reference Link
http://www.rainwaterharvesting.org/Urban/Rainfall.htm
Data
South
Indian
states North Indian states
1094 1011
3456 1251
3055 649
998 617
998 617
1515 617
1025
1326