Advanced Excel Charts Big Data Analytics (Prof. HR Ansari)
Advanced Excel Charts Big Data Analytics (Prof. HR Ansari)
CHARTS
Big Data & Analytics
I would like to dedicate the book to the students who will be
using it. I wish you future career success and hope you never
stop learning.
CONTENTS
Title Page
Dedication
Preface
Introduction
Waterfall Chart
Band Chart
Gantt Chart
Thermometer Chart
Gauge Chart
Bullet Chart
Funnel Chart
Waffle Chart
Heat Map
Step Chart
Box and Whisker Chart
Histogram
Pareto Chart
Organization Chart
Since you're still here....
PREFACE
Before you proceed with this book, we are assuming that you are
already aware of the basics of Microsoft Excel charts. If you are not
well aware of these concepts, then we will suggest you to go through
our short book on Excel charts.
INTRODUCTION
You are aware that charts are the efficient data visualization means
to convey the results. In addition to the chart types that are available
in Excel, some widely used application charts are popular. In this
tutorial, you will learn about these advanced charts and how you
can create them in Excel.
Waterfall Chart
A Waterfall chart is a form of data visualization that helps in
understanding the cumulative effect of sequentially introduced
positive or negative values.
Band Chart
A Band chart is a Line chart with added shaded areas to display the
upper and lower boundaries of the defined data ranges.
Gantt Chart
A Gantt chart is a chart in which a series of horizontal lines
depicting tasks, task duration and task hierarchy are used planning
and tracking projects.
Thermometer Chart
A Thermometer chart keeps track of a single task, for e.g.
completion of work, representing the current status as compared to
a Target. It displays the percentage of the task completed, taking
Target as 100%.
Gauge Chart
Gauge charts, also referred to as Dial charts or Speedometer
charts, use a pointer or a needle to show information as a reading
on a dial.
Bullet Chart
Bullet charts support the comparison of a measure to one or more
related measures with a linear design.
Funnel Chart
Funnel chart is used to visualize the progressive reduction of data
as it passes from one phase to another in a process.
Waffle Chart
Waffle chart is a 10 × 10 cell grid with the cells colored as per
conditional formatting to portray a percent value such % work
complete.
Heat Map
A Heat Map is a visual representation of data in a Table to highlight
the data points of significance.
Step Chart
A Step chart is a Line chart that uses vertical and horizontal lines to
connect the data points in a series, forming a step-like progression.
Box and Whisker Chart
Box and Whisker charts, also referred to as Box Plots are
commonly used in statistical analysis. In a Box and Whisker chart,
numerical data is divided into quartiles and a box is drawn between
the first and third quartiles, with an additional line drawn along the
second quartile to mark the median. The minimums and maximums
outside the first and third quartiles are depicted with lines, which are
called whiskers. Whiskers indicate variability outside the upper and
lower quartiles, and any point outside the whiskers is considered as
an outlier.
Histogram
A Histogram is a graphical representation of the distribution of
numerical data and is widely used in Statistical Analysis. A
Histogram is represented by rectangles with lengths corresponding
to the number of occurrences of a variable in successive numerical
intervals.
Pareto Chart
Pareto chart is widely used in Statistical Analysis for decision-
making. It represents the Pareto principle, also called 80/20 Rule,
which states that 80% of the results are due to 20% of the causes.
Organization Chart
An Organization chart graphically represents the management
structure of an organization.
Though some of these charts are included in Excel 2016, Excel
2013 and earlier versions do not have them as built-in charts. In this
tutorial, you will learn how to create these charts from the built-in
chart types in Excel.
Preparation of Data
You need to prepare the data from the given input data, so that it
can be portrayed as a Waterfall chart.
Consider the following data −
Prepare the data for the Waterfall chart as follows −
Add a column End − the last column in the chart with the
end value in the Net Cash Flow.
In the Float column, insert a row in the beginning and at the end.
Place an arbitrary value 50000. This is just to have some space to
the left and right sides of the chart.
The data will look as given in the following table −
The data is ready to create a Waterfall chart.
Preparation of Data
You need to prepare the data that can be used to create a Band
chart from the given input data.
Step 1 − Consider the following data that you have from the
customer survey for two regions – East and West across the
financial year April - March.
Suppose you want to compare this data across three intervals −
Low (<50%).
Medium (50% - 80%).
High (>80%).
Preparation of Data
Arrange your data in a table in the following way −
Note − When the Tasks are in a hierarchy, Start of any Task – Taskg
is Start of previous Task + it’s Duration. That is, Start of a Task
Taskh is the End of the previous Task, Taskg if they are in a
hierarchy, meaning that Taskh is dependent on Taskg . This is
referred to as Task Dependency.
Following is the data −
Creating a Gantt Chart
Step 1 − Select the data.
Step 2 − Insert a Stacked Bar chart.
Preparation of Data
Prepare the data in the following way −
As you can see, the Primary Axis and the Secondary Axis have
different ranges.
Step 9 − Right click on the Primary Axis. Select Format Axis from
the dropdown list.
Step 10 − Type the following in Bounds under AXIS OPTIONS in
the Format Axis pane −
0 for Minimum.
1 for Maximum.
Repeat the steps given above for the Secondary Axis to change the
Bounds to 0 and 1.
Both the Primary Axis and Secondary Axis will be set to 0% - 100%.
As you can observe, the Target Column hides the Actual Column.
Step 11 − Right click on the visible Column, i.e. Target.
Step 12 − Select Format Data Series from the dropdown list.
In the Format Data Series pane, select the following −
No fill under the FILL option.
Solid line under the BORDER option.
Blue under the Color option.
Step 13 − In Chart Elements, deselect the following −
Axis → Primary Horizontal.
Axis → Secondary Vertical.
Gridlines.
Chart Title.
Step 21 − Resize the Chart Area to get the Thermometer shape for
the chart.
You got your Thermometer chart, with the Actual Value as against
Target Value being shown.
Step 22 − You can make this Thermometer chart more appealing
with some formatting.
Insert a Rectangle shape superimposing the blue
rectangular part in the chart.
In the Format Shape options, select the following −
Gradient fill for FILL.
Linear for Type.
1800 for Angle.
Set the Gradient stops at 0%, 50% and 100%.
For the Gradient stops at 0% and 100%, choose the color
black.
For the Gradient stop at 50%, choose the color white.
Insert an oval shape at the bottom.
Format the oval shape with the same options as of
rectangle.
The result will be as shown below −
Your aesthetic Thermometer chart is ready. This will look good on a
dashboard or as a part of a presentation.
GAUGE CHART
A Gauge is a device for measuring the amount or size of something,
for example, fuel/rain/temperature gauge.
There are various scenarios where a Gauge is utilized −
To Gauge the temperature of a person, a thermometer is
used.
To Gauge the speed of an automotive, a speedometer is
used.
To Gauge the performance of a student, a mark sheet is
used.
Preparation of Data
Consider the following data −
As you can observe, the upper half of the Pie chart is what you will
convert to a Gauge chart.
Step 7 − Right click on the bottom Pie slice.
Step 8 − Click on Fill. Select No Fill.
You can see that the Pie slice on the right represents the Profit %.
Step 9 − Make the chart appealing as follows.
Change the Fill colors of the Pie slices.
Click on the right Pie slice, select 3-D FORMAT as Top
bevel, and choose Angle.
Click on the left Pie slice, select 3-D FORMAT as Top
bevel, and choose Divot.
Click on the right Pie slice, select 1% as Point Explosion
under SERIES OPTIONS.
Click on the right Pie slice and add Data Label.
Size and Position the Data Label.
Deselect Legend in Chart Elements.
Give the chart Title as Profit % and Position it.
Your Gauge chart is ready.
Preparation of Data
Arrange the data for values as given below.
This data will be used for Doughnut chart. Arrange the data for
Pointer as given below.
The value in the cell C9 is the Pointer size. You can take it
as 5 for brevity in formatting and later change to 1, to
make it a thin pointer.
Step 5 − Right click on the chart and select Format Data Series.
Step 6 − Type 271 in the box – Angle of first slice in the SERIES
OPTIONS in the Format Data Series pane.
Check the box Secondary Axis for the Pointer series. Click
OK.
<30% Poor
With the above values, the Bullet chart looks as shown below.
Though we used colors in the above chart, Stephen Few suggests the usage of
only Gray shades in the interest of color-blind people.
Bullet Charts are widely used by data analysts and dashboard vendors.
You can use Bullet chart to track the number of defects in Low, Medium
and High categories.
You can visualize the Revenue flow across the Fiscal year.
Preparation of Data
Arrange the data as given below.
As you can observe, the qualitative values are given in the column – Performance.
The Bands are represented by the column – Value.
Step 6 − As you can see, the Primary and the Secondary Vertical Axis have
different ranges. Make them equal as follows.
Right click on Primary Vertical Axis and select Format Axis.
Click on the AXIS OPTIONS tab in the Format Axis pane.
In AXIS OPIONS, under Bounds, type the following −
0.0 for Minimum
1.0 for Maximum
Repeat the above steps for Secondary Vertical Axis.
Step 9 − Right click on the column for Value (blue color in the above chart).
Step 10 − Select Format Data Series.
Step 11 − Change Gap Width to 500% under SERIES OPTIONS in Format Data
Series pane.
Step 1 − You can then define a Limit for number of defects and represent the
number of defects found by a Value. Add Value and Limit to the above table.
As you can see, the ranges are changed to correctly interpret the context.
FUNNEL CHART
Funnel chart is used to visualize the progressive reduction of data
as it passes from one phase to another. Data in each of these
phases is represented as different portions of 100% (the whole).
Like the Pie chart, the Funnel chart does not use any axes either.
For example, in a sales pipeline, there will be stages as shown
below.
Prospects → Qualified Prospects → Proposals → Negotiations →
Final Sales.
Typically, the values decrease gradually. Many prospects are
identified, but a part of them are validated and even lesser qualify
for Proposals. A still lesser number come for negotiations and in the
end, there is only a handful of deals that are won. This will make the
bars resemble a funnel.
Preparation of Data
Place the data values in a table.
You will get the following data. You will use this table to create the
Funnel chart.
Step 2 − Right click on the Shift Bars (blue color in the above chart)
and change Fill color to No Fill.
Step 2 − Select the data and insert a 3-D Stacked Column chart.
Step 6 − Right click on the Bars and select Format Data Series
from the dropdown list.
Step 7 − Click on SERIES OPTIONS in the Format Data Series
pane and type 50% for Gap Depth under SERIES OPTIONS.
Step 2 − Fill the cells with % values, starting with 1% in the left-
bottom cell and ending with 100% in the right-top cell.
Step 3 − Decrease the font size such that all the values are visible
but do not change the shape of the grid.
This is the grid that you will use for the Waffle chart.
Step 1 − Create a Waffle chart that displays the Profit% for the
Region East by applying Conditional Formatting to the Grid you
have created as follows −
Step 2 − Define another rule to format values > 85 % (give the cell
reference of the Profit %) with fill color and font color as light green.
Step 3 − Give the Chart Title by giving reference to the cell B3.
As you can see, choosing the same color for both Fill and Font
enable you not to display the %values.
Step 4 − Give a Label to the chart as follows.
Insert a Text box in the chart.
Give the reference to the cell C3 in the Text box.
You can also make Waffle charts for different regions distinctly, by
choosing a variation in the colors for Conditional Formatting.
As you can see, the colors chosen for the Waffle charts on the right
are varying from the colors chosen for the Waffle charts on the left.
HEAT MAP
Heat Map is normally used to refer to the colored distinction of
areas in a two dimensional array, with each color associated with a
different characteristic shared by each area.
In Excel, Heat Map can be applied to a range of cells based on the
values that they contain by using cell colors and/or font colors.
Excel Conditional Formatting comes handy for this purpose.
Preparation of Data
Arrange the data in a table.
As you can see, the data is for a fiscal year, April – March, month-
wise for each product. You can create a Heat Map to quickly identify
during what months the sales were high or low.
As you can see, the numbers are not visible. Next, you need to
highlight the top five and bottom five values without displaying the
numbers.
Step 2 − Select the data (which is not visible, of course).
Step 3 − Apply Conditional Formatting such that the top five values
are colored with green (both fill and font) and the bottom five values
are colored with red (both fill and font).
Step 1 − Select the data. Copy and paste the data below the last
row of the data.
Step 2 − Copy and paste the entire data on the right side of the
data. The data looks as given below.
Step 3 − Delete the cells highlighted in red that are depicted in the
table of second set of data given below.
Step 4 − Shift the cells up while deleting. The second set of data
looks as given below.
Step 5 − Copy the second set of data and paste it to the right side
of it to get the third set of data.
Step 6 − Select the third set of data. Sort it from the smallest to the
largest values.
You need to use this sorted third set of data to create the Step chart.
Step 4 − Click OK in the Select Data Source dialog box. Your chart
will look as shown below.
Step 2 − Create a third table from the second table, computing the
differences −
Retain the first row – Minimum Value as it is.
In the second row – compute values as First Quartile -
Minimum Value.
In the third row – compute values as Median Value - First
Quartile.
In the fourth row – compute values as Third Quartile -
Median Value.
In the fifth row – compute values as Maximum Value -
Third Quartile.
Step 5 − Right click on the bottom Data Series. Click Fill and select
No Fill.
Step 8 − Now, your Boxes are ready. Next, you have to create the
Whiskers.
Right click on the Top Data Series.
Change Fill to No Fill.
Click the DESIGN tab on the Ribbon.
Click Add Chart Element in the Chart layouts group.
Click Error Bars in the dropdown list and select Standard
Deviation.
Step 9 − You got the top Whiskers. Next, format Whiskers (Error
Bars) as follows −
Step 10 − Click the Fill & Line tab under ERROR BAR OPTIONS in
the Format Error Bars pane.
Select Solid line under LINE.
Select the color as dark blue.
Type 1.5 in the Width box.
Step 11 − Repeat the above given steps for the second lower
bottom Series.
Step 12 − Next, format the boxes as follows.
Right click on one of the Box series.
Click Fill.
Choose color as light blue.
Click Outline.
Choose the color as dark blue.
Click Weight.
Select 1½ pt.
Step 13 − Repeat the steps given above for the other Box series.
What is a Histogram?
A Histogram is represented by rectangles with lengths
corresponding to the number of occurrences of a variable in
successive numerical intervals. The numerical intervals are called
bins and the number of occurrences is called frequency.
The bins are usually specified as consecutive, non-overlapping
intervals of the variable. The bins must be adjacent and are of equal
size. A rectangle over a bin with height proportional to the frequency
of the bin depicts the number of cases in that bin. Thus, the
horizontal axis represents the bins whereas the vertical axis
represents the frequency. The rectangles are colored or shaded.
A Histogram will be as shown below.
Advantages of Histograms
Histogram is used to inspect the data for its underlying distribution,
outliers, skewness, etc. For example, Histogram can be used in
statistical analysis in the following scenarios −
Preparation of Data
Consider the data given below.
Create bins and calculate the number of values in each bin from the
above data as shown below −
This table is called a Frequency table and we will use it to create the
Histogram.
Creating a Histogram
Following are the steps to create a Histogram.
Step 1 − Select the data in the Frequency table.
Step 2 − Insert a Clustered Column chart.
Step 3 − Right click on the Columns and select Format Data Series
from the dropdown list.
Step 4 − Click SERIES OPTIONS and change the Gap Width to 0
under SERIES OPTIONS.
Preparation of Data
Consider the following data, where the defect causes and the
respective counts are given.
As you can observe, 80% of the defects are due to two causes.
ORGANIZATION CHART
You can illustrate the reporting relationships in your team or
organization using an organization chart. In Excel, you can use a
SmartArt graphic that uses an organization chart layout.
As you can observe, you can enter the text in the left pane and it
appears immediately on the chart on the right. The box that has a
bullet with line attached in the left pane indicates that it is Assistant
box in the chart. The boxes with bullets in the left pane indicate they
are part of hanging layout in the chart.
Step 7 − Enter the information in the Text pane.
Step 8 − Demote if there is reporting relationship.