Visualizing Descriptive Statistics and Analytics: Progress Check
Visualizing Descriptive Statistics and Analytics: Progress Check
LO 10-2
Explain how descriptive analytics
incorporates visualizations in
communicating its results.
Recall from Chapter 6 that descriptive statistics procedures summarize data to determine
what happened. As you recall, descriptive statistics are brief summaries (or factoids) of a
data set that provide a representation of the data set as a whole including basic statistics,
including the mean, standard deviation, minimums, and maximums of a data set. Exhibit 10.5
depicts a simple table showing Amazon’s net income and sales from 2008 through 2018.
Presented in tabular form, it is difficult to see the changes of the measures over time.
EXHIBIT 10.5
Table of Financial Performance for Amazon from 2008–2018 ($ in millions)
Source: Amazon Income Statements 2008–2018.
When that same data is visualized using a bar chart, as shown in Exhibit 10.6, you can
quickly recognize that Amazon’s sales have grown steadily each year. Note the chart
components:
EXHIBIT 10.6
Bar Chart of Financial Performance for Amazon from 2008–2018 ($ in millions)
Source: Amazon Income Statements 2008–2018.
A vertical axis with a scale from 0 to 250,000, with tick marks indicating each incremental
50,000 increase.
A horizontal axis indicating the years from 2008 through 2018.
Bars indicating the data series for each sales data point of each year.
page 531
Bar charts should be sorted logically. When using time series data (values taken on by a
variable are listed in time order: days, months, or years), it typically makes the most sense to
sort the data chronologically. However, bar charts also lend themselves to sorting based on
the numerical value of the measure. In this instance, sales increased each year, so it would
not change if sorted ascending based on sales.
Due to the time series nature of this data, a line chart would also work. Line charts should
only be sorted chronologically. Depicting Exhibit 10.1’s sales from five different companies
as a line chart would not make sense because there is nothing that inherently connects the
different companies’ sales totals from data point to data point. In contrast, the data in Exhibit
10.5 can be depicted as a line chart because the net income is provided over a 10-year period
as shown in Exhibit 10.7. Not only do line charts lend themselves to time series data, but they
also lend themselves well to numerical data that extends below 0. In this instance, 2012 and
2014 each had negative net income.
EXHIBIT 10.7
Line Chart of Net Income for Amazon from 2008–2018 ($ in millions)
Source: Amazon Income Statements 2008–2018.
This net income data could technically be depicted with a bar chart as shown in Exhibit
10.8; however, it is clunkier in its presentation than the sales data. The data point for 2012 is
not even visible on the bar chart due to the negative amounts. Line charts are also preferable
when the overall trend is most important to communicate rather than specific page 532
data points. Line charts provide more flexibility with scales.
EXHIBIT 10.8
Bar Chart of Net Income for Amazon from 2008–2018 ($ in millions)
Source: Amazon Income Statements 2008–2018.
Typically, ratio data should be visualized with 0 as the starting point, especially in a bar
chart. However, in a line chart, the trend will stand out the same way, even if your starting
point makes more sense at a point above or below 0.
Another type of chart to depict descriptive analytics is a pie chart. Pie charts can be
useful for showing proportion and visualizing categorical data. Recall that categorical data
tend to be represented by words, such as categorizing transaction types (e.g., sales versus
returns).
Pie charts can be used to show proportion when it is meaningful for numerical data.
Proportion is the number of observations in one particular category divided by the grand
total of observations. However, in Exhibit 10.9, the proportion of each year’s sales over a
period of 10 years is not particularly useful. It is also difficult to tell the year each slide
represents based on the legend. Pie charts are rarely preferred if there are more than six
categories (or “slices” of pie).
EXHIBIT 10.9
Pie Chart of Financial Performance for Amazon from 2008–2018 ($ in millions)
Source: Amazon Income Statements 2008–2018.
page 533
PROGRESS CHECK
5. When is a line chart preferred over a bar chart?
6. When you are visualizing data that is presented over time, which charts are the best options?
7. What type of data does a pie chart present?
LAB CONNECTION
Lab 10.1 provides an example of creating a dashboard in Excel with filters, and Lab 10.2 provides an
example of creating a dashboard in Tableau.
page 534
EXHIBIT 10.11
Histogram of Grade Distribution
An example of aged receivables (created using the Excel PivotTable in Lab 6-1) shows an
aging analysis, which can translate into a modified histogram (Exhibit 10.12). It is modified
for the following reasons:
EXHIBIT 10.12
Example of Aged Receivables from an Excel Pivot Chart (using Lab 6 data)
The vertical axis represents a sum of transaction amounts in each bin, instead of a count of
transactions.
The lowest bin starts at 1, instead of 9 (which is the actual lowest value in the range of data).
The data presents like a bar chart, due to the spaces between the bars.
page 536
Regardless of these differences, the way one would interpret this chart is similar to the way
one would interpret the results of a histogram. There does not seem to be a significant skew,
with the bulk of the receivables being in the middle intervals of the dataset.
For comparison’s sake, look at a count of the aged receivables transaction in the actual
histogram in Exhibit 10.13. The distribution is very similar.