How To Win With Your Data Visualizations The 5 Part Guide For Junior Analysts To Create Effective Data Visualizations And... (Clarke, Elizabeth) (Z-Library)
How To Win With Your Data Visualizations The 5 Part Guide For Junior Analysts To Create Effective Data Visualizations And... (Clarke, Elizabeth) (Z-Library)
YOUR DATA
VISUALIZATIONS
THE 5 PART GUIDE FOR JUNIOR
ANALYSTS TO CREATE EFFECTIVE
DATA VISUALIZATIONS AND
ENGAGING DATA STORIES
ELIZABETH CLARKE
© Copyright Elizabeth Clarke 2021 - All rights reserved.
The content contained within this book may not be
reproduced, duplicated, or transmitted without direct written
permission from the author or the publisher. Under no
circumstances will any blame or legal responsibility be held
against the publisher, or author, for any damages, reparation,
or monetary loss due to the information contained within this
book. Either directly or indirectly. You are responsible for
your own choices, actions, and results.
Legal Notice:
This book is copyright protected. This book is only for
personal use. You cannot amend, distribute, sell, use, quote
or paraphrase any part, or the content within this book,
without the consent of the author or publisher.
Disclaimer Notice:
Please note the information contained within this document
is for educational and entertainment purposes only. All effort
has been executed to present accurate, up-to-date, and
reliable, complete information. No warranties of any kind are
declared or implied. Readers acknowledge that the author is
not engaging in the rendering of legal, financial, medical or
professional advice. The content within this book has been
derived from various sources. Please consult a licensed
professional before attempting any techniques outlined in
this book.
By reading this document, the reader agrees that under no
circumstances is the author responsible for any losses, direct
or indirect, which are incurred as a result of the use of the
information contained within this document, including, but
not limited to, — errors, omissions, or inaccuracies.
TABLE OF C ONTENTS
Introduction
Conclusion
Resources
FREE DATA VISUALIZATION
CHECKLIST!
C H OOSI NG TH E R I GH T C H AR T
Now, do not get me wrong. I am not
knocking the tremendous contribution that a
good chart or well-dressed set of numbers can
have to a data presentation. The problem is
that too many people focus solely on this and
forget the narrative to support these data
presentation tools.
The correct chart presented at the right time
can take a simple analysis and give it a visual
form that people can use to develop a mental
image of what you are presenting. This will
help your presentation stick in their minds. It
will make you memorable.
Data visualization is another crucial
component of the data storytelling process
that you must nail on the head. You will
surely learn all you need to know about
presenting the visual aspect of your data
presentation in this part of the book.
A WI NNI NG DESI GN
Sight is the most used of all the human
senses. Use that knowledge to your benefit.
Do not just drop bland, tasteless charts and
graphs on your audience’s laps and call it a
day. That will only leave a bad taste in their
mouths and make you look less than the
competent business person you are.
The face of any data presentation is the
design. Having clean, concise, clutter-free
designs is crucial to keeping your audience
informed and engaged. As they say –
presentation is everything. When it comes to
data storytelling, you need to take the saying
literally and figuratively.
C R AFTI NG A WI NNI NG DATA
STOR Y
The parts discussed above are essential, but
none of them will give your data presentation
the edge it needs alone. You have to bring
them all together to amaze, astound, educate,
and convince your audience. This last part of
the book shows you how to do just that,
painlessly and cohesively.
I run into large amounts of data every day in
my career in marketing and social analytics.
If you are anything like me, you find yourself
fascinated with the components that make up
company growth statistics, consumer and
social analytics, sales figures, expenditure
reports, and all the other numbers that show a
company’s performance. While these figures
excite the senses of the data science nerd in
me, I also realized that they could look like a
foreign language to someone else. I have a
few failed data presentation stories of my
own that I can tell.
When I was new to my career, I struggled to
translate these figures to other people. Still,
experience and knowledge collection have
taught me the best ways to transform any
form of data into a language that anyone can
understand. Translation through storytelling
is a method that cannot be beaten.
Data storytelling is a vital part of any
company’s growth and management. Big
businesses need it and so do small ones.
Executives that have proved their mantle
need it no matter how much they advance in
their careers, and so too does the intern just
starting in a field. My passion is to help as
many people and businesses across the board
take raw data and translate those bytes into
stories that encourage change that allows
meeting goals and targets. Call me quirky,
but I always love when people reach out to
me with stories about how the words I have
written have helped inspire the change they
need to take their professional lives to the
next level.
I hope to hear a similar story from you. A
data story of how much you have achieved
using the words in this book, perhaps.
I suggest you get out a highlighter and mark
anything you find valuable and worthy of
remembering. One of my favourite tricks for
revisiting valuable information with ease.
The data shows that anyone who reads past
this page to the first chapter dramatically
increases their chances of becoming a data
storytelling superstar. So, what are you
waiting for? Turn the page!
1
C OUNTERINTUITIVE D ATA
This narrative shows surprising data that
contradicts what the norm is. The
counterintuitive nature of data invites further
exploration, especially in instances where it
has an adverse or alternatively, favorable
impact on the business. For example, the
analyst may show the executives a surprising
increase in website optimization software
outside of the peak season. This unexpected
data needs further exploration as to why this
happened to determine if more sales can be
derived from the revealed answer.
Demographic
Age
Knowledge of the subject matter that
will be prominent in your data
presentation
Careers and educational background
Bar charts
Pie charts
Road maps
Country maps
Area maps
Flowcharts
Tables
Pictographs
Scatterplots
Venn diagrams
Hierarchy diagrams
CAPTIVATING YOUR
AUDIENCE
— SIMON RAYBOULD
B ydatanow,storyyoupresentation’s
would have realized that your
success depends
on the depth of knowledge that you have
about your audience. You need to have a
profile of these people before you take even
the first step in putting your information
together and certainly before you start
designing that presentation. Doing otherwise
is like hunting in the dark and hoping to hit
the target - a nearly impossible task.
Ensure that you hit the mark as close to the
bullseye by putting in that preliminary
groundwork about the people you will be
presenting to. Ensure that you captivate these
people to have the highest chance of eliciting
the change you want from making that
presentation.
The question that has many junior analysts
abandoning this vital task is the how of it.
How exactly do you go about finding what
you need to know about your audience so that
you cater your data story to touch their
emotions and better get your point across?
Luckily, this is not a matter of guesswork.
You can implement proven methods to gather
the information you need to know about this
group. This chapter focuses on the “how” of
getting this information and using it to
maximum effect.
— ABRAHAM LINCOLN
— DANTE VITAGLIANO
Comparison
Composition
Relationship
Distribution
Column charts
Bar charts
Tables
Line charts
Column charts
Circular area charts
Composition
This type of data allows noting how part of a
data set can compare to the whole data set.
Data stories that show composition can be
static or show change over time. Also,
composition data may be expressed in
absolute numbers or in relative forms such as
percentages to show the variations of parts of
the whole. Just like comparison data,
composition data is a widely used type of
data.
Examples of visualizations that can be used
to show static composition include:
Pie charts
Waterfall charts
Column charts
In the case of composition data that changes
over time, commonly used charts include
Column charts
Area charts
Relationship
This type of data shows the connection
between at least two variables in a given set
of data. An example of a relationship with
only two variables may be children's height
relative to their age. Another data set
exploring relationships with multiple
variables includes website conversions from
specific demographics such as age, gender,
etc.
If someone just dumped a bunch of numbers
on you, there is no way of finding the
correlation between these numbers until they
are grouped to show relationships. Good
visualizations help determine these
relationships.
Scatter plots and bubble charts are typically
used to show relationships in data
presentations. Scatter plots are more
commonly used when there are only two
variables, while bubble charts are more
commonly used when there are more than
two variables.
Distribution
Useful in developing trends, this type of data
shows how variables in a set of data or
multiple steps of data are distributed over
time. With trends, probabilities can be
developed to offer predictions of possible
outcomes based on historical information. For
example, data from a swim club may show
that swimmers in different height categories
swim at different paces. This data can be used
to make future predictions as to who might be
the top competitors based on different swim
categories. Column charts, line histograms,
scatter plots, and 3D area charts commonly
show distribution data.
BAR GR APH S
This type of chart has a lot of aliases. It also
goes by the name of a column chart or bar
graph. It is so named because it allows data
visualizations where numeric values are
featured in the form of bars. The levels of
these bars are plotted on one axis while the
values are plotted on the other axis. Each
category of data is highlighted on one axis,
and the length of that bar corresponds to the
value on the other axis. Bar charts can make
use of either vertical or horizontal bars. The
categories are placed on the horizontal axis
when vertical bars are used, and the opposite
is true when horizontal bars are used. Vertical
bars are the norm, but horizontal bars are
good practice when working with long
category labels. Whether you use vertical
bars or horizontal bars, the thing you need to
ensure is that you accurately label each axis.
FIGURE 1 Bar graph
FIGURE 1.1 Horizontal bar graph
PI E C H AR TS
Pie charts are so commonly used in data
visualization that some people might say that
they are overused. We will delve into when it
is appropriate to use pie charts and times
when it is inappropriate. Also, we will look at
how to determine what is proper and
improper related to pie chart usage. However,
before we get to that, let's look at pie charts
and their value to data visualization.
Circular in nature, pie charts are data
visualization tools that use slice sizes to
depict parts of a whole or highlight the
relationship between multiple datasets.
Investors share percentage, for example.
The primary use of the pie charts is to
compare the groups contained within one set
of data. Suppose we reuse our investor share
example from above. In that case, we can
either develop one pie chart showcasing what
percentage each investor owns. Or, we can
create three separate pie charts to show each
investors shares in relation to the whole. In
this case, creating three different pie charts
doesn’t make all that much sense, and those
two types of data cannot be compiled into the
same pie chart as these data points would
confuse your audience. Compilation is best
done using bar charts.
FIGURE 1.3 Non effective chart - Not the
best option for this set of values as it is
doesn’t visually represent the data
effectively or draw an easy conclusion.
FIGURE 1.4 The more effective choice-
A better option for this set of values as
you can immediately distinguish who
owns what % of shares in relation to each
other.
LI NE GR APH S
Also called a line chart or a line plot, a line
graph is just what it sounds like. It is a type of
data visualization that shows continuous
progression using lines from left to right to
show changes in value. This constant
progress is shown on the chart’s horizontal
axis, while the vertical axis shows the value
metrics that highlight that change. For
example, a marketing agency might use a line
chart to show how the website traffic from
their top advertising campaigns for one of
their clients has been distributed over the first
3 quarters going into Q4. The line will show
the progression from January to September
on the horizontal axis, while the vertical axis
will show the metric value, which is the
number of website visitors and how it differs
throughout the first 3 quarter’s. With this
information they can see how their campaign
performance is and note some possible
projections going into Q4.
FIGURE 1.6 Comparison - An effective use of a line
graph comparing the performance between campaign
A and campaign B over the first three quarters.
AR EA C H AR TS
A slightly more complicated chart is the area
chat. What makes the area chart special is
that it combines a bar chart and a line chart to
show the progression of a variable compared
to another set of data. This progression is
usually demonstrated over intervals of time.
The difference between a line chart and an
area chart is the shading notable between the
lines and the horizontal axis.
Area charts are typically used to show
comparisons between multiple variables or
how one set of data is divided into different
proportions. Because of this, there are two
main types of area charts.
The first one is called an overlapping area
chart. This type of area chart shows the
comparison between variables and different
sets of data. This type of chart offers the
standard line, but each point plotted on the
vertical axis indicates the value for every
variable in the different datasets. Each plot
point has shading between the line and the
horizontal axis. Of course, this shading can
add a little panache to an area chart, but it
shows the greatest value in each variable and
differentiates each variable from the others.
As a result, such a chart will typically be
distinct as it has figures that look like
mountain peaks. An example of an
overlapping area chart could be monitoring
website traffic during a product launch
throughout the day from different sources.
SC ATTER PLOTS
Mainly used to show the relationship between
at least two variables, scatter plots use dots to
represent values based on these variables and
how they correlate in relation to one another.
These points allow for reporting the
relationship between two variables and show
patterns in the distribution of that data.
The relationship between the plotted points
can show various positive, linear, and strong
patterns. Such a pattern shows the
distribution of these dots in a line that has an
upward trend. On the other hand, the
distribution of dots may indicate no clear
relationship between the two variables as
these dots are plotted all over the chart. Of
course, other relationships can be shown
between these two extremes, such as one that
is non-linear but still strong. To make the
general trends that are developed by the plot
points in this type of chart easy to spot, it is
helpful to draw lines based on the distribution
of those points. This line is known as a trend
line.
Scatter plots are not just limited to the use of
only two variables. A third variable can be
added, but more variables are not
recommended as this will lead to creating a
cluttered chart that is hard to understand. The
plotting of too many variables is called
overplotting, and it is so-called because
having too many variables and dots makes it
challenging to understand the relationship
between them.
As great as scatter plots are, there is one
possible problem that you may run into when
using them. While you can note the
relationship between two variables, you
cannot determine what causes this
relationship based on the points plotted on a
scatter plot. The counter to this limitation
with this type of chart is that it invites further
investigation.
Here are some examples of possible scatter
plot outcomes:
FIGURE 1.9 No relationship known.
FIGURE 1.10 Moderate, negative, linear
relationship.
FIGURE 1.11 Strong, non linear
relationship.
FIGURE 1.12 Strong, positive, linear
relationship.
USI NG TABLES
At this point, I am sure that you might be
wondering one thing - so, what about tables?
Don't worry. I have not forgotten about them.
Tables are often the source of the data you
will develop into visualizations throughout
your career developing data stories. You
might even find that they make good data
visualizations when showcasing comparison,
composition, and relationship data types.
Especially when there are few variables or
data points to be outlined to your audience.
Other times when it would be appropriate to
use tables in your data presentation include:
Tableau
Qlikview
FusionCharts
— GARR REYNOLDS
Form
Color
Spatial position
Movement
Length
Width
Orientation
Size
Curvature
Shape
Added marks
Enclosure
Hue
Grouping
Positioning
Intensity
FIGURE 2 Pre-attentive attributes
Isolation
This law focuses on removing objects that are
not necessary to impact information so that
the focus remains one of the relevant
elements. As a data storyteller, this means
that you keep your visuals as simple as
possible. All information that is irrelevant to
the presentation should be kept off your
visuals. For example, if the marketing
manager wants the marketing team to focus
on 2021 sales, there is no reason why they
should also display the sales for 2020 in their
visualizations.
Grouping
This artistic law focuses on the similar
grouping of objects. This allows the audience
to note the commonalities that these features
share and the minor differences.
The marketing manager may choose to group
the sales of the best-performing months so
that the marketing team can see the
similarities between them. This can then lead
to further information on what causes sales to
be high during these particular times and
what caused the disparity between these
months.
Contrast
Contrast allows the audience to better note
the communicated information due to
different colors placed next to each other.
Data tends to become misconstrued by the
brain when it all looks the same, even when
there are differences. Therefore, using this
artistic law allows your audience to better
note your information as there are clear
disparities between each set.
The marketing manager may choose to
display 12 months of sales information on a
bar chart by utilizing colors such as red and
green so the marketing team can get a more
truthful perception of each month’s
advertising performance. Green can be used
for the profitable campaigns and red for the
not so profitable campaigns.
Perceptual Problem Solving
This artistic experience is about getting the
mind to actively work to reach a conclusion
about the visual information that is being
displayed. Such an artistic experience relies
on the artist creating a display that allows the
audience to follow a path of logic to reach
that conclusion. In other words, it relies on
that “aha” moment after a moment of
thought.
For example, the marketing manager may
display two sets of information side by side.
These datasets are the sales information and
changes made to the main website landing
page at certain times of the year. They aim to
get their audience to follow a path of logic
that shows that the placement of their best
sellers on the website landing page converts
better than showcasing the cheapest products.
The Generic Viewpoint
This artistic rule ensures that an audience
should not have to take a unique perspective
to gain information from watching a display.
This means not allowing the visual display to
remain up to interpretation. You do not want
your audience to be confused. Make the
visual display as straightforward as possible
so that the audience's innate understanding of
that data is supported no matter how this
person looks at it.
Visual Metaphors
In literature, a metaphor is defined as one
thing that is a representation of something
else entirely. From an artistic standpoint, this
is about relating seemingly unrelated ideas.
Using Visual metaphors gets your audience’s
mental juices flowing and allows them to be
more engaged with your visualizations. The
marketing manager's display of seemingly
unrelated information such as the sales data
and placement of landing page items can be
seen as a visual metaphor.
Symmetry
This artistic experience relies on making
objects proportionately pleasing to the eye.
Think of it like watching a drawn skyline.
Suppose there is a building that seems
crooked in relation to the others. In that case,
this makes for a visually displeasing
experience to most people, even if they are
unaware of what aspect is causing this
displeasure.
The same thing will happen if your
visualizations have aspects that seem like
they do not originate from a central axis.
Sequential Palettes
This type of color palette makes use of color
by adding variations with different saturation.
For example, there may be six variations that
need to be presented on a chart. The color
used on this chart to show these data
variations is pink. Each of these variations
will have its value represented by a different
shade of pink.
Diverging Palettes
When plotting variables with a central value
like zero, divergent palettes are typically the
go-to color palette. Two different sequential
palettes are combined to show the movement
of values. Values on either side of that central
value are assigned a different color gradient.
One of the most common situations where
divergent color palettes are used is when
negative and positive values are highlighted
in one chart. The same rules for designing
when using a sequential palette apply when
creating charts with diverging palettes.
— AL SHALLOWAY
M ovies,
They
novels, and even data stories.
are modern examples of
storytelling. However, storytelling is not
new to humanity. Cave paintings show that
our cavemen ancestors have been doing it a
millennium back. In fact, there are cave
paintings found that date back between
17,000 and 15,000 BCE.
Of course, with verbal communication came
oral story storytelling, and with written
language arose written storytelling. Written
stories have been found dated back as far as
3,400 BCE. Those stories were written on
clay tablets.
Luckily, some of the stories we find most
precious are not written on such fragile
things. With technological advancement, we
can now listen to, read, and tell stories on
radios, televisions, tablets, computers, and
even on the go with our smartphones. What a
way we have come!
To have withstood millennia of human
changes, storytelling must be powerful
indeed, and it has been shown that its powers
are unlimited. Storytelling has the power to
shape how we view the world and ourselves.
This one thing has the ability to determine the
prejudices that we develop (and yes, we all
have biases) and the values and morals that
we hold. It helps us understand and remember
valuable information. It helps us cohabitate
with each other via communication. Of
course, we can deliver communication via
hard facts, but stories give us context and
deliver valuable insight that would have
otherwise been lost.
All of these powers and more are yours to
harness because by using storytelling, you
can translate data into something
understandable, relatable, and actionable.
Through storytelling, data analysts can turn
numbers that mean nothing to an audience,
into a vehicle to drive change.
Just as there are steps and processes involved
in creating a magnificent story through a
novel or a movie, there are also steps and
processes involved in creating a magnificent
data story. Winging it will not do. Just as the
human body is composed of specific parts to
make complete anatomy, your data story
needs to hold certain features for it to be
complete and deliver value to your audience.
Without these components, you will have a
poorly executed data story.
This book was written to ensure that your
data story has complete anatomy. It was
written to give you the steps and processes
necessary for developing a magnificent data
story each and every time. This part of this
book shows you how to bring all the steps
and processes together beautifully.
H OW TO EXPLAI N YOUR C H AR T
As great as your visuals may be with
aesthetic colors, appropriate titles and labels,
and all the like, they need to be followed up
with verbal communication that reinforces
what is seen and adds relevant context.
I get it. Verbal presentations may not be some
of our strong suits. And left to our own
devices, we tend to make a muck of things.
But that will not be the case. You can fall
back on a methodology to make the most out
of explaining your charts, and it is called
Schneiderman’s mantra.
Ben Schneiderman developed this mantra.
Born in 1947, he is an American computer
scientist and a professor at the University of
Maryland. He proposed his mantra as a way
for data analysts to understand how people
visually engage content. An overview of this
visual engagement goes like this:
The Introduction
The rising action
The climax
Conclusion
TH E I NTR ODUC TI ON
In a book or film, this section is a
representation of the main characters going
about their daily lives before they are
ultimately thrown through the wringer to
come out as a changed person. It is the setup
to show where this character is now so that
the audience has context as to how they will
move through the main parts of the story. It is
what allows the audience reading or watching
to develop a connection with this character.
When the audience cares, the introduction has
performed its function and hooked these
people.
In data storytelling, the introduction has the
same function. It is a setup meant to give
context and hook the audience. This is where
the analyst will show the audience what
problem brought everyone to that setting on
that day, the benefits to solving that issue,
and possible solutions to solving the problem.
The audience needs to care about moving
through the guided tour that starts with the
problem and ends with the solution.
Let’s use an example to illustrate. A
marketing manager is presenting to the CEO
and CFO in regards to ad performance, and
they are introducing the problem by saying:
“Since we introduced our advertising
campaign in Quarter 2 of 2019, traffic to our
website has grown steadily by approximately
15% to 25% each quarter. We anticipated
that this trend would continue to be the case
for quite some time. However, that prediction
has not come to fruition. In the first quarter
of 2021, we observed a staggering 30%
decrease in website traffic, and it got us
wondering why…”
FIGURE 3.3 Ask the right questions - Ask the right
questions so your audience can draw the proper
solution on their own without you needing to persuade
them.
TH E C LI MAX
This step in storytelling represents that
turning point in the story where the changes
that will come to fruition become clear. It is
the highest point of tension in the story. It is
the height of the anticipation built in the
rising action step. Remember that this part
will not serve its purpose unless you set the
foundation with a good introduction and the
anticipation developed in the rising action.
In data storytelling, the climax is your ‘aha’
moment. It finally reveals the things that the
presenter was alluding to in the introduction
and rising action portions. Things should
become clear to the audience then, leaving
them with a sense of fulfillment.
Back to our example, the marketing manager
may say something like this:
“It took some digging, but then we found a
problem. At the time that the traffic decrease
was recorded, we had lowered the
advertising budget to test other markets
beyond our traditional audiences.
Additionally, we have found that due to this
budget drop, we started to lose out on front-
page bids. Due to this, our competitors' ads
were outbidding us for the top spots, gaining
more visibility with our primary audience.”
www.facebook.com/groups/junioranalystsmastermind/
FREE GOOD KARMA
— JAGAT SAIKIA
Introduction
Conclusion
Resources
RESOURCES