0% found this document useful (0 votes)
13 views96 pages

04 Data Communication

Uploaded by

Thyago Miranda
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
13 views96 pages

04 Data Communication

Uploaded by

Thyago Miranda
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 96

Data Communication

Graphics that tell stories in an engaging way


Cédric Scherer // rstudio::conf // July 2022
Data Visualization
is any graphical representation

of information and data.

Cédric Scherer // rstudio::conf // July 2022


Data Visualization
converts information into visual 

forms as quantifiable features.

Cédric Scherer // rstudio::conf // July 2022


Data Visualization
helps to amplify cognition, gain insights,

discover, explain, and make decisions.

Cédric Scherer // rstudio::conf // July 2022


Visualize Your Data

Cédric Scherer // rstudio::conf // July 2022


Visualize Your Data

“When Dmitry Kobak and Sergey


Shpilkin [...] analysed the results,
they found that an unusually high
number of turnout and vote-share
results were multiples of five 

(eg, 50%, 55%, 60%), a tell-tale sign
of manipulation.”

“Russian elections once again had



a suspiciously neat result” 

by The Economist

Cédric Scherer // rstudio::conf // July 2022


Visualize Your Data

“When Dmitry Kobak and Sergey


Shpilkin [...] analysed the results,
they found that an unusually high
number of turnout and vote-share
results were multiples of five 

(eg, 50%, 55%, 60%), a tell-tale sign
of manipulation.”

“Russian elections once again had



a suspiciously neat result” 

by The Economist

Cédric Scherer // rstudio::conf // July 2022


Anscombe’s Quartet
each dataset has the 

same summary statistics 

mean, standard deviation, and correlation 

but are visually distinct.

“Same Stats, Different Graphs: Generating Datasets with Varied Appearance and Identical Statistics through Simulated Annealing”

by Justin Matejka & George Fitzmaurice, ACM SIGCHI Conference on Human Factors in Computing Systems 2017

Cédric Scherer // rstudio::conf // July 2022


Datasaurus Dozen

“Same Stats, Different Graphs: Generating Datasets with Varied Appearance and Identical Statistics through Simulated Annealing”

by Justin Matejka & George Fitzmaurice, ACM SIGCHI Conference on Human Factors in Computing Systems 2017
Cédric Scherer // rstudio::conf // July 2022
What makes it a good data visualization?

Soure:
“Yearly Fluctuations in Area of Arctic Covered by Ice” by Derek Watkins (New York Times)
Cédric Scherer // rstudio::conf // July 2022
What makes it a good data visualization

☛ INFORMATION (integrity)

Cédric Scherer // rstudio::conf // July 2022


What makes it a good data visualization

☛ INFORMATION (integrity)

☛ STORY (interestingness)

Cédric Scherer // rstudio::conf // July 2022


What makes it a good data visualization

☛ INFORMATION (integrity)

☛ STORY (interestingness)

☛ GOAL (usefulness)

Cédric Scherer // rstudio::conf // July 2022


What makes it a good data visualization

☛ INFORMATION (integrity)

☛ STORY (interestingness)

☛ GOAL (usefulness)

☛ VISUAL FORM (beauty)

Cédric Scherer // rstudio::conf // July 2022


Source:

Cédric Scherer // rstudio::conf // July 2022


INFORMATION
Understand your data and be accurate


Cédric Scherer // rstudio::conf // July 2022


Cédric Scherer // rstudio::conf // July 2022
Our data is never a perfect

reflection of the real world.

Cédric Scherer // rstudio::conf // July 2022


Our data is never a perfect

reflection of the real world.

→only a subset: not crime but reported crime

→collected by humans: guesstimation, precision and errors

→collected by machines: precisions and errors

Cédric Scherer // rstudio::conf // July 2022


Cédric Scherer // rstudio::conf // July 2022
Cédric Scherer // rstudio::conf // July 2022
“Much of the increase
of hazardous events
reported is probably
due to significant
improvements in
information access”

Cédric Scherer // rstudio::conf // July 2022


The best use of data is to


teach us what isn’t true.

Cédric Scherer // rstudio::conf // July 2022


The best use of data is to

teach us what isn’t true.
→don’t formulate a single statement

→confront yourself with a falsifiable universal statement

Source: inhomelandsecurity.com/risk-management-and-black-swan-events Cédric Scherer // rstudio::conf // July 2022


The best use of data is to

teach us what isn’t true.

Source: inhomelandsecurity.com/risk-management-and-black-swan-events cedricscherer.com @CedScherer z3tt


STORY
Be clear about the message of your visualization


Cédric Scherer // rstudio::conf // July 2022


Who is my audience?

Cédric Scherer // rstudio::conf // July 2022


Who is my audience?
Which story is interesting for them?

Cédric Scherer // rstudio::conf // July 2022


Who is my audience?
Which story is interesting for them?

What are relevant details to include?

Cédric Scherer // rstudio::conf // July 2022


Who is my audience?
Which story is interesting for them?

What are relevant details to include?

Which variables are meaningful to them?

Cédric Scherer // rstudio::conf // July 2022


Who is my audience?
Which story is interesting for them?

What are relevant details to include?

Which variables are meaningful to them?

How will they encounter the visualization?

Cédric Scherer // rstudio::conf // July 2022


Who is my audience?
Which story is interesting for them?

What are relevant details to include?

Which variables are meaningful to them?

How will they encounter the visualization?

Do I need a visualization at all??

Cédric Scherer // rstudio::conf // July 2022


Warming Stripes by Ed Hawkins

Cédric Scherer // rstudio::conf // July 2022


Warming Stripes by Ed Hawkins

Cédric Scherer // rstudio::conf // July 2022


showyourstripes.info/faq

Cédric Scherer // rstudio::conf // July 2022


These graphics are specifically 

designed to [...] start conversations
about our warming world and

the risks of climate change.

showyourstripes.info/faq

Cédric Scherer // rstudio::conf // July 2022


Perceiving Interpreting Comprehending

What do I see? What does it mean for the subject? What does it mean for me?

Visualiser Control Viewer Control

Scheme by Andy Kirk

Cédric Scherer // rstudio::conf // July 2022


GOAL
Select charts that successfully transport your story


Cédric Scherer // rstudio::conf // July 2022


“How maps in the media make us more negative about migrants” by Maite Vermeulen, Leon de Korte & Henk van Houtum

Cédric Scherer // rstudio::conf // July 2022


“How maps in the media make us more negative about migrants” by Maite Vermeulen, Leon de Korte & Henk van Houtum

Cédric Scherer // rstudio::conf // July 2022


“How maps in the media make us more negative about migrants” by Maite Vermeulen, Leon de Korte & Henk van Houtum

Cédric Scherer // rstudio::conf // July 2022


“How maps in the media make us more negative about migrants” by Maite Vermeulen, Leon de Korte & Henk van Houtum

Cédric Scherer // rstudio::conf // July 2022


“How maps in the media make us more negative about migrants” by Maite Vermeulen, Leon de Korte & Henk van Houtum

Cédric Scherer // rstudio::conf // July 2022


Typology of Information Graphics
by Juuso Koponen & Jonatan Hildén, "Data Visualization Handbook" (2020), p. 25

Is the information conceptual or measurable?

☛ Type of information: depict conceptual information <> convert information into visual forms

Cédric Scherer // rstudio::conf // July 2022


Typology of Information Graphics
by Juuso Koponen & Jonatan Hildén, "Data Visualization Handbook" (2020), p. 25

Is the information conceptual or measurable?

☛ Type of information: depict conceptual information <> convert information into visual forms

Is the purpose to explore or to explain the information?


☛ Purpose of the graphic: facilite discovery <> communicate information

Cédric Scherer // rstudio::conf // July 2022


Typology of Information Graphics
by Juuso Koponen & Jonatan Hildén, "Data Visualization Handbook" (2020), p. 25

Cédric Scherer // rstudio::conf // July 2022


“Visualizations can be designed and experienced 

in various ways, by people of various backgrounds, 

and in various circumstances. That's why 

reflecting on the purpose of a visualization is paramount 

before we design it—or before we critique it.”
Alberto Cairo

Excerpt from the foreword to “Data Sketches” by Nadieh Bremer & Shirley Wu (CRC Press 2021)

Cédric Scherer // rstudio::conf // July 2022


“A common truism about information visualization is 

that it is primarily about ‘showing the data’. [...]

While this might be true for scientific (or financial, or many other) 

application fields, there are many good uses of visualization 

that go beyond a precise, “neutral” display of data.“

Moritz Stefaner

Cédric Scherer // rstudio::conf // July 2022


The Vertices of Visualization
by Alberto Cairo, personal communication

Exploratory Explanatory
Discovery Communication

Affective
Emotion
Cédric Scherer // rstudio::conf // July 2022
The Vertices of Visualization
by Alberto Cairo, personal communication

Exploratory Explanatory Priority: 



Discovery Communication efficiency + effectiveness
Goal: 

functional

Affective
Emotion
Cédric Scherer // rstudio::conf // July 2022
The Vertices of Visualization
by Alberto Cairo, personal communication

Exploratory Explanatory Priority: 



Discovery Communication efficiency + effectiveness
Goal: 

functional

Priority: 

creativity + novelity
Affective Goal: 

Emotion emotional
Cédric Scherer // rstudio::conf // July 2022
Weissgerber et al. (2015) PLoS Biology

Cédric Scherer // rstudio::conf // July 2022


Weissgerber et al. (2015) PLoS Biology

Cédric Scherer // rstudio::conf // July 2022


Weissgerber et al. (2015) PLoS Biology

Cédric Scherer // rstudio::conf // July 2022


Modified from Weissgerber et al. (2015) PLoS Biology

Cédric Scherer // rstudio::conf // July 2022


Modified from Weissgerber et al. (2015) PLoS Biology

Cédric Scherer // rstudio::conf // July 2022


Weissgerber et al. (2015) PLoS Biology

Cédric Scherer // rstudio::conf // July 2022


Weissgerber et al. (2015) PLoS Biology

Cédric Scherer // rstudio::conf // July 2022


Cédric Scherer // rstudio::conf // July 2022
“Not my cup of coffee”, #TidyTuesday Contribution

Cédric Scherer // rstudio::conf // July 2022


Abb. 46 “Afrozensus 2020” by Citizens For Europe & EOTO e.V.

Cédric Scherer // rstudio::conf // July 2022


Cédric Scherer // rstudio::conf // July 2022
Cédric Scherer // rstudio::conf // July 2022
data-to-viz.com datavizproject.com visualizationuniverse.com

Cédric Scherer // rstudio::conf // July 2022


data-to-viz.com

Cédric Scherer // rstudio::conf // July 2022


data-to-viz.com

Cédric Scherer // rstudio::conf // July 2022


data-to-viz.com

Cédric Scherer // rstudio::conf // July 2022


The Power of Small Multiples

“Russia’s excess mortality soars since start of Covid pandemic” by John Burn-Murdoch (Financial Times)

Cédric Scherer // rstudio::conf // July 2022


“UK virus cases hit 6-week high but vaccines diminish threat” by John Burn-Murdoch (Financial Times)

Cédric Scherer // rstudio::conf // July 2022


“Escalating Drought”, together with Georgios Karamanis for Scientific American, Issue Nov 2021

Cédric Scherer // rstudio::conf // July 2022


“The Rise and Fall of Women’s College Basketball Dynasties”, #TidyTuesday Contribution

Cédric Scherer // rstudio::conf // July 2022


“How European countries generated electricity in 2018”, #TidyTuesday Contribution

Cédric Scherer // rstudio::conf // July 2022


Left: Choropleth Map by Die Zeit | Right: Tile Grid Map by Cédric Scherer & Ansgar Wolsing

Cédric Scherer // rstudio::conf // July 2022


VISUAL FORM
Follow design rules and data visualization principles


Cédric Scherer // rstudio::conf // July 2022


What is good DataViz design?

Cédric Scherer // rstudio::conf // July 2022


What is good DataViz design?

7 Clean layout — “less is more”

cedricscherer.com @CedScherer z3tt


What is good DataViz design?
often but not necc
e sarily!
7 Clean layout — “less is more”

cedricscherer.com @CedScherer z3tt


What is good DataViz
of
design?
ten but not neccesarily!
' Clean layout — “less is more
' Use direct annotations to ease readability + interpretability

Cédric Scherer // rstudio::conf // July 2022


What is good DataViz
of
design?
ten but not neccesarily!
- Clean layout — “less is more
- Use direct annotations to ease readability + interpretabilit
- Make use of hierarchy to guide the reader

Cédric Scherer // rstudio::conf // July 2022


What is good DataViz
of
design?
ten but not neccesarily!
1 Clean layout — “less is more
1 Use direct annotations to ease readability + interpretabilit
1 Make use of hierarchy to guide the reade
1 Consistent use of colors, spacing, typefaces, and weights

Cédric Scherer // rstudio::conf // July 2022


What is good DataViz
of
design?
ten but not neccesarily!
2 Clean layout — “less is more
2 Use direct annotations to ease readability + interpretabilit
2 Make use of hierarchy to guide the reade
2 Consistent use of colors, spacing, typefaces, and weight-
2 Use colors wisely and make sure they work for colorblind persons

Cédric Scherer // rstudio::conf // July 2022


What is good DataViz
of
design?
ten but not neccesarily!
3 Clean layout — “less is more
3 Use direct annotations to ease readability + interpretabilit!
3 Make use of hierarchy to guide the reade
3 Consistent use of colors, spacing, typefaces, and weight.
3 Use colors wisely and make sure they work for colorblind person.
3 Most important information should receive the main attention

Cédric Scherer // rstudio::conf // July 2022


Order your data

cedricscherer.com @CedScherer z3tt


Don’t rotate your text

Cédric Scherer // rstudio::conf // July 2022


Don’t rotate your text

Cédric Scherer // rstudio::conf // July 2022


Add direct labels

Cédric Scherer // rstudio::conf // July 2022


Use colors + annotations wisely

Cédric Scherer // rstudio::conf // July 2022


The Power of Annotations

“Is white space always your friend?” by Neil Richards

Cédric Scherer // rstudio::conf // July 2022


The Power of Annotations

“Is white space always your friend?” by Neil Richards

Cédric Scherer // rstudio::conf // July 2022


“The key thing we do is to add a title to the chart, as an entry point
and to explain what is going on. Text and other annotations add
enourmous value for non-chart people.”

~ John Burn-Murdoch, Financial Times

Cédric Scherer // rstudio::conf // July 2022


Annotated time-series chart by William Playfair from “The Commercial and Political Atlas and Statistical Breviary” (1786)

Cédric Scherer // rstudio::conf // July 2022


Wrap Up

Cédric Scherer // rstudio::conf // July 2022


“Clearin the Air” by Adam Ginsburg (Washington Post)

Cédric Scherer // rstudio::conf // July 2022


Notes by Francis Gagnon (Voilà)

Cédric Scherer // rstudio::conf // July 2022


Information

Understand your data and be accurate.

Story

Be clear about the message of your visualization.

Goal

Select charts that successfully transport your story.

Visual Form

Follow design rules and data visualization principles.

Cédric Scherer // rstudio::conf // July 2022


Your Turn!

We form groups and each group gets a number between 1 and 10

 Open the image file(s) with the according number in the folder

exercises/4-1-data-communicatio5

 Discuss the visualization with regard to the 4 levels of dataviz design

 Overall, do you think it is a good or a bad visualization?(

 What are details you like1

 How could one improve the chart?(

 Is there another (potentially better) way to tell the story?


→Sketch it (and think about how you could build it with ggplot2)

Cédric Scherer // rstudio::conf // July 2022

You might also like