0% found this document useful (0 votes)
46 views1 page

Data Visualization

The document discusses data visualization and best practices for creating effective visual representations of data. It describes several types of quantitative messages that can be communicated through data visualization, including time-series, ranking, part-to-whole, deviation, frequency distribution, correlation, nominal comparison, and geographic comparisons. Effective visualizations tell a story with the data, maximize the ratio of data to ink, and stand alone without needing additional context from a report. Misleading graphs that distort the message or support erroneous conclusions should be avoided.

Uploaded by

Gokul Krish
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
46 views1 page

Data Visualization

The document discusses data visualization and best practices for creating effective visual representations of data. It describes several types of quantitative messages that can be communicated through data visualization, including time-series, ranking, part-to-whole, deviation, frequency distribution, correlation, nominal comparison, and geographic comparisons. Effective visualizations tell a story with the data, maximize the ratio of data to ink, and stand alone without needing additional context from a report. Misleading graphs that distort the message or support erroneous conclusions should be avoided.

Uploaded by

Gokul Krish
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 1

Data visualization - Wikipedia https://en.wikipedia.

org/wiki/Data_visualization

For example, the Minard diagram shows the losses suffered by Napoleon's army in the 1812–1813 period. Six variables are plotted:
the size of the army, its location on a two-dimensional surface (x and y), time, direction of movement, and temperature. The line
width illustrates a comparison (size of the army at points in time) while the temperature axis suggests a cause of the change in
army size. This multivariate display on a two dimensional surface tells a story that can be grasped immediately while identifying
the source data to build credibility. Tufte wrote in 1983 that: "It may well be the best statistical graphic ever drawn."[10]

Not applying these principles may result in misleading graphs, which distort the message or support an erroneous conclusion.
According to Tufte, chartjunk refers to extraneous interior decoration of the graphic that does not enhance the message, or
gratuitous three dimensional or perspective effects. Needlessly separating the explanatory key from the image itself, requiring the
eye to travel back and forth from the image to the key, is a form of "administrative debris." The ratio of "data to ink" should be
maximized, erasing non-data ink where feasible.[10]

The Congressional Budget Office summarized several best practices for graphical displays in a June 2014 presentation. These
included: a) Knowing your audience; b) Designing graphics that can stand alone outside the context of the report; and c) Designing
graphics that communicate the key messages in the report.[11]

Quantitative messages
Author Stephen Few described eight types of quantitative messages that users may
attempt to understand or communicate from a set of data and the associated
graphs used to help communicate the message:

1. Time-series: A single variable is captured over a period of time, such as the


unemployment rate over a 10-year period. A line chart may be used to
demonstrate the trend.
2. Ranking: Categorical subdivisions are ranked in ascending or descending A time series illustrated with a line chart
order, such as a ranking of sales performance (the measure) by sales persons demonstrating trends in U.S. federal
(the category, with each sales person a categorical subdivision) during a spending and revenue over time.
single period. A bar chart may be used to show the comparison across the
sales persons.
3. Part-to-whole: Categorical subdivisions are measured as a ratio to the whole
(i.e., a percentage out of 100%). A pie chart or bar chart can show the
comparison of ratios, such as the market share represented by competitors in
a market.
4. Deviation: Categorical subdivisions are compared against a reference, such
as a comparison of actual vs. budget expenses for several departments of a
business for a given time period. A bar chart can show comparison of the
actual versus the reference amount.
5. Frequency distribution: Shows the number of observations of a particular
variable for given interval, such as the number of years in which the stock
market return is between intervals such as 0-10%, 11-20%, etc. A histogram,
a type of bar chart, may be used for this analysis. A boxplot helps visualize
key statistics about the distribution, such as median, quartiles, outliers, etc. A scatterplot illustrating negative
correlation between two variables
6. Correlation: Comparison between observations represented by two variables
(X,Y) to determine if they tend to move in the same or opposite directions. For (inflation and unemployment) measured
example, plotting unemployment (X) and inflation (Y) for a sample of months. at points in time.
A scatter plot is typically used for this message.
7. Nominal comparison: Comparing categorical subdivisions in no particular
order, such as the sales volume by product code. A bar chart may be used for this comparison.
8. Geographic or geospatial: Comparison of a variable across a map or layout, such as the unemployment rate by state or the
number of persons on the various floors of a building. A cartogram is a typical graphic used.[2][12]
Analysts reviewing a set of data may consider whether some or all of the messages and graphic types above are applicable to their
task and audience. The process of trial and error to identify meaningful relationships and messages in the data is part of
exploratory data analysis.

3 of 14 2/16/2018 10:26 AM

You might also like