What Is Visualization Really For
What Is Visualization Really For
MIN CHEN, University of Oxford
LUCIANO FLORIDI, University of Hertfordshire and University of Oxford
RITA BORGO, Swansea University
Whenever a visualization researcher is asked about the purpose of visualization, the phrase “gaining
insight” by and large pops out instinctively. However, it is not absolutely factual that all uses of
visualization are for gaining a deep understanding, unless the term insight is broadened to encompass all
types of thought. Even when insight is the focus of a visualization task, it is rather difficult to know what
insight is gained, how much, or how accurate. In this paper, we propose that “saving time” in
accomplishing a user’s task is the most fundamental objective. By giving emphasis to saving time, we can
establish a concrete metric, alleviate unnecessary contention caused by different interpretations of insight,
and stimulate new research efforts in some aspects of visualization, such as empirical studies, design
optimisation and theories of visualization.
General Terms: Visualization
1. INTRODUCTION
Visualization was already an overloaded term, long before it has become a
fashionable word in this era of data deluge. It may be used in the context of
meditation as a means for creative imagination, or in sports as a means for creating a
heightened sense of confidence. If we consider the term literally, as Robert Spence
said, “visualization is solely a human cognitive activity and has nothing to do with
computers” [Spence 2007].
In this article, we focused on visualization in computing, which may be referred to
technically as Computer-supported Data Visualization. In this context, the process of
visualization features both data and computer. These two essential components
differentiate this technological topic from those above-mentioned contexts. In the
remainder of this article, we will simply refer to “computer-supported data
visualization” as “visualization”.
Scott Owen [1999] compiled a collection of definitions and rationale for
visualization, most of which are still widely adopted or adapted today. These
definitions were intended to define the two questions, namely what is visualization
and what is it for?
“The goal of visualization in computing is to gain insight by using our visual
machinery.” [McCormick et al. 1987]
“Visualization is a method of computing. It transforms the symbolic into the
geometric, ... Visualization offers a method for seeing the unseen. It enriches
the process of scientific discovery and fosters profound and unexpected
insights.” [McCormick et al. 1987]
“Visualization is essentially a mapping process from computer representations
to perceptual representations, choosing encoding techniques to maximize
human understanding and communication.” [Owen 1999]
“Visualization is concerned with exploring data and information in such a way
as to gain understanding and insight into the data. The goal ... is to promote a
Authors’ addresses: M. Chen, Oxford e-Research Centre, University of Oxford, UK; R. Borgo, Department
of Computer Science, Swansea University, UK; L. Floridi, Department of Philosophy, University of
Hertfordshire, UK.
deeper level of understanding of the data under investigation and to foster new
insight into the underlying processes, relying on the humans’ powerful ability
to visualize”, [Earnshaw and Wiseman 1992]
“The primary objective in data visualization is to gain insight into an
information space by mapping data onto graphical primitives.” [Senay and
Ignatius 1990]
In addition to Scott Owen’s collection, there are other commonly cited definitions:
Visualization facilitates “the use of computer-supported, interactive, visual
representations of abstract data to amplify cognition.” [Card et al. 1999]
“Graphics reveal data. Indeed graphics can be more precise and revealing than
conventional statistical computations.” [Tufte 2001]
“Information visualization helps think.” [Few 2009]
“Information visualization utilizes computer graphics and interaction to assist
humans in solving problems.” [Purchase et al. 2008]
“The goal of information visualization is to translate abstract information into
a visual form that provides new insight about that information. Visualization
has been shown to be successful at providing insight about data for a wide
range of tasks.” [Hearst 2009]
“The goal of information visualization is the unveiling of the underlying
structure of large or abstract data sets using visual representations that utilize
the powerful processing capabilities of the human visual perceptual system.”
[Berkeley 2010]
“The purpose of visualization is to get insight, by means of interactive graphics,
into various aspects related to some processes we are interested in ...” [Telea
2008]
In the above definitions, there are many references to gaining insight, or likewise
phrases such as amplifying cognition, seeing the unseen, unveiling structure,
answering questions, solving problems, and so forth. It is unquestionable that these
are the benefits that visualization can bring about in many occasions. There has been
an abundance of evidence to confirm such goals are achievable. However, insight is a
non-trivial concept. It implies “accurate and deep intuitive understanding” according
to many dictionaries. While it is what everyone who creates or uses visualization is
inspired to achieve, it is an elusive notion and rather difficult to measure, evaluate,
or validate objectively.
Perhaps it is also because of its vagueness, it is relatively easier for people to
interpret the term insight differently. The charged debate about chart-junks a few
years ago was perhaps partly caused by the diverse interpretation of what insight to
be gained from visualization.
The debate started with a paper by Bateman et al. [2010], which reported an
empirical study on the effects of using visual embellishments in visualization. They
compared conventional plain charts with highly embellished charts drawn by Holmes
[1984]. The findings of the study suggest that embellishment may aid memorization.
Following this work, Hullman et al. [2011] proposed a possible explanation that
“introducing cognitive difficulties to visualization” “can improve a userʼs
understanding of important information.” Obviously this was a major departure from
the traditional wisdom of avoiding chart-junks in visualization. For example, in
[Tufte 2001], some of Holmes’s visual designs were shown as counter examples of this
wisdom.
These two pieces of work attracted much discussion in the blogosphere. Stephen
Few, the author of several popular books on visualization (e.g., [Few 2009]), wrote
two articles. On [Bateman et al. 2010], he concluded:
“At best we can treat the findings as suggestive of what might be true, but not
conclusive.” [Few 2011a]
Few was much more critical on [Hullman et al. 2011]:
“If they’re wrong, however, which indeed they are, their claim could do great
harm.” [Few 2011b]
In many ways, the two sides of the debate were considering different types of
insight to be gained in different modes of visualization. We will revisit this debate
later in Section 3.2.
Fig. 1. This is the earliest line graph found in the literature. It divides the 2D plane onto some
30 temporal zones across the x‐axis and uses horizontal lines to indicate zodiac zones across
the y‐axis. Seven time series were displayed in this chart. Source: [Funkhouser 1936].
Fig. 1 shows a line graph created by an unknown astronomer in the 10th (or
possibly 11th) century, depicting the “inclinations of the planetary obits as a function
of the time” [Funkhouser 1936]. More line graphs were found in the 17th century
records, noticeably the plot of “life expectancy vs. age” by Christiaan Huygens in
1669, and the plot of “barometric pressure vs. altitude” by Edmund Halley in 1686
[Friendly2007]. The 18th and 19th centuries saw the establishment of statistical
graphics as a collection of charting methods, attributed to William Playfair, Francis
Galton, Karl Pearson and others [Cleveland 1985]. The invention of coordinate
papers in the 18th century also helped make line graph a ubiquitous technique in
science and engineering.
Today, digitally stored data that captures or exhibits a functional relationship y =
f(x) is everywhere. For example, there are thousands or millions of real time data
feeds of financial information. Weather stations and seismic monitors around the
world generate an overwhelming amount of data in the form of y = f(t). In some cases,
we still use line graphs for visualization, and in other cases, we do not. What has
been the most fundamental factor that makes visualization users choose one visual
representation from another? Is it a more quantifiable factor, such as the number of
data series, the number of data points per data series, or another data-centric
attribute? Is it a less quantifiable factor such as the amount or type of insight, the
amount of cognitive load required, the level of aesthetic attraction, the type of
judgment to be made, or any other human-centric attribute?
Let us consider why a seismologist uses a seismograph, which is a type of line
graph that depicts the measured vibrations over time. (For the convenience of
referring, we use female pronouns for the seismologist.) The main task supported by
a seismograph is for a seismologist to make observation. Her first priority is simply to
see, or to know, the data stream in front of her, so she can confidentially say “I have
seen the data”. She may wish to observe some signature patterns of a potential
earthquake, relationships among several data series measured at different locations,
anomalies that may indicate malfunction of a device, and so on. The seismologist also
uses seismographs as a mechanism of external memorization, since they “remember”
the data for her. In real time monitoring, she does not have to stare at the
seismometer constantly and can have a break from time to time. In offline analysis,
she does not need to remember all historical patterns, and can recall her memory by
inspecting the relevant seismographs. Viewing seismographs simulates various
thoughts, such as hypotheses. After observing a certain signature pattern in a
seismograph, she may hypothesise that the vibrations would become stronger in the
next few hours. While the seismograph advances with newly arrived data, she
evaluates her hypothesis intuitively. When discussing with her colleagues, she draws
their attention to the visual patterns on the seismograph, and explains her
hypothesis and conclusion. In other words, she uses the seismograph to aid her
communication with others.
Perhaps the seismologist does not have to use seismographs. The vibration
measures could simply be displayed as a stream of numbers; after all viewing these
numbers would be more accurate than viewing the wiggly line on a seismograph.
Alternatively, to make more cost-effective use of the visual media, the stream of
number could be animated in real time as a dot moving up and down, accompanied
by a precise numerical reading updated dynamically. Let us have a close look at the
advantages of a seismograph over a stream of numbers or an animation.
the same animation repeatedly, and would eventually work out interesting patterns
in the movement of the dot and the variations of the numbers. It is no doubt much
slower than viewing a line graph.
Most analytical tasks (Sections 2.1-2.4) are likely to be conducted in modes (1), (2),
and (3). Only the tasks of knowledge dissemination are normally conducted in modes
(4) and (5). Mode (6) is relatively rare, but one can easily imagine that some
visualization tasks during disaster management may be performed in this mode. On
the other hand, mode (7) is rather common, but often has conflicting requirements
between the knowledge dissemination task and those analytical tasks.
4.1 Measurement
Firstly, time is much easier to measure and quantify than insight, knowledge or
cognitive load, especially in the case of analytical tasks. In many ways, time may also
be easier to measure than information, that is, the quantitative measures used in
information theory. While the measurement about insight or cognitive load may be
undertaken in a laboratory condition, it is usually far too intrusive for a practical
environment. Such a measurement would be uncertain as the measurement
introduces a significant amount of artefacts and distortion to a normal cognitive
process of gaining insight.
Potential user (doubtfully): Really, what kind of insights are we talking about?
Visualization researcher (anxiously): Patterns. (Pause, trying to recollect some
definitions of visualization.) Interesting patterns, such as various anomalies,
complex associations, warning signs, and potential risks.
Potential user (hopefully but cautiously): Can those pictures tell me all these
automatically?
Visualization researcher (truthfully but uneasily): Not quite automatically. The
mapping from data to visual representations will enable you see these patterns
more easily and help you to make decisions.
Potential user (disappointedly): I can understand my data with no problem. I
could not imagine how these pictures can help me make better decisions.
After a while, some of us learned a wisdom, i.e., never suggesting to potential
collaborators that visualization could offer them insight. It is much better to state
that visualization could save their time. As Sections 2 and 3 have shown,
visualization can indeed save time.
“Gaining insight” has been an elusive purpose of visualization for several decades.
It is perhaps the time to invigorate visualization as a scientific discipline by shining
the spotlight on a more concrete purpose, that is, to save the time required for
accomplish a visualization task.
REFERENCES
S. Bateman, R.L. Mandryk, C. Gutwin, A. Genest, D. McDine and C. Brooks. 2010. “Useful junk?: the
effects of visual embellishment on comprehension and memorability of charts”. In Proc. ACM CHI,
2573–2582.
R. Borgo, A. Abdul-Rahman, F. Mohamed, P.W. Grant, I. Reppa, L. Floridi, and M. Chen. 2012. “An
empirical study on using visual embellishments in visualization”. IEEE Transactions on Visualization
and Computer Graphics, 18, 12, 2759-2768.
S. Card, J. Mackinlay, and B. Shneiderman. 1999. Readings in Information Visualization: Using Vision to
Think, Morgan Kaufmann Publishers.
S. Card. 2007. “Information visualization”. In A. Sears and J.A. Jacko (Eds.), The Human-Computer
Interaction Handbook, Lawrence Erlbaum Assoc Inc, 2007.
M. Chen and H. Jänicke. 2010. “An information-theoretic framework for visualization”. IEEE Transactions
on Visualization and Computer Graphics, 16, 6, 1206–1215.
W.S. Cleveland. 1985. The Elements of Graphic Data. Brooks Cole.
R.A. Earnshaw and N Wiseman. 1992. “An Introduction to Scientific Visualization”. In Scientific
Visualization, Techniques and Applications, K.W. Brodlie et al. (Eds), Springer-Verlag.
S. Few. 2009. Now You See It, Analytics Press.
S. Few. 2011a. The Chartjunk Debate: A Close Examination of Recent Findings.
http://www.perceptualedge.com/articles/visual_business_intelligence/the_chartjunk_debate.pdf
S. Few. 2011b. Benefitting InfoVis with Visual Difficulties? Provocation Without a Cause.
http://www.perceptualedge.com/articles/visual_business_intelligence/visual_difficulties.pdf
R.M. Friedhoff and T. Kiley. 1990. “The eye of the beholder”. Computer Graphics World, 13, 8, 46.
M. Friendly. 2007. “A brief history of data visualization”. In Handbook of Data Visualization, Springer, 15-
56.
H.G. Funkhouser. 1936. “A Note on a tenth century graph”. Osiris, 1, 260–262.
S.R. Gomez, R. Jianu, C. Ziemkiewicz, H. Guo and D.H. Laidlaw. 2012, “Different strokes for different
folks: visual presentation design between disciplines”. IEEE Transactions on Visualization and
Computer Graphics, 18, 12, 2411-2420.
M.A. Hearst. 2009. Search User Interfaces, Cambridge University Press.
N. Holmes. 1984. Designer’s Guide to Creating Charts and Diagrams, Watson-Guptill Publications.
J. Hullman, E. Adar, and P. Shah. 2011. “Benefitting infovis with visual difficulties”. IEEE Transactions
on Visualization and Computer Graphics, 17, 12, 2213 –2222.
B.H. McCormick, T.A. DeFanti and M.D. Brown (Eds.). 1987. Visualization in Scientific Computing, ACM
G.S. Owen. 1999. HyperVis Teaching Scientific Visualization Using Hypermedia. ACM SIGGRAPH
SIGGRAPH Computer Graphics, 21, 6.
H.C. Purchase, N. Andrienko, T.J. Jankun-Kelly, and M. Ward. 2008. “Theoretical foundations of
information visualization. In Information Visualization: Human-Centered Issues and Perspectives, A.
Kerren et al. (Eds.) Lecture Notes In Computer Science, Vol. 4950. Springer-Verlag, 46-64.
H.Senay and E. Ignatius. 1994. “A knowledge-based system for visualization design”, IEEE Computer
Graphics and Applications, 14, 6, 36-47.
R. Spence. 2007. Information Visualization: Design for Interaction, Pearson.
E. Styles. 2006. The Psychology of Attention, Psychology Press.
A.C. Telea. 2008. Data Visualization, Principles and Practice, A K Peters.
J.J. Thomas and K.A. Cook (Eds.). 2005. Illuminating the Path: The Research and Development Agenda for
Visual Analytics, IEEE.
E.R. Tufte. 2001. The Visual Display of Quantitative Information, Graphics Press.
C. Ware. 2004. Information Visualization: Perception for Design, Morgan Kaufmann.