Watten InfoVis
Watten InfoVis
& 2004 Palgrave Macmillan Ltd. All rights reserved 1473-8716 $30.00
www.palgrave-journals.com/ivs
Introduction
The design of information graphics remains a poorly understood, hit-or-
miss process. Part of the difficulty is that models for how humans extract
information from visual displays remain incomplete. Indeed, seemingly
minor design variations can have dramatic effects on comprehensibility. As
a result, creating effective displays often requires expensive user tests, time-
consuming redesigns, and even a certain amount of guesswork.
Many researchers have recognized these problems and have investigated
guidelines and models for the perception of information graphics.1 Much
work has been done on the efficacy of different visual encodings,2,3
resulting in useful rules about the use of color, position, area, etc. to
represent different types of variables. Others, for example Healy et al.4 have
investigated how models of preattentive processing can be used in
designing visualizations.
But these lines of research do not address a key element in the efficacy of
an information graphic: the degree to which its perceptual organization
reflects the organization of the underlying data. Many authors have
stressed that to design successful information graphics, one must take into
account the effects of perceptual grouping. For instance, the work of
Kosslyn5 contains many examples in which unintentional grouping effects
lead to confusing displays. It would therefore be useful to have a tool that
helped designers assess the perceptual organization of their designs.
Some attempts have been made to model perceptual organization in
Received: 24 November 2003 information graphics. Tufte provides general guidelines, such as the
Revised: 30 January 2004 Macro/Micro principle.6 But quantitative models suitable for software
Accepted: 8 March 2004 implementation are rare. Several authors have analyzed special classes of
Analyzing perceptual organization in information graphics Martin Wattenberg and Danyel Fisher
124
Information Visualization
Analyzing perceptual organization in information graphics Martin Wattenberg and Danyel Fisher
125
Information Visualization
Analyzing perceptual organization in information graphics Martin Wattenberg and Danyel Fisher
126
without additional steps. Third, the sign of the function but interesting visualization issue: informal tests showed
gs is useful in creating an algorithmic version of the that for complex segmentations, users found these scale
linking step below. Despite these advantages, it is space cartoons easier to interpret than the outline view
important to note some well-known drawbacks to this commonly seen in computer vision output. Note how
technique: poor localization, rounded corners, and over- closely the images in the bottom row match the hand-
sensitivity.27 A different edge detector would not, how- drawn diagrams of Figure 1.
ever, fundamentally alter the framework of our model. The scale space cartoons do raise some new issues,
Figure 4 shows the resulting segmentation at scales of however. One potential concern is that two adjacent
8, 16, and 44. In the top row the edges of segments are segments with similar average values may be difficult to
shown. In the bottom row, each segment has been filled distinguish. In many cases, however, this difficulty
with a single gray tone representing the average grayscale simply reflects the fact that the visual difference between
value of the pixels in the segment, a technique we call a the two segments is relatively unimportant. In situations
scale space cartoon. The scale space cartoon itself is a small where drawing attention strongly to all segmentations is
necessary, one might draw a faint outline around each
segment.
Edge detection is not the only way to locate structure at
a given scale. Probably the most common method one
used in many of the original scale space papers is to
analyze local maxima and minima of the function fs.21,28
Often this analysis is accompanied by some sort of
watershed segmentation.22,29 We tried several variants
of this technique but found they produced poor results,
possibly due to the non-generic nature of typical
Figure 3 Difference of Gaussians: fsf3s/2 , s 8, 16, 44. 50% information graphics. Compared to images of natural
gray is zero; dark gray is negative; light gray is positive. scenes, diagrams and visualizations have an unusual
number of areas of nearly uniform brightness. In many
cases we found that fs contained ridges, valleys, and
plateaus that were almost but not quite level, leading to a
proliferation of local extrema that did not correspond to
useful features in the image. Figure 5 shows an example.
On the left is a simple graph. On the right are the
boundaries of regions found by watershed segmentation
for local minima (as in Lindberg22) for scale s 16. It is
clear that the graph line itself has dissolved into many
individual segments, because the smoothed function has
many almost indistinguishable extrema in the area of the
main graph line in the image. This effect, which in no
way reflects the visual experience of viewing the graph, is
why we chose the edge-detection scheme described
above.
Figure 4 Algorithmically derived segmentation of the Dr. Seuss
image for s 8, 16, 44. Top: edges of segments. Bottom: filled Linking structures at different scales As described so far,
segments, or Gestalt cartoons. the model finds structure only at a single scale. But the
Figure 5 An image (left), and its boundaries as found by watershed segmentation (center). Contrast with the results of our edge-
detection algorithm (right).
Information Visualization
Analyzing perceptual organization in information graphics Martin Wattenberg and Danyel Fisher
127
Information Visualization
Analyzing perceptual organization in information graphics Martin Wattenberg and Danyel Fisher
128
Information Visualization
Analyzing perceptual organization in information graphics Martin Wattenberg and Danyel Fisher
129
The software
The software tool contains the following numerical
approximation of the model. We represented the image
functions fs as 2D arrays of floating-point values (one per
pixel in the original image), and computed fs for only a
few discrete values of s. To perform linking, we looked at
each pair of successive approximations to fs, and connect Figure 11 Gestalt cartoons showing differentiation of figure
any two segments that share a sign and which overlap. and ground in a graph. Left: thin grid lines. Right: thick grid
Our implementation is written in Java, and on a 700 MHz lines.
Pentium 3 PC requires up to a minute to perform a full
structural analysis on a 800 600 pixel image at 15 scales.
Once the analysis is performed, it is saved for viewing as human can segment the graph in the second diagram by
both a series of grayscale images and as a 3D VRML file. using orientation information, which our model ignores.
We describe the interface of the application in more Nonetheless, doing so places an additional cognitive
detail in Section 4. burden on the viewer, and in fact it is a standard principle
of information design that grid lines should be signifi-
Questions of scale An important consideration for cantly lighter than lines representing foreground data.
computational algorithms is to ensure that they are able Thus the model indicates, correctly, that there is a
to handle large data sets. Because our system examines problem with the second graph. This situation where
the output of a visualization algorithm the pixels it a minor visual change has a large effect on comprehen-
produces and not the underlying data set, it is limited sibility is exactly where it is useful to have a model.
by the size of the screen. The segmentation and linking
steps each require time and memory proportional to the A famous real-life example
number of pixels, since each pixel is marked once in an How does the model fare on a real-life example? Figure 12
offscreen buffer. For a constant s, computing gs is also shows scale space cartoons for a complex scatterplot, the
linear in the number of pixels. Since the algorithm is not famous astronomical HertzprungRussell diagram. This
affected by the number of items in the underlying data scatterplot, which displays data on stars with temperature
set for a visualization, and indeed works by progressively on the x-axis and absolute magnitude on the y-axis, plays
simplification of an image, it is in practice computation- a central role in scientists conception of stellar evolu-
ally feasible. In addition, the examples below show that tion. The HR diagram at the top left of Figure 12 is
often only a small subset of scaling levels need to be reproduced directly from Spence and Garrison,35 which
examined in order to reveal useful information about contains a detailed discussion of this historically sig-
perceptual grouping. nificant information graphic.
The segmentations in the scale space cartoons capture
A simple example: graphs and grid lines the intuitive experience of reading the diagram: the
Our first example shows scale space cartoons of two small-scale (s 4) view emphasizes the vertical structures,
versions of a simple graph (Figure 11). At top left is a while at s 8 and 16 the large-scale clusters stand out.
graph with thin gridlines, at top right is a graph with The areas highlighted for s 16 correspond nicely to the
overpoweringly thick ones. The segmented versions at standard organization given by human experts: Figure 13
scale s 4 are shown below. In the graph with thick shows how an astronomer structures the diagram.
gridlines the graph itself is not segmented from the One subtlety is that at high scales some regions of the
background. This is an interesting indication of both the scale space cartoon become extremely faint, making
strength of our model and one of its limitations. A them difficult to see. In Figure 12, for example, the area
Information Visualization
Analyzing perceptual organization in information graphics Martin Wattenberg and Danyel Fisher
130
Information Visualization
Analyzing perceptual organization in information graphics Martin Wattenberg and Danyel Fisher
131
Information Visualization
Analyzing perceptual organization in information graphics Martin Wattenberg and Danyel Fisher
132
contact sheets are convenient for printing. A second a completely different type of data. Consider a line graph.
menu option lets the user create a 3D VRML view of the As we showed above, our scale space model can provide
scale space analysis, viewable in a web browser with a insight into whether the graph stands out from a
VRML 2.0 plug-in (e.g. Parallelgraphics Cortona plug-in). background grid. The model does not, however, provide
The 3D view is good for detailed analysis and demonstra- any insight into the numerical values conveyed by that
tions, but is difficult to reproduce clearly in print. graph which is clearly the most important data it
conveys. It is therefore unlikely that scale space analysis,
Current limitations and future directions no matter how it is extended, will become the sole tool
We have a described a model that is by no means for evaluating the perception of information graphics.
complete. Here we discuss various limitations and Finally, although the scale space method may help
possibilities for addressing those limitations. Perhaps identify flaws in presentation, it currently does not give
most important, the model is at its core a psychological suggestions for fixing those flaws. A potentially fruitful
hypothesis and therefore cries out for experimental area of long-term investigation would be using the model
validation. There are several natural directions to in- to automatically optimize information graphics. That is,
vestigate. One tactic would be to compare the structures given a known data structure one could attempt to find a
generated by our model with self-reports of users method for displaying that structure in an optimal
perceptions. A more pragmatic validation would be to manner according to the model detailed here. This could
study whether, in using the software tool described here, ultimately involve either an algorithm that incrementally
creators of information graphics are able to modify their improved a representation, to find a local optimum, or
designs in a way that user studies show are beneficial. even some method of mapping a structure directly to a
Two obvious shortcomings of our model are that it globally optimal visual representation.
applies only to grayscale images and that it addresses
only one type of grouping mechanism. One of the
reasons to choose scale space analysis as the basis for Conclusion
our method is that there is a rich body of research We proposed a new technique for modeling multi-scale
extending the basic idea to more general aspects of image perceptual organization in information graphics. The
structure. Theories that handle color or orientation have model is based on a classical machine vision technique,
been proposed14,34,40,41,43 and could be applied to our scale space, with a novel method of creating links
model. In addition, increasingly sophisticated visual between structures at different scales. We demonstrated
perception models, based on psychology, are becoming how a software implementation of this model captures
increasingly available.30,39 Some of those models fruit- important aspects of design aesthetics for several infor-
fully address notions of connectivity and suggest a broad mation graphics, and gave an example of how it may be
set of ways that humans cluster information. Orienta- used to give input into questions of design. We believe
tion-sensitive models have the potential to address the there is sufficient evidence of promise that it is worth
fact that our method often confers insufficient saliency extending and validating the model.
on lines and curves, which can lead to unsatisfactory
analyses for graphics such as node-and-link diagrams. It Acknowledgments
may also be advantageous to use a more sophisticated Thanks to Ben Shneiderman for helpful comments on the
segmentation method than the difference-of-Gaussians purpose of modeling perception, and to Dan Gruen for
edge detection employed here, since in some complicated advice on cognitive psychology. Thanks to Kushal Dave,
images the simple segmentation algorithm described here Steve Rohall and Bernard Kerr for comments on this
can yield counterintuitive results. It would also be useful manuscript, and to the IBM Research CUE group for support
to investigate ways of optimizing the numerical algo- and encouragement. We also acknowledge the anonymous
rithm to run in an interactive timeframe. referees for several helpful suggestions and references.
A deeper limitation is that the model relies on a Figure 12 reprinted with permission from The American
directed graph to describe the perception of an image. Statistician. Copyright 1993 by the American Statistical
Obviously, in many cases an information graphic conveys Association. All rights reserved.
References
1 Ware C. Information Visualization: Perception for Design. Morgan- 4 Healey CG, Booth KS, Enns JT. Harnessing preattentive processes for
Kaufmann: San Francisco, 2000. multivariate data visualization. Proceedings Graphics Interface 93,
2 Cleveland W. The Elements of Graphing Data. Wadsworth: Monterey, 1993; 107117.
CA, 1980. 5 Kosslyn SM. Understanding charts and graphs. Applied Cognitive
3 Mackinlay JD. Applying a theory of graphical presentation to the Psychology, 1989; 3: 185226.
graphic design of user interfaces. Proceedings of the ACM SIGGRAPH 6 Tufte E. Envisioning Information. Graphics Press Cheshire, Connecti-
Symposium on User Interface Software (UIST 88), 1988; 179189. cut, 1990.
Information Visualization
Analyzing perceptual organization in information graphics Martin Wattenberg and Danyel Fisher
133
7 Tullis TS. A computer-based tool for evaluating alphanumeric 27 Parker JR. Algorithms for Image Processing and Computer Vision. John
displays. Proceedings of INTERACT84, 1984. Wiley & Sons: New York, 1997.
8 Shneiderman B, Chimera R, Jog N, Stimart R, White D. Evaluating 28 Witkin A. 1983. Scale-space filtering. Proceedings of the Eighth
spatial and textual style of displays. Proceedings of Getting the Best International Joint Conference on Artificial Intelligence (IJCAI 83).
from State-of-the-Art Display Systems 95. London 1995. Karlsruhe 812 August, 1983..
9 Saund E. Symbolic construction of a 2-D scale-space image. IEEE 29 Leung Y, Zhang JS, Xu ZB. Clustering by scale-space filtering. IEEE
Transactions on Pattern Analysis and Machine Intelligence, 1990; 12: Transactions on Pattern Analysis and Machine Intelligence, 2000; 22:
817830. 12.
10 Wattenberg M, Fisher D. A multi-scale model of perceptual 30 Palmer S, Rock I. Rethinking perceptual organization: the role of
organization in information graphics. InfoVis 2003, 2003. uniform connectedness. Psychonomic Bulletin and Review, 1994; 1:
11 Bertin J. The Semiology of Graphics. University of Wisconsin Press: 2955.
Madison, WI, 1983. 31 Antoine JP, Carrette P, Murenzi R, Piette B. Image analysis with two-
12 Wertheimer M. Gestalt theory. In: Ellis WD (Ed). A Sourcebook of Gestalt dimensional continuous wavelet transform. Signal Processing, 1993;
Psychology. The Humanities Press: New York, 1924/1950; 1110. 31: 241272.
13 Navon D. Forest before trees: the precedence of global features in 32 Scott DW. Multivariate Density Estimation. Wiley-Interscience: New
visual perception. Cognitive Psychology, 1977; 9: 353383. York, 1992.
14 Palmer SE. Hierarchical structure in perceptual representation. 33 Burt PJ, Adelson EH. The Laplacian pyramid as a compact
Cognitive Psychology, 1977; 9: 441474. image code. IEEE Transactions on Communications, 1983; 31:
15 Kubovy M. The perceptual organization of dot lattices. Psychonomic 532540.
Bulletin & Review, 1994; 1: 182190. 34 Itti L, Koch C. Computational modeling of visual attention. Nature
16 Li Z. A neural model of contour integration in the primary visual Reviews Neuroscience, 2001; 2: 194203.
cortex. Neural Computation, 1998; 10: 903940. 35 Spence I, Garrison RF. A remarkable scatterplot. The American
17 Logan GD. The CODE theory of visual attention. Psychological Review, Statistician, 1993; 47: 1219.
1996; 103: 603649. 36 Fix JD. Astronomy: Journey to the Cosmic Frontier. (Second edition,
18 Pirolli P, Card SK, van der Wege MM. Visual information foraging in a updated) McGraw-Hill: New York, 1999.
focus+context visualization. Proceedings, CHI01, Human Factors in 37 Wattenberg M. Visualizing the Stock Market. CHI99 Extended
Computing Systems, 2001. Abstracts, 1999.
19 Witkin A, Tenenbaum M. On the role of structure in vision. In: 38 Shneiderman B. Tree visualization with tree-maps: 2-d space-
Rosenfeld A (Ed). Human and Machine Vision. Academic Press: New filling approach. ACM Transactions on, 1992; 11: 9297.
York, 1983. 39 Geisler WS, Super BJ. Perceptual organization of two-dimensional
20 Iijima T. Basic theory of pattern observation. Papers of Technical Group patterns. Psychological Review, 2000; 107: 677708.
on Automata Control, IECE, Japan, 1959. 40 ter Haar Romeny B, Geusebroek JM, Van Osta P, van den Boomgaard
21 Koenderink JJ. The structure of images. Biological Cybernetics, 1984; R, Koenderink JJ. Color differential structure. Scale-Space 2001, 2001;
50: 363370. 353361.
22 Lindeberg T. Scale-Space Theory in Computer Vision. Kluwer: 41 Kalitzin B, ter Haar Romeny B, Viergever M. Invertible orientation
Dordrecht, 1994. bundles on 2d scalar images. Scale-Space Theories in Computer Vision.
23 Furnas GW, Bederson B. Space-scale diagrams: understanding Springer: Berlin, 1997; 7788.
multiscale interfaces. Proceedings CHI 95. Human Factors in Computing 42 Mozer M, Zemel RS, Behrmann M, Williams CKI. Learning to
Systems, 1995. segment images using dynamic feature binding. Neural Computation,
24 Marr D, Hildreth E. Theory of edge detection. Proceedings of the Royal 1992; 4: 650665.
Society of London, Series B, 1980; 207: 187217. 43 Perona P, Malik J. Scale-space and edge detection using anisotropic
25 Canny J. A computational approach to edge detection. IEEE diffusion. IEEE Transactions on Pattern Analysis and Machine Intelli-
Transactions on Pattern Analysis and Machine Intelligence, 1986; 8: gence, 1990; 12: 629639.
679698. 44 Wickens CD, Carswell CM. The proximity compatibility principle: its
26 Shen J, Castan S. An optimal linear operator for step edge detection. psychological foundation and relevance to display design. Human
Computer Vision, Graphics, and Image Processing, 1992; 34: 321343. Factors, 1995; 37: 473495.
Information Visualization
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.