Unit V DVT
Unit V DVT
• Issues of Data
Additional information regarding the data being shown (e.g., labels) and the
mapping (e.g., a color key) are also essential to facilitate interpretation, and
must be integrated into the visualization.
Design stages:
1. Intuitive Mappings from Data to Visualization
2. Selecting and Modifying Views
3. Information Density—When Is It Too Much or Too Little?
4. Keys, Labels, and Legends
5. Using Color with Care
6. The Importance of Aesthetics
Intuitive Mappings from Data to Visualization
For example, in Figure 12.1, images of planets are used to plot the relationship
between the distance from the planet to the sun and the duration of its orbit.
Using intuitive scatterplot symbols to show the distance from planets to the sun
versus the duration of a single orbit.
Except for fairly simple data sets, one view is rarely sufficient to convey all the
information contained in the data.
View modifications fall into a number of categories, and their inclusion as part
of the functionality should be considered, based on user priorities.
Scrolling and zooming operations are needed if the entire data set cannot be
presented at the resolution desired by the user.
4.Scale control permits the user to modify the range and distribution of values
for a particular data field prior to its mapping. Similarly, clipping and other
forms of filtering allow the user to focus on data subsets.
Three views of the IRIS data set (scatterplot matrix, star glyphs, and parallel
coordinates). (Image from XmdvTool.)
Levels of detail in maps. (Images courtesy of Google Maps c 2008 Google)
The first, which might be called “gratuitous graphics,” occurs when there is
very little information to present.
One method is to give the user the option of disabling or enabling different
components of the display
This supporting information should begin with a detailed caption indicating the
particular data fields being displayed, and the mappings that were used.
A complex visualization with and without captions/ticks/legends
• grid or tick marks should be displayed to convey the ranges and values of
interest for numeric fields when absolute judgments are important, and all
axes should be labeled with appropriate units.
• Figure 12.4 highlights the importance of this supplementary information.
• The use of grid and tick marks can be both a boon and a curse to the
visualization.
• The actual positions of the markings can also have a bearing on how readily
the data is interpreted.
• Based on the semantics of the data, certain gaps between markings may
make more sense to the user than others.
• Grid spacings: (a) illogical; (b) logical.
• Simplicity. Don’t try to cram too much information in one display (see
Section 12.1.3), and don’t use graphics gimmick simply because they are
available (e.g., using 3D Phong shaded histograms when a bar or line chart
could convey the same information).
• A useful procedure to follow once a visualization has been designed is to
iteratively remove features and measure the loss of information being
conveyed.
•
•
• The problem with data scrubbing: (a) raw data showing lack of correlation;
• (b) scrubbed data revealing false correlation.
• Unbalanced scaling:Scaling is a powerful tool in visualization, since careful
selection of scale factors can reveal patterns and structures not visible in
unscaled views.
• However, scaling can be used to deceive the viewer into believing that a
trend is stronger or weaker than supported by the data. This can lead to what
Tufte refers to as the “lie factor,” which is the ratio between the raw data
change and the change as depicted in the visualization.
•
• Vis Lies: perspective distorts size in favor of closer objects.
• Range distortion:As mentioned in an earlier section, viewers often have an
expectation about the ranges for a particular data dimension; by setting this
range to be significantly different from this expectation, the user may be
deceived into misinterpretation.
• Abusing dimensionality:that errors in interpretation rise with the power of
the dimensionality being portrayed.
• Thus, our errors in judging volume are much worse than those for area,
which in turn are worse than those for length
• Visual Nonsense—Comparing Apples and Oranges:Visualizations are
designed to convey information, and it is important that the information be
meaningful.
• Visualizations are often created by combining data sets from different
sources.
• The visualization designer should attempt to avoid creating nonsense
graphics before they are presented to users.
• Losing Data in the Chart Junk:the importance of including labeled grid or
tick marks on visualizations that require quantitative assessment.
• The excessive use of such markings is an example of what Tufte referred to
as “chart junk.”
• Chart junk can be defined as any supplementary (nondata) graphics in a
visualization that are not necessary for the accurate interpretation of the data.
• In some visualization tasks, users can switch between qualitative overviews
and quantitative analysis.
• Raw versus Derived Data:A common practice is to compute an analytic
model of the data using curve/surface fitting to obtain a more visually
appealing result.
• In some visualizations, it is common practice to throw out all of the raw data
and only show the smooth approximation derived from that data.
•