Unit V DVT Unit 5 Notes
Unit V DVT Unit 5 Notes
Unit VI
Research Direction in Visualizations
• Issues of Data
Additional information regarding the data being shown (e.g., labels) and the
mapping (e.g., a color key) are also essential to facilitate interpretation, and
must be integrated into the visualization.
Design stages:
1. Intuitive Mappings from Data to Visualization
2. Selecting and Modifying Views
3. Information Density—When Is It Too Much or Too Little?
4. Keys, Labels, and Legends
5. Using Color with Care
6. The Importance of Aesthetics
Intuitive Mappings from Data to Visualization
For example, in Figure 12.1, images of planets are used to plot the relationship
between the distance from the planet to the sun and the duration of its orbit.
Using intuitive scatterplot symbols to show the distance from planets to the sun
versus the duration of a single orbit.
Except for fairly simple data sets, one view is rarely sufficient to convey all the
information contained in the data.
View modifications fall into a number of categories, and their inclusion as part
of the functionality should be considered, based on user priorities.
Scrolling and zooming operations are needed if the entire data set cannot be
presented at the resolution desired by the user.
4.Scale control permits the user to modify the range and distribution of values
for a particular data field prior to its mapping. Similarly, clipping and other
forms of filtering allow the user to focus on data subsets.
Three views of the IRIS data set (scatterplot matrix, star glyphs, and parallel
coordinates). (Image from XmdvTool.)
The first, which might be called “gratuitous graphics,” occurs when there is
very little information to present.
One method is to give the user the option of disabling or enabling different
components of the display
This supporting information should begin with a detailed caption indicating the
particular data fields being displayed, and the mappings that were used.
sufficiently emphasized, viewers don’t have sufficient cues for guiding their
inspection (see Figure 12.11).
• Simplicity. Don’t try to cram too much information in one display (see
Section 12.1.3), and don’t use graphics gimmick simply because they are
available (e.g., using 3D Phong shaded histograms when a bar or line chart
could convey the same information).
• A useful procedure to follow once a visualization has been designed is to
iteratively remove features and measure the loss of information being
conveyed.
•
• The problem with data scrubbing: (a) raw data showing lack of correlation;
• (b) scrubbed data revealing false correlation.
• Unbalanced scaling:Scaling is a powerful tool in visualization, since careful
selection of scale factors can reveal patterns and structures not visible in
unscaled views.
• However, scaling can be used to deceive the viewer into believing that a
trend is stronger or weaker than supported by the data. This can lead to what
Tufte refers to as the “lie factor,” which is the ratio between the raw data
change and the change as depicted in the visualization.
•
• Vis Lies: perspective distorts size in favor of closer objects.
• Range distortion:As mentioned in an earlier section, viewers often have an
expectation about the ranges for a particular data dimension; by setting this
range to be significantly different from this expectation, the user may be
deceived into misinterpretation.
• Abusing dimensionality:that errors in interpretation rise with the power of
the dimensionality being portrayed.
• Thus, our errors in judging volume are much worse than those for area,
which in turn are worse than those for length
• Visual Nonsense—Comparing Apples and Oranges:Visualizations are
designed to convey information, and it is important that the information be
meaningful.
• Visualizations are often created by combining data sets from different
sources.
• The visualization designer should attempt to avoid creating nonsense
graphics before they are presented to users.
• Losing Data in the Chart Junk:the importance of including labeled grid or
tick marks on visualizations that require quantitative assessment.
• The excessive use of such markings is an example of what Tufte referred to
as “chart junk.”
restricted to only using a small number of distinct values for each graphical
attribute being used to convey information.
• Bounding boxes, grids, and tick marks are all excellent tools for converting
an absolute judgment task to one that depends more on relative judgment.
3. Issues of Data:
• Many of the current and proposed future research activities are centered on
expanding the characteristics of data that can be effectively visualized. Some
of these are discussed here.
• Scale:Perhaps the most frequently addressed problem dealing with data is
finding solutions to coping with ever-increasing sizes for data sets.
• Static versus dynamic:While most visualization techniques to date have
been developed with the assumption that data is static (e.g., in files or
databases), a growing interest is in the area of visual analysis of dynamic
data [405].
• An increasing number of streaming data sources are being studied in the
database and data mining communities, and efforts to perform visual
analysis on this type of data are starting to emerge.
• Spatial versus nonspatial data:A growing number of application areas for
visualization include both spatial and nonspatial data, including many
scientific and engineering fields.
• To provide analysts with a powerful environment for studying this data,
several recent efforts have focused on the integration of the spatial
visualization techniques normally found in scientific visualization with the
nonspatial techniques that are common in information visualization.
• Nominal versus ordinal:The graphical attributes to which we map data in
our visualizations, such as position, size, and color, are primarily
quantitative in nature, while it is quite common to have data that is not
quantitative, such as the name of a gene or the address of an employee.
• If this nominal data is to be used in the visualization, a mapping is needed.
• However, it is also important to ensure that relationships derived from visual
analysis are truly part of the data, and not an artifact of the mapping.
• Structured versus nonstructured:Data can be classified based on the
degree to which it follows a predictable structure For example, tables of
numbers would be considered highly structured, while newspaper articles
may be regarded as unstructured.