IDV-05-Visualization For Multivariated Data
IDV-05-Visualization For Multivariated Data
05
Visualization Techniques Multivariate Data
IDV 2019/2020
Notice
! Author
! This material can be freely used for personal or academic purposes without
any previous authorization from the author, provided that this notice is kept
with.
! For commercial purposes the use of any part of this material requires the
! Introduction
! Point-Based Techniques
! Line-Based Techniques
! Region-Based Techniques
! Combinations of Techniques
Introduction
IDV 2019/2020
2.4. Dataset Types
Dataset Types:
26
Table 2. What: Data Abstraction
Dataset Types
Tables Networks Fields (Continuous) Geometry (Sp
Attributes (columns) Grid of positions
item cell
20 Value in cell
Value in cell
Figure 2.5. In a simple table of orders, a row represents an item, a column rep-
A multidimensionalFigure 2.4.a The detailed
table has structure of the four basic dataset types.
resents an attribute, and their intersection is the cell containing the value for that
more complex!structure for indexing pairwise combination.
A synonym for networks Tamara Munzner
into a cell, withis multiple keys.
graphs. The word graph
is also deeply overloaded in
2.4.1 Tablesvis. Sometimes it is used
to mean network as we dis- 2.4.2 Networks and Trees Data Foundations - 5
cuss here, for instance in
Many datasets come in the form The tablestype
of dataset that are made
of networks is well up offor specifying that there
suited
Multivariate Data
! Point-Based Techniques
display space, such that data records map to k-dimensional points. (e.g. Scatterplots)
! Line-Based Techniques
" Points corresponding to a particular record or dimension are linked together with
! Region-Based Techniques
" Filled polygons are used to convey values, based on their size, shape, color, or other
Point-Based Techniques
! Their success stems from our innate abilities to judge relative position within a
bounded space
! As the dimensionality of the data increases, the choices for visual analysis consist of:
! dimension reduction (to transform the high-dimensional data to data of lower dimension).
! Scatterplots
y = mx + b; m = -12.5 and b = 50
! Scatterplots
...
# data is the data frame with all variable
# snc is the subset of numerical variables of interest
...
# data is the data frame with all variable
# snc is the subset of numerical variables of interest
! Principal Component Analysis (PCA) - read more and see this implementation
https://en.wikipedia.org/wiki/Principal_component_analysis
http://www.nlpca.org/pca_principal_component_analysis.html
! The key goal is to attempt to maintain the N-dimensional features and characteristics of the
data through the projection process, e.g., relationships that exist in the original data must also
exist after projection.
" The projection may also unintentionally introduce artifacts that may appear in the
visualization and are not present in the data.
" Repeat
" Create a coordinates Matrix M x L and fill randomly or other method (ex: PCA)
" Compute an M x M matrix (L) based on L coordinates. And compute S the difference
between D and L.
" Shift the positions of points in L in a direction that will reduce their individual stress levels
D Coordinates L S
" Create a coordinates Matrix M x L and fill randomly or other method (ex: PCA)
" Compute an M x M matrix (L) based on L coordinates. And compute S the difference
between D and L.
" Shift the positions of points in L in a direction that will reduce their individual stress levels
" As in any optimization process, there is the potential to fall into a local minimal configuration that
still has a high level of stress.
" Common strategies to alleviate this include occasionally adding a random jump in the
position of a point to see if it will converge to a different location
" Obviously, the results are not unique: minor changes in the starting conditions can lead to
dramatically different results.
Iris setosa
Iris setosa
Iris versicolor
Iris virginica
! RadViz: is a force-driven point layout technique that is based on Hooke’s Law for equilibrium.
! For an N-dimensional data set, N anchor points are placed on the circumference of the circle to
represent the fixed ends of the N springs attached to each data point.
! Different placement and ordering of the anchors will give different results, and that points
that are quite distinct in N dimensions may map to the same location in 2D.
! RadViz: different views of the same data set in RadViz, using manual reordering of dimensions.
! RadViz: different views of the same data set in RadViz, using manual reordering of dimensions.
! RadViz: different views of the same data set in RadViz, using manual reordering of dimensions.
Line-Based Techniques
IDV 2019/2020
Multivariate Data: Line-Based Techniques
Line Graphs
! Parallel Coordinates
! Inselberg in 1985
! Check https://eagereyes.org/techniques/parallel-coordinates
! Check https://syntagmatic.github.io/parallel-coordinates/
! Check https://eagereyes.org/techniques/parallel-coordinates
! Check https://syntagmatic.github.io/parallel-coordinates/
! Check https://eagereyes.org/techniques/parallel-coordinates
! Check https://syntagmatic.github.io/parallel-coordinates/
! Check https://eagereyes.org/techniques/parallel-coordinates
! Check https://syntagmatic.github.io/parallel-coordinates/
! Check https://eagereyes.org/techniques/parallel-coordinates
! Check https://syntagmatic.github.io/parallel-coordinates/
! http://www.xdat.org/
! Check http://www.parallelcoordinates.de/paco/#
! Part1
! Part2
! Part3
! circular bar charts: like circular line graphs, but plotting bars on the base line;
! circular area graphs: like a line graph, but with the area under line filled in with a color
or texture;
! circular bar graphs: with bars that are circular arcs with a common center point and
base line.
https://brilliant.org/wiki/polar-curves/
circular bar charts: like circular line graphs, but plotting bars on the base line
circular bar charts: like circular line graphs, but plotting bars on the base line
https://datavizcatalogue.com/methods/radial_bar_chart.html
circular bar graphs: with bars that are circular arcs with a common center point and base line.
https://www.r-graph-gallery.com/circular-barplot/
Visualization Techniques for Multivariate Data - 53
Interactive Data Visualization
Region-Based Techniques
! Bar Charts/Histograms
! Bar Charts
! Tabular Displays
" Heatmaps are created by displaying the table of record values using color rather than
text. All data values are mapped to the same normalized color space, and each is
! table lens combines all these ideas and includes a level-of-detail mechanism for providing
panning and zooming capabilities to display whole table views, while still providing some detail
! Dimensional Stacking
" Begin with data of dimension 2N + 1 (for an even number of dimensions there would be
" Choose one of the dimensions to be the dependent variable. The rest will be
considered independent
" Create ordered pairs of the independent dimensions (N pairs) and assign to each pair a
" The pair corresponding to speed 1 will create a virtual image whose size coincides with
the cardinality of the dimensions (the first dimension in the pair is oriented horizontally,
ure 8.19. An example of 4D data visualized using dimensional stacking. The data consists
of drill-hole data, with three spatial dimensions, and the ore grade as the fourth
dimension.
drill-hole data, with three spatial dimensions, and the ore grade as the fourt
chniques
Multivariate Data: Region-Based Techniques 30
drill-hole data, with three spatial dimensions, and the ore grade as the fourt
Interactive Data Visualization
Combinations of Techniques
! Many others
" Interactive Data Visualization: Foundations, Techniques, and Applications, Matthew O. Ward
! Supplemental readings:
" Classical point base techniques have a limited dimensionality - Scatter based
! Line based
! Region based
! Combination Techniques
" Dense
! Glyphs