IN4089 - Lecture 05 - Graphs and Dimensionality Reduction-Pdfjam
IN4089 - Lecture 05 - Graphs and Dimensionality Reduction-Pdfjam
Graphs
Parent
Depth
What is a graph 𝐺𝐺 ? • Graph that is
Child
𝐺𝐺 = (𝑉𝑉, 𝐸𝐸): 𝐸𝐸 ⊆ 𝑉𝑉 2 • Connected: all nodes can be reached
3 4
Networks/graphs and trees Node-link techniques
• Layout:
• Nodes should not overlap
• …
5 http://mbostock.github.com/d3/ex/force.html 6
Layout Layout
• Compute forces • Compute forces
• Move vertices according to forces • Move vertices according to forces
• Terminate on • Terminate on
• Fixed number of iteration • Fixed number of iteration
• Total energy below some • Total energy below some
threshold threshold
• Local minimum • Local minimum
• User input • User input
13 14
15 https://observablehq.com/@d3/tidy-tree 16
Idiom: treemap Idiom: icicle/sunburst
• data Icicle SunBurst
• tree
• 1 quantitative attribute at leaf nodes
• encoding
• area containment marks for hierarchical
structure
• Every level one row/circle
• rectilinear orientation
• size encodes quant attrib • No overlapping parent child – attributes easier
• tasks displayed
• query attribute (at leaf nodes)
• scalability • Space on interior nodes can be used
• 1M leaf nodes • Not as dense as Treemaps
https://observablehq.com/@d3/treemap 17 http://homes.cs.washington.edu/~jheer/files/zoo/ 18
19 20
Tree drawing idioms comparison Further reading
• data shown • Visual Analysis of Large Graphs: State-of-the-Art and Future Research Challenges. von
– link relationships Landesberger et al. Computer Graphics Forum 30:6 (2011), 1719–1749.
– tree depth • Simple Algorithms for Network Visualization: A Tutorial. McGuffin. Tsinghua Science and
– sibling order Technology (Special Issue on Visualization and Computer Graphics) 17:4 (2012), 383–398.
• design choices • Drawing on Physical Analogies. Brandes. In Drawing Graphs: Methods and Models, LNCS
– connection vs containment link marks Tutorial, 2025, edited by M. Kaufmann and D. Wagner, LNCS Tutorial, 2025, pp. 71–86.
What‘s a Springer-Verlag, 2001.
– rectilinear vs radial layout fitting data
– spatial position channels • http://www.treevis.net Treevis.net: A Tree Visualization Reference. Schulz. IEEE Computer
set for any
of these Graphics and Applications 31:6 (2011), 11–15.
• considerations
idioms? • Perceptual Guidelines for Creating Rectangular Treemaps. Kong, Heer, and Agrawala. IEEE
– information density?
Quantifying the Space-Efficiency of 2D Graphical Representations of Trees. Trans. Visualization and Computer Graphics (Proc. InfoVis) 16:6 (2010), 990–998.
• avoid wasting space McGuffin and Robert. Information Visualization 9:2 (2010), 115–140.
21 22
Further reading
• Visual Analysis of Large Graphs: State-of-the-Art and Future Research Challenges. von
Landesberger et al. Computer Graphics Forum 30:6 (2011), 1719–1749.
• Simple Algorithms for Network Visualization: A Tutorial. McGuffin. Tsinghua Science and
Technology (Special Issue on Visualization and Computer Graphics) 17:4 (2012), 383–398.
• Drawing on Physical Analogies. Brandes. In Drawing Graphs: Methods and Models, LNCS
Tutorial, 2025, edited by M. Kaufmann and D. Wagner, LNCS Tutorial, 2025, pp. 71–86.
Springer-Verlag, 2001.
• http://www.treevis.net Treevis.net: A Tree Visualization Reference. Schulz. IEEE Computer Questions!
Graphics and Applications 31:6 (2011), 11–15. Slides (partially) based on lectures by
• Perceptual Guidelines for Creating Rectangular Treemaps. Kong, Heer, and Agrawala. IEEE Anna Vilanova (TU Eindhoven)
Trans. Visualization and Computer Graphics (Proc. InfoVis) 16:6 (2010), 990–998. Tamara Munzner (University of British Columbia)
Thomas Höllt (TU Delft)
23
Recap: Idiom scatterplot matrix
• scatterplot matrix (SPLOM)
Dimensionality • rectilinear axes, point mark
Reduction •
•
all possible pairs of axes
scalability
• one dozen attributes
IN4089 Data Visualization
• dozen to hundreds of items
Martin Skrodzki – Computer Graphics & Visualization
• Interaction is crucial
3 4
Dimensionality Reduction Types of Dimensionality Reduction
• Filter attributes possible • Linear
Resulting attributes are linear combination of existing attributes (interpretable)
… but which ones?
• Principal Component Analysis (PCA)
• New space/embedding preserves • Linear Discriminant Analysis (LDA)
specific properties (e.g., variance, • …
5 6
PCA - Intuition Principal Component Analysis (PCA) in Brief
• Given a dataset with n attributes (n-dimensional problem)
• PCA:
• finds a new coordinate system obtained from the previous one by translation
and rotation only – changes the point of view
• moves the center of the coordinate system with center of the data
• moves the x-axis into the principal axis of variation
• orders axes by amount of variation (importance)
11 12
PCA PCA
• PCA transforms an n-dimensional space to an n-dimensional space
• In the new space dimensions are ordered by importance (highest variance)
PCA - Projection that best
• Dimensionality reduction: take the first m dimensions (m<n) represents the data variation
PC
13 14
16
MNIST dataset – Handwritten numbers t-SNE Intuition
0 • Non-linear dimensionality reduction 7
• Compute neighborhoods in hi-D 1
4
• Model low-D to preserve 9
neighborhoods 8
2
6
• Preserves local neighborhoods 3 5
17 18
distributions P/Q
• P: similarities in HD
• Q: random init
• Minimizie Kullback
Leiber Divergence
KLD(P,Q)
19 20
t-SNE in Brief t-SNE Parameters
• Computationally intensive • Perplexity
• compute high dimensional • Number of iterations
neighborhoods
• Learning rate
• optimize low dimensional neighborhoods
• Theta (for BH t-SNE)
(Many optimized implementations)
…
• Several parameters
• Some can severely impact results
https://distill.pub/2016/misread-tsne/
21 22
collaboration with
https://nicola17.github.io/tfjs-tsne-demo/
https://github.com/tensorflow/tfjs-tsne
23 T. Höllt et al.: Interactive Immune Cell Phenotyping for Large Single-Cell Datasets, EuroVis 2016
Dimensionality Reduction Theses Projects
• Dimensionality Reduction is an
active field of research
• Both application-driven and
theory-based projects possible
• Come talk to us if you are
interested!
• Tensorflow GPU t-SNE in javascript: https://nicola17.github.io/tfjs-tsne-demo/
• Many algorithms in javascript: https://github.com/saehm/DruidJS
• Further Read: Visualizing Dimensionally-Reduced Data: Interviews with Analysts and a
Characterization of Task Sequences, Brehmer, et al. In BELIV 2014
25
Questions!
Slides (partially) based on lectures by
Thomas Höllt (TU Delft)
Anna Vilanova (TU Eindhoven)
Tamara Munzner (University of British Columbia)