0% found this document useful (0 votes)

112 views145 pages

ML Visualization NeurIPS Tutorial

Visualization for Machine Learning is a document that discusses how data visualization can be applied to machine learning. It provides an overview of the history and goals of data visualization, how visualizations work by encoding data visually, and some common techniques like color scales, guiding attention, interactive exploration, and faceting. It also discusses opportunities for applying visualization to machine learning, such as visualizing training data, model performance, interpretability, and high-dimensional data. The document aims to understand the state of the art in visualization and how those techniques can help apply machine learning models and communicate their results.

Uploaded by

Nataraju Gaddamadugu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

112 views145 pages

ML Visualization NeurIPS Tutorial

Uploaded by

Nataraju Gaddamadugu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 145

Visualization for Machine Learning

Fernanda Viégas @viegasf

Martin Wattenberg @wattenberg
Google Brain
PAIR
People + AI Research
Bringing Design Thinking and HCI
to Machine Learning
google.ai/pair
Today's Agenda
What is data visualization?
How does it work? What are some best practices?

How has visualization been applied to ML?

Overview of the landscape
Special case: high-dimensional data
Goals
Understand state of the art
Known best practices in visualization
Broad survey of existing applications to ML

Apply visualizations in your own situation

References to tools and libraries
References to literature
What is data visualization?
Transform data into visual encodings

What is it good for?

Data exploration
Scientific insight
Communication
Education

How to ensure it works well?

Engage the visual system in smart ways
Take advantage of pre-attentive processing
What is data visualization?
Transform data into visual marks

What is it good for? How is it different from statistics?

Data exploration Vis: no specific question necessary
Scientific insight Classic Stats: you investigate a specific question*
Communication Vis & Stats: wonderful, complementary partners
Education

How to ensure it works well?

Engage the visual system in smart ways
Take advantage of pre-attentive processing
*OK, maybe not in EDA, but visualization is the
key technique there anyway!
Predates computers...
William Playfair (1786)
Line, bar, pie charts were all
invented by the same person!

Aside from revolutionizing

graphics, Playfair was an
economist, engineer, and even
a secret agent.

(Image: Wikipedia)
Florence Nightingale (1858)
These charts led to the
adoption of better hygiene /
sanitary practices in military
medicine, saving millions of
lives.

Arguably the most effective

visualization ever!

This particular visualization

technique would be frowned
on today. Lesson: technique is
less important than having
the right data and right
message.

(Image: Wikipedia)
W. E. B. Du Bois (1900)
For 1900 World's Fair, a
compendium of
visualizations. Many new
chart types!

Excellent example of
visualization aimed at
political change.

(Quartz)
What do these have in common?
Using special properties of the visual system to help us think.
What do these have in common?
Using special properties of the visual system to help us think.

Our visual system is like a GPU

- Incredibly good at a few special tasks
- With work, can be repurposed for more general situations
What do these have in common?
Using special properties of the visual system to help us think.

Our visual system is like a GPU

- Incredibly good at a few special tasks
- With work, can be repurposed for more general situations

All visualizations are made from a series of compromises.

How do visualizations work?
How do visualizations work?
Find visual encodings that
● Guide viewer's attention
● Communicate data to the viewer
● Let viewer calculate with data

Special case: color scales
Intensively studied for decades…
Rogowitz & Treinish (1996)
Web article:

“Why Should Engineers and

Scientists Be Worried About Color?”

Conclusions:
● Rainbow scales: bad
● There is no “best” scale
Practically speaking...
When in doubt, use the "Color Brewer" site:
http://colorbrewer2.org

(Built by Cynthia Brewer, a cartographer)

And study continues to this day...

A dive into a very recent paper (CHI 2018)

Color scales
Color scales
Uh oh, colorblindness… (very common!)

Red-blind protonopia. See http://www.color-blindness.com/coblis-color-blindness-simulator/

Guiding attention
Pre-attentive processing
Count the 5s
Count the 5s
Theory: attention
(Colin Ware, Visual Thinking for Design)

Pre-attentive processing / "popout"

Under the right circumstances, visual search

can be parallel, rather than serial

Time to find target does not increase as

number of distractors increases
Pre-Attentive Processing

Color Shape
Layering & separation

after Tufte
Layering & separation

after Tufte
Theory: calculation
Calculation
Example: we naturally average sizes.
“Seeing Sets: Representation by Statistical Properties.” Dan Ariely (2001)
Calculation
We can do weighted averages, too!
Example
Calculation
Hertzsprung-Russell diagram (via Wikipedia)

Your eye is doing something like kernel density

estimation...

Source: Wikipedia
How do visualizations work
- on computers?
How do visualizations work
- on computers?
Beyond static representations
● Interaction
● Conversation and collaboration
Theory: interaction
Shneiderman “mantra”:
(1996: “The Eyes Have It: A Task by Data Type
Taxonomy for Information Visualizations”)
● Overview first
● Zoom and filter
● Details on demand
Theory: interaction
Shneiderman “mantra”:
(1996: “The Eyes Have It: A Task by Data Type
Taxonomy for Information Visualizations”)
● Overview first
● Zoom and filter
● Details on demand

Example: dot maps

The Racial Dot Map: One Dot Per Person for the Entire U.S.
demographics.virginia.edu/DotMap/
Recap: How do visualizations work?
Find visual encodings that
● Guide viewer's attention
● Communicate data to the viewer
● Let viewer calculate with data

On computer
● Interactive exploration
Some common techniques
That could help in the ML context…

From the simple...

Case study: the humble table
We've talked to many, many ML teams

Every one of them displayed data in tables

Good design can make a huge difference

Exactly what they're doing is somewhat mysterious

- And their failures (e.g. adversarial examples) add to mystery

But: Way easier to inspect what’s going on in artificial classifiers than in human
classifiers ;-)

Since these are visual systems, it's natural to use visualization to inspect them
- What features are these networks really using?
- Do individual units have meaning?
- What roles are played by different layers?
- How are high-level concepts built from low-level ones?
Saliency maps - examples

More comparisons: https://pair-code.github.io/saliency/

Saliency maps
(a.k.a. "Sensitivity maps")

Idea: consider sensitivity of class to each pixel

i.e. grad(f), where f is function from pixels to class score.

Many ways to extend basic idea!

- Layer-wise relevance propagation (Binder et al.)
- Integrated gradients (Sundararajan et al.)
- Guided backprop (Springenberg et al.)
- etc.

Yet interpretation is slippery (Adebayo et al., Kindermans et al.)

- Tend to be visually noisy. Are these sometimes Rorschach tests?
- Are some of these methods essentially edge detectors?
Visualizing arbitrary neurons along the way to the top...

Gray: trying to maximize neural response. Colorful squares: maximal examples from an image data set
Visualizing and Understanding Convolutional Networks
Zeiler & Fergus, 2013
Understanding Neural Networks Through Deep Visualization
Yosinski et al. , 2015
http://yosinski.com/deepvis
drawNet
Torralba
Deep Dream

deepdream
Mordvintsev, Tyka, Olah
Combining these
interpretability
ideas to create new
visualizations

The Building Blocks of

Interpretability
Olah, Satyanarayan, Johnson, Carter,
Schubert, Ye, Mordvintsev
Interpreting Deep Visual Representations
Bau, Khosla, Oliva, Torralba
RNNs
Visualizing text sequences, colored by activations of a cell

The Unreasonable Effectiveness of Recurrent Neural Networks

Karpathy, 2015
The Unreasonable Effectiveness of Recurrent Neural Networks
Karpathy, 2015
Seq2Seq-Vis:
Visual Debugging
Tool for Sequence-
to- Sequence
Models
Strobelt, 2018

Examine model
decisions
Connect decisions to
previous examples
Test alternative
decisions
Linking multiple
views...

DQNViz: A Visual
Analytics Approach
to Understand
Deep Q-Networks

Wang et al.,
VAST 2018.
4. High-dimensional data
Why high-dimensional data?
Vectors spaces are the lingua franca of much of ML these days
- Data such as images, audio, video is naturally high-dimensional
- Dense representations of discrete data (e.g. word embeddings) have had major
successes
Why is it hard? Because it's impossible
Why is it hard? Because it's impossible

See Every Map Projection, Bostock.

Main approaches
Linear
- Principal Component Analysis
- Visualization of Labeled Data Using Linear Transformations (Koren & Carmel)

Non-linear (just a few of many)

- Multidimensional scaling
- Sammon mapping
- Isomap
- t-SNE
- UMAP
Main approaches
Linear
- Principal Component Analysis (show as much variation in data as possible)
- Visualization of Labeled Data Using Linear Transformations (clusters match labels)

Non-linear (just a few of many)

- Multidimensional scaling
- Sammon mapping
- Isomap Minimize distortion, according to some metric
- t-SNE
- UMAP
t-SNE
t-SNE
Fairly complex non-linear technique

Uses an adaptive sense of "distance." Translates well between geometry of high- and
low-dimensional space

Has become a standard tool, so we'll spend some time discussing how to read it.
Demo: MNIST visualization

Embedding Projector
Open Source visualization tool
Also available on Tensorboard
projector.tensorflow.org/
"Close reading" a visualization technique
What's the right way to understand
a "magic" visualization technique?

See Distill article

"Close reading" a visualization technique
What's the right way to understand
a "magic" visualization technique?

More visualization, of course!

Those hyperparameters really matter
Those hyperparameters really matter
Cluster sizes in a t-SNE plot mean nothing
Cluster sizes in a t-SNE plot mean nothing
Distances between clusters may not mean much
Distances between clusters may not mean much
You can see some shapes, sometimes
You can see some shapes, sometimes
Let's try this out with MNIST
Stopping too soon yields weird
artifacts.
The 4's may not be separated into
two clusters.

Clusters seem about equally far

apart in 3D; may not actually be.
The clusters of 1's probably is long
and thin.
UMAP: New kid on the block
UMAP: New kid on the block
Practical value
- Faster than t-SNE
- Can efficiently embed into high dimensions (i.e. useful not just for visualization)
- Often seems to capture global structure better
UMAP: New kid on the block
Practical value
- Faster than t-SNE
- Can efficiently embed into high dimensions (i.e. useful not just for visualization)
- Often seems to capture global structure better

Theory
- Roughly: manifold learning combined with explicit topology
- In detail: I don't completely understand the theory!
- This note does an amazing job of extracting key bits of UMAP paper:
https://www.math.upenn.edu/~jhansen/2018/05/04/UMAP/
UMAP: New kid on the block
Comparison of UMAP (left) and t-SNE (right) from McInnes
& Healy.

Global structure does seem to emerge more in UMAP.

For more
Let's compare in real-time on an audio data set!
Comparative Audio Analysis With Wavenet, MFCCs, UMAP,
t-SNE and PCA
(Leon Fedden)
Putting this together
The Beginner's Guide to Dimensionality Reduction
Matthew Conlen and Fred Hohman

https://idyll.pub/post/dimensionality-reduction-293e465c2a3443e8941b016d/
(just Google "Beginner's Guide to Dimensionality Reduction")
Pitfalls of high-dimensional space
Geometry of high-dimensional space holds many surprises…
Be careful about interpreting visualizations!

Adding "usually," "most," and "approximately" where appropriate:

- Two random vectors are perpendicular

- A standard Gaussian distribution is just a uniform distribution on a sphere
- A random matrix is a scalar multiple of an orthogonal matrix
- Random walks all have the same shape
Example: PCA of gradient descent trajectories

Lorch, Visualizing Deep Network Li et al, Visualizing the Loss Landscape

Training Trajectories, 2017 of Neural Nets, 2018
How to interpret? Compare random walks
It turns out that principal components of a random walk in a
high-dimensional space are (probably, approximately) cosines of
various frequencies! (Antognini, Sohl-Dickstein)

Can also see this via Karhunen-Loeve theorem for Brownian

motion.

Important: This doesn't invalidate work that uses PCA to look at

SGD trajectories. But it changes how we read the visualizations:
the interesting parts are differences from Lissajous patterns,
not similarities.

Antognini, Sohl-Dickstein. 2018

Lesson
If you see something interesting in
high-dimensional space…

compare to a random baseline!

Model interpretability example
Multi-lingual translation
What does the language embedding space look like?

https://arxiv.org/abs/1611.04558
Google's Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation

Melvin Johnson, Mike Schuster, Quoc V. Le, Maxim Krikun, Yonghui Wu, Zhifeng Chen, Nikhil Thorat, Fernanda Viégas,
Martin Wattenberg, Greg Corrado, Macduff Hughes, Jeffrey Dean
Training: English ← → Japanese
English ← → Korean
Japanese ← → Korean (zero shot)
Visualize internal representation ("embedding space")
Research question
What does the multi language embedding space look like?

Note: not real data

What does a sentence look like in embedding space?
(points in 1024-dim space: the data that the decoder receives)

E.g. “The stratosphere extends from 10km to 50km in altitude”

What does a sentence look like in embedding space?

Note: simplification of real situation!

What does a sentence look like in embedding space?
What do parallel sentences look like in embedding space?
(same meaning, different language)

like this?
<2en>

<2pt>

English
Portuguese
What do parallel sentences look like in embedding space?
(same meaning, different language)

or like this?

English
Portuguese
Interlingua?
Sentences with the same meaning mapped to similar regions regardless of language!
Distance between bridge / non-bridge sentences is inversely related to translation quality
5. Education and communication
Education & communication
for technical audiences
TensorFlow Playground
playground.tensorflow.org
GAN Lab
https://poloclub.github.io/ganlab/
Distill.pub
Editors: Carter, Olah, Satyanarayan
Görtler, Kehlbeck,
Deussen. 2018
Education & communication
for non-technical audiences
Attacking discrimination with smarter machine learning
research.google.com/bigpicture/attacking-discrimination-in-ml

Transform math into a visual,

interactive simulation that can be
used by a broader set of stakeholders
such as policymakers and regulators.

Wattenberg, Viégas, Hardt. 2016

Google Creative Lab
https://quickdraw.withgoogle.com/
On Quickdraw, users draw
common objects (e.g.
avocado), then see if the
algorithm has correctly
recognized the object.

You were asked to draw avocado, and

the neural net did not recognize it.
After users see the
recognition result,
Quickdraw shows
visual examples to help
users understand the
algorithm’s reasoning.

For example, it shows

examples of what
typical avocados look like.
It also shows a visual diff
between the user’s
drawing and the
most-similar drawings
from alternative classes.
Compare user input to
classes system thought
were closest
Show examples of what the
system expected for the class
in question

Illustrate latent space to users

Visual Analytics in Deep
Learning: An Interrogative
Survey for the Next
Frontiers
Hohman, Kahng, Pienta, Chau
Resources
ML-specific General visualization & design Implementation

Stanford CS 231 Tableau (desktop app) D3

Sequences - Commercial - See also blocks.org
- Seq2Seq-vis - State of the art Notebooks
- LSTMvis - Industrial-strength - Observable
RawGraphs (web) - Jupyter'
Embedding Projector Flourish.studio (web) Matplotlib
Facets Three.js
Lobe.ai Color Brewer Kepler.gl
Coblis Plotly
- Colorblindness simulator
A Survey: Visual Analytics in
Deep Learning (Hohman et al)
Visualization for Machine Learning

Fernanda Viégas @viegasf

Martin Wattenberg @wattenberg
Google Brain

Introduction To Data Science Interactive Visualization: CS 194 Fall 2015 John Canny
No ratings yet
Introduction To Data Science Interactive Visualization: CS 194 Fall 2015 John Canny
65 pages
Lecture 1 1
No ratings yet
Lecture 1 1
103 pages
Introduction To Data Science Interactive Visualization: CS 194 Fall 2015 John Canny
No ratings yet
Introduction To Data Science Interactive Visualization: CS 194 Fall 2015 John Canny
65 pages
Introduction To Visualization QVW
No ratings yet
Introduction To Visualization QVW
52 pages
Intro Visualization
No ratings yet
Intro Visualization
46 pages
CSE442 Perception
No ratings yet
CSE442 Perception
82 pages
1st Session. Slides
No ratings yet
1st Session. Slides
87 pages
519 Mid Term Slides
No ratings yet
519 Mid Term Slides
411 pages
chapter-1
No ratings yet
chapter-1
52 pages
Visualization Methods 2019
No ratings yet
Visualization Methods 2019
25 pages
A Tour Through The Visualization Zoo PDF
No ratings yet
A Tour Through The Visualization Zoo PDF
18 pages
IN4089 - Lecture 01 - Intro - What Why How-Pdfjam
No ratings yet
IN4089 - Lecture 01 - Intro - What Why How-Pdfjam
16 pages
Lectures
No ratings yet
Lectures
191 pages
UNIT 1 DVT
No ratings yet
UNIT 1 DVT
22 pages
Hierachies 2022
No ratings yet
Hierachies 2022
44 pages
PP Jess Cohen-Tanugi Design Principles For Visualization - 2-20-19
No ratings yet
PP Jess Cohen-Tanugi Design Principles For Visualization - 2-20-19
61 pages
Color Coding in Data Visualization
No ratings yet
Color Coding in Data Visualization
17 pages
Slide 1 - CSE 564 Intro
No ratings yet
Slide 1 - CSE 564 Intro
57 pages
DV-Week 9
No ratings yet
DV-Week 9
25 pages
c3200 02
No ratings yet
c3200 02
8 pages
Data Visualization Best Practices
No ratings yet
Data Visualization Best Practices
73 pages
BI - Visualization and Data Governance
No ratings yet
BI - Visualization and Data Governance
19 pages
Data Visualization PDF
No ratings yet
Data Visualization PDF
90 pages
DVP 4 Ia 2 Ia
No ratings yet
DVP 4 Ia 2 Ia
6 pages
DV Notes Diskha
No ratings yet
DV Notes Diskha
15 pages
Chapter1 Introduction Data visualization
No ratings yet
Chapter1 Introduction Data visualization
73 pages
BDT UNIT - 4 Text Note
No ratings yet
BDT UNIT - 4 Text Note
63 pages
BS1807 Slides Lecture 9
No ratings yet
BS1807 Slides Lecture 9
135 pages
00 Course
No ratings yet
00 Course
15 pages
01 Introduction
No ratings yet
01 Introduction
51 pages
Principles of Data Visualization
100% (6)
Principles of Data Visualization
105 pages
Unit 02
No ratings yet
Unit 02
112 pages
DVT UNIT 1
No ratings yet
DVT UNIT 1
29 pages
Introduction To Data Visualization
No ratings yet
Introduction To Data Visualization
28 pages
Unit5&6 Mba
No ratings yet
Unit5&6 Mba
12 pages
L01-intro
No ratings yet
L01-intro
47 pages
Fe 550
No ratings yet
Fe 550
4 pages
Data Visualization in Data Science
No ratings yet
Data Visualization in Data Science
50 pages
L04 Visualization Design
No ratings yet
L04 Visualization Design
69 pages
CSC 428_4
No ratings yet
CSC 428_4
12 pages
Cda-u2-Visualization
No ratings yet
Cda-u2-Visualization
39 pages
fundamental-techniques-graphics-visualization
No ratings yet
fundamental-techniques-graphics-visualization
421 pages
2 Vis Basics
No ratings yet
2 Vis Basics
45 pages
Group - 4
No ratings yet
Group - 4
27 pages
Administrivia: CMPSCI 370: Introduction To Computer Vision
No ratings yet
Administrivia: CMPSCI 370: Introduction To Computer Vision
12 pages
Business Data Visual
No ratings yet
Business Data Visual
50 pages
DVT Unit2 1
No ratings yet
DVT Unit2 1
17 pages
Marks & Channels
No ratings yet
Marks & Channels
53 pages
02 Abstractions
No ratings yet
02 Abstractions
78 pages
Module 6_Data Visualization Tools
No ratings yet
Module 6_Data Visualization Tools
37 pages
unit-v-dvt-unit-5-notes
No ratings yet
unit-v-dvt-unit-5-notes
21 pages
Data Visualization U1 L5
No ratings yet
Data Visualization U1 L5
22 pages
Thinking Numbers in Pictures
No ratings yet
Thinking Numbers in Pictures
43 pages
DS351 DataViz Intro
No ratings yet
DS351 DataViz Intro
49 pages
Lecture 1
No ratings yet
Lecture 1
68 pages
Foundations 1: Basic Concepts
No ratings yet
Foundations 1: Basic Concepts
54 pages
PDV 9
No ratings yet
PDV 9
22 pages
Changing Minds To Changing The World: Mapping The Spectrum of Intent in Data Visualization and Data Arts
No ratings yet
Changing Minds To Changing The World: Mapping The Spectrum of Intent in Data Visualization and Data Arts
30 pages
Deep Learning Frameworks
From Everand
Deep Learning Frameworks
Jamal Hopper
No ratings yet
Object-Oriented Basics
From Everand
Object-Oriented Basics
Alisa Turing
No ratings yet
Running Azure Databricks Notebook On Synapse Analytics
No ratings yet
Running Azure Databricks Notebook On Synapse Analytics
12 pages
Azure Data Explorer From Synapse Analytics Workspace
No ratings yet
Azure Data Explorer From Synapse Analytics Workspace
22 pages
My Home Tridasa E Brochure
No ratings yet
My Home Tridasa E Brochure
28 pages
Introducing Amazon Kinesis: Managed Service For Streaming Data Ingestion & Processing
No ratings yet
Introducing Amazon Kinesis: Managed Service For Streaming Data Ingestion & Processing
36 pages
Datastage
100% (1)
Datastage
404 pages
Insurance Fraud Detection
No ratings yet
Insurance Fraud Detection
10 pages
PLSQL
100% (1)
PLSQL
12 pages
Muhammad Zeeshan
No ratings yet
Muhammad Zeeshan
1 page
Disaster Recovery Plan Template
No ratings yet
Disaster Recovery Plan Template
16 pages
Distribution Business Unit Cummins Field Service Report Deccan Sales and Services Private Limited (Indore)
No ratings yet
Distribution Business Unit Cummins Field Service Report Deccan Sales and Services Private Limited (Indore)
2 pages
Tutorial - Editing Interviews For The Pet Photographers Club 2022
No ratings yet
Tutorial - Editing Interviews For The Pet Photographers Club 2022
3 pages
"Friction": Assignment
No ratings yet
"Friction": Assignment
8 pages
PDF CounterExamples From Elementary Calculus to the Beginnings of Analysis 1st Edition Andrei Bourchtein download
100% (9)
PDF CounterExamples From Elementary Calculus to the Beginnings of Analysis 1st Edition Andrei Bourchtein download
75 pages
Sr. # Integration Priority Test Case Type Test Cases
No ratings yet
Sr. # Integration Priority Test Case Type Test Cases
71 pages
Format
100% (4)
Format
11 pages
Installing Hard Drive and Optical Drive
No ratings yet
Installing Hard Drive and Optical Drive
22 pages
Tagum Doctors College
No ratings yet
Tagum Doctors College
9 pages
Notice: Meetings: Advisory Committee To The U.S. Section To The International Commission For The Conservation of Atlantic Tunas
No ratings yet
Notice: Meetings: Advisory Committee To The U.S. Section To The International Commission For The Conservation of Atlantic Tunas
1 page
Party For Freedom
No ratings yet
Party For Freedom
17 pages
PMG Sem-2 Macro-1 2 PDF
No ratings yet
PMG Sem-2 Macro-1 2 PDF
206 pages
300-02 - Op Amp-II
No ratings yet
300-02 - Op Amp-II
18 pages
Bottom Thermal Insulation (Washing Tank) Flow Chart
No ratings yet
Bottom Thermal Insulation (Washing Tank) Flow Chart
1 page
Architectural Updated Plans 2-23-21
No ratings yet
Architectural Updated Plans 2-23-21
71 pages
9607 s21 QP 42 PDF
No ratings yet
9607 s21 QP 42 PDF
4 pages
AccountStatement 01 NOV 2024 to 23 APR 2025
No ratings yet
AccountStatement 01 NOV 2024 to 23 APR 2025
94 pages
Difference Between Ledger & Journal
No ratings yet
Difference Between Ledger & Journal
2 pages
British Studies Lectures
No ratings yet
British Studies Lectures
18 pages
1580-Article Text-3530-1-10-20210429
No ratings yet
1580-Article Text-3530-1-10-20210429
27 pages
COMP3331 Assignment
No ratings yet
COMP3331 Assignment
10 pages
CH 7 Tranasport
No ratings yet
CH 7 Tranasport
7 pages
FRM Operator Booklet
No ratings yet
FRM Operator Booklet
52 pages
Abd Proposal 21 6 16
No ratings yet
Abd Proposal 21 6 16
35 pages
MB Manual Intel700series-Bios e
No ratings yet
MB Manual Intel700series-Bios e
29 pages
GP280A/I-N GP280SM/I-N: Scheda Tecnica / Technical Data Sheet / Fiche Tecnique / Ficha Tecnica
No ratings yet
GP280A/I-N GP280SM/I-N: Scheda Tecnica / Technical Data Sheet / Fiche Tecnique / Ficha Tecnica
6 pages
Business Cycle Shilpa
No ratings yet
Business Cycle Shilpa
9 pages
Weaccess Terms and Conditions (Viewing)
No ratings yet
Weaccess Terms and Conditions (Viewing)
1 page
TIG Welding
No ratings yet
TIG Welding
35 pages

ML Visualization NeurIPS Tutorial

Uploaded by

ML Visualization NeurIPS Tutorial

Uploaded by

Visualization for Machine Learning

Fernanda Viégas @viegasf

How has visualization been applied to ML?

Apply visualizations in your own situation

What is it good for?

How to ensure it works well?

What is it good for? How is it different from statistics?

How to ensure it works well?

Aside from revolutionizing

Arguably the most effective

This particular visualization

Our visual system is like a GPU

Our visual system is like a GPU

All visualizations are made from a series of compromises.

Edmund Halley, 1686

Comparison B (2013): Earth.nullschool

Position Length Area Slope Brightness Hue Text

Position Length Area Slope Brightness Hue Text

Position Length Area Slope Brightness Hue Text

Position Length Area Slope Brightness Hue Text

“Why Should Engineers and

(Built by Cynthia Brewer, a cartographer)

A dive into a very recent paper (CHI 2018)

Red-blind protonopia. See http://www.color-blindness.com/coblis-color-blindness-simulator/

Pre-attentive processing / "popout"

Under the right circumstances, visual search

Time to find target does not increase as

Your eye is doing something like kernel density

Example: dot maps

From the simple...

Every one of them displayed data in tables

Good design can make a huge difference

Remove to improve data tables

These all apply to more complicated visualizations!

Across U.S. Companies,

Source: Yannick Assogba

Machine Learning for Visualization

South Africa Russia Korea Brazil United States Germany

Visual Averages by Country

Germany Japan Malaysia Sweden

Visual Averages by Country

Visual Averages by Country

Two examples among many...

Exactly what they're doing is somewhat mysterious

More comparisons: https://pair-code.github.io/saliency/

Idea: consider sensitivity of class to each pixel

Many ways to extend basic idea!

Yet interpretation is slippery (Adebayo et al., Kindermans et al.)

The Building Blocks of

The Unreasonable Effectiveness of Recurrent Neural Networks

See Every Map Projection, Bostock.

Non-linear (just a few of many)

Non-linear (just a few of many)

See Distill article

More visualization, of course!

Clusters seem about equally far

Global structure does seem to emerge more in UMAP.

Adding "usually," "most," and "approximately" where appropriate:

- Two random vectors are perpendicular

Lorch, Visualizing Deep Network Li et al, Visualizing the Loss Landscape

Can also see this via Karhunen-Loeve theorem for Brownian

Important: This doesn't invalidate work that uses PCA to look at

Antognini, Sohl-Dickstein. 2018

compare to a random baseline!

Note: not real data

E.g. “The stratosphere extends from 10km to 50km in altitude”

Note: simplification of real situation!

Transform math into a visual,

Wattenberg, Viégas, Hardt. 2016

You were asked to draw avocado, and

For example, it shows

Illustrate latent space to users

Stanford CS 231 Tableau (desktop app) D3

Fernanda Viégas @viegasf

You might also like