0% found this document useful (0 votes)
2 views23 pages

Slide 3 Ég

Part 3 of the document focuses on data visualization, covering topics such as data quality, interactive dashboards, and the use of Plotly for creating visual representations of data. It discusses the importance of identifying issues in data and outlines the process of building effective dashboards for data analysis. The section emphasizes the capabilities of Plotly as a tool for interactive graphing in Python.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
2 views23 pages

Slide 3 Ég

Part 3 of the document focuses on data visualization, covering topics such as data quality, interactive dashboards, and the use of Plotly for creating visual representations of data. It discusses the importance of identifying issues in data and outlines the process of building effective dashboards for data analysis. The section emphasizes the capabilities of Plotly as a tool for interactive graphing in Python.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 23

ACCELERATING DATA

ENGINEERING
PIPELINES
Part 3: Data Visualization

1
Part 1: Data Formats

Part 2: ETL with


AGENDA NVTabular

Part 3: Data
Visualization
AGENDA – PART 3
• Data Quality
• Interactive Dashboards
• Plotly
DATA QUALITY
4
BECOMING A DATA DETECTIVE
What is wrong with the following data?

5
WHAT IS WRONG WITH THIS DATA?
North America Temperature

Mexico City, Washington, Ottawa,


Temperature
Mexico DC, USA Canada
Average 14 36 -7
Median 12 37 -8
Mode 12 36 -8
Min 5 25 -14
Max 22 43 -5

6
WHAT IS WRONG WITH THIS DATA?
Candy Bars Eaten in a Day
Frequency

Number of Candy Bars Eaten 7


WHAT IS WRONG WITH THIS DATA?
Candy Bars Eaten in a Day
Frequency

Number of Candy Bars Eaten 8


WHAT IS WRONG WITH THIS DATA?
Hours Playing Video Games

3
Hours Played

0
0 1 2 3

Day 9
WHAT IS WRONG WITH THIS DATA?
Hours Playing Video Games

3
Hours Played

0
0 1 2 3

Mike Jacqui Marina

Day 10
WHAT IS WRONG WITH THIS DATA?
United States Precipitation

11
INTERACTIVE
DASHBOARDS
12
THE INTERNET
The world’s largest system of digital data?

13
A DASHBOARD DATA PIPELINE
A Simple Server

1: User asks for


information

Client
2: Server returns Server
webpage with all
data

3: User’s
computer filters
data for
interactivity

14
DASHBOARDS FOR BIG DATA
A More Complex Server

1: User asks
for information

2: Server
returns
webpage Server Server
Client CPU 4: CPU GPU
3: User requests
requests computation
filtered data from GPU
6: server
5: GPU returns
returns filtered
result
data

15
MAKING A GOOD DASHBOARD

Date: 1969-07- Show


20 Zeros

16
PLOTLY DASH
17
PLOTLY DASH
The front end for ML and data science models

plotly.py is an interactive, open-


source, and browser-based graphing
library for Python

Supports line plots, scatter plots, area


charts, bar charts, error bars, box plots,
histograms, heatmaps, subplots,
multiple-axes, polar charts, and bubble
charts.

https://github.com/plotly/plotly.py

18
MAKING A GOOD DASHBOARD

Div
Date: 1969-07- Show
20 Zeros Div
DatePickerSingl BooleanSwitch
e id=“show-
id=“date- zeros”
picker”
Graph
id=“precipitation-map”

19
MAKING A GOOD DASHBOARD

Div
Div
DatePickerSingl BooleanSwitch
e id=“show-
id=“date- zeros”
picker”
Graph
id=“precipitation-map”

20
MAKING A GOOD DASHBOARD

@app.callback(
Output(‘precipitation-map`, `figure`),
[Input(‘date-picker’, ‘date’), # 1st
input
Input(‘show-zeros’, ‘on’)]) # 2nd
input
def make_graph(first_input,
second_input):

regular python function stuff

return fig # Must match Output data


type

21
LET’S GO!

22
28

You might also like