IISE 2021 - Data Processing and Visualization
IISE 2021 - Data Processing and Visualization
Visualization Training
Contents
1 What is Datviz?
Why Datviz?
2
3 How to do Datviz?
Datviz
Fundamentals 5
4 Do’s and Don’ts
Anomaly
Detection 6
The discrepancy of data processing and
data visualization
1 2
Data Processing Data Visualization
Data is collected and translated into usable The graphic representation of data.
information.
Data Processing Life Cycle Most Data Visualization Application for Daily
Working Use
By using visual elements, data visualization tools provide an accessible way to see and
understand trends, outliers, and patterns in data.
Advantages of Data Processing and Data
Visualization
Source: 1 Source: 2
https://www.elprocus.com/data-processing-types-and-its-applications/
Data Visualization Starts with ‘Why’
“A beautiful chart ● The choice of what type of visualization to use isn’t purely aesthetic,
that no one can read nor is it entirely personal.
is just ● The wrong choice can lead your viewer to boredom, confusion, or both.
● Even worse, visualizing data inaccurately can constitute a breach of trust
abstract art.” between you and your audience.
Source: https://www.kaggle.com/getting-started/160583
Data Visualization Grammar and Vocabulary
https://github.com/ft-interactive/chart-doctor/tre https://www.kdnuggets.com/2018/08/data-visua
e/master/visual-vocabulary lization-cheatsheet.html
Data Visualization: ‘I can read this!’ > ‘Wow Cool!’
Concern Details Visualized
Although donut chart looks correct at first glance, it causes major problems.
Problems with ● Comparing sizes of different groups is error-prone.
donut chart ● Viewers are focused in the center of the chart, while useful data is
provided in the background.
A chart/plot looking great can have flaws. Some of the usual flaws include:
Good looking does
● Skewing perceptions, making it difficult to rate proportions, or even
not mean more
getting more value out of the plot.
readable!
● Adding additional objects in the chart may be obstructive.
Just as how bar chart is more often, if not always, better than pie chart, all
Heights are easier
height-characterized charts are usually easier to read, more informative and
to compare
has more value compared to non-height-characterized charts.
harder to interpret
color fill
DO
SCAN HERE
Don't mix too many colors together (2-3 is the most optimal number) and, if you
absolutely need to use colors, use palettes designed to be color blind friendly
Why should you bother if there is a solid set of colours ready to be used?
Early Anomaly Detection & Insight: Findings
Pattern, Trend and Outlier
Trend Pattern Outlier
In order to know the future, Sometimes we can gained insight In real life, we often facing several
sometimes we need to look back from certain behavior from the data abnormalities (noises) in a dataset
from past data itself
Outlier are the data points that need to
Trend usually used for knowing whether In order to predict the future outcomes,
be handled in data processing.
the future would be uptrend or we need to analyse the pattern in
downtrend dataset
https://towardsdatascience.com/15-data-exploration-techniques-to-go-from-data-to-insights-93f66e6805df
Our Experiences