C6 - Comm213
C6 - Comm213
“Visualizations gives you answers to questions you didn’t know you had” – Schneiderman
Basics of data visualization:
93% of all communication is nonverbal
70% of all sensory receptors are in eye
90% of information transmitted to brain is visual
40% of people respond better to visual than other
50% of your brain is active in visual processing
Tools:
Excel: add a chart to numerical analysis
Power BI & Tableau: large datasets, blend/merge datasets, charts, dashboards
3. Which type of data is being visualized, categorical or numerical? art > science
Charts for categorical:
1. Comparison:
- Pie chart: 5 or less categories, slides is equal to proportion (# observation/ grand total) of
each category
- Bar chart: proportion of each category, easy to interpret b/c brain is more skilled at
comparing the height of columns, stacked bar charts, discrete (whole number+ finite)
- Stacked bar chart: difficult to read, confuse audience if more than 4/5 layers or too many
bars, take less space. Good when break down and compart part of the whole
- Tree map: use size and color, +: pattern, outliers, many categories. - : not exact
proportion/number (need to add labels showing number
- Clustered bar: two categorical variables. Bares are grouped by positions for 1 categorical,
and color indicating second categorical
Heat map: +: visualize data across multiple categorical, patterns, correlation. Different
shapes of colors to indicate different values so no exact proportion or number
2. Geographic data: little dots unlike filled which is shading
- Symbol map: +: proportions across geographical areas, - : not exact proportion or number
3. Text data: word cloud: frequency of each word mentioned in a data
base, high frequency, large + bold font. Good for open ended
questions
Charts for numerical: above but word cloud works for them too since more
complex the following are best:
1. Outlier detection:
- Box and whisker plots and histogram: distribution and identify outliers
Answers why
Deceptive Charts:
- Representations intentionally or unintentionally designed to mislead the viewer about
data being presented: Manipulates scales, axes, or data to create false perceptions
Bars must always have 0 start and give wrong impression if divided into very large or very small
increments
Be careful with dual axis: arbitrarily choosing axis ranges, can make different data series look as
correlated as we like
Make sure scatter plot is standard direction
3D graphs can be deceptive b/c may distort the scale even when number are accurate