0% found this document useful (0 votes)
6 views

UNIT 3B Data Visualization

Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
6 views

UNIT 3B Data Visualization

Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 42

Data Visualization

Lecture Notes Unit III B

Lea B. Villanueva
Course Facilitator
B. Data Visualization
Introduction
• Data visualization involves:
• Creating a summary table for the data
• Generating charts to help interpret, analyze, and learn from the data
• Uses of data visualization:
• Helpful for identifying data errors
• Reduces the size of your data set by highlighting important relationships
and trends in the data

2024 Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Overview of Data Visualization
Data-ink ratio: Measures the proportion of what Tufte terms “data-ink”
to the total amount of ink used in a table or chart
• Edward R. Tufte first described the data-ink ratio
• Helpful for creating effective tables and charts for data visualization
• Data-ink: Ink used in a table or chart that is necessary to convey the meaning of the data
to the audience
• Non-data-ink: Ink used in a table or chart that serves no useful purpose in conveying the
data to the audience

2024 Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Table 3.1: Example of a Low Data-Ink Ratio Table
Figure 3.3: Example of a Low Data-Ink Ratio Chart

2024 Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Tables
Table Design Principles
Crosstabulation
Pivot Tables in Excel
Recommended Pivot Table in Excel
Tables
Tables should be used when:
1. The reader needs to refer to specific numerical values
2. The reader needs to make precise comparisons between
different values and not just relative comparisons
3. The values being displayed have different units or very
different magnitudes

2024 Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Table 3.3: Table Showing Exact Values for Costs and Revenues by Month for Gossamer
Industries
Figure 3.5: Line Chart of Monthly Costs and Revenues at Gossamer Industries

2024 Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Figure 3.6: Combined Line Chart and Table for Monthly Costs and Revenues at
Gossamer Industries

2024 Cengage. All Rights Reserved. May not be scanned,


copied or duplicated, or posted to a publicly accessible
website, in whole or in part.
Tables
Table Design Principles
• Avoid using vertical lines in a table unless they are necessary for
clarity
• Horizontal lines are generally necessary only for separating column
titles from data values or when indicating that a calculation has taken
place

2024 Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Comparing Different Table Designs

2024 Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Tables
• Crosstabulation: A useful type of table for describing data of two
variables
• PivotTable: A crosstabulation in Microsoft Excel

2024 Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Quality Rating and Meal Price for 300 Los Angeles Restaurants

2024 Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Crosstabulation of Quality Rating and Meal Price for 300 Los
Angeles Restaurants

The greatest number of restaurants in the sample (64) have a very good rating and
a meal price in the $20–29 range
Only two restaurants have an excellent rating and a meal price in the $10–19 range
The right and bottom margins of the crosstabulation give the frequency of quality
rating and meal price separately

2024 Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Excel Worksheet Containing Restaurant Data

2024 Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Initial PivotTable Field List and PivotTable Field Report for the
Restaurant Data
Completed PivotTable Field List and A Portion of the PivotTable
Report for the Restaurant Data (Columns H:AK are Hidden)

2024 Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Final PivotTable Report for the Restaurant Data

2024 Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
PivotTable Report for the Restaurant Data with
Average Wait Times Added

2024 Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Charts
Scatter Charts
Recommended Charts in Excel
Line Charts
Bar Charts and Column Charts
A Note on Pie Charts and Three Dimensional Charts

Bubble Charts
Heat Maps
Additional Charts for Multiple Variables
PivotCharts in Excel
Charts
• Charts (or graphs): Visual methods of displaying data
• Scatter chart: Graphical presentation of the relationship between
two quantitative variables
• Trendline: A line that provides an approximation of the relationship
between the variables
• Line chart: A line connects the points in the chart
• Useful for time series data collected over a period of time (minutes, hours,
days, years, etc.)

2024 Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in
Table 3.8: Sample Data for the San Francisco Electronics Store

2024 Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Figure 3.17: Scatter Chart for the San Francisco Electronics Store

2024 Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Table 3.9: Monthly Sales Data of Air
Compressors at Kirkland Industries

2024 Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Figure 3.19: Scatter Chart and Line Chart for Monthly Sales Data at
Kirkland Industries

2024 Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Table 3.10: Regional Sales Data by Month for Air Compressors at
Kirkland Industries

2024 Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Figure 3.21: Line Chart of Regional Sales Data at Kirkland Industries

2024 Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Charts
Sparkline: Special type of line chart
• Minimalist type of line chart that can be placed directly into a
cell in Excel
• Contain no axes; they display only the line for the data
• Take up very little space and they can be effectively used to
provide information on overall trends for time series data

2024 Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Charts
• Bar Charts: Use horizontal bars to display the magnitude of the
quantitative variable
• Column Charts: Use vertical bars to display the magnitude of the
quantitative variable
• Bar and column charts are very helpful in making comparisons
between categorical variables

2024 Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Figure 3.23:
Bar Charts for Accounts Managed Data

Gentry manages the greatest number of accounts and Williams


the fewest

2024 Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Charts
• Pie charts: Common form of chart used to compare categorical data
• Bubble chart:
• Graphical means of visualizing three variables in a two-dimensional
graph
• Sometimes a preferred alternative to a 3-D graph
• Heat map: A two-dimensional graphical representation of data that
uses different shades of color to indicate magnitude

2024 Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in
Charts
Additional Charts for Multiple Variables
• Stacked column chart: Allows the reader to compare the relative
values of quantitative variables for the same category in a bar
chart
• Clustered column (or bar) chart: An alternative chart to stacked
column chart for comparing quantitative variables
• Scatter chart matrix: Useful chart for displaying multiple
variables

2024 Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Charts
PivotCharts in Excel
PivotChart: To summarize and analyze data with both a
crosstabulation and charting, Excel pairs PivotCharts with
PivotTables

2024 Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Figure 3.32: PivotTable and PivotChart for the Restaurant Data

2024 Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Advanced Data Visualization
• Advanced Charts
• Geographic Information Systems Charts

2024 Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Advanced Data Visualization
• Parallel-coordinates plot: Chart for examining data with more than
two variables
• Includes a different vertical axis for each variable
• Each observation is represented by drawing a line on the parallel
coordinates plot connecting each vertical axis
• The height of the line on each vertical axis represents the value taken by
that observation for the variable corresponding to the vertical axis
• Treemap: Useful for visualizing hierarchical data along multiple
dimensions

2024 Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Figure 3.33: Parallel Coordinates Plot for
Baseball Data

2024 Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in
part.
Figure 3.34: SmartMoney’s Map of the Market as
an Example of a Treemap

2024 Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Advanced Data Visualization
Geographic Information Systems Charts
• Geographic Information Systems (GIS): A system that merges
maps and statistics to present data collected over different
geographies
• Helps in interpreting data and observing patterns

2024 Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Figure 3.35:
GIS Chart for Cincinnati Zoo Member Data

2024 Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Data Dashboards
• Data dashboard: Data visualization tool that illustrates multiple
metrics and automatically updates these metrics as new data
become available
• Key performance indicators (KPIs) in dashboards:
• Automobile dashboard: Current speed, Fuel level, and oil pressure
• Business dashboard: Financial position, inventory on hand, customer
service metrics

2024 Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Data Dashboards
Principles of Effective Data Dashboards
• Should provide timely summary information on KPIs that are important to
the user
• Should present all KPIs as a single screen that a user can quickly scan to
understand the business’s current state of operations
• The KPIs displayed in the data dashboard should convey meaning to its user
and be related to the decisions the user makes
• A data dashboard should call attention to unusual measures that may
require attention
• Color should be used to call attention to specific values to differentiate
categorical variables, but the use of color should be restrained

2024 Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
END OF PRESENTATION

2024 Cengage. All Rights Reserved. May not be scanned,


copied or duplicated, or posted to a publicly accessible
website, in whole or in part.

You might also like