Chap003 - Vizualize & Describe Data
Chap003 - Vizualize & Describe Data
McGraw-Hill/Irwin Copyright © 2015 by The McGraw-Hill Companies, Inc. All rights reserved.
Chapter 3
Describing Data Visually
Chapter Contents
3.1 Stem-and-Leaf Displays and Dot Plots
3.2 Frequency Distributions and Histograms
3.3 Effective Excel Charts
3.4 Line Charts
3.5 Bar Charts
3.6 Pie Charts
3.7 Scatter Plots
3.8 Tables
3.9 Deceptive Graphs
3-2
Chapter 3
Describing Data Visually
3-3
Chapter 3
Describing Data Visually
3-4
Chapter 3
3.1 Dot Plots
3-5
Chapter 3
3.1 Dot Plots
3-6
Chapter 3
3.1 Dot Plots
• Preliminary Assessment
• Look at the data and visualize how they were collected
and measured.
• Sorting (Example: Price/Earnings Ratios)
• Sort the data as a first step and then summarize in a
graphical display. Here are the sorted P/E ratios (values
from Table 3.2).
3-7
Chapter 3
LO3-1 3.1 Stem-and-leaf Displays and
Dot Plots
The type of graph you use to display your data is dependent on
the type of data you have. Some charts are better suited for
quantitative data, while others are better for displaying
categorical data.
LO3-1: Make a stem-and-leaf or dot plot.
Stem-and-Leaf Plot
One simple way to visualize small data sets is a stem-and-leaf
plot. The stem-and-leaf plot is a tool of exploratory data
analysis (EDA) that seeks to reveal essential data features in an
intuitive way. A stem-and-leaf plot is basically a frequency tally,
except that we use digits instead of tally marks. For two-digit or
three-digit integer data, the stem is the tens digit of the data, and
the leaf is the ones digit. 3-8
Chapter 3
LO3-1 3.1 Stem-and-Leaf Displays and
Dot Plots
For the 44 P/E ratios, the stem-and-leaf plot is given below.
For example, the data values in the fourth stem are 31, 37, 37, 38. We always use
equally spaced stems (even if some stems are empty). The stem-and-leaf can reveal
central tendency (24 of the 44 P/E ratios were in the 10–19 stem) as well as
dispersion (the range is from 7 to 59). In this illustration, the leaf digits have been
sorted, although this is not necessary. The stem-and-leaf has the advantage that we
can retrieve the raw data by concatenating a stem digit with each of its leaf digits. For
example, the last stem has data values 50 and 59.
3-9
Chapter 3
LO3-1 3.1 Dot Plots
Dot Plots
• A dot plot is the simplest graphical display of n individual values of
numerical data.
- Easy to understand.
- It reveals dispersion, central tendency, and the shape of the distribution .
Note: If more than one data value lies at about the same axis
location, the dots are stacked vertically.
3-10
Chapter 3
LO3-1 3.1 Dot Plots
3-11
Chapter 3
LO3-1 3.1 Dot Plots
Comparing Groups
• A stacked dot plot compares two or more groups using a common
X-axis scale.
3-12
Chapter 3
LO3-2
3.2 Frequency Distributions and
Histograms
LO3-2: Create a frequency distribution for a data set
3-13
Chapter 3
LO3-2
3.2 Frequency Distributions and
Histograms
LO3-2: Create a frequency distribution for a data set
3-14
Chapter 3
3.2 Frequency Distributions and
LO3-2 Histograms
Constructing a Frequency Distribution
- Herbert Sturges proposed the following rule:
3-15
Chapter 3
3.2 Frequency Distributions and
LO3-2 Histograms
3-16
Chapter 3
3.2 Frequency Distributions and
LO3-2 Histograms
Histograms
3-17
Chapter 3
3.2 Frequency Distributions and
LO3-3 Histograms
LO3-3: Make a histogram with appropriate bins.
• Consider 3 histograms for the P/E ratio data with different bin
widths. What do they tell you?
3-18
Chapter 3
3.2 Frequency Distributions and
LO3-3 Histograms
3-19
Chapter 3
3.2 Frequency Distributions and
LO3-3 Histograms
Modal Class
• A histogram bar that is higher than those on either side.
• Unimodal – a single modal class.
• Bimodal – two modal classes.
• Multimodal – more than two modal classes.
• Modal classes may be artifacts of the way bin limits are chosen.
3-20
Chapter 3
3.2 Frequency Distributions and
LO3-4 Histograms
LO3-4: Identify skewness, modal classes, and outliers in a histogram.
Shape
• A histogram may suggest the shape of the population.
• It is influenced by the number of bins and bin limits.
• Skewness – indicated by the direction of the longer tail of the
histogram.
• Left-skewed – (negatively skewed) a longer left tail.
• Right-skewed – (positively skewed) a longer right tail.
• Symmetric – both tail areas are the same.
• 3-21
Chapter 3
3.2 Frequency Distributions and
LO3-4 Histograms
3-22
Chapter 3
3.3 Effective Excel Charts
3-23
Chapter 3
LO3-5 3.4 Line Charts
LO3-5: Make an effective line chart.
3-24
Chapter 3
LO3-5 3.4 Line Charts
3-25
Chapter 3
LO3-5 3.4 Line Charts
Log Scales
• Arithmetic scale – distances on the Y-axis are proportional to the
magnitude of the variable being displayed.
• Use a log scale for the vertical axis when data vary over a wide
range, say, by more than an order of magnitude.
3-26
Chapter 3
LO3-5 3.4 Line Charts
Log Scales
A log scale is useful for time series data that might be expected to grow at a
compound annual percentage rate (e.g., GDP, the national debt, or your
future income). It reveals whether the quantity is growing at an
increasing percent (concave upward),
constant percent (straight line), or
declining percent (concave downward)
3-27
Chapter 3
LO3-6 3.5 Column and Bar Charts
LO3-6: Make an effective column chart or bar chart.
3-28
Chapter 3
LO3-6 3.5 Bar Charts
Pareto Charts
• Special type of bar chart used in quality management to display the
frequency of defects or errors of different types.
• Categories are
displayed in
descending order
of frequency.
• Focus on
significant few
(i.e., few
categories that
account for most defects or errors).
3-29
Chapter 3
LO3-6 3.5 Bar Charts
3-30
Chapter 3
LO3-7 3.6 Pie Charts
Pie Chart
• A pie chart can only convey a general idea of the data.
• Pie charts should be used to portray data which sum to a total
(e.g., percent market shares).
• A pie chart should only have a few (i.e., 2 to 5) slices.
• Each slice can be labeled with data values or percents.
3-31
Chapter 3
LO3-7 3.6 Pie Charts
Pie Chart
• A simple 2-D pie chart is best as shown in Figure 3.19.
3-32
Chapter 3
LO3-7 3.6 Pie Charts
Pie Chart
• The 3-D pie chart adds visual interest, but the sizes of the
pie slices are harder to assess.
3-33
Chapter 3
LO3-7 3.6 Pie Charts
Pie Chart
• A simple bar chart can be used to display the same data, and
would be preferred by many statisticians.
3-34
Chapter 3
LO3-8 3.7 Scatter Plots
LO3-8: Make and interpret a scatter plot.
• Scatter plots can convey patterns in data pairs that would not be
apparent from a table.
• A scatter plot is a starting point for bivariate data analysis in which we
investigate the association and relationship between two variables.
• View the next slide for an example.
3-35
Chapter 3
LO3-8 3.7 Scatter Plots
LO3-8: Make and interpret a scatter plot.
3-36
Chapter 3
LO3-8 3.7 Scatter Plots
LO3-8: Make and interpret a scatter plot.
• Figure 3.23 shows some scatter plot patterns similar to those that you
might observe when you have a sample of (X, Y) data pairs.
• A scatter plot can convey patterns in data pars that would not be
apparent from a table.
3-37
Chapter 3
LO3-8 3.7 Scatter Plots
LO3-8: Make and interpret a scatter plot.
• Other examples of scatter plots
3-38
Chapter 3
LO3-8 3.7 Scatter Plots
LO3-8: Make and interpret a scatter plot.
• Other examples of scatter plots
3-39
Chapter 3
LO3-9 3.8 Tables
• The data can be viewed by focusing on the time pattern (down the
columns) or by comparing the variables (across the rows).
3-40
Chapter 3
LO3-9 3.8 Tables
Example: School Expenditures
3-41
Chapter 3
LO3-9 3.8 Tables
1. Keep the table simple, consistent with its purpose. Put summary
tables in the main body of the written report and detailed tables in
an appendix.
2. Display the data to be compared in columns rather than rows.
3. For presentation purposes, round off to three or four significant
digits.
4. Physical table layout should guide the eye toward the
comparison you wish to emphasize.
5. Row and column headings should be simple yet descriptive.
6. Within a column, use a consistent number of decimal digits.
3-42
Chapter 3
LO3-10 3.9 Deceptive Graphs
LO3-10: Recognize deceptive graphing techniques.
Deceptive Correct
3-43
Chapter 3
LO3-10 3.9 Deceptive Graphs
3-44
Chapter 3
LO3-10 3.9 Deceptive Graphs
3-45
Chapter 3
LO3-10 3.9 Deceptive Graphs
3-46
Chapter 3
LO3-10 3.9 Deceptive Graphs
Error 8: Complex Graphs
3-47
Chapter 3
LO3-10 3.9 Deceptive Graphs
3-48
Chapter 3
LO3-10 3.9 Deceptive Graphs
3-49