Describing Data Graphically: Exercise 1
Describing Data Graphically: Exercise 1
This answer key provides solutions to the corresponding student activity sheet.
Exercise 1
(a) Construct a histogram of this data in Minitab.
Solution: The histogram will have bins of width 8 with the first bin starting at 76 and the last bin
ending at 140. The midpoints of the bars will be displayed in this first histogram. If x is the film
length, then the bin with midpoint 80 ranges from 76 to 84 and contains x’s such that 76 x <
84.
5
Frequency
0
80 88 96 1 04 112 1 20 1 28 1 36
Film Lengths (min)
(b) Change the histogram bins to cutpoints (boundary values), instead of midpoints.
WWW.MINITAB.COM/ACADEMIC
Solution: The histogram will have bins of width 8 with the first bin starting at 80 and the last bin
ending at 144. The cutpoints of the bars will be displayed in this histogram. If x is the film length,
then the bin with cutpoint 80 ranges from 80 to 88 and contains x’s such that 80 x < 88.
4
Frequency
0
80 96 112 1 28 1 44
Film Lengths (min)
(c) Add the following enhancements to the histogram from part (b).
Solution: The histogram will have bins of width 8 with the first bin starting at 80 and the last bin
ending at 144. The cutpoints of the bars will be displayed in this histogram. If x is the film length,
then the bin with cutpoint 80 ranges from 80 to 88 and contains x’s such that 80 x < 88. The
correct histogram is displayed below on the left (the histogram on the right pertains to part (d)
below).
Solution: Although we haven’t formally defined outliers, it appears that the film Rope has an
unusually small film length with respect to the other film lengths.
(e) Construct a stem-and-leaf plot of the “Film Lengths (min)” data in Minitab. Let Minitab
choose the increment value.
(f) Use your plot in part (e). Ignore the first column for now [1, 1, 1, 1, 4, 8, 10, (4), 8, 5, 3, 1], and
interpret row 6. What are the lengths of these Hitchcock films?
(g) What is the longest film length from this sample of Hitchcock films?
(h) Are there any Hitchcock films in this sample that have lengths between 85 and 100 minutes?
Solution: No
(i) Now let’s use the first column, or the “count” column, of the stem-and-leaf plot for Hitchcock
film lengths. How many of the sample Hitchcock films have lengths less than 110 minutes?
Solution: 8 films
(j) What is the mode or modes of the sample of Hitchcock film lengths?
Solution: The data is bimodal: 108 minutes and 120 minutes, each with a frequency of 3
(k) This is a personal preference question. Which graph do you prefer for gathering information
about the length of Hitchcock films—the histogram or stem-and-leaf plot? Briefly state why.
Histogram plusses:
Histogram minuses:
The histogram does not show the values of the data contained within each of the bins.
It can be time-consuming to determine the binning structure for a histogram.
Different binning structures give different views of the same data set, which can create
confusion when reading the graph.
In a stem-and-leaf plot, the actual data values are part of the graphic. You can compute
the mean, median, mode, standard deviation, and range of a data set by using the data
in the plot.
A stem-and-leaf plot is easy to construct once you determine the bin increments.
In general, many non-statistical people have never seen a stem-and-leaf plot. Detailed
instructions need to be provided before the person reading the graphic can understand
and construct one.
It can be hard to determine the best increment for the plot’s rows.
Different incrementing can give different views of the same data set, which can create
confusion when reading the graph.
What is revealed about the data by the second stem-and-leaf plot (with an increment of 5) that
is not visibly apparent in the first stem-and-leaf plot (with an increment of 10)?
What does the second stem-and-leaf plot (with an increment of 5) reveal about the data that is
not visibly apparent in the first stem-and-leaf plot (with an increment of 10)?
Solution: 11 teams
(b) What is the median team salary for the 30 NBA teams?
Solution: Since there are 30 teams, then the median is the average of the 15 th and 16th data
points. Thus, the median team salary is 67.5 million dollars.
Exercise 4
(a) Construct a histogram in Minitab of your professor’s commute times.
Solution:
20
Frequency
15
10
0
15 20 25 30 35
Commute Times (in minutes)
Bins contain their left endpoints.
(b) The 21st commute time, 37.4 minutes, reflects a day when your professor left home without
his laptop computer and had to turn around to retrieve it. Remove this outlier from the data set
and reconstruct the histogram.
Solution: Depending on the bin width chosen, most students will have one of the graphs below.
Solution: 11 days
Exercise 5
(a) Construct a histogram of this data using cutpoints.
Solution: The graph should look fairly identical to one of the histograms below. Make sure the
title is clear, time units are provided, and there is a note about the bin structure.
5
Frequency
0
10 20 30 40 50 60 70
News Reading Times (in minutes)
Bins contain their left endpoints
Solution: The graph should look fairly identical to the histogram below if a bin width of 0.02 is
used. Make sure the title indicates that the histogram was constructed with transformed data.
Time units should still be provided, as well as a note about bin structure.
6
Frequency
0
0.1 2 0.1 6 0.20 0.24 0.28
News Reading Times (mins) Transformed by 1 /SQRT(x)
Bins contain their left endpoints
Solution: The transformation has affected the positive skewness of the data. The transformed
data appears to be symmetric and even bell-shaped.