0% found this document useful (0 votes)
216 views9 pages

Describing Data Graphically: Exercise 1

This document provides solutions to exercises describing data graphically. It includes solutions for constructing histograms and stem-and-leaf plots of various data sets and interpreting the results. Key points extracted include that using a smaller increment of 5 instead of 10 for a stem-and-leaf plot of exam scores revealed there were no scores between 75-79, and the median salary for 30 NBA teams was $67.5 million.

Uploaded by

rgererg
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
216 views9 pages

Describing Data Graphically: Exercise 1

This document provides solutions to exercises describing data graphically. It includes solutions for constructing histograms and stem-and-leaf plots of various data sets and interpreting the results. Key points extracted include that using a smaller increment of 5 instead of 10 for a stem-and-leaf plot of exam scores revealed there were no scores between 75-79, and the median salary for 30 NBA teams was $67.5 million.

Uploaded by

rgererg
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 9

ANSWER KEY: DESCRIBING DATA GRAPHICALLY

This answer key provides solutions to the corresponding student activity sheet.

Describing Data Graphically


The data for these exercises are in the Minitab file DescribingDataGraphically_Activity.mtw.

Exercise 1
(a) Construct a histogram of this data in Minitab.

Solution: The histogram will have bins of width 8 with the first bin starting at 76 and the last bin
ending at 140. The midpoints of the bars will be displayed in this first histogram. If x is the film
length, then the bin with midpoint 80 ranges from 76 to 84 and contains x’s such that 76 x <
84.

You should notice the following items about this graph:

1. Correct histogram will have bins with midpoints


2. Correct labeling of axes, including time units

Histogram of Film Lengths (min)

5
Frequency

0
80 88 96 1 04 112 1 20 1 28 1 36
Film Lengths (min)

(b) Change the histogram bins to cutpoints (boundary values), instead of midpoints.

WWW.MINITAB.COM/ACADEMIC
Solution: The histogram will have bins of width 8 with the first bin starting at 80 and the last bin
ending at 144. The cutpoints of the bars will be displayed in this histogram. If x is the film length,
then the bin with cutpoint 80 ranges from 80 to 88 and contains x’s such that 80  x < 88.

You should notice the following items about this graph:

1. Correct histogram will have bins with cutpoints


2. Correct labeling of axes, including time units

Histogram of Film Lengths (min)

4
Frequency

0
80 96 112 1 28 1 44
Film Lengths (min)

(c) Add the following enhancements to the histogram from part (b).

1. Add a footnote to your histogram.


2. Include a more descriptive title with the histogram.

Solution: The histogram will have bins of width 8 with the first bin starting at 80 and the last bin
ending at 144. The cutpoints of the bars will be displayed in this histogram. If x is the film length,
then the bin with cutpoint 80 ranges from 80 to 88 and contains x’s such that 80  x < 88. The
correct histogram is displayed below on the left (the histogram on the right pertains to part (d)
below).

You should notice the following items about this graph:

1. Correct histogram will have bins with cutpoints


2. Correct labeling of axes, including time units
3. More descriptive title than “Histogram of Film Lengths (min)”
4. Footnote explaining the binning in the histogram

ANSW ER KEY: DESCRIBING DATA GRAPHICALLY 2


(d) Which film appears to be an outlier with respect to film length?

Solution: Although we haven’t formally defined outliers, it appears that the film Rope has an
unusually small film length with respect to the other film lengths.

(e) Construct a stem-and-leaf plot of the “Film Lengths (min)” data in Minitab. Let Minitab
choose the increment value.

Solution: Minitab uses an increment of 5 minutes.

(f) Use your plot in part (e). Ignore the first column for now [1, 1, 1, 1, 4, 8, 10, (4), 8, 5, 3, 1], and
interpret row 6. What are the lengths of these Hitchcock films?

Solution: 105, 108, 108, and 108 minutes, respectively

(g) What is the longest film length from this sample of Hitchcock films?

ANSW ER KEY: DESCRIBING DATA GRAPHICALLY 3


Solution: 136 minutes

(h) Are there any Hitchcock films in this sample that have lengths between 85 and 100 minutes?

Solution: No

(i) Now let’s use the first column, or the “count” column, of the stem-and-leaf plot for Hitchcock
film lengths. How many of the sample Hitchcock films have lengths less than 110 minutes?

Solution: 8 films

(j) What is the mode or modes of the sample of Hitchcock film lengths?

Solution: The data is bimodal: 108 minutes and 120 minutes, each with a frequency of 3

(k) This is a personal preference question. Which graph do you prefer for gathering information
about the length of Hitchcock films—the histogram or stem-and-leaf plot? Briefly state why.

Solution: Some possible plusses and minuses of each:

Histogram plusses:

 A histogram is easy to construct once you determine the bin widths.


 A histogram is easy to read and understand if the binning method is explained clearly.
 In general, many non-statistical people have seen a histogram and get the basic idea of
what it’s representing.
 The histogram displays the basic shape, center, and spread of the data, in addition to
outliers.
 The histogram provides a quick visual “summary” of the data.

Histogram minuses:

 The histogram does not show the values of the data contained within each of the bins.
 It can be time-consuming to determine the binning structure for a histogram.
 Different binning structures give different views of the same data set, which can create
confusion when reading the graph.

Stem-and-leaf plot plusses:

 In a stem-and-leaf plot, the actual data values are part of the graphic. You can compute
the mean, median, mode, standard deviation, and range of a data set by using the data
in the plot.
 A stem-and-leaf plot is easy to construct once you determine the bin increments.

ANSW ER KEY: DESCRIBING DATA GRAPHICALLY 4


 A stem-and-leaf plot is easy to read and understand once it is explained to the person
reading the graph.
 The stem-and-leaf plot displays the basic shape, center, and spread of the data, in
addition to outliers.

Stem-and-leaf plot minuses:

 In general, many non-statistical people have never seen a stem-and-leaf plot. Detailed
instructions need to be provided before the person reading the graphic can understand
and construct one.
 It can be hard to determine the best increment for the plot’s rows.
 Different incrementing can give different views of the same data set, which can create
confusion when reading the graph.

ANSW ER KEY: DESCRIBING DATA GRAPHICALLY 5


Exercise 2
Below are stem-and-leaf plots of n = 40 Statistics Exam 1 scores. One plot uses an increment of
10 and the other uses an increment of 5, where the increment indicates the difference in value
between stems.

What is revealed about the data by the second stem-and-leaf plot (with an increment of 5) that
is not visibly apparent in the first stem-and-leaf plot (with an increment of 10)?

What does the second stem-and-leaf plot (with an increment of 5) reveal about the data that is
not visibly apparent in the first stem-and-leaf plot (with an increment of 10)?

Solution: There are NO test scores between and including 75 to 79.

ANSW ER KEY: DESCRIBING DATA GRAPHICALLY 6


Exercise 3
(a) How many teams had salaries of at least 70 million dollars?

Solution: 11 teams

(b) What is the median team salary for the 30 NBA teams?

Solution: Since there are 30 teams, then the median is the average of the 15 th and 16th data
points. Thus, the median team salary is 67.5 million dollars.

Exercise 4
(a) Construct a histogram in Minitab of your professor’s commute times.

Solution:

Histogram of Commute Times from Home to Work


25

20
Frequency

15

10

0
15 20 25 30 35
Commute Times (in minutes)
Bins contain their left endpoints.

(b) The 21st commute time, 37.4 minutes, reflects a day when your professor left home without
his laptop computer and had to turn around to retrieve it. Remove this outlier from the data set
and reconstruct the histogram.

Solution: Depending on the bin width chosen, most students will have one of the graphs below.

ANSW ER KEY: DESCRIBING DATA GRAPHICALLY 7


(c) How many days did your professor’s commute time fall between 18 minutes (inclusive) and
18.5 minutes (exclusive); i.e. 18 ≤ commute time < 18.5?

Solution: 11 days

Exercise 5
(a) Construct a histogram of this data using cutpoints.

Solution: The graph should look fairly identical to one of the histograms below. Make sure the
title is clear, time units are provided, and there is a note about the bin structure.

Histogram of News Reading Times


7

5
Frequency

0
10 20 30 40 50 60 70
News Reading Times (in minutes)
Bins contain their left endpoints

(b) What type of skewness, if any, does this data display?

Solution: Positively skewed or right skewed.

ANSW ER KEY: DESCRIBING DATA GRAPHICALLY 8


(c) Calculate 1/√𝑥 for each data value and put the new transformed data values in a new
column in Minitab.

(d) Construct a histogram of the transformed data using cutpoints.

Solution: The graph should look fairly identical to the histogram below if a bin width of 0.02 is
used. Make sure the title indicates that the histogram was constructed with transformed data.
Time units should still be provided, as well as a note about bin structure.

Histogram of Transformed Reading Times


9

6
Frequency

0
0.1 2 0.1 6 0.20 0.24 0.28
News Reading Times (mins) Transformed by 1 /SQRT(x)
Bins contain their left endpoints

(e) What is the effect of the transformation on the data?

Solution: The transformation has affected the positive skewness of the data. The transformed
data appears to be symmetric and even bell-shaped.

ANSW ER KEY: DESCRIBING DATA GRAPHICALLY 9

You might also like