Sampling Activity
Sampling Activity
This activity sheet includes exercises to assess students’ understanding of important concepts
presented in the Sampling lesson.
Sampling
A data set is not provided for these exercises.
Exercise 1
Every freshman at Star Wars Academy is issued the same laptop when they start school. Since
laptops are continually evolving, the following year’s freshmen are all issued the same, but
newer, laptops than the previous year’s class. Each class from freshman to senior year has the
same computer, but the computers vary across class year.
Students are allowed to keep their laptops after they graduate. Unfortunately, some of the
laptop batteries fail before graduation. Consider a study to estimate the average lifetime of
laptop batteries for Star Wars students. One hundred laptops are randomly sampled and the
battery lifetime for each is recorded.
C. The average lifetime of the laptop batteries belonging to the sample of 100 students
D. The average lifetime of all laptop batteries belonging to Star Wars Academy students
(c) True or False. Using a simple random sample is the most appropriate sampling method for
obtaining 100 laptop batteries.
True False
Exercise 2
A simple random sample of 500 college students is selected from all students registered at ABC
College, and of these 500 students, 100 of them participate in the annual on-campus Spelling
Bee.
(a) True or False. The proportion of students at ABC College who participate in the Spelling Bee
is 20%.
True False
(b) True or False. The proportion of students at ABC College who participate in the Spelling Bee
is likely to be close to 20%, but not equal to 20%.
True False
Exercise 3
In 2012, CBS News reported that “worrying about a math test can quite literally hurt.” Before we
discuss the validity of this claim, read the short article found on the following website:
http://www.cbsnews.com/news/fear-of-math-can-cause-real-pain/
The article begins “Worrying about a math test can quite literally hurt.” This claim alone implies
that anyone who worries about a math test could feel pain. Is this claim justified by the results of
the article? Explain your answer. Hint: Consider the population to which the results generalize;
that is, of what population is this sample representative?
Exercise 4
Lorem Ipsum is dummy text that is used in printing. It has been an industry standard since the
1500’s. For more information, see lipsum.com. A set of 200 words – 20 rows, each containing 10
words – from this filler text is printed below. We are interested in estimating the average
number of letters per word, so let’s take 3 simple random samples (SRS), each of size 5.
Minitab 17
Minitab Express
Each number in columns C1-C3 represents a word in the passage below. Look up that word and
record the corresponding number of letters in columns C4-C6.
Note: The numbers at the beginning of the line tell you which word begins that line. For
example, 51 tells you that et is word number 51 and libero is word number 53.
1 Lorem ipsum dolor sit amet, consectetur adipiscing elit. Ut aliquam
11 nunc vitae neque aliquam dictum. Suspendisse et odio felis. Vivamus
21 posuere imperdiet pretium. Donec vel lacus luctus massa blandit gravida.
31 Nunc in nisi non elit cursus blandit. Nam vulputate elit
41 ut tortor porttitor rhoncus. Morbi posuere mauris eget lectus lobortis
51 et suscipit libero egestas. Fusce est felis, lobortis vitae vestibulum
61 at, vestibulum et dolor. Integer iaculis neque et est semper
71 eu sollicitudin turpis posuere. Cum sociis natoque penatibus et magnis
81 dis parturient montes, nascetur ridiculus mus. Donec bibendum vestibulum tellus,
91 vitae dignissim lorem vehicula non. Vestibulum vitae elit a velit
(a) Compute the average word length for your three random samples.
Minitab 17
(b) Suppose we were to repeat this exercise by drawing samples of size 25 (you do not have to
do this). Which averages would you trust more: those from part (a) or these new ones? Explain
your answer.
Exercise 5
Quality engineers at an industrial fastener company, Tighten Up, are responsible for assessing
product quality. One product of interest is a certain type of bolt produced in the tens of
thousands and sold to auto manufacturers and their suppliers across the country. To assess the
quality of the current shipment of 10,000 bolts being sent to My Motors, inspectors obtain 150
bolts from the shipment and subject them to various quality tests and measurements to
determine thread pitch, which is the distance from one thread groove to the next, measured
from crest to crest.
Exercise 6
For each of the following, select the best description of the sampling strategy employed in the
given situation. If you are uncertain of your answer, provide an explanation for your best guess.
(a) Body Mass Index (BMI) is a useful measure of overall fitness in adults. In an effort to assess
the overall fitness level of residents in a certain community, a researcher selects a sample of the
first 20 citizens entering the local gym wearing white socks on a given day.
(b) There are 11,200 students enrolled at Cavalier College. The registrar draws a sample of 100
students to question about the online course registration process. He obtains an alphabetical
listing of all 11,200 students, numbered from 1 to 11,200, and uses a random number generator
to generate 100 random integers between 1 and 11,200. He then surveys the 100 students
corresponding to those numbers.
(c) A pollster is interested in gauging how a given state’s Governor would perform in his home
state if he ran for President. The pollster randomly selects a sample of voters from a list of all
eligible voting citizens in that state. Because nearly 75% of the state’s voters claim a party
affiliation (e.g. Democrat, Republican), the pollster decides to include approximately 25 non-
affiliated voters for every 75 affiliated voters in his sample.
(d) A professor wants to know how his students performed on an exam from a total population
of 100 students. Since he’s eager to calculate sample statistics for the scores, he uses the first 20
exams that are turned in and begins his computations.
(e) Citrus trees are usually grown in orderly arrangements of rows to facilitate automated
farming and harvesting practices. Suppose 1000 trees are grown down the sides of a small hill
and are laid out uniformly about the hill in 40 rows of 25 trees each. To test the crop weight of
fruit from these trees, a sample of 32 trees will be selected. Because growing conditions are
different on the four sides of the hill, the hill should be divided into four quadrants, and 8 trees
should be selected from each quadrant.
(h) A Starbucks coffee shop wants to identify the most popular beverages. On a Saturday
morning, the manager samples the first 20 customers who order drinks.
(i) On the 35th anniversary of the (alleged) death of Elvis Presley, a record company sponsored a
national call-in survey. Listeners at thousands of radio stations were asked to call a 1-900
number (at a charge of $2.50) to voice an opinion as to whether or not Elvis is really dead. It
turned out that 56% of the callers felt that Elvis was still alive.
(j) A professor at Cape Fear College is interested in studying the effects of math anxiety on
student math exam scores. The professor is currently teaching an advanced calculus course to
college freshmen and a few sophomores. She decides to use her class as the study sample and
passes out surveys during class.
Exercise 7
A researcher is studying the age of U.S. freshmen. She randomly samples 250 students from all
U.S. colleges. The average age of the college students in her sample is an example of a:
Exercise 8
In order to capture the opinions of an entire college campus as to whether or not Steve Jobs
should have been TIME magazine’s “person of the year,” an electronic survey is sent to the entire
campus.
(a) Give one example of why the population the “entire college campus” is ambiguous. That is,
what is at least one questionable aspect of this definition of the population?
(b) What type of sampling method may be creating bias in this case?
Exercise 9
Email spam filters are based on statistical analysis. Consider a simple spam filter that obtains a
sample of size n words from an email. It then compares the sample to a list of questionable
words. If more than 75% of the sample appears in the list, the email is determined to be spam.
Below are two sampling implementations.
(a) Sampling Method 1: Put all the words in a “bin” and randomly select n words. Or, number
all of the words in the email and then use a random number generator to select words. Every
word or every group of words of size n has an equally likely chance of being selected.
True or False. The above sampling method is an example of simple random sampling.
True or False
(b) Sampling Method 2: Separate the words in the email into two “bins”, or strata, based on
word length. Consider small words to consist of 3 or fewer letters and big words to consist of 4
or more letters. Pick a simple random sample from each bin corresponding to the proportion of
small and big words. For example, if 40% of the words in the email are small, then randomly
choose 0.4*n of the small words and 0.6*n of the big words.
True or False
Exercise 10
Ann Landers, who wrote a popular daily newspaper advice column, once asked her readers “If
you had it to do over again, would you have children?” Of the more than 10,000 readers who
responded, 70% said no. What does this show?
C. The survey would have been more meaningful if she had picked a random sample of the
10,000 readers who responded.
D. The survey would have been more meaningful if she had used a control group.
E. This was a legitimate sample, randomly drawn from her readers and of sufficient size to allow
the conclusion that most of her readers who are parents would have second thoughts about
having children.
Exercise 11
Every NBA basketball team has 12 players. A sample of players is to be chosen as follows. Each
team will be asked to place all 12 players’ names into a hat and randomly draw 2 names. The 2
names from each team will be combined to make up the sample. Will this method result in a
simple random sample of basketball players?
A. Yes, because each player has the same chance of being selected.
B. Yes, because each team is equally represented.
C. Yes, because this is an example of stratified sampling, which is a special case of simple
random sampling.
Exercise 12
To survey the opinions of baseball bleacher fans at Wrigley Field, a surveyor plans to select every
one-hundredth fan who enters the bleachers one afternoon. Will this result in a simple random
sample of fans who sit in the stadium’s bleachers?
A. Yes, because each bleacher fan has the same chance of being selected.
C. Yes, because the 99 out of 100 bleacher fans who are not selected will form a control group.
D. Yes, because this is an example of systematic sampling, which is a special case of simple
random sampling.
E. No, because not every sample of the intended size has an equal chance of being selected.
Exercise 13
Read the following sampling design scenarios. What fault do they all have in common?
I. The Wall Street Journal plans to make a prediction for a presidential election based on a
survey of its readers.
II. A radio talk show asks people to phone in their views on whether the United States should
pay off its huge debt to the United Nations.
III. A police detective, interested in determining the extent of drug use by teenagers, randomly
selects a sample of high school students and interviews each one about any illegal drug use by
the student during the past year.
A. I only B. I and II C. I and III D. I, II, and III E. None of the above
Exercise 15
A researcher planning a survey of heads of households in a particular state has census lists for
each of the 23 counties in that state. He plans to obtain a random sample from each of the
counties rather than grouping all census lists together and obtaining a sample from the entire
group. Which of the following statements about the resulting stratified sample are true?
II. It will help to avoid samples that do not represent the population.
III. It provides comparative information that a simple random sample would not provide.
A. I only B. I and II C. I and III D. I, II, and III E. None of the above
Exercise 16
To conduct a survey of long-distance calling patterns, a researcher opens a telephone book to a
random page, closes his eyes, puts his finger down on the page, and then reads off the next 50
names. Which of the following statements about this scenario are true?
I. The survey design incorporates chance.
A. I only B. I and II C. I and III D. I, II, and III E. None of the above