Population Mean Hypothesis Testing For Large Samples: Exercise 1
Population Mean Hypothesis Testing For Large Samples: Exercise 1
Population Mean Hypothesis Testing For Large Samples: Exercise 1
SAMPLES
This activity sheet includes exercises to assess students’ understanding of important concepts
presented in the Population Mean Hypothesis Testing for Large Samples lesson.
Exercise 1
For the following multiple choice problems, choose the best answer.
(a) The test statistic and p-value corresponding to the hypothesis test H0: μ = 2 versus H1: μ > 2
are 𝑥̅ = 2.076 and p = 0.065, respectively. Which of the following is an appropriate interpretation
of this p-value?
(b) A pudding manufacturer packages its product into bags weighing 1 kilogram, on average.
The manufacturer’s statistician has discovered that the setting of the machine is causing the fill
weights to drift. The statistician needs to detect shifts in the mean weight as quickly as possible
and reset the machine when appropriate. In order to detect shifts in the mean weight, he
collects a random sample of 50 bags periodically, weighs them, and calculates the mean and
standard deviation. The data from this afternoon’s sample yields a sample mean of 1.03 kg and a
sample standard deviation of 0.08 kg. Determine the p-value for the hypothesis test H0: μ = 1
versus H1: μ 1.
WWW.MINITAB.COM/ACADEMIC
A. 0.008
B. 0.004
C. 0.011
D. 0.005
E. 0.704
F. 0.996
G. We cannot determine the p-value because the data does not come from a normal
distribution.
(c) The pain reliever currently used at General Hospital brings relief to patients in a mean time of
3.5 minutes. To compare a new pain reliever with the current one, the new pain reliever is
administered to a random sample of n = 50 patients. The mean time to feel relief for this sample
of patients is 2.8 minutes with a standard deviation of 1.14 minutes.
What are the appropriate null and alternative hypotheses to determine if the mean time for
patients to feel relief from the new pain reliever is less than the time required
for the hospital’s current pain reliever?
A. Yes
B. No
C. There is not enough information to answer this question.
(e) We are conducting the following hypothesis test: H0: μ = 9.5 versus H1: μ > 9.5. The Minitab
output for this 1-sample Z-test is provided below.
One-Sample Z
Descriptive Statistics
N Mean SE Mean
50 9.728 0.122
μ: mean of Sample
Known standard deviation = 0.86
Test
Null hypothesis H₀: μ = 9.5
Alternative hypothesis H₁: μ > 9.5
The approximate p-value for this hypothesis test is:
A. 0.015
B. 0.031
C. 0.058
D. 0.267
E. 0.395
F. 0.971
G. We cannot determine the p-value because we don’t know if the data is from a normal
distribution.
(f) A certain brand of orange juice is advertised to contain 85% fruit juice per bottle. A random
sample of 32 bottles of this juice is selected in order to perform the hypothesis test:
One-Sample Z
Descriptive Statistics
N Mean SE Mean
32 0.8228 0.0335
μ: mean of Sample
Known standard deviation = 0.189505
Test
Z-Value P-Value
-0.81 0.417
(g) A quality control specialist takes several measurements to test H 0: μ = 2 versus H1: μ 2. She
computes a p-value of 0.01 for the hypothesis test. Which interpretation is correct?
(h) Which of the following statements is not true regarding hypothesis testing?
A. The alternative hypothesis is the assertion that is contradictory to the null hypothesis.
B. The null hypothesis is the claim that is assumed to be true, or the “status quo” hypothesis.
(i) In performing the hypothesis test H0: = 8 versus H1: 8, the resulting p-value is 0.016.
Thus, if we construct a 95% confidence interval for using the exact same data, = 8 will not be
included in the 95% confidence interval.
A. True
B. False
C. We do not have enough information to determine this.
(j) You perform the hypothesis test H0: = 180 versus H1: > 180 for the average number of
minutes per day that students at your school watch TV. Which of the following statements is
correct regarding the p-value?
A. Assuming the null hypothesis is true, an extremely small p-value indicates that the sample
mean calculated from the sample data is extremely different from null mean = 180.
B. The p-value measures the probability that the alternative hypothesis is true.
C. The p-value measures the probability that the null hypothesis is true.
D. The larger the p-value, the stronger the evidence against the null hypothesis.
E. A large p-value indicates that the data supports the alternative hypothesis.
(k) Under normal environmental conditions, adult catfish in Dog Lake have an average length of
µ = 13.9 cm with a standard deviation σ = 2.1 cm. Students who frequently fish at Dog Lake claim
that the catfish are smaller than usual this year. Suppose your statistics class takes a random
sample of adult catfish from Dog Lake. Which of the following provides the strongest evidence
to support the claim that students are catching smaller than average length (13.9 cm) catfish this
year?
A. True
B. False
C. We do not have enough information to determine this.
(m) The Central Limit Theorem, the key to performing a hypothesis test for the population mean
μ with a Z test, states that:
A. We can always use a normal curve to approximate the distribution of the sample mean 𝑋.
B. If n is large (e.g. n > 30) and the original population distribution is normal, then the distribution
of the sample mean 𝑋 can be approximated by a normal curve.
C. We can always use a normal curve to approximate the distribution of a random variable X.
D. If n is large (e.g. n > 30) then the distribution of the sample mean 𝑋 can be approximated
closely by a normal curve even if the original distribution is not normal.
E. If n is large (e.g. n > 30), then the distribution of the random variable X can be approximated
closely by a normal curve even if the original distribution is not normal.
Exercise 2
Putting Puzzles Together
If students are helping to conduct this exercise in class, at least 30 puzzle times are needed to be
able to apply the concepts, namely a 1-sample Z hypothesis test for the population mean.
Instructions for creating the data set are below. If you do not have time or puzzles to collect
class data or do not have at least 30 students to perform this activity, then you can use the
‘Puzzle Times (secs)’ data found in HypTestForMean_LargeSample_Activity.mtw. For this data,
the population mean is μ0 = 90 with sample size n = 43.
At the beginning of class, hand out the puzzles to the students. You can have groups of students
share one puzzle and take turns. Tell them to record (in seconds) the amount of time it takes to
complete the puzzle. Other group members should not watch each other to avoid biasing the
results. Times can be recorded using a cell phone. Alternatively, have students bring their
laptops to class and send them the link to a given online puzzle, which typically measures the
time for you.
As students finish the puzzle, ask them to go to the board to record their puzzle construction
times. Once everyone has recorded his or her data, the instructor can enter the data in Minitab.
The instructor can then email the worksheet data to students to ensure that everyone is working
with the same data set.
Activity “Story”: At a circuit board company, employees are required to assemble circuit
boards (puzzles) correctly and completely in a desired amount of time. If they are too slow or
inaccurate, then the entire assembly line is affected.
Based on former employee records, the company states that these circuit boards can be
assembled in 0 = 90 seconds. [Instructors may need to provide a different μ0 based on their
students’ puzzle times.]
You are the manager of these employees. You suspect that the true mean time is not accurate.
Specifically, you think the true mean time to assemble these boards is more than 90 seconds,
and employees are being unfairly assessed. In order to test the hypothesis:
you randomly select n workers to assemble these boards (puzzles). Their average time to
assemble the boards is 𝑥̅ seconds with standard deviation s seconds. [Instructors will need to
provide values for n, 𝑥̅ , and s based on the class’s data. The data provided in the Minitab
worksheet has n = 43, 𝑥̅ = 95.65 seconds, and s = 28.09 seconds.]
(a) Can we assume that the distribution of the mean time 𝑋 for n = 43 employees is normally
distributed? Why or why not? Be specific.
(b) Determine the standardized test statistic z and its p-value by hand.
(c) Determine the standardized test statistic z and its p-value in Minitab.
Exercise 3
Cutting Paper Strips
Reference: “The Blind Paper Cutter: Teaching about Variation, Bias, Stability, and Process Control,”
by Richard A. Stone (August 1998). The American Statistician, 52(3), pp. 244-247.
If you do not have the time or materials required to collect class data, then you can use the ‘Cut
Lengths (cm)’ data found in HypTestForMean_LargeSample_Activity.mtw. For this data, the
mean is μ0 = 14 cm (half the length of the 28 cm paper strips). There are only 25 lengths in the
dataset, but the data in that column is normally distributed.
Activity Instructions: To obtain data for this exercise, groups of 3-4 students will cut 30 strips of
paper of equal length. For example, you can use 8.5 x 11 inch scrap paper to create the strips. It
doesn’t matter how long the strips are as long as they are all the same length. The mean μ0 for
the questions below is equal to ½ of the length.
It is recommended that the instructor or students create the strips of paper prior to class. During
class, each group will need a ruler, as well as a pair of scissors to cut the paper. Each group will
need to select one person to be the cutter, and then do the following:
Before cutting the strips, measure the length. Every strip should be the same length. Then
divide the length by 2 to calculate the value of μ0. For example, if the strips are 28cm, then
μ0 = 14 cm.
Enter the minimum strip lengths into Minitab and email the data to your instructor.
Perform the following three hypothesis tests in Minitab, estimating by calculating the standard
deviation of the data. What conclusions can you draw from the p-values reported for each
hypothesis test?
Exercise 4
A drug manufacturer claims a given type of medicine contains 2.5 milligrams of a certain active
ingredient per capsule. An independent laboratory takes a random sample of 20 of these capsules
and measures the amount of the active ingredient in each.
The laboratory has been hired to determine if the true mean amount of the active ingredient is
actually less than 2.5.
(a) Using the proper statistical notation, write down the null and alternative hypotheses.
A. The true mean amount of the ingredient in all capsules of this type.
B. The true mean amount of the ingredient in the 20 capsules of the laboratory’s sample.
C. The difference between the true mean amount of the ingredient in the population of all
capsules and the sample mean amount of the ingredient in the 20 capsules in our sample.
D. The true proportion of all capsules of this type that contain less than 2.5 milligrams of the
ingredient.
(c) Using Minitab, determine the mean and standard deviation of the sample data provided. The
data is in the Minitab column “Active Ingredient.”
(d) Assume that the amount of the ingredient in capsules is normally distributed. Also, assume
that the laboratory is told that the population standard deviation is = 0.8 mg. By hand, calculate
the z statistic and p-value for the hypothesis test in part (a) based on the sample data provided.
(e) Verify the z statistic and p-value using Minitab. From the 1-Sample Z dialog box, click Options
to select the appropriate alternative hypothesis.
Exercise 5
A waiter (with a statistics background) who works at the “World of Sushi” restaurant is asked to
estimate the true mean time to clean and set tables on a busy Saturday night. This knowledge will
help the restaurant owners determine how much time they should schedule between reservations.
Suppose the waiter randomly samples times for n = 40 tables on busy Saturday nights and obtains
a sample mean of 𝑥̅ = 4.2 minutes with a sample standard deviation of s = 0.8 minutes. The sample
data is used to construct the following 95% confidence interval for the true mean clean-up
time µ:
[3.952, 4.448]
Based on the 95% confidence interval, can the null hypothesis for the following hypothesis test
be rejected at an = 0.05 level of significance? Explain why or why not.
Exercise 7
According to a Google search, the average height of male soccer players in
the U.S. is normally distributed with mean 1.79 m with a standard deviation
of 0.04 m.
(a) For a randomly selected soccer team of 11 players, what is the probability that the average
height of the players is less than 1.77 m? Calculate the probability by hand or in Minitab.
(b) In view of the small sample size, must you make any additional assumptions to justify the
answer to part (a)? Please provide a short explanation.
(c) Set up the hypothesis test to test the true average height of male U.S. soccer players.
(d) Determine the z statistic and p-value for the hypothesis test in part (c).
Exercise 8
According to a study, the U.S. mean family income is $63,091 with a standard deviation of
$21,000.
(a) If a consulting agency surveys 49 families at random, what is the probability that it finds a
mean family income that is more than $71,500? Calculate the probability by hand or in Minitab.
(c) Suppose the median income in the U.S. was $55,000. Why do many websites use the median
as their indicator of income level instead of the mean?
Exercise 9
Your neighbor grows and sells cucumbers in the summer. She packages them in plastic storage
bags and claims that the true mean weight of one of these bags is 1 pound. To test her claim,
you take a random sample of 64 of
these cucumber- filled bags, weigh
them, and record their weights in the
Minitab column “Bag Weights (lbs).”