SLNotes 09
SLNotes 09
SLNotes 09
H0: x 3003 = x
Ha: x 3003 x
}
H0 Ha Hypothesis test ()
Alternative hypothesis:
Two-tailed test () -- Ha: 0,
Left-tailed test () -- Ha: < 0,
Right-tailed test () -- Ha: > 0.
Examples P.385 ~ P.387 Ex9.1~3, for the following examples:
a. Determine the null hypothesis for the hypothesis test.
b. Determine the alternative hypothesis for the hypothesis test.
c. Classify the hypothesis test as two-tailed, left-tailed, or right-tailed.
1. Quality Assurance. A snack-food company produces 454g bag of pretzels. Although the
actual net weights deviate slightly from 454g and vary from one bag to another, the
company insists that the mean net weight of the bags be kept at 454g. Indeed, if the mean
net weight is less than 454g, the company will be short-changing its customers; and if the
mean net weight exceeds 454g, the company will be unnecessarily overfilling the bags.
As part of its program, the quality assurance department periodically performs a
hypothesis test to decide whether the packaging machine is working properly, that is, to
decide whether the mean net weight of all bags packaged is 454g.
Ans: Let denote the mean net weight of all bags packaged.
(a) The packaging machine is working properly, or symbolically H0: = 454 gm.
(b) The packaging machine is not working properly, or symbolically, Ha: 454 gm.
(c) Since the sign is for the alternative hypothesis, the test is two-tailed.
2. Prices of History Books. The R.R. Bowker Company of New York collects information
on the retail prices of books and publishes the data in Publishers Weekly. In 1997, the
mean retail price of history books was $43.50. Suppose that we want to perform a
hypothesis test to decide whether this years mean retail price of history books has
increased from the 1997 mean.
Ans: Let denote this years mean retail price of history books.
(a) This years mean retail price of history books equals the 1997 mean of $43.50, i.e.
H0: = $43.50.
(b) This years mean retail price of history books is greater than the 1997 mean of
$43.50, that is, Ha: > $43.50.
(c) Since the > sign is for the alternative hypothesis, the test is right-tailed.
3. Poverty and Calcium. Calcium is the most abundant mineral in the body and also one of
the most important. It works with phosphorus to build and maintain bones and teeth.
According to the Food and Nutrition Board of the National Academy of Sciences, the
recommended daily allowance (RDA) of calcium for adults is 800 milligrams (mg).
Suppose that we want to perform a hypothesis test to decide whether the average person
with an incomer below the poverty level gets less than the RDA of 800 mg.
Ans: Let denote the mean calcium intake (per day) of all people whose income below the
poverty level.
(a) The mean calcium intake of all people with incomes below the poverty level equals
800 mg per day, i.e. H0: = 800 mg.
(b) The mean calcium intake of all people with incomes below the poverty level is less
than the RDA of 800 mg per day; i.e. Ha: < 800 mg.
(c) Since the < sign is for the alternative hypothesis, the test is left-tailed.
The logic of Hypothesis Testing ():
Basic logic of Hypothesis Testing:
Take a random sample from the population. If the sample data are consistent with the null
hypothesis, do not reject the null hypothesis; if the sample data are inconsistent with the
null hypothesis (in the direction of the alternative hypothesis), reject the null hypothesis
and conclude that the alternative hypothesis is true.
Example P.388 Ex9.4, Quality Assurance, A company that produces snack-food uses a
machine to package 454g bags of pretzels. We assume that the net weights are normally
distributed and that the population standard deviation of all such weights is = 7.89g. A
random sample of 25 bags of pretzels has the net weights, in grams, displayed in the
table shown.
465
449
468
446
447
456
442
433
447
456
438
449
454
456
456
454
446
463
452
435
447
447
450
444
450
Mean ( x ) = 450
Do the data provide sufficient evidence to conclude that the packaging machine is not
working properly? We use the following steps in order to answer the question.
a. State the null and alternative hypotheses for the hypothesis test.
b. Discuss the logic behind carrying out the hypothesis test.
c. Identify the distribution of the variable x , that is, the sampling distribution of the
sample mean for samples of size 25.
d. Obtain a precise criterion for deciding whether to reject the null hypothesis in favor of
the alternative hypothesis.
e. Apply the criterion in part (d) to the sample data and state the conclusion.
Ans: Let denote the mean net weight of all bags packaged.
(a) The null and alternative hypotheses for the hypothesis test,
H0: = 454 g (the packaging machine is working properly)
Ha: 454 g (the packaging machine is not working properly)
(b) The logic if the null hypothesis is true, that is, if = 454 g, the mean weight, x , of
the sample of 25 bags of pretzels should approximately equal 454g. We say
approximately equal because we cannot expect a sample mean to equal exactly the
population mean; some sampling error is to be anticipated. However, if the sample
mean weight differs too much from 454 g, we would be inclined to reject the null
hypothesis and conclude that the alternative hypothesis is true. As we shall show in
part (d), we can use our knowledge of the sampling distribution of the sample mean
to decide how much difference is too much.
(c) The sampling distribution of the mean is normal, with n = 25, = 7.89.
x = (which we dont know),
x =
7 .8
=
= 1.56, and
n
25
x is normally distributed.
(d) The 68.26-95.44-99.74 rule states that, for a normally distributed variable, 95.44%
of all possible observations lie within two standard deviations to either side of the
mean. Applying this part of the rule to the variable x and refer to part (c), we see
that 95.44% of all samples of 25 bags of pretzels have mean weights within 2(1.56)
= 3.12 gm of . Or, equivalently, only 4.56% of all samples of 25 bags of pretzels
have mean weights that are not within 3.12 g of .
If the mean weight, x , of the 25 bags of pretzels sampled is more than two standard
deviations from 454 gm, reject the null hypothesis, = 454g, and conclude that the
alternative hypothesis, 454g, is true. Otherwise, do not reject the null hypothesis.
(e) The mean weight, x , of the sample of 25 bags of pretzels is 450 g. Therefore,
z=
x 454
450 454
=
= -2.56.
1.56
1.56
Because the mean weight of the 25 bags of pretzels sampled is more than two
standard deviations from 454 gm, we reject the null hypothesis, = 454 g, and
conclude that the alternative hypothesis, 454g, is true.
The data provide sufficient evidence to conclude that the packaging machine is not
working properly.
Do not
reject H0
Reject H0
Reject H0
Nonrejection Regtion
(Rejection regions and nonrejection region for two-tailed tests)
The alternative hypotheses:
Two-tailed test
Left-tailed test
Right-tailed test
Sign in Ha
<
>
Rejection region
Both sides
Left side
Right side
H0 is True
H0 is False
Accept H0
Correct Decision
Type II error
Reject H0
Type I error
Correct Decision
Example P.394 Ex 9.5, Quality Assurance, consider once again the pretzel packaging
hypothesis test. The null and alternative hypotheses are
H0: = 454 g (the packaging machine is working properly)
Ha: 454g (the packaging machine is not working properly),
where is the mean net weight of all bags of pretzels packaged. Explain what each of
the following terms would mean.
a. Type I error
b. Type II error
c. Correct decision
Recall that the results of sampling 25 bags of pretzels led to rejection of the null
hypothesis, = 454 g, that is, to the conclusion that 454 g. Classify that conclusion
by error type or as a correct decision if
d. the mean net weight, , is in fact 454 g.
e. the mean net weight, , is in fact not 454 g.
Ans: (a) In fact, = 454 g but the results of the sampling lead to 454 g.
In fact, the packaging machine is working properly, but we conclude that it is not.
(b) In fact, 454 g but the results of the sampling lead to = 454 g.
In fact, the packaging machine is not working properly, but we conclude that it is.
(c) A correct decision can occur in either of 2 ways:
1. When in fact, = 454 g, the results of the sampling lead to = 454 g. The
packaging machine is working properly, and we conclude that it is.
2. When in fact, 454 g, the results of the sampling lead to 454 g. The
packaging machine is not working properly, and we conclude that it is not.
(d) Type I error. In fact = 454 g, but we have rejected it.
(e) A correct decision. In fact 454 g, and we have accepted it.
Probabilities of Type I and Type II errors -- The probabilities of making type I and type II
errors
Significance level ( / ) -- The probability of making a Type I error,
that is, of rejecting a true null hypothesis, is called the significance level, , of a
hypothesis test.
Relation between Type I and Type II Error Probabilities For a fixed sample size, the
smaller we specify the significance level, , the larger will be the probability, , of not
rejecting a false null hypothesis.
1
Region of
(Value of =
c. right-tailed.
Ans:
Do not
reject H0
Reject H0
Do not reject H0
Reject H0
Reject H0
Critical values
(a) The left diagram, for = 0.05, z / 2 = z 0.025 = 1.96, and the critical value is 1.96.
(b) The middle diagram, for = 0.05, z = z 0.05 = 1.645, and the critical value is
1.645.
(c) The right diagram, for = 0.05, z = z 0.05 = 1.645, and the critical value is 1.645.
The most common five tail areas -- are 0.10, 0.05 and 0.01. (Why not 6?)
The One-Sample z-Test for a Population Mean (Critical-Value Approach):
( z )
Assumptions
1. Normal population or large sample.
2. is known.
Step 0. define what is.
Step 1. The null hypothesis is H0: = 0, and the alternative hypothesis is
Ha: 0
or
Ha: < 0
or Ha: > 0
(Two-tailed)
(Left-tailed)
(Right-tailed)
Step 2. Decide on the significance level .
Step 3. Compute the value of test statistic ()
x 0
z=
.
/ n
Step 4. The critical value(s) are
or
or
z
z
z / 2
(Two-tailed)
(Left-tailed)
(Right-tailed)
Use Table II to find the critical values(s).
Step 5. If the value of the test statistic falls in the rejection region, reject H0; otherwise, do
not reject H0.
Step 6. Interpret the results of the hypothesis test.
The hypothesis test is exact for normal populations and is approximately correct for large
38.29
39.92
53.74
42.93
42.98
48.20
42.94
39.38
46.86
39.07
44.40
52.74
44.37
55.78
46.03
47.77
54.72
42.99
64.42
43.74
44.46
10
33.12
67.41
45.80
56.97
48.52
64.21
49.48
61.08
53.30
46.13
34.38
34.69
s = $8.11
Mean ( x ) = $46.91
46.91 43.50
= 2.85.
7.61 / 40
433
620
743
574
647
634
734
850
641
858
11
992
775
1113
Mean ( x ) = 747.4
672
879
609
s = 178.8
At the 5% significance level, do the data provide sufficient evidence to conclude that the
mean calcium intake of all people with incomes below the poverty level is less than the
RDA of 800 mg? Assume that = 188 mg.
Ans: The probability normal plot reveals no outlier and roughly a normal distribution.
Though the sample size n = 18, the z-test procedure applies.
Step 1. State the null and alternative hypothesis.
Let denote the mean calcium intake (per day) of all people with incomes below the
poverty level. The null and alternative hypotheses are,
H0: = 800 mg (mean calcium intake is not less than the RDA)
Ha: < 800 mg (mean calcium intake is less than the RDA)
Step 2. Decide on the significance level .
The significance level is 5%, i.e. = 0.05
Step 3. Compute the value of test statistic
x 0
z
/ n
The known data are: 0 = 800 mg, x = 747.4 mg, =188 mg and n = 18.
z=
747.4 800
= 1.19.
188 / 18
12
One common estimate of mean top speed for cheetahs is 60 mph. The following table
gives the top speeds, in mph, over a quarter mile for a sample of 35 cheetahs. At the 5%
significance level, do the data provide sufficient evidence to conclude that the mean top
speed of all cheetahs differs from 60 mph? Assume that the population standard
deviation of top speeds is 3.2 mph.
57.3
65.0
65.2
60.9
59.8
57.5
60.1
54.8
75.3
63.4
59.0
59.7
55.4
60.6
54.7
56.5
62.6
55.5
58.1
60.2
Mean ( x ) = 59.5
61.3
52.6
57.8
55.9
52.4
57.6
60.7
58.7
61.6
58.3
59.2
62.3
57.8
59.6
66.0
s = 4.3
Ans: a frequency histogram for the data suggests that the top speed of 75.3 mph is an outlier.
Thus, we 1st apply the z-test procedure to the full data set and then do it again on the
data set without the outlier.
Step 1. State the null and alternative hypothesis.
Let denote the mean top speed of all cheetahs.
The null and alternative hypotheses are,
H0: = 60 mph (mean top speed of cheetah is 60 mph)
Ha: 60 mph (mean top speed of cheetah is not 60 mph)
Step 2. Decide on the significance level .
The significance level is 5%, i.e. = 0.05.
Step 3. Compute the value of test statistic
x 0
z
/ n
The known data are: 0 = 60 mph, x = 59.5 mph, =3.2 mph and n = 35.
z=
59.5 60
= 0.88.
3.2 / 35
13
After removing the outlier (new x = 59.06), we find that the value of test statistic is z =
1.71, which still lies in the nonrejection region, although it is much closer to the critical
value. In this case, removing the outlier does not affect the conclusion of the hypothesis
test. We can probably accept that the mean top speed of all cheetahs is roughly 60 mph.
Statistical significance versus practical significance:
Statistical significance means that the data provide sufficient evidence to conclude that
the truth is different from the stated H0. However, it does not necessarily mean that the
difference is important in any practical sense.
x 0
x 0
n , in which | x 0 | may be small, but if n
From the formula z
=
/ n
is large, the value of z may be large and large enough to fall into the rejection region. In
such case, the value | x 0 | may not be practical significant, but it may be statistical
significant.
9.5 P-values
Critical approach () -- use the critical value () in a hypothesis test; the
approach we used above.
P-value approach (P ) -- use the observed value (sample value) as the critical value in a
hypothesis test.
P-value (observed significance level or probability value):
1. The percentage of samples that would yield a value of the test statistic as extreme (
) as or more extreme than that observed if the null hypothesis is true.
2. The probability of observing a value of the test statistic as extreme as or more extreme
than that observed. To obtain the P-value of a hypothesis test, we assume that the null
hypothesis is true. By extreme we mean, far from what we would expect to observe if
the null hypothesis is true. We use the letter P to denote the P-value.
Small P-values provide evidence against the null hypothesis; large P-values do not. The
smaller (closer to 0) the P-value, the stronger the evidence is against the null hypothesis.
Obtaining P-values for a one-sample z-test -- the P-value depends on the test, if it is a twotailed test or a one-tailed test (left-tailed and right-tailed).
Example P.422 Ex9.12, Prices of History Books, consider the history book hypothesis test
where we wanted to decide whether this years mean cost of all history books has
increased from the 1997 mean of $43.50. Recall that the null and alternative hypotheses
are (let denote this years mean retail price of all history books),
H0: = $43.50 (mean price has hot increased)
14
= 0.1894, = 0.3789.
15
speed at least as far from 60 mph as (60 59.526 = 0.474) that of our sample more than
37% of the time.
(b) Without the outlier, the test statistic becomes 1.71, and the resulted P-value is
0.0872 (2-tailed). The interpretation is similar to (a).
(c) Parts (a) and (b) indicate that the strength of the evidence against the null
hypothesis. If the outlier is retained, there is virtually no evidence against the null
hypothesis; if the outlier is removed, there is moderate evidence against the null
hypothesis.
P-value approach to hypothesis testing:
P-value as the Observed Significance Level:
The P-value of a hypothesis test equals the smallest significance level at which the null
hypothesis can be rejected, that is, the smallest significance level for which the observed
sample data results in rejection of H0.
Decision Criterion for a Hypothesis Test Using the P-value:
If the P-value is less than or equal to the specified significance level, reject the null
hypothesis; otherwise, do not reject the null hypothesis.
The One-Sample z-Test for a Population Mean (P-value Approach):
Assumptions
1. Normal population or large sample
2. is known
Step 1. The null hypothesis is H0: = 0, and the alternative hypothesis is
Ha: 0 or
Ha: < 0
or Ha: > 0
(Two-tailed)
(Left-tailed)
(Right-tailed)
Step 2. Decide on the significance level .
Step 3. Compute the value of test statistic
x 0
z=
= z0
/ n
(denote that value as z0.)
Step 4. Use Table II to obtain the P-value.
or
or
z P = z0
z P = z0
zP / 2 = z0
(Left-tailed)
(Right-tailed)
(Two-tailed)
(Notice the P subscript utilizes the -notation.)
Step 5. If P reject H0; otherwise, do not reject H0.
Step 6. Interpret the results of the hypothesis test.
The hypothesis test is exact for normal populations and is approximately correct for large
16
433
620
775
743
574
1113
Mean ( x ) = 747.4
647
634
672
734
850
879
641
858
609
s = 178.8
Ans: A normal probability plot of the above data reveals no outliers and roughly a normal
distribution. We can apply the z-test procedure (P-value approach).
Step 1 State the null and alternative hypothesis.
Let denote the mean calcium intake (per day) of all people with incomes below the
poverty level. The null and alternative hypotheses are
H0: = 800 mg (mean calcium intake is not less than the RDA)
Ha: < 800 mg (mean calcium intake is less than the RDA).
The alternative hypothesis is left-tailed because of the sign (<).
Step 2 Decide on the significance level, .
The significance level is given as 5%, i.e. = 0.05.
Step 3. Compute the value of test statistic
x 0
z=
= z0
/ n
The test statistic, z0 =
x 0
/ n
747.4 800
= 1.19.
188 / 18
17
Critical-value approach
P-value approach
, 3 )
Using the P-value to assess the evidence against the null hypothesis:
P-value
Evidence against H0
P > 0.1
0.05< P < 0.1
0.01 < P 0.05
P 0.01
Weak or none
Moderate
Strong
Very strong
18
x 0
s/ n
or
t
or
t
(Two-tailed)
(Left-tailed)
(Right-tailed)
with df = n 1. Use Table IV to find the critical value(s).
Step 5 If the value of the test statistic falls in the rejection region, reject H0; otherwise, do not
reject H0.
Step 6 Interpret the results of the hypothesis test.
The hypothesis test is exact for normal populations and is approximately correct for large
samples from nonnormal populations.
The One-mean t-Test for a Population Mean (P-Value Approach)
Assumptions
1. Normal population or large sample
2. is unknown
19
or
Ha: > 0
(Right-tailed)
7.3
6.3
6.9
Mean = 6.6
6.1
5.5
6.7
6.9
6.3
7.9
6.6
6.5
5.8
s = 0.672
Ans: A normal probability plot of the data reveals no outliers and is quite linear.
Step 1 State the null and alternative hypotheses.
Let denote the mean pH level of all high mountain lakes in the Southern Alps.
H0: = 6 (mean pH level is not greater than 6)
Ha: > 6 (mean pH level is greater than 6)
Since the (>) sign appears in the Ha, it is a right-tailed test.
20
P-value Approach
Step 4 The critical value for a right-tailed test Step 4 The t-statistic has df = n 1 . Use Table
is t , with df = n 1
IV to estimate the P-value, or obtain it exactly
by using technology.
From df = 14, and = 0.05, table IV gives t0.05 From t P = t0 = 3.458, table IV gives the
= 1.761.
largest value is 0.005, i.e. P < 0.005. By using
Excel, P = 0.00192.
Step 5 If the value of the test statistic falls in
the rejection region, reject H0; otherwise, do
not reject H0.
21
22
Review Problems
Understanding the Concepts and Skills
1. Explain the meaning of each term.
a. Null hypothesis
b. Alternative hypothesis
c. Test statistic
d. Rejection region
e. Nonrejection region
f. Critical value(s)
Ans: a. The null hypothesis is a hypothesis to be tested.
b. The alternative hypothesis is a hypothesis to be considered as an alternate to the null
hypothesis.
c. The test statistic is the statistic used as a basis for deciding whether the null
hypothesis should be rejected.
d. The rejection region is the set of values for the test statistic that leads to rejection of
the null hypothesis.
e. The nonrejection region is the set of values for the test statistic that leads to
23
24
25
26
d. Because it is the smallest significance level for which the observed sample data result
in rejection of the null hypothesis.
13. Discuss the differences between the critical-value and P-value approaches to hypothesis
testing.
14. Identify two advantages of nonparametric methods over parametric methods. When is a
parametric procedure preferred? Explain your answer.
15. Cheese Consumption. The U.S. Department of Agriculture reports in Food
Consumption, Prices, and Expenditures that the average American consumed 30.0 lb of
cheese in 2001. Cheese consumption has increased steadily since 1960 when the average
American ate only 8.3 lb of cheese annually. Suppose that you want to decide whether
year's mean cheese consumption is greater than the 2001 mean.
a. Identify the null hypothesis.
b. Identify the alternative hypothesis.
c. Classify the hypothesis test as two tailed, left tailed, or right tailed.
Ans: Let denote last year's mean cheese consumption by Americans.
a. H0: = 30.0 lb
b. Ha: > 30.0 lb
c. Right tailed
16. The following graph portrays the decision criterion for a hypothesis test about a
population mean, . The null hypothesis for the test is Ho: = o, and the test statistic is
z=
x 0
.
/ n
The curve shown in the graph reveals the implications of the decision criterion if in fact
the null hypothesis is true.
Determine the
a. rejection region.
b. nonrejection region.
27
c. critical value(s).
d. significance level.
e. Draw a graph that depicts the answers you obtained in parts (a)-(d).
f. Classify the hypothesis test as two tailed, left tailed, or right tailed.
Ans: a. z > 1.28
b. z < 1.28
c. z = 1.28
d. a = 0.10
f. Right tailed
17. Cheese Consumption. The null and alternative hypotheses for the hypothesis test in
Problem 15 are:
H0: = 30.0 lb (mean has not increased)
Ha: > 30.0 lb (mean has increased),
where is last year's mean cheese consumption for all Americans. Explain what each of
the following would mean.
a. Type I error
b. Type II error
c. Correct decision
Now suppose that the results of carrying out the hypothesis test lead to non-rejection of
the null hypothesis. Classify that decision by error type or as a correct decision if in fact
last year's mean cheese consumption
d. has not increased from the 2001 mean of 30.0 lb.
e. has increased from the 2001 mean of 30.0 lb.
Ans: a. A Type I error would occur if in fact = 30.0 lb, but the results of the sampling lead
to the conclusion that > 30.0 lb.
b. A Type II error would occur if in fact > 30.0 lb, but the results of the sampling fail
to lead to that conclusion.
c. A correct decision would occur if in fact = 30.0 lb and the results of the sampling
do not lead to the rejection
d. Correct decision
e. Type II error.
*18. Cheese Consumption. Refer to Problem 15. Suppose that you decide to use a z-test
with a significance level of 0.10 and a sample size of 35. Assume that = 6.9 lb.
a. Determine the probability of a Type I error.
b. If last year's mean cheese consumption was 30.5 lb, identify the distribution of the
variable x , that is, the sampling distribution of the mean for samples of size 35.
c. Use part (b) to determine the probability, , of a Type II error if in fact last year's
mean cheese consumption was 30.5 lb.
d. Repeat parts (b) and (c) if in fact last year's mean cheese consumption was 31.0 lb,
31.5 lb, 32.0 lb, 32.5 lb, 33.0 lb, 33.5 lb, and 34.0 lb.
e. Use your answers from parts (c) and (d) to construct a table of selected Type II error
probabilities and powers similar to Table 9.8 on page 433.
28
f. Use your answer from part (e) to construct the power curve.
Using a sample size of 60 instead of 35, repeat
g. part (b).
h. part (c).
i. part (d).
j. part (e).
k. part (f).
l. Compare your power curves for the two sample sizes and explain the principle being
illustrated.
Ans: Note: The answers obtained to many of the parts of this problem may vary depending on
when and how much intermediate rounding is done. We used statistical software to get
the answers to most parts of this problem.
a. 0.10
b. Approximately normal with a mean of 30.5 and a standard deviation of 6.9/ 35
1.17.
c. 0.8031
d. Approximately normal with the specified mean and a standard deviation of 6.9/
1.17. The Type II error probabilities, , are shown in the table in part (e).
g. Approximately normal with a mean of 30.5 and a standard deviation of 6.9/ 60
0.89.
h. 0.7643
i. Approximately normal with the specified mean and a standard deviation of 6.9/
0.89. The Type II error probabilities, , are shown in the table in part (j).
l. For a fixed significance level, increasing the sample size increases the power.
35
60
19. Cheese Consumption. Refer to Problem 15. The following table provides last year's
cheese consumption, in pounds, for 35 randomly selected Americans.
42
29
32
40
33
25
28
28
29
33
29
32
41
22
32
34
24
20
33
18
38
43
35
23
40
36
22
24
27
32
30
38
29
32
25
a. At the 10% significance level, do the data provide sufficient evidence to conclude that
last year's mean cheese consumption for all Americans has increased over the 2001
mean? Assume that = 6.9 lb. For your hypothesis test, use a z-test and the criticalvalue approach. (Note: The sum of the data is 1078 lb.)
b. Given the conclusion in part (a), if an error has been made, what type must it be?
Explain your answer.
Ans: a. H0: = 30.0 lb, Ha: > 30.0 lb; = 0.10; z = 0.69; critical value = 1.28; do not reject
Ho; at the 10% significance level, the data do not provide sufficient evidence to
29
conclude that last year's mean cheese consumption for all Americans has increased
over the 2001 mean of 30.0 lb.
b. A Type II error because, given that the null hypothesis was not rejected, the only error
that could be made is the error of not rejecting a false null hypothesis.
20. Cheese Consumption. Refer to Problem 19.
a. Repeat the hypothesis test, using the P-value approach to hypothesis testing.
b. Use Table 9.12 on page 444 to assess the strength of the evidence against the null
hypothesis.
Ans: a. H0: = 30.0 lb, Ha: > 30.0 lb; = 0.10; z = 0.69; P = 0.2451; do not reject H0; at
the 10% significance level, the data do not provide sufficient evidence to conclude
that last year's mean cheese consumption for all Americans has increased over the
2001 mean of 30.0 lb.
b. The data provide at most weak evidence against the null hypothesis.
21. Purse Snatching. The U.S. Federal Bureau of Investigation (FBI) compiles information
on robbery and property crimes, by type and selected characteristic, and publishes its
findings in Population-at-Risk Rates and Selected Crime Indicators. According to that
document, the mean value lost to purse snatching was $332 in 2002. For last year, 12
randomly selected purse-snatching offenses yielded the following values lost, to the
nearest dollar.
207
237
422
226
272
205
362
348
165
266
269
430
Use a t-test with either the critical-value approach or the P-value approach to decide, at
the 5% significance level, whether last year's mean value lost to purse snatching has
decreased from the 2002 mean. The mean and standard deviation of the data are $284.1
and $86.9, respectively.
Ans: H0: = $332, Ha: < $332; = 0.05; t = -1.909; critical value = -1.796; 0.025<P<0.05;
reject H0; at the 5% significance level, the data provide sufficient evidence to conclude
that last year's mean value lost to purse snatching has decreased from the 2002 mean of
$332.
*22. Purse Snatching. Refer to Problem 21.
a. Perform the required hypothesis test, using the Wilcoxon signed-rank test.
b. In performing the hypothesis test in part (a), what assumption did you make about the
distribution of last year's values lost to purse snatching?
c. In Problem 21, we used the t-test to perform the hypothesis test. The assumption in
30
that problem is that last year's values lost to purse snatching are normally distributed.
If that assumption is true, why is it permissible to perform a Wilcoxon signed-rank
test for the mean value lost?
Ans: a. H0: = $332, Ha: < $332; = 0.05; W = 17; critical value = 17; P = 0.046; reject
H0; at the 5% significance level, the data provide sufficient evidence to conclude that
last year's mean value lost to purse snatching has decreased from the 2002 mean of
$332.
b. It is symmetric.
c. Because a normal distribution is symmetric.
*23. Purse Snatching. Refer to Problems 21 and 22. If in fact last year's values lost to purse
snatching are normally distributed, which is the preferred procedure for performing the
hypothesis test the t-test or the Wilcoxon signed-rank test? Explain your answer.
Ans: t-test
24. Betting the Spreads. College basketball, and particularly the NCAA basketball
tournament, is a popular venue for gambling, from novices in office betting pools to the
high roller. To encourage uniform betting across teams, Las Vegas oddsmakers assign a
point spread to each game. The point spread is the oddsmakers' prediction for the number
of points by which the favored team will win. If you bet on the favorite, you win the bet
provided the favorite wins by more than the point spread; otherwise, you lose the bet. Is
the point spread a good measure of the relative ability of the two teams? H. Stern and B.
Mock addressed this question in the paper "College Basketball Upsets: Will a 16-Seed
Ever Beat a 1-Seed?" (Chance, Vol. 11(1), pp. 27-31). They obtained the difference
between the actual margin of victory and the point spread, called the point-spread error,
for 2109 college basketball games. The mean point-spread error was found to be -0.2
point with a standard deviation of 10.9 points. For a particular game, a point-spread error
of 0 indicates that the point spread was a perfect estimate of the two teams' relative
abilities.
a. If, on average, the oddsmakers are estimating correctly, what is the (population) mean
point-spread error?
b. Use the data to decide, at the 5% significance level, whether the (population) mean
point-spread error differs from 0.
c. Interpret your answer in part (b).
Ans: a. 0 points
b. H0: = 0 points, Ha: 0 points; = 0.05; t = -0.843; critical values = 1.96;
P>0.20; do not reject H0.
c. At the 5% significance level, the data do not provide sufficient evidence to conclude
31
that the population mean point-spread error differs from 0. In fact, because P > 0.20,
there is virtually no evidence against the null hypothesis that the population mean
point-spread error equals 0.
Problems 25 and 26 each include a normal probability plot and either a frequency histogram
or a stem-and-leaf diagram for a set of sample data. The intent is to use the sample data to
perform a hypothesis test for the mean of the population from which data were obtained. In
each case, consult the graphs provided to decide whether to use the z-test, the t-test, or
neither. Explain your answer.
25. The normal probability plot and histogram of the data are depicted in Fig. 9.44; is
known.
Ans: It is probably okay to use the z-test because the sample size is large and is known.
However, it does appear from the normal probability plot that there may be outliers, so
one should proceed cautiously in using the z-test.
26. The normal probability plot and stem-and-leaf diagram of the data are depicted in Fig.
9.45; is unknown.
Ans: It appears that the variable under consideration is far from being normally distributed
and, in fact, has a left-skewed distribution. However, the sample size is large and the
plots reveal no outliers. Keeping in mind that is unknown, it is probably reasonable to
use the t-test.
*27. Refer to Problems 25 and 26.
a. In each case, consult the appropriate graphs to decide whether using the Wilcoxon
signed-rank test is reasonable for performing a hypothesis test for the mean of the
population from which the data were obtained. Give reasons for your answers.
b. For each case where using either the z-test or the -test is reasonable and where using
the Wilcoxon signed-rank test is also appropriate, decide which test is preferable. Give
reasons for your answers.
Ans: a. In view of the graphs, it appears reasonable to assume that, in Problem 25, the
variable under consideration has (approximately) a symmetric distribution but not so
in Problem 26. Consequently, it would be reasonable to use the Wilcoxon signed-rank
test in the first case, but not the second.
b. In Problem 25, it is a tough call between the Wilcoxon signed-rank test and the z-test
but, considering the possible outliers, the Wilcoxon signed-rank test is probably the
better one to use.
32
*28. Nursing-Home Costs. The cost of staying in a nursing home in the United States is
rising dramatically, as reported in the August 5, 2003 issue of The Wall Street journal. In
May 2002, the average cost of a private room in a nursing home was $168 per day. For
August 2003, a random sample of 11 nursing homes yielded the following daily costs, in
dollars, for private room in a nursing home.
73
199
192
181
182
250
159
182
208
129
282
a. Apply the t-test to decide at the 10% significance level whether the average cost for a
private room in a nursing home in August 2003 exceeded that in May 2002.
b. Repeat part (a) by using the Wilcoxon signed-rank test.
c. Obtain a normal probability plot, a boxplot, a stem-and-leaf diagram, and a histogram
of the sample data.
d. Discuss the discrepancy in results between the t-test and the Wilcoxon signed-rank
test.
Ans: a. H0: = $168, Ha: > $168; = 0.10; t = 1.03; critical value 1.372; P > 0.10; do not
reject H0; at the 10% significance level, the data do not provide sufficient evidence to
conclude that the average cost for a private room in a nursing home in August 2003
exceeded that in May 2002.
b. H0: = $168, Ha: > $168; = 0.10; W = 48; critical value = 48; P = 0.099; reject
H0; at the 10% significance level, the data provide sufficient evidence to conclude
that the average cost for a private room in a nursing home in August 2003 exceeded
that in May 2002.
d. From part (c), we find that the variable under consideration appears to be symmetric,
but that the data contain outliers. This explains the discrepancy between the results of
the two tests. In view of the small sample size, the Wilcoxon signed-rank test is
preferable to the t-test.
33
than the 2002 mean of 64.5 lb. Apply the one-mean t-test.
c. The sample data contain four potential outliers: 0, 0, 8, and 20. Remove those four
observations, repeat the hypothesis test in part (b), and compare your result with that
obtained in part (b).
d. Assuming that the four potential outliers are not recording errors, comment on the
advisability of removing them from the sample data before performing the hypothesis
test.
e. What action would you take regarding this hypothesis test?
*30. Beef Consumption. Use the technology of your choice to do the following.
a. Repeat parts (b) and (c) of Problem 29 by using the Wilcoxon signed-rank test.
b. Compare your results from part (a) with those in Problem 29.
c. Discuss the reasonableness of using the Wilcoxon signed-rank test here.
31. Body Mass Index. Body Mass Index (BMI) is a measure of body fat based on height and
weight. According to the document Dietary Guidelines for Americans, published by the
U.S. Department of Agriculture and the U.S. Department of Health and Human Services,
for adults, a BMI of greater than 25 indicates an above healthy weight. The BMIs of 75
randomly selected U.S. adults provided the data on the WeissStats CD. Use the
technology of your choice to do the following.
a. Obtain a normal probability plot, a boxplot, and a histogram of the data.
b. Based on your graphs from part (a), is it reasonable to apply the one-mean z-test to the
data? Explain your answer