Chapter 8 Exercises - Lyman Ott Michael Longnecker An Introduc
Chapter 8 Exercises - Lyman Ott Michael Longnecker An Introduc
Chapter 8 Exercises - Lyman Ott Michael Longnecker An Introduc
8.9 Exercises
8.1 Introduction
Med. 8.1 For the port-wine stains research study, answer the following:
a. What are the populations of interest?
b. What are some factors besides change in skin color that may be of interest to the
investigators?
Med. 8.2 For the port-wine stains research study, do the following:
a. Describe how the subjects in this experiment could have been selected so as to satisfy
the randomization requirements.
b. State several research hypotheses that may have been of interest to the researchers.
8.2 A Statistical Test about More Than Two Population Means: An Analysis of Variance
Theory 8.3 Four techniques of teaching algebra are to be compared in five high schools. Four mathe-
matics teachers are randomly selected in each of the five schools and the four techniques are
randomly assigned to the teachers. The students are given a standardized algebra exam at the
end of the semester with the average score in each classroom used as the measure of the effec-
tiveness of the teaching technique. Therefore, five measurements of the teaching effectiveness
are obtained for each of the four teaching techniques. Would it be appropriate to use the AOV
F test to evaluate whether there is a difference in the average scores of the four teaching
techniques?
Theory 8.4 In Example 8.2, suppose the psychologist wanted to compare method 1 to method 2. What
is the advantage of using a t test having s 2W in the denominator as opposed to using the conven-
tional pooled t test of Chapter 5, with s2p, the average of the sample variances from method 1 and
method 2 data?
Theory 8.5 For an experiment comparing t treatments with sample sizes n1, n2, . . . , nt and sample vari-
ances s21, s22 , . . . , s2t , consider the following questions.
a. If the sample sizes satisfy n1 . . . nt, show the s 2W is the average of the t sample
variances, s21, s22 , . . . , s2t .
b. Does this hold if the sample sizes are not equal? If not, why not just use the average?
Ag. 8.6 A large laboratory has four types of devices used to determine the pH of soil samples. The
laboratory wants to determine whether there are differences in the average readings given by
these devices. The lab uses 24 soil samples having known pH in the study, and randomly assigns
six of the samples to each device. The soil samples are tested and the response recorded is the dif-
ference between the pH reading of the device and the known pH of the soil. These values, along
with summary statistics, are given in the following table.
Sample
Sample Standard
Device 1 2 3 4 5 6 Size Mean Deviation
a. Based on your intuition, is there evidence to indicate any difference among the mean
differences in pH readings for the four devices?
b. Run an analysis of variance to confirm or reject your conclusion of part (a). Use a .05.
c. Compute the p-value of the F test in part (b).
d. What conditions must be satisfied for your analysis in parts (b) and (c) to be valid?
e. Suppose the 24 soil samples have widely different pH values. What problems may
occur by simply randomly assigning the soil samples to the different devices?
8.9 Exercises 439
Bus. 8.7 A cigarette manufacturer has advertised that it has developed a new brand of cigarette,
LowTar, that has a lower average tar content than the major brands. To evaluate this claim, a con-
sumer testing agency randomly selected 100 cigarettes from each of the four leading brands of
cigarettes and 100 from the new brand. The tar content (milligrams) of the cigarettes gave the fol-
lowing results:
Brand yi si ni
A boxplot of the data used to produce the table are given here.
*
12 **
11
10
9
LowTar A B C D
Brand
a. Based on the information contained in the boxplot, does the LowTar brand appear to
have a lower average tar content than the other brands?
b. Using the computer output shown here, is there a significant (a .01) difference in
the average tar content of the five brands of cigarettes?
c. What is the p-value of the test statistic in (b)?
d. What are the practical consequences of making a Type I error with respect to your test
in (b)?
A 34 28 50 193 55 4 7 174 76 10 0 10 84 0 9 1 0 62
26 15 226 54 46 128 4 105 40 4 273 164 7 55 41 26 6
B 15 2 11 75 6 1 12 4 6 64 3 0 3 1 20 78
C 17 142 110 3 273 54 32 3 40 23 30 17 7 12 6 12 7 5
D 8 64 29 1 3 8 29 4 60
E 12 139 21 214 67 174 1 9 2 119 237 110 71
F 7 18 108 9 8 17 88 28
G 10 0 6 0 16 1 58 13 36 33 19
442 Chapter 8 Inferences about More Than Two Population Central Values
a. Do the conditions necessary for conducting the AOV F test appear to be satisfied
by these data?
b. Because the data are counts of number of successes for the EDGs, the
Poisson model may be an alternative to the normal based analysis. Apply a
transformation to the data and then apply the AOV F test to the transformed
data.
c. As a second alternative analysis which has fewer restrictions, answer the agencys
question by applying the KruskalWallis test to the reliability data.
d. Compare your conclusions to parts (a)(c). Which of the three procedures do you
feel more confident with its conclusion?
Engin. 8.27 In the manufacture of soft contact lenses, the actual strength (power) of the lens needs to
be very close to the target value for the lenses to properly fit the customers needs. In the paper,
An ANOM-type test for variances from normal populations, Technometrics (1997), 39:
274 283, a comparison of several suppliers is made relative to the consistency of the power of the
lenses. The following table contains the deviations from the target power of lenses produced
using materials from three different suppliers:
Lens
Supplier 1 2 3 4 5 6 7 8 9
a. Using the appropriate tests and plots given here, assess whether the data meet the
necessary conditions to use an AOV to determine whether there is a significant
difference in the mean deviations for the three suppliers.
b. Conduct an AOV with a .05 and compare your results with the conclusions from (a).
c. Apply the KruskalWallis test to evaluate the research hypothesis that the three
suppliers have different distributions of deviations.
d. Suppose that a difference in mean deviation of 20 units would have commercial
consequences for the manufacture of the lenses. Does there appear to be a practical
difference in the three suppliers?
8.9 Exercises 443
220
210
200
Deviations
190
180
170
160
150
A B C
Suppliers
.999
.99
Probability
.95
.80
.50
.20
.05
.01
.001
-10 0 10
Residuals
Average: -0.0000000 Anderson-Darling Normality Test
StDev: 5.53971 A-Squared: 0.718
N: 27 P-Value: 0.054
Ag. 8.28 The Agricultural Experiment Station of a university tested two different herbicides and
their effects on crop yield. From 90 acres set aside for the experiment, the station used herbicide
1 on a random sample of 30 acres, herbicide 2 on a second random sample of 30 acres, and they
used the remaining 30 acres as a control. At the end of the growing season, the yields (in bushels
per acre) were as follows:
Sample Standard
Sample Mean Deviation Sample Sizes
a. Use these data to conduct a one-way analysis of variance to test whether there is a
difference in the mean yields. Use a .05.
b. Construct 95% condence intervals on the mean yields mi.
c. Which of the mean yields appear to be different?
444 Chapter 8 Inferences about More Than Two Population Central Values
Hort. 8.29 Researchers from the Department of Fruit Crops at a university compared four different
preservatives to be used in freezing strawberries. The researchers prepared the yield from a straw-
berry patch for freezing and randomly divided it into four equal groups. Within each group they
treated the strawberries with the appropriate preservative and packaged them into eight small plas-
tic bags for freezing at 0C. The bags in group I served as a control group, while those in groups II,
III, and IV were assigned one of three newly developed preservatives. After all 32 bags of straw-
berries were prepared, they were stored at 0C for a period of 6 months. At the end of this time, the
contents of each bag were allowed to thaw and then rated on a scale of 1 to 10 points for discol-
oration. (Note that a low score indicates little discoloration.) The ratings are given here:
a. Use the following plots of the residuals and a test of the homogeneity of variances to
assess whether the conditions needed to use AOV techniques are satised with this
data set.
b. Test whether there is a difference in the mean ratings using a .05.
c. Place 95% condence intervals on the mean ratings for each of the groups.
d. Conrm your results with the computer output given here.
6
5
4
3
2
1
0
I II III IV
Group
8.9 Exercises 445
Normal probability plot of
residuals for Exercise 8.29 .999
.99
.95
Probability
.80
.50
.20
.05
.01
.001
-1 0 1
Residuals
Average: 0 Anderson-Darling Normality Test
StDev: 0.927840 A-Squared: 0.503
N: 32 P-Value: 0.191
8.30 Refer to Exercise 8.29. In many situations in which the response is a rating rather than an
actual measurement, it is recommended that the KruskalWallis test be used.
a. Apply the KruskalWallis test to determine whether there is a shift in the distribution
of ratings for the four groups.
b. Is the conclusion reached using the KruskalWallis test consistent with the conclusion
reached in Exercise 8.29 using AOV?
H.R. 8.31 Salary disputes and their eventual resolutions often leave both employers and employ-
ees embittered by the entire ordeal. To assess employee reactions to a recently devised salary
and fringe benets plan, the personnel department obtained random samples of 15 employees
from each of three divisions in the company: manufacturing, marketing, and research. The per-
sonnel staff asked each employee sampled to respond (in condence) to a series of questions.
Several employees refused to cooperate, as reected in the unequal sample sizes. The data are
given here:
Sample size 12 14 11
Sample mean 25.2 32.6 28.1
Sample variance 3.6 4.8 5.3
Group I, student /family prompt: Students were prompted to seek assistance from a family
member and the family was encouraged to provide assistance to students.
Group II, student prompt: Students were prompted to seek assistance from a family
member but there was no specic encouragement of family members to provide
assistance to students.
Group III, no prompts: Students were not prompted to seek assistance from a family
member nor were family members encouraged to provide assistance to students.
Thus, one class was assigned to each of the three treatment groups. The researchers gave
the students a posttest, with the results given here:
The researchers concluded that higher levels of family involvement were not associated with
higher student achievement in this study.
a. What is the population of interest in this study?
b. Based on the data collected, to what population can the results of this study be
attributed?
c. What is the effective sample for each of the treatment groups; that is, how many
experimental units were randomly assigned to each of the treatment groups?
d. What criticisms would you have for the design of this study?
e. Suggest an improved design for addressing the research hypothesis that family
involvement improves student performance in mathematics classes.
Gov. 8.35 In a 1994 Senate subcommittee hearing, an executive of a major tobacco company
testied that the accusation that nicotine was added to cigarettes was false. Tobacco company
scientists stated that the amount of nicotine in cigarettes was completely determined by the size
of tobacco leaf, with smaller leaves having greater nicotine content. Thus, the variation in nico-
tine content in cigarettes occurred due to a variation in the size of the tobacco leaves and was not
due to any additives placed in the cigarettes by the company. Furthermore, the company argued
that the size of the leaves varied depending on the weather conditions during the growing sea-
son, for which they had no control. To study whether smaller tobacco leaves had a higher nico-
tine content, a consumer health organization conducted the following experiment. The major
factors controlling leaf size are temperature and the amount of water received by the plants dur-
ing the growing season. The experimenters created four types of growing conditions for tobacco
8.9 Exercises 447
plants. Condition A was average temperature and rainfall amounts. Condition B was lower than
average temperature and rainfall conditions. Condition C was higher temperatures with lower
rainfall. Finally, condition D was higher than normal temperatures and rainfall. The scientists
then planted 10 tobacco plants under each of the four conditions in a greenhouse where tem-
perature and amount of moisture were carefully controlled. After growing the plants, the scien-
tists recorded the leaf size and nicotine content, which are given here:
Leaf size
30
20
10
0
A B C D
Group
.80
.50
.20
.05
.01
.001
-10 0 10
Leaf size residuals
9
8
7
6 *
A B C D
Group
8.9 Exercises 449
Probability plot of residuals
nicotine content for .999
Exercise 8.36 .99
Probability
.95
.80
.50
.20
.05
.01
.001
-4 -3 -2 -1 0 1 2 3
Nicotine content residuals
Ag. 8.37 Scientists conducted an experiment to test the effects of ve different diets in turkeys. They
randomly assigned six turkeys to each of the ve diet groups and fed them for a xed period of time.
8.40 Refer to the data of Exercise 8.37. To illustrate the effect that an extreme value can have
on conclusions from an analysis of variance, suppose that the weight gained by the fth turkey in
the level 2, additive B group was 15.8 rather than 9.8.
a. What effect does this have on the assumptions for an analysis of variance?
b. With 9.8 replaced by 15.8, if someone unknowingly ran an analysis of variance, what
conclusions would he or she draw?
8.41 Refer to Exercise 8.40. What happens to the KruskalWallis test if you replace the value
9.8 by 15.8? Might there be a reason to run both a KruskalWallis test and an analysis of vari-
ance? Justify your answer.
Engin. 8.42 A small corporation makes insulation shields for electrical wires using three different types of
machines. The corporation wants to evaluate the variation in the inside diameter dimension of the
shields produced by the machines. A quality engineer at the corporation randomly selects shields
produced by each of the machines and records the inside diameters of each shield (in millimeters).
She wants to determine whether the means and standard deviations of the three machines differ.
a. Conduct a test for the homogeneity of the population variances. Use a .05.
b. Would it be appropriate to proceed with an analysis of variance based on the results
of this test? Explain.
c. If the variances of the diameters are different, suggest a transformation that may alle-
viate their differences and then conduct an analysis of variance to determine whether
the mean diameters differ. Use a .05.
d. Compare the results of your analysis in (c) to the computer output given here, which
was an analysis of variance on the original diameters.
e. How could the engineer have designed her experiment differently if she knew that the
variance of machine B and machine C were so much larger than that of machine A?
Analysis of Variance
Source DF SS MS F P
Factor 2 4141 2071 2.73 0.094
Error 17 12907 759
Total 19 17048 Individual 95% CIs for Mean
Based on Pooled StDev
Level N Mean StDev --------+---------+---------+--------
Machine 5 8.32 6.52 (---------*----------)
Machine 5 16.78 22.43 (----------*---------)
Machine 10 40.70 34.52 (------*-------)
--------+---------+---------+--------
Pooled StDev = 27.55 0 25 50
8.43 The KruskalWallis test is not as highly affected by unequal variances as the AOV test.
Demonstrate this result by applying the KruskalWallis test to both the original and transformed
data and comparing the conclusions reached in this analysis for the data of Exercise 8.42.