Docx
Docx
A Preview of
Business Statistics
Test Bank
TRUE/FALSE
1. True or False
Regardless of your eventual career destination, whether it is accounting or
marketing, finance or human resource management, you will find statistics
relevant to your field.
2. True or False
Business statistics can be described as the collection, summarization,
analysis,and reporting of numerical findings relative to a business decision or
situation.
3. True or False
The sole purpose of the U.S. Census Bureau is to simply count how many people
live in the United States.
4. True or False
ATP singles rankings for tennis players rank Rafael Nadal as 2 and Roger Federer
as 1. This is an example of an interval scale.
5. True or False
Quantitative variables usually represent membership in groups or categories.
7. True or False
The population is sometimes referred to as the universe and is the entire set of
people or objects of interest.
8. True or False
Prior to making decisions, companies often collect information through a series
of steps called the research process.
9. True or False
Improperly used, statistics can become an effective weapon with which to
persuade or manipulate others into beliefs or behaviors that we would like them
to adopt.
MULTIPLE CHOICE
5. The difference between the highest and lowest values in the sample is known
as the sample:
a. mean
b. median
c. range
d. proportion
ANS: C PTS: 1 OBJ: Section 1.3
7. When data are collected in a statistical study for only a portion or subset of all
elements of interest we are using a:
a. sample
b. parameter
c. statistic
d. population
ANS: A PTS: 1 OBJ: Section 1.3
10. The statistic that describes the midpoint of the data which has just as many
values above it as below it is known as the sample:
a. mean
b. median
c. mode
d. range
e. standard deviation
ANS: B PTS: 1 OBJ: Section 1.3
11. Daily ambient temperatures in degrees Fahrenheit are being recorded for a
global warming study. This scale of measurement for this data would be:
a. nominal
b. ordinal
c. interval
d. ratio
ANS: C PTS: 1 OBJ: Section 1.4
13. A customer service survey asks respondents for their birthdates, which are
used to calculate their ages. Which scale of measurement does age represent?
a. nominal
b. ordinal
c. interval
d. ratio
ANS: D PTS: 1 OBJ: Section 1.4
COMPLETION
ANS: census
2. The temperature of the room in which you are taking this test is a
____________________ quantitative variable.
ANS: continuous
ANS: gaps
MATCHING
Match the terms with the definitions. (Note: there are more definitions than
terms)
a. a characteristic of the population
b. indicates that a person or object belongs in a category
c. the entire set of people or objects of interest
d. the arithmetic average of the data
e. a measurement scale with an absolute zero
f. a measured characteristic of a sample
g. helps us determine how much of something is possessed
h. a subset of people or objects that exist within the entire set
1. population
2. sample
3. parameter
4. statistic
5. quantitative variable
6. qualitative variable
SHORT ANSWER
1. A recent Wall Street Journal poll asked a group of American voters to rate
economic conditions in the United States. 22% of the respondents rated
economic conditions as “favorable.” Identify the population, sample, sample
statistic, and population parameter in this survey.
Population: ________________________________________
Sample: ________________________________________
ANS:
All eligible voters in the U.S.; Voters selected to participate in the survey.; 22%;
The percentage of all voters who would rate economic conditions as favorable.
ESSAY
1. As a business person, in what ways may you find yourself involved with
statistics?
ANS:
You may find yourself involved with statistics in at least one of the following
ways: (a) as a practitioner collecting, analyzing, and presenting findings based
on statistical data, or (b) as a consumer of statistical claims and findings offered
by others, some of whom may be either incompetent or unethical.
ANS:
Business statistics can be described as the collection, summarization, analysis,
and reporting of numerical findings relevant to a business decision or situation.
ANS:
In ancient times, statistics was mainly employed for counting people or
possessions to facilitate taxation.
PTS: 1 OBJ: Section 1.2
ANS:
Statistics can be used to describe information, to analyze data, to reach
conclusions, and to make decisions in all facets of business today.
ANS:
a. Defining the problem in specific terms that can be answered by research b.
Deciding on the type of data required c. Determining through what means the
data will be obtained d. Planning for the collection of data and, if necessary,
selection of a sample e. Collecting and analyzing the data f. Drawing conclusions
and reporting the findings g. Following through with decisions that take the
findings into consideration
6. The authors of the text claim that a company might actually benefit when one
of its employees uses statistics incorrectly. How can this be?
ANS:
The company may manipulate the data to show what it wants to show, for
example, by distorting the scale on a graph to make a large deficit appear very
small. Another example is to use a sample that is not a good representative of
the population of interest.
7. Car dealers sometimes provide surveys so that customers can evaluate their
purchasing experience at the establishment. What kinds of decisions might be
made on the basis of this information?
ANS:
The car dealers might be able to make decisions on the quality of service, if
there are enough salesmen, and the cleanliness of the show room.
8. What kinds of statistical data play a role in a life insurance firm’s decision on
the annual premium you will pay for your policy?
ANS:
The amount of money paid out in premiums and the number of policies
purchased in the previous year would play a role in how much you pay for your
policy. Data such as your age, and whether you are a smoker could also play a
role.
9. A store’s inventory of party supplies consists of 600 red balloons, 200 white
balloons, and 200 blue balloons. Your boss asks you to choose a representative
sample of 75 balloons for a window display. How many red balloons should you
place in the display? Explain your answer.
ANS:
45; A sample is representative if its members tend to have the same
characteristics as the population from which they were selected. In this case, we
would want our sample to contain a similar percentage of red balloons as the
store’s inventory.
10. A businesswoman who is running for the vacant City Commission seat with
20,000 registered voters conducts a survey. In the survey, 56% of the 500
registered voters interviewed say they planned to vote for her.
A) What is the population of interest?
B) What is the sample?
C) Is the 56% a parameter or a statistic?
ANS:
The 20,000 registered voters.; The 500 registered voters interviewed.; Statistic
12. A manufacturer of children toys claims that less than 5% of his products are
defective. When 500 toys were drawn from a large production run, 7% were
found to be defective.
A) What is the population of interest?
B) What is the sample?
C) What is the parameter?
D) What is the statistic?
E) Does the value 5% refer to the parameter or the statistic?
F) Is the value 7% a parameter or a statistic?
G) Explain briefly how the statistic can be used to make inferences about the
parameter to test the claim.
ANS:
All toys in the production run; 500 toys; Proportion of the production run that is
defective; Proportion of sample toys that are defective; Parameter; Statistic;
Because the sample proportion is more than 5%, we may conclude that the
claim is false.
13. For each of the following indicate whether the variable of interest would be
qualitative or quantitative.
A) Whether you are a US citizen
B) Your marital status
C) Number of cars in a parking lot
D) Amount of time you spend per week on your homework
E) Lily’s travel time from her apartment to the student union at Central Michigan
University
F) Heidi’s favorite brand of tennis balls
ANS:
Qualitative; Qualitative; Quantitative; Quantitative; Quantitative; Qualitative
ANS:
Continuous; Continuous; Discrete; Discrete
15. A research firm observes that men are twice as likely as women to hunt.
Does this information represent descriptive statistics or inferential statistics?
Why?
ANS:
Inferential; This information would represent inferential statistics, since a sample
is used to make generalizations about the population.
TRUE/FALSE
1. True or False
Raw data are converted to frequency distributions and visual displays that
provide us with a “big picture” of the information collected.
2. True or False
The appropriate methods for representing the data do not depend on whether
the variable of interest is quantitative or qualitative.
3. True or False
Contingency tables are used primarily for counts involving qualitative data.
4. True or False
In a raw data listing, data are listed in increasing or decreasing numerical order.
5. True or False
Compared to the frequency distribution, the stem-and-leaf display provides
more detail since it can describe the individual data values as well as show how
many are in each group, or stem.
ANS: T PTS: 1 OBJ: Section 2.3
6. True or False
The following stem-and-leaf output has been generated by Minitab. This data
has a negative mode.
7. True or False
When some of the variables represent categories, we can apply a useful
summarization method called tabulation, where we simply count how many
people or items are in each category or combination of categories.
8. True or False
To more concisely communicate the information contained, raw data can be
visually represented and expressed in terms of statistical summary measures.
9. True or False
A histogram describes a frequency distribution by using a series of adjacent
rectangles, each of which has a length that is proportional to the frequency of
observations within the range of values it represents.
MULTIPLE CHOICE
2. The midway point between the upper and lower class limits for a frequency
distribution is known as the:
a. class interval.
b. class width.
c. class mark.
d. class frequency.
e. class average.
ANS: C PTS: 1 OBJ: Section 2.2
3. The __________ consists of line segments connecting the points formed by the
intersections of the class marks with the class frequencies.
a. frequency polygon
b. histogram
c. stem-and-leaf diagram
d. scatter plot
e. bar graph
ANS: A PTS: 1 OBJ: Section 2.2
7. The largest value in a set of data is 160, and the smallest value is 70. If the
resulting frequency distribution is to have six classes of equal width, what will be
the class interval?
a. 15
b. 6
c. 12
d. 5
ANS: A PTS: 1 OBJ: Section 2.2
15. A stem-and-leaf display describes two-digit integers between 20 and 80. For
one of the classes displayed, the row appears as 5|246. What numerical values
are being described?
a. 25, 45, and 65
b. 60, 50, 40, and 20
c. 52, 54, and 56
d. 46 and 52
ANS: C PTS: 1 OBJ: Section 2.3
18. In cross-tabulation:
a. data are represented in a table showing how many people or items are in
combinations of categories.
b. we can identify possible relationships between the variables.
c. data are represented as a circular display divided into sections based on the
number of observations within the segments.
d. data are represented as a pair of known values of two variables, one being
referred to as independent and the other, dependent.
e. Both A and B are correct.
ANS: E PTS: 1 OBJ: Section 2.6
NUMERIC RESPONSE
ANS: 4
ANS: 12.5%
COMPLETION
ANS: below
ANS: histogram
ANS: cross-tabulation
SHORT ANSWER
ANS:
a. Raw data
b.
Cholesterol Number of Readings
200-209 2
210-219 2
220-229 3
230-239 2
240-249 1
PTS: 1 OBJ: Section 2.2
2.
Price of a Home Number of Homes Sold
Under $100,000 35
$100,000 – under $120,000 45
$120,000 – under $140,000 60
$140,000 – under $160,000 75
$160,000 – under $180,000 55
$180,000 – under $200,000 30
Develop a frequency distribution and proportion distribution for the data. What
does the data suggest about the strength of the political parties in Michigan?
ANS:
ANS:
a. 10
b. 5
c. 6
d. 8
e. 7
Construct frequency and relative frequency distributions for the data using
seven class intervals.
ANS:
60 79 75 84 85 74
81 95 89 58 66 98
99 99 62 86 85 99
79 82 98 72 72 72
75 88 86 81 96 86
78 91 83 85 92 68
ANS:
5|8
6|0268
7|222455899
8|1123455566689
9|125688999
60 79 75 84 85 74
81 95 89 58 66 98
99 99 62 86 85 99
79 82 98 72 72 72
75 88 86 81 96 86
78 91 83 85 92 68
ANS:
Draw a scatter plot and find the least squares regression line.
____________________
ANS:
= -1.115 + 1.846x
ANS:
= 8.63 + 1.0817x
ANS:
a)
Stem-and-leaf of Age N = 25
Leaf Unit = 1.0
3 2 144
6 2 688
12 3 001244
(4) 3 5578
9 4 013
6 4 5557
2 5 34
b)
11. Voters participating in a recent election exit poll in Michigan were asked to
state their political party affiliation. Coding the data 1 for Republican, 2 for
Democrat and 3 for Independent, the data collected were as follows: 3, 2, 3, 2,
1, 1, 3, 2, 1, 3, 1, 2, 3, 1, 3, 3, 3, 3, 2, 1, 1, 3, 2, 3, 1
ANS:
12. A grocery store’s monthly sales (in thousands of dollars) for the last year
was as follows:
Month 1 2 3 4 5 6 7 8 9 10 11 12
Sales 78 74 83 87 85 93 100 105 103 89 78 94
ANS:
47 21 37 53 28 40 30 32 34
34 24 24 35 45 38 35 28 43
30 45 31 41 54 26 45
Use Excel or Minitab to draw a histogram with five classes.
ANS:
ESSAY
He asks for your feedback. What advice would you give him?
ANS:
Your friend needs to revise the distribution. The classes are not mutually
exclusive (they overlap). Also, some classes are open-ended (“40 and below,”
“90 and above”) and not of equal width.
ANS:
The frequency distribution is a table that divides the data values into classes
and shows the number of observed values that fall into each class. The
histogram describes the frequency distribution by using a series of adjacent
rectangles, each of which has a length proportional to the frequency of the
observations within the range of values or class it represents.
ANS:
Choosing a symbol in a pictogram is an important consideration because the
right or wrong symbol can lend nonverbal or emotional content to the display
which can have an impact on how the data is interpreted.
ANS:
The guidelines for constructing a frequency distribution are: a. The set of classes
must be mutually exclusive, with no overlaps. b. Set of classes must be
exhaustive. c. If possible the classes should have equal widths. d. In general,
about 5 to 15 classes will be suitable. e. When possible, class widths should be
round numbers. f. Avoid using open-end classes.
5. A recent Wall Street Journal poll asked a sample of professional, white collar,
and blue-collar workers whether they felt legal immigration had a positive effect
on the United States. The responses were graphed as follows:
Identify the type of chart shown, and describe the information it gives.
ANS:
This is a pie chart. It shows that 39% of professionals, 35% of white-collar
workers, and 26% of blue-collar workers think legal immigration is positive for
the U.S.
6. What is the difference between a histogram and a bar chart? For what type of
data would each be appropriate?
ANS:
A histogram graphically displays class intervals as well as class frequencies. A
bar chart displays the frequencies for a set of categories or classes. Histograms
are appropriate for quantitative data, while bar charts are better for qualitative
data. Adjacent rectangles in the histogram share a common side while those in
the bar chart have a gap between them.
7. What is a scatter diagram, and for what kind of data is it a useful descriptive
device?
ANS:
A scatter diagram is a plot in which each point represents a pair of known or
observed values of x and y. The variables are represented on the horizontal (x)
and vertical (y) axes. The scatter diagram is useful in examining whether the
variables might be related to each other in some way.
ANS:
In a positive linear relationship, y tends to increase linearly with increases in x.
In a negative linear relationship, y tends to decrease linearly as x increases.
Draw a scatter diagram for these data with the income on the vertical axis.
Describe the relationship between income and education.
ANS:
10. The general manager of a chain of furniture stores believes that experience
is the most important factor in determining the level of success of a
salesperson. To examine this belief she records last month’s sales (in $1,000s)
and the years of experience of 10 randomly selected salespeople. These data
are listed below.
ANS:
For each additional year of experience, monthly sales of a salesperson increase
on average by $1,081.70.
11. When variables are used as the basis for a cross-tabulation, what scale of
measurement must they represent?
ANS:
The classificatory variables that represent the rows and columns in cross-
tabulation (contingency tables) will be in the nominal scale of measurement.
The variables described within the table can be nominal (categories) when
counts are being expressed. They can be interval or ratio when measures such
as the mean or median are being expressed.
ANS:
Simple tabulation involves just one variable. Example: we may express a count
of how many students in a class are males and how many are females.
ANS:
In cross-tabulation, we express a count of how many people or items are in
combinations of categories. Example: we may express a count of how many
students in a class are males majoring in accounting and how many are females
majoring in finance.
14. A recent Wall Street Journal survey found that 523 Democrats and 765
Republicans and believe legal immigration has a positive effect on the Untied
States. Display this data in a bar chart.
ANS:
Chapter 3. Statistical
Description of Data
Test Bank
TRUE/FALSE
1. True or False
Characteristics of the population are referred to as parameters, while
characteristics of the sample are referred to as statistics.
2. True or False
When a distribution has more values to the left and tails to the right, we say it is
skewed negatively.
3. True or False
When a distribution has more values to the right and tails to the left, we say it is
skewed positively.
5. True or False
The mode is defined as the value that has just as many values above it as it has
below it.
6. True or False
Measures of dispersion describe typical values in the data.
7. True or False
The midrange is the average of the two middle data values.
8. True or False
Percentiles divide the values into 100 parts of equal size, each comprising 1% of
the observations.
9. True or False
The mean of a set of values describes the 50th percentile.
MULTIPLE CHOICE
8. When a distribution is bell-shaped with the left half identical to the right half,
it is:
a. skewed positively
b. symmetrical
c. equilateral
d. skewed negatively
e. not applicable for Chebyshev’s theorem
ANS: B PTS: 1 OBJ: Section 3.2
9. The value in the data set that occurs with the greatest frequency is known as
the:
a. mode.
b. median.
c. mean.
d. weighted mean.
e. variance.
ANS: A PTS: 1 OBJ: Section 3.2
10. The average score for a class of 25 students was 75. If the 15 female
students in the class averaged 70, then the male students in the class averaged:
a. 85.0
b. 82.5
c. 77.5
d. 75.0
e. 70.0
ANS: B PTS: 1 OBJ: Section 3.2
11. Which of the following statements are true for the following data values: 9,
7, 8, 6, 9, 10, and 14?
a. The mean, median and mode are all equal
b. Only the mean and median are equal
c. Only the mean and mode are equal
d. Only the median and mode are equal
ANS: A PTS: 1 OBJ: Section 3.2
12. Which of the measures of central tendency would best represent the data
below?
a. Range
b. Mean
c. Median
d. Mode
e. Standard deviation
ANS: C PTS: 1 OBJ: Section 3.2
16. The difference between the largest and smallest values in a data set is
called the:
a. standard deviation.
b. variance.
c. coefficient of variation.
d. range.
e. mean.
ANS: D PTS: 1 OBJ: Section 3.3
17. Deciles:
a. are an array of values in 10 parts of equal size.
b. indicate the median with the fifth decile.
c. divide the values into four parts of equal size.
d. indicate the median with the second decile.
e. both A and B are correct.
ANS: E PTS: 1 OBJ: Section 3.3
24. The Empirical Rule states that the percentage of observations in a data set
(providing that the data set has a bell-shaped and symmetric distribution) that
fall within one standard deviation of their mean is approximately:
a. 68%
b. 75%
c. 95%
d. 99%
ANS: A PTS: 1 OBJ: Section 3.4
25. A study was conducted of the 12-month earnings per share for two
categories of airline companies.
Based on the above data, which of the following statements is correct?
a. The large airlines have a coefficient of variation of 2.44 while the small
airlines have a coefficient of variation of 3.64.
b. Because the large airlines have mean earnings per share of 3.40 and the
small airlines have mean earnings per share of 1.11, the large airlines have the
largest coefficient of variation.
c. Because the large airlines have mean earnings per share of 3.54 and the
small airlines have mean earnings per share of 1.71, the large airlines have the
largest coefficient of variation.
d. The large airlines have a coefficient of variation of 0.85 and the small airlines
have a coefficient of variation of 2.72.
e. The large airlines have a coefficient of variation of 0.81 and the small airlines
have a coefficient of variation of 1.17.
ANS: D PTS: 1 OBJ: Section 3.4
26. The length of the box in the box-and-whisker plot portrays the:
a. mean
b. median
c. range
d. interquartile range
ANS: D PTS: 1 OBJ: Section 3.4
27. For a set of data, the mean is 50 and the standard deviation is 10. In the raw
data, one of the observations is equal to 74. After the data are standardized,
what will be the standardized value of this observation?
a. +7.4
b. -7.4
c. +2.4
d. -2.4
ANS: C PTS: 1 OBJ: Section 3.4
28. When the standard deviation is expressed as a percentage of the mean, the
result is the:
a. coefficient of correlation.
b. coefficient of determination.
c. coefficient of variation.
d. coefficient of dispersion.
ANS: C PTS: 1 OBJ: Section 3.4
30. If the coefficient of correlation , then the best-fit linear equation will include:
a. 75% of the data points.
b. 85% of the data points.
c. 95% of the data points.
d. all of the data points.
ANS: D PTS: 1 OBJ: Section 3.6
34. A perfect straight line sloping upward would produce a correlation coefficient
equal to:
a. +1.
b. -1.
c. +2.
d. -2.
ANS: A PTS: 1 OBJ: Section 3.6
35. The difference between the mean of a data set and the individual values of
that set is known as the:
a. variance
b. standard deviation
c. residual
d. mean absolute deviation
e. quartile
ANS: C PTS: 1 OBJ: Section 3.3
NUMERIC RESPONSE
1. Heidi has been keeping track of what she spends to eat out. The last week’s
expenditures (in dollars) for meals eaten out were 3.69, 3.95, 4.19, 8.91, 5.49,
12.53, and 5.66. Calculate the mean amount she spent on meals last week.
ANS: $6.35
ANS: $30.70
3. For each of the past 10 years, the U.S. unemployment rate has been recorded
as follows: 6.7, 5.5, 5.7, 5.2, 4.5, 3.8, 3.8, 3.6, 3.5, and 4.9. Find the mean
unemployment rate for this period.
ANS: 4.72
4. A sample of interest rates on unpaid credit card balances yields the following
data: 13.25, 13.67, 13.60, 13.29, 12.97, 12.90, 12.76, 12.66, 12.58, 12.77, and
12.91. Find the median interest rate.
ANS: 12.91
5. You are interested in beginning a small business enterprise and one aspect of
your research is to look at the loan record for stockholders of small business
corporations. A14-year record of net loans from stockholders of small business
corporations (in billions of dollars) are as follows: 1.6, -0.2, 1.0, 1.4, -0.1, 0.6,
1.5, 0.9, 2.2, 1.3, 1.9, 1.9, 2.8, and 5.0.
Calculate the mean.
____________________ billion
ANS: 1.557
6. You are interested in beginning a small business enterprise and one aspect of
your research is to look at the loan record for stockholders of small business
corporations. A14-year record of net loans from stockholders of small business
corporations (in billions of dollars) are as follows: 1.6, -0.2, 1.0, 1.4, -0.1, 0.6,
1.5, 0.9, 2.2, 1.3, 1.9, 1.9, 2.8, and 5.0.
____________________ billion
ANS: 1.45
7. You are interested in beginning a small business enterprise and one aspect of
your research is to look at the loan record for stockholders of small business
corporations. A14-year record of net loans from stockholders of small business
corporations (in billions of dollars) are as follows: 1.6, -0.2, 1.0, 1.4, -0.1, 0.6,
1.5, 0.9, 2.2, 1.3, 1.9, 1.9, 2.8, and 5.0.
____________________ billion
ANS: 1.90
8. A student scores 89, 72, 93, and 84 on four exams during the semester and
96 on the final exam. If the final is weighted double and the four others
weighted equally, what would the student’s final average be?
ANS: 88.33
ANS: $7.62
10. Sample values of 13 monthly sales figures (in thousands of dollars) are: 6, 5,
6, 6, 7, 8, 9, 10, 10, 10, 9, 9, and 6.
____________________ thousand
ANS: 7.769
11. Sample values of 13 monthly sales figures (in thousands of dollars) are: 6, 5,
6, 6, 7, 8, 9, 10, 10, 10, 9, 9, and 6.
____________________ thousand
ANS: 8.00
12. Sample values of 13 monthly sales figures (in thousands of dollars) are: 6, 5,
6, 6, 7, 8, 9, 10, 10, 10, 9, 9, and 6.
____________________ thousand
ANS: 6.00
13. Consider the following sample data: 4, 14, 6, 9, 21, 3, 7, and 10.
Calculate the mean.
ANS: 9.25
14. Consider the following sample data: 4, 14, 6, 9, 21, 3, 7, and 10.
ANS: 8.00
15. Find the range for the following data: 9, 94, 48, 62, 11, 50, and 6.
ANS: 88
16. Find the midrange for the following data: 14, 99, 53, 67, 16, 55, and 11.
ANS: 55
17. A sample of 13 monthly sales figures (in thousands of dollars) are: 10, 6, 5,
10, 9, 7, 10, 9, 6, 8, 9, 6, and 6.
ANS: 1.633
18. A sample of 13 monthly sales figures (in thousands of dollars) are: 10, 6, 5,
10, 9, 7, 10, 9, 6, 8, 9, 6, and 6.
ANS: 1.83
ANS: 3.359
20. A sample of 13 monthly sales figures (in thousands of dollars) are: 10, 6, 5,
10, 9, 7, 10, 9, 6, 8, 9, 6, and 6.
ANS: 23.59
NARRBEGIN: TV Markets
The top twenty TV markets in the United States are:
NARREND
21. What is the median?
ANS: 1263875
ANS: 1036063
NARRBEGIN: Pizza
In preparation for a pizza party, the students at State University surveyed the
price of medium-sized pizzas for local restaurants:
NARREND
23. What is the range of these prices?
ANS: $2.05
ANS: $7.475
ANS: 18
26. Consider the following sample data: 6, 10, 4, 3, 7, 9, 21, and 14. What is the
mean absolute deviation?
ANS: 4.31
27. Consider the following sample data: 6, 10, 4, 3, 7, 9, 21, and 14. What is the
standard deviation?
ANS: 5.90
28. Consider the following sample data: 6, 10, 4, 3, 7, 9, 21, and 14. What is the
variance?
ANS: 34.79
ANS: 52.43
30. In a test of a lifelong brand of tires, a survey revealed that the mean life was
35,000 miles and the standard deviation was 7,083 miles. If the shape of the
population distribution is not known, approximately what percentage of the tires
last less than 20,834 miles?
ANS: 12.5%
31. The mean of fifty sales receipts is $58.35 and the standard deviation is
$10.38. Using Chebyshev’s theorem, determine how many sales receipts were
between $37.59 and $79.11.
ANS: 75%
32. Your final statistics grade will be based on your final exam score, a
homework score, and a final project, each weighted according to the following
table.
ANS: 90.6
NARREND
34. Calculate the average number of calls per day.
36. A company counted the number of their employees in each of the following
age classes. According to this distribution, what is the average age of the
employees in the company?
37. A survey asked 200 respondents how many children under the age of 18 live
in the household. The following table summarizes the results.
NARRBEGIN: Intervals
Let x be the class mark for each of the intervals below.
NARREND
38. Find the sample mean for these grouped data.
ANS: 6.88
39. Compute the sample standard deviation for the data above.
ANS: 3.1796
NARRBEGIN: Salaries
The salaries (in thousands of dollars) for a sample of 13 employees of a firm are:
26.5, 23.5, 29.7, 24.8, 21.1, 24.3, 20.4, 22.7, 27.2, 23.7, 24.1, 24.8, and 28.2.
NARREND
40. Compute the mean salary.
ANS: 24.692
ANS: 24.30
ANS: 7.097
ANS: 2.664
ANS: 9.3
ANS: 23.5
ANS: 26.5
ANS: 28.2
ANS: 936.75
COMPLETION
ANS: dispersion
ANS: association
5. The measure of dispersion which sums the absolute values of the differences
between the individual values and the mean, and then divides by the number of
individual values is known as the ____________________.
ANS: midrange
7. ____________________ divide the values into 100 parts of equal size, each
comprising 1% of the observations.
ANS: Percentiles
ANS: Deciles
ANS: midpoint
12. If the coefficient of correlation r is positive, the dependent variable y and the
independent variable x are said to be ____________________ related.
ANS: directly
ANS: inversely
SHORT ANSWER
1. Find the mean, median, mode, and range for these data values: 14, 12, 11,
11, 16, 18, 11, and 10.
Mean = ____________________
Median = ____________________
Mode = ____________________
Range = ____________________
ANS:
12.875; 11.5; 11; 8
2. Compute the mean, median, and mode from these data: 1.2, 1.6, 2.1, 1.2,
1.8, 2.2, 1.2, 1.6, 1.8, and 2.0.
Mean = ____________________
Median = ____________________
Mode = ____________________
ANS:
1.67; 1.70; 1.2
3. What are the relative magnitudes of the mean, median, and mode for a
unimodal distribution that is:
A) symmetrical?
Mean ____________________ median ____________________ mode.
B) negatively skewed?
C) positively skewed?
ANS:
=; =;
<, <;
>>
Q1 = ____________________
Q2 = ____________________
Q3 = ____________________
ANS:
4808.25; 5263.5; 5745.0
ANS:
(Q3 – Q1)/2 = 468.375
Year 1980 1981 1982 1983 1984 1985 1986 1987 1988 1989
Spending 27.6 30.8 37.8 40.0 36.9 36.0 38.4 35.7 34.7 35.3
Year 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999
Spending 37.5 33.3 28.0 27.6 28.6 32.2 36.1 35.4 39.4 41.9
ANS:
31.15, 35.55, 37.725
ANS:
3.5; 12.5; 22.0
8. Find the range and midrange from these data: 4, 8, 2, 16, 12, 10, 6, and 22.
Range = ____________________
Midrange = ____________________
ANS:
20; 12
ANS:
At least 88.9%
Stockholder Percentage
A 0.7%
B 0.7%
C 17.8%
D 5.1%
E 4.3%
F 1.8%
G 2.1%
If the mean is 4.63 and the standard deviation is 5.594, what percentage of the
measurements would you expect to fall between -3.748 and 13.034?
ANS:
At least 55.6%
ANS:
At least 84%; at least 88.89%; at least 91.84%; at least 93.75%
ANS:
The asterisks represent outliers in the data.; the line represents the second
quartile (median).
14. The first leadoff hitter, the first second baseman, and the smallest player to
hit a home run over the 94 foot tall, double decked roof at Tiger Stadium is
Detroit’s Lou Whitaker. He stands 5′ 11″ and weighs 160 pounds. Others who
have accomplished this same feat are:
ANS:
Standard deviation = 24.0675 and the mean = 200.43
k = -1.68
Proportion of observations that fall within 1.68 standard deviations of the mean
is at least [1-1/(1.682] = .64569
Therefore a proportion of 1 – .646 or .354 of the data lies on either extreme of
the distribution and the half on the left is .177.
15. Using the grouped data set below, find the sample mean, variance, and
sample standard deviation:
= ____________________.
s2 = ____________________.
s = ____________________.
ANS:
0.55; 0.0499; 0.2233
16. Use the frequency distribution below to determine the mean and standard
deviation of this sample.
= ____________________.
s = ____________________.
ANS:
2.47; 1.23
17. Compute the sample mean and sample standard deviation from the
following set of grouped data:
= ____________________.
s = ____________________.
ANS:
1.625; 1.1252
= ____________________.
= ____________________.
ANS:
31.85; 8.21
NARRBEGIN: Variables
A sample of eight observations of variables x and y is shown below:
NARREND
19. Calculate the coefficient of correlation r and comment on the relationship
between x and y.
r = ____________________.
Comments:
ANS:
-0.991; There is a very strong, almost perfect, negative linear relationship
between x and y.
= ____________________.
ANS:
30.155-2.119x; 19.560
21. Use Minitab or Excel to produce the scatter diagram that includes the best-fit
linear equation.
ANS:
ANS:
Set B
23. A company counted the number of all of their employees in each of the
following age classes. According to this distribution, what is the standard
deviation for the age of the employees in the company?
24. The average order size for a home shopping company follows a bell-shaped
and symmetrical distribution has a mean equal to $75 and a standard deviation
equal to $10. What range of sales order size would we expect to see 95 percent
of the time?
ANS:
95 percent of the orders should fall between $55 and $95.
ANS:
Using Chebyshev’s Theorem, we would expect 75% of the cars to fall within
these limits.
26. The following data represents the years of service for 10 employees at a
particular company:
8 11 6 2 11 6 5 6 10 15
ANS:
Q1 = 5.5, Q2 = 7, Q3 = 11
ANS:
Set B
28. Find the range, median, mean, and variance for the following sample data:
Range = ____________________.
Median = ____________________.
= ____________________.
s2 = ____________________.
ANS:
20.93; 8.97; 11.22; 32.36
ANS:
28; 3.24
ESSAY
ANS:
When the average of the group is larger than most of the values in the group
(i.e., there is skewness in the data), if there are outliers in the data, or if the
group of values is small.
2. The chapter notes that the standard deviation will always be larger than the
mean absolute deviation. Explain why.
ANS:
When the extreme values are squared, they affect a greater influence.
4. In a given data sample, the mean is larger than the median, and both are
larger than the mode. Identify and illustrate the shape of the distribution.
ANS:
This is a positively skewed distribution
5. A data sample has a mean of 107, a median of 122, and a mode of 134. What
type of distribution does it have? Why?
ANS:
Negative skewed distribution; since mean < median < mode
ANS:
There is no modal value.
TRUE/FALSE
1. True or False
Statistics can be used as a descriptive tool for expressing information relevant to
business.
2. True or False
Statistics can be used as an inferential device for using sample findings to draw
conclusions about a population.
3. True or False
An economics student searches the Survey of Current Business for gross
domestic product data as a class report item. This is an example of using
primary data.
4. True or False
A recent graduate of mortuary school samples the death rate statistics for
counties in her state in hopes of finding a good location to start a mortuary. This
is an example of secondary data.
5. True or False
A manufacturer of semiconductors experiments with different designs and
records the data. This is an example of using primary data.
6. True or False
Consumers Union purchases twenty-five brands of lawnmowers and tests them
for various attributes. This is an example of secondary data.
8. True or False
Causal research helps us become familiar with the problem situation, identify
important variables, and use other variables to form hypotheses that can be
tested in subsequent research.
9. True or False
Predictive research attempts to forecast some situation or value that will occur
in the future.
MULTIPLE CHOICE
18. When the sample mean differs from the population mean simply be chance,
the error that results is known as:
a. sampling error.
b. response error.
c. nonresponse error.
d. bias.
e. nonsystematic error.
ANS: A PTS: 1 OBJ: Section 4.6
23. If the population for a study is defined as the student body in a particular
high school, and the sample is selected by randomly choosing classrooms at
11:00 AM and administering the survey to all the students in those classes, we
are relying on:
a. simple random sampling.
b. nonprobability sampling.
c. cluster sampling.
d. stratified sampling.
e. convenience sampling.
ANS: C PTS: 1 OBJ: Section 4.7
27. At the annual freshman mixer at your university, each student places a card
with his/her name in a large basket. A card is drawn for the evening’s door prize.
This is an example of:
a. a nonprobability sample.
b. a probability sample.
c. a simple random sample.
d. both A and C are correct.
e. both B and C are correct.
ANS: E PTS: 1 OBJ: Section 4.7
28. In order to obtain a sample of persons owning a telephone, you select every
24th name in the local phone book. This is an example of:
a. simple random sampling
b. systematic sampling
c. stratified sampling
d. probability sampling
e. both B and D are correct
ANS: E PTS: 1 OBJ: Section 4.7
29. When the population is divided into mutually exclusive sets, and then a
simple random sample is drawn from each set, this is called:
a. simple random sampling.
b. stratified sampling.
c. cluster sampling.
d. systematic sampling.
e. quota sampling.
ANS: B PTS: 1 OBJ: Section 4.7
30. When all the items in a population have some (nonzero) known or calculable
chance of being selected in the sample, this is called:
a. probability sampling.
b. nonprobability sampling.
c. purposive sampling.
d. convenience sampling.
e. judgment sampling.
ANS: A PTS: 1 OBJ: Section 4.7
31. Which of the following sampling plans use random methods of selections?
a. Simple random sampling
b. Stratified sampling
c. Cluster sampling
d. Systematic Sampling
e. All of these
ANS: E PTS: 1 OBJ: Section 4.7
33. When every possible sample of size n has the same chance of being
selected, this is called:
a. simple random sampling.
b. stratified sampling.
c. cluster sampling.
d. convenience sampling.
e. quota sampling.
ANS: A PTS: 1 OBJ: Section 4.7
COMPLETION
ANS: open-ended
ANS: dichotomous
ANS: sampling
ANS: External
10. Fill in the blank: Data that are collected by someone other than the
researcher, for purposes other than the problem or decision at hand, are
referred to as _________________________.
ANS: bias
ANS: stratified
16. The ____________________ sample involves dividing the population into groups,
then randomly selecting some of the groups and taking either a sample or a
census of their members.
ANS: cluster
17. ____________________ data are generated by the researcher for the problem or
situation at hand, while ____________________ data have been collected by
someone else for some other purpose.
SHORT ANSWER
What type of sample are they taking? How will they select the sample?
ANS:
Cluster; They will randomly select some of the areas and take either a sample or
census of each area.
ANS:
Purposive
ANS:
Stratified
ESSAY
ANS:
A type of research in which we communicate with a sample of individuals in
order to generalize on the characteristics of the population from which they
were drawn.
2. Comment on the appropriateness of the question, “Have you ever broken the
law and endangered the lives of others by speeding and tailgating?
ANS:
This question is biased and will most likely lead to many false negative answers.
ANS:
Data warehouse is a large collection of data from inside and outside the
company, put together for the express purpose of using sophisticated analytical
techniques to uncover patterns and discover interrelationships within the data.
ANS:
Data mining is a procedure by which sophisticated analytical techniques are
applied to uncover patterns and discover interrelationships within the data.
5. How can data mining be useful to a business or other organization? What are
some of the analytical tools it employs?
ANS:
Data mining is analogous to physically sifting through great quantities of ore in
the quest for gold. It utilizes information and techniques such as descriptive
statistics, cross-tabulations, regression, and correlation, but also includes many
advanced methods such as factor analysis, cluster analysis, discriminant
analysis, and multidimensional scaling, for the purpose of uncovering patterns
and discovering interrelationships within the data.
ANS:
This report is published every 10 years and presents voluminous data on such
variables as home ownership or rental, type of structure, number of residents,
market value, heating source, major appliance, and much more.
ANS:
The Census of Population is the research for which the Bureau of the Census is
probably best known. It is conducted every 10 years and provides detailed
information on the U.S. population, including age, education, employment,
gender, and a broad range of other demographic characteristics.
ANS:
The Encyclopedia of Associations identifies industry-related trade associations
that collect and disseminate data that are related to their industry. This
publication is also an excellent source for identifying private organizations
having special interests that may be related to the business researcher’s
information needs.
ANS:
In a simple random sample, every person or element in the population has an
equal chance of being included in the sample.
11. A politician wants to estimate the mean age of registered voters in her
district. Unfortunately, she does not have a complete list of households.
Describe a sampling plan that would be suitable for her purposes.
ANS:
The politician may use cluster sampling by letting each county in the district
represents a cluster.
12. The dean of a college of arts and sciences with six departments wants to
estimate the student-hours lost per month due to illness of professors. Describe
a sampling plan that would be suitable for estimating the college-wide loss and
for comparing departments.
ANS:
The dean can select stratified random samples where the strata are the six
departments. Simple random sampling can be conducted in each department.
13. What are the three major sources of error in survey research?
ANS:
Sampling error, response error, and nonresponse error
ANS:
Internal validity refers to whether the independent variable really made the
difference in the measurements obtained. External validity asks whether the
results of the experiment can be generalized to other people or settings.
PTS: 1 OBJ: Section 4.4
TRUE/FALSE
1. True or False
Uncertainty plays an important role in our daily lives and activities as well as in
business.
2. True or False
In the classical approach, probability is the proportion of times an event is
observed to occur in a very larger number of trials.
3. True or False
Sometimes it is useful to revise a probability on the basis of additional
information that we didn’t have before.
4. True or False
An experiment is an activity of measurement that results in an outcome.
5. True or False
In general, an event is one of the possible outcomes of an experiment.
7. If the probability of an event is x, with , then the odds in favor of the event
are x to (1 – x).
8. True or False
The union of events describes two or more events occurring at the same time.
9. True or False
When the events within a set are both mutually exclusive and exhaustive, the
sum of their probabilities is 1.0.
MULTIPLE CHOICE
7. If Events A and B are not mutually exclusive, then the probability that one of
the events will occur is represented by
a. P(A or B) = P(A) + P(B) – P(A and B)
b. P(A or B) = P(A) + P(B)
c. P(A and B) = P(A) + P(B) – P(A or B)
d. P(A and B) = P(A) + P(B)
e. P(A or B) = P(A) + P(B) + P(A and B)
ANS: A PTS: 1 OBJ: Section 5.3
8. Which of the following statements is not correct?
a. Two events A and B are mutually exclusive if event A occurs and event B
cannot occur.
b. If events A and B occur at the same time, then A and B intersect.
c. If event A does not occur, then its complement A’ will also not occur.
d. A union of events occurs when at least one event in a group occurs, e.g., (A or
B or C).
e. If all possible outcomes of an experiment are represented in a set, the set is
considered exhaustive.
ANS: C PTS: 1 OBJ: Section 5.3
10. The table below gives the probabilities of combinations of religion and
political parties in a major U.S. city.
12. The table below gives the probabilities of combinations of religion and
political parties in a major U.S. city.
13. The table below gives the probabilities of combinations of religion and
political parties in a major U.S. city.
14. The table below gives the probabilities of combinations of religion and
political parties in a major U.S. city.
What is the probability that a randomly selected person would be a Democrat
who was not Jewish?
a. 0.93
b. 0.50
c. 0.47
d. 0.07
e. None of these
ANS: C PTS: 1 OBJ: Section 5.3
15. The table below gives the probabilities of combinations of religion and
political parties in a major U.S. city.
17. If P(A) = 0.84, P(B) = 0.76 and P(A or B) = 0.90, then P(A and B) is:
a. 0.06.
b. 0.14.
c. 0.70.
d. 0.83.
ANS: C PTS: 1 OBJ: Section 5.4
18. If P(A) = 0.25 and P(B) = 0.60, then P(A and B) is:
a. 0.15.
b. 0.35.
c. 0.85.
d. Cannot be determined from the information given.
ANS: D PTS: 1 OBJ: Section 5.4
26. Assume that A and B are independent events with P(A) = 0.40 and P(B) =
0.30. The probability that both events will occur simultaneously is:
a. 0.10.
b. 0.12.
c. 0.70.
d. 0.75.
e. 1.0.
ANS: B PTS: 1 OBJ: Section 5.5
28. If A and B are independent events with P(A) = 0.25 and P(B) = 0.60, then
P(A/B) is:
a. 0.25.
b. 0.60.
c. 0.35.
d. 0.85.
e. 0.15
ANS: A PTS: 1 OBJ: Section 5.5
29. If the probability of drawing an Ace from a deck of cards is 0.077 and the
probability of rolling a “2” using a fair die is 0.167, then the probability of
drawing an Ace and rolling a “2” is
a. 0.244
b. 0.090
c. 0.454
d. 0.013
e. 0.333
ANS: D PTS: 1 OBJ: Section 5.5
31. A survey revealed that 21.5% of the households had no checking account,
66.9% had regular checking accounts, and 11.6% had NOW accounts. Of those
households with no checking account 40% had savings accounts. Of the
households with regular checking accounts 71.6% had a savings account. Of the
households with NOW accounts 79.3% had savings accounts.
The probability that a randomly selected household has a savings account is:
a. 0.609.
b. 1.000.
c. 0.227.
d. 0.657.
e. 0.537.
ANS: D PTS: 1 OBJ: Section 5.8
32. A survey revealed that 21.5% of the households had no checking account,
66.9% had regular checking accounts, and 11.6% had NOW accounts. Of those
households with no checking account 40% had savings accounts. Of the
households with regular checking accounts 71.6% had a savings account. Of the
households with NOW accounts 79.3% had savings accounts.
The probability that a randomly selected household has no checking account
and no savings account is:
a. 0.0860.
b. 0.1290.
c. 0.5330.
d. 0.1309.
e. 0.3430.
ANS: B PTS: 1 OBJ: Section 5.8
33. A survey revealed that 21.5% of the households had no checking account,
66.9% had regular checking accounts, and 11.6% had NOW accounts. Of those
households with no checking account 40% had savings accounts. Of the
households with regular checking accounts 71.6% had a savings account. Of the
households with NOW accounts 79.3% had savings accounts.
The probability that a randomly selected household with a savings account has
no checking account is:
a. 0.1309.
b. 0.1290.
c. 0.1437.
d. 0.2150.
e. 0.4000.
ANS: A PTS: 1 OBJ: Section 5.8
34. Which of the following statements is correct given that the events A and B
have nonzero probabilities?
a. A and B cannot be both independent and mutually exclusive
b. A and B can be both independent and mutually exclusive
c. A and B are always independent
d. A and B are always mutually exclusive
ANS: A PTS: 1 OBJ: Section 5.8
NUMERIC RESPONSE
1. The odds of the Dallas Cowboys winning this year’s Super Bowl are 3 to 1.
Compute the probability of the Cowboys winning.
ANS: 0.75
NARRBEGIN: College
The table below indicates the number of majors found in a college of business:
NARREND
2. What is the probability that a randomly selected student is an accounting
major?
ANS: 0.129
ANS: 0.259
ANS: 0
ANS: 0.72
ANS: 0.37
7. Let X be the number of service calls during the warranty period for a
dishwasher. What is the probability that there will be exactly one service call?
ANS: 0.38
8. Let X be the number of service calls during the warranty period for a
dishwasher. What is the probability that there will be at least one service call?
ANS: 0.75
9. Let X be the number of service calls during the warranty period for a
dishwasher. What is the probability that there will be at most one service call?
ANS: 0.63
ANS: 0.128
ANS: 0.277
ANS: 0.425
NARRBEGIN: Odds
The odds in favor of an event are the number of successes divided by the
number of failures. The probability of this event occurring is the number of
successes divided by the sum of the number of successes and the number of
failures. The number of successes is five and the number of failures is four.
NARREND
13. Find the probability of failure.
ANS: 0.4444
ANS: 0.4917
PTS: 1 OBJ: Section 5.2
15. In the board game called TRIVIAL PURSUIT, a single die is used to determine
the number of spaces a player is allowed to move. A player is currently
positioned 3 spaces from a space where a correct answer will earn a “pie” and 4
spaces from a space where “roll again” is the option. Assuming a fair die, what is
the probability the player will get to answer a question for a “pie” or get to “roll
again”?
ANS: 0.333
NARRBEGIN: Event
NARREND
16. Find P (C1 and D1).
ANS: 0.075
ANS: 0.60
ANS: 0.525
ANS: 1.00
NARRBEGIN: Candidate
Six candidates for a new position of vice-president for academic affairs have
been selected. Three of the candidates are female. The candidates’ years of
experience are as follows:
ANS: 0.167
ANS: 0.500
ANS: 0.333
ANS: 0.667
NARRBEGIN: Manufacturers
A large men’s store in a mall purchases suits from four different manufacturers
in short, regular, and long. Their current selections are given in the contingency
table below:
Let the abbreviations and letters represent the events. For example Event S = a
randomly selected suit is short, event HMS = a randomly selected suit is
manufactured by HMS.
NARREND
24. Compute P(S).
ANS: 0.20
ANS: 0.20
ANS: 0.03
ANS: 0.15
ANS: 0.15
ANS: 0.492
ANS: 0.467
ANS: 0.111
32. What is the probability that a randomly selected respondent was a Democrat
or not in favor of the proposed law?
ANS: 0.617
33. Given that the respondent was in favor of the proposed law, what is the
probability that they were a Republican?
ANS: 0.554
Number of Bedrooms
Location 3 Bedroom 4 Bedroom Total
Beach Front 17 24 41
Non Beach Front 6 210 216
Total 23 234 257
NARREND
34. What is the probability of selecting a beachfront site?
ANS: 0.1595
35. What is the probability of selecting a non beach front site with 3 bedrooms?
ANS: 0.023
36. What is the probability of selecting a non beach front site given that the
facility had 4 bedrooms?
ANS: 0.897
NARRBEGIN: Gender
The physical science degrees conferred by a school between 1992 and 1995
were broken down as follows:
ANS: 0.500
ANS: 0.125
ANS: 0.47
41. A study of Brand D washing machines that failed within three years
(warranty period) was 23.2%. Of those that failed, 82.9% lasted over two years.
What is the probability that a washer, which lasts at least two years, will still fail
before the end of the warranty period?
ANS: 0.171
42. A firm produces three qualities of bed linens — good, better, best. Production
is scheduled to result in 50% good, 35% better, and 15% best. A quality control
person checks the finished product and classified them as acceptable or
imperfect. The percentages acceptable are good, 92%; better, 88%; best, 82%.
Find the posterior probability that a sheet, which is “acceptable”, will be
acceptable.
ANS: 0.1380
ANS: 0.045
ANS: 0.035
ANS: 0.08
ANS: 0.5625
ANS: 0.4375
NARRBEGIN: Event 1
The prior probabilities of events A1, A2, and A3 are P(A1) = .50, P(A2) = .20 and
P(A3) = .30. The conditional probabilities of event B given A1, A2, and A3 are
P(B / A1) = .40, P(B / A2) = .50 and P(B / A3) = .30.
NARREND
48. Compute P(B and A1).
ANS: 0.20
ANS: 0.10
ANS: 0.09
ANS: 0.39
52. Apply Bayes’ theorem to compute the posterior probability P(A1 / B).
ANS: 0.5128
53. Apply Bayes’ theorem to compute the posterior probability P(A2 / B).
ANS: 0.2564
54. Apply Bayes’ theorem to compute the posterior probability P(A3 / B).
ANS: 0.2308
56. How many different combinations are possible if 8 substitute workers are
available to fill 4 openings created by employees planning to take vacation
leave next week?
ANS: 70
57. Ten students from an economics class have formed a study group. Each may
or may not attend a study session. Assuming that the members will be making
independent decisions on whether or not to attend, how many different
possibilities exist for the composition of the study session?
ANS: 1024
58. A state’s license plate has 9 positions, each of which has 37 possibilities (26
letters, 10 integers, or blank). If the purchaser of a vanity plate wants his first
seven positions to be Stanley, in how many ways can his plate appear?
ANS: 1369
ANS: 0.35
ANS: 0.75
61. Determine the probability that a randomly selected bond will have a rate of
interest of at least 14 percent, but less than 15 percent.
ANS: 0.15
62. Determine the probability that a randomly selected bond will have a rate of
interest of less than 13 percent.
ANS: 0.75
63. Determine the probability that a randomly selected bond will have a rate of
interest of 11 percent or more.
ANS: 0.75
ANS: 0.25
ANS: 0.00
ANS: 0.414
67. What is the probability that an item selected at random is a Mother burger?
ANS: 0.218
68. What is the probability of selecting a menu item from a high profit store and
it is nachos?
ANS: 0.174
69. What is the probability of tacos being selected at a low profit store?
ANS: 0.197
NARREND
70. For a randomly selected consumer, find the following probability: P(D).
ANS: 0.6
71. For a randomly selected consumer, find the following probability: P(B and D).
ANS: 0.2
72. For a randomly selected consumer, find the following probability: P(A and D).
ANS: 0.3
73. For a randomly selected consumer, find the following probability: P(A or D).
ANS: 0.7
74. For a randomly selected consumer, find the following probability: P(A / D).
ANS: 0.5
75. For a randomly selected consumer, find the following probability: P(E / B).
ANS: 0.33
NARRBEGIN: Dormitory
A dormitory has two activating devices in each fire detector. One is smoke
activated and has a probability of .98 of sounding an alarm when it should. The
second is a heat sensitive activator and has a probability of .95 of operating
when it should. Each activator operates independently of the other. Presume a
fire starts near a detector.
NARREND
76. What is the probability that the detector will sound an alarm?
ANS: 0.999
77. What is the probability that both types of activators will work properly?
ANS: 0.931
78. What is the probability that only the smoke activator functions?
ANS: 0.049
COMPLETION
ANS: Classical
ANS: a priori
ANS:
zero; one; chance
0, 1, chance
PTS: 1 OBJ: Section 5.2
5. You have applied for a job and are told by the interviewer that the probability
of your being hired is 0.70. Calculate the odds that you will be hired.
ANS:
7: 3
seven; three
6. The probability of a tornado hitting in your city has been estimated at 0.001.
What are the corresponding odds?
ANS: 1; 999
7. In a state lottery, which pays at least one million dollars, six numbers from 1
to 50 are randomly selected. If five have been selected and your ticket matches
the first five, what are the odds that the sixth number will make you a
millionaire?
ANS: 1; 44
8. The grade distribution for a particular statistics exam was recorded as follows:
ANS: exhaustive
11. The ____________________ of events describes the case where two or more
events occur at the same time.
ANS: intersection
12. When we wish to determine the probability that one or more of several
events will occur in an experiment, we would use ____________________ rules.
ANS: addition
13. When events are ____________________ the occurrence of one means that
none of the others can occur.
ANS: independent
15. Events are ____________________ when the occurrence of one event changes
the probability that another will occur.
ANS: dependent
NARRBEGIN: Gender
The physical science degrees conferred by a school between 1992 and 1995
were broken down as follows:
ANS: P(P)
ANS: Permutations
PTS: 1 OBJ: Section 5.7
ANS: Combinations
NARRBEGIN: Odds
The odds in favor of an event are the number of successes divided by the
number of failures. The probability of this event occurring is the number of
successes divided by the sum of the number of successes and the number of
failures. The number of successes is five and the number of failures is four.
NARREND
19. Find the odds in favor of success.
ANS:
5; 4
five, four
SHORT ANSWER
1. If the odds in favor of an event happening are A:B, what is the probability
being expressed?
ANS:
2. You think you have a 95% chance of passing your next business statistics
exam. This is an example of which approach to probability?
ANS:
Subjective
ANS:
ANS:
Yes; 0.2308
5. According to an old song lyric, “love and marriage go together like a horse
and carriage.” Let love be event A and marriage be event B. Are events A and B
mutually exclusive?
ANS:
No
6. One card is randomly selected from a deck of 52 playing cards. Define the
following events:
ANS:
1 – [(4/52) + (4/52) + (4/52)] = 0.7692
NARRBEGIN: Candidate
Six candidates for a new position of vice-president for academic affairs have
been selected. Three of the candidates are female. The candidates’ years of
experience are as follows:
ANS:
No; P(A/B) is not equal to P(A).
Is the event of selecting a beach front site independent of the event of selecting
a 3 bedroom facility? Explain.
ANS:
No; P(beach front) = .1595; P(3 bedroom) = .0895 P(beach front/3 bedroom) = .
7391 Since 0.1595 is not equal to 0.7391 or 0.0895 is not equal to 0.7391, these
events are not independent events.
NARRBEGIN: Gender
The physical science degrees conferred by a school between 1992 and 1995
were broken down as follows:
ANS:
No
ANS:
P(C/F) = 0.47 and P(C) = 0.5, so P(C/F) is not equal to P(C)
NARRBEGIN: Probabilities
The prior probabilities for two events A1 and A2 are P(A1) = .30 and P(A2) = .70.
It is also known that P(A1 and A2) = 0. Suppose P(B / A1) = .15 and P(B / A2) = .
05.
NARREND
11. Are A1 and A2 mutually exclusive? Explain.
ANS:
Yes; Yes, since P(A1 and A2) = 0.
NARRBEGIN: Machines
Machine A produces 3% defectives, machine B produces 5% defectives, and
machine C produces 10% defectives. Of the total output from these machines,
60% of the items are from machine A, 30% from B, and 10% from C. One item is
selected at random from a day’s production.
NARREND
12. What is the prior probability that the item came from machine A?
Machine B?
Machine C?
ANS:
P(A) = 0.60; P(B) = 0.30; P(C) = 0.10
14. If inspection finds the item to be defective, what is the revised probability
that the item came from machine C?
ANS:
15. If inspection finds the item to be defective, what is the revised probability
that the item came from machine B?
ANS:
16. If inspection finds the item to be defective, what is the revised probability
that the item came from machine A?
ANS:
17. I would like to select 3 paperback books from a list of 12 books to take on
vacation. How many different sets of 3 books can I choose?
ANS:
12 C 3 = 220
18. At a track meet, eight runners are competing in the 100-yard dash. First,
second, and third trophies will be awarded. In how many ways can the trophies
be awarded?
ANS:
(8!/5!) = 336
PTS: 1 OBJ: Section 5.7
ANS:
A) P(M5)
B) P(R3)
D) P(M2 / R2)
E) P(R1 / M4)
ANS:
Probability of tacos being ordered.; Probability of a store being a Low Profit
Store.; Probability that a Father Burger is ordered at a Medium Profit Store.;
Probability of a Mother Burger being ordered knowing it was at a Medium Profit
Store.; Probability that the store is High Profit given Nachos are ordered.
ANS:
Father Burger; 27.1%
23. What item was selected the least at any type store?
ANS:
Tacos; 15.2%
24. What item has the highest probability of being selected for any type store?
ANS:
Father Burger; 27.6%
A) P(M5)
B) P(R3)
D) P(M2 / R2)
E) P(R1 / M4)
ANS:
0.152; 0.254; 0.090; 0.234; 0.410
Portfolio Composition
Gender Bonds Stocks Balanced
Male 0.18 0.20 0.25
Female 0.12 0.05 0.20
One client is selected at random, and two events A and B are defined as follows:
A) A or B
B) A and B
C) A and
D) or
ANS:
The client selected either is male or has a balanced portfolio or both.; The client
selected is male and has a balanced portfolio.; The client selected is male and
has an unbalanced portfolio.; The client selected either is female or has an
unbalanced portfolio or both.
A) P(A)
B) P(B)
ANS:
0.63; 0.45
A) P(A or B)
B) P(A and B)
ANS:
0.83; 0.25
A) P(A/B)
B) P(B/A)
ANS:
The probability that the employee selected is male, given that the employee has
a balanced portfolio.; The probability that the employee selected has a balanced
portfolio, given that the employee is male.
A) P(A/B)
B) P(B/A)
ANS:
0.555; 0.397
PTS: 1 OBJ: Section 5.8
ANS:
No; P(A / B) = 0.555 is not equal to P(A) = 0.63
ANS:
No. Since it’s possible to be both a male and have a balanced fund, P(A and B) =
0.25 which is not equal to zero.
33. There are 12 players on a basketball team and 5 of those players start the
game. How many different starting lineups are possible?
ANS:
12 C 5 = 792
ESSAY
ANS:
The degree to which one happens to believe that an event will or will not
happen.
ANS:
A relative frequency distribution indicates the decimal fraction that each class
contains of the total number of pieces of data. Let one of the classes be defined
as the random variable X. Then the probability of X will be the relative frequency
of the class representing X.
ANS:
Venn diagram illustrates the intersection of two events, when two or more
events occur at the same time.
ANS:
Cross-classification array taking into account all the different possibilities in the
array.
ANS:
Marginal probability is the probability that a given event will occur, with no other
events taken into consideration. In a contingency table, marginal probabilities
are computed by dividing the row and column totals by the sum of the
frequencies.
ANS:
The probability that an event will occur, given that another event has already
occurred.
ANS:
The probability of two events is unaltered whether or not one of them has
happened.
ANS:
P(A / B) = P(A) or P(B /A) = P(B).
TRUE/FALSE
1. True or False
The number of life insurance policy holders is an example of a discrete random
variable.
2. True or False
The driving time between Big Rapids and Grand Rapids is an example of a
discrete random variable.
4. True or False
One characteristic of a discrete probability distribution is that values of x are
exhaustive.
5. True or False
One characteristic of a discrete probability distribution is that the sum of the
probabilities for all the possible values for x must be between 0 and 1.
6. True or False
A characteristic of a Bernoulli process is that the trials are statistically
dependent; that is, the outcome in any trial is affected by the outcomes of
earlier trials, and it does affect the outcomes of later trials.
7. True or False
A characteristic of a Bernoulli process is that the probability of success remains
the same from one trial to the next.
8. True or False
The hypergeometric distribution is similar to the binomial distribution in that
successive trials are considered to be independent of one another.
9. True or False
The binomial distribution requires that the probability of a success for each trial
remains constant.
ANS: T PTS: 1 OBJ: Section 6.2
MULTIPLE CHOICE
4. A table, formula, or graph that shows all possible values a random variable
can assume, together with their associated probabilities is referred to as:
a. Discrete probability distribution
b. Continuous probability distribution
c. Mixed probability distribution
d. Contaminated probability distribution
ANS: A PTS: 1 OBJ: Section 6.1
5. A new car salesperson knows that he sells cars to one customer out of 20 who
enter the showroom. The probability that he will sell a car to exactly two of the
next three customers is:
a. 0.9939
b. 0.0075
c. 0.0071
d. 0.0851
e. 0.1354
ANS: C PTS: 1 OBJ: Section 6.2
8. A barbershop requires appointments for perms and hair cuts. Ten percent of
those making appointments cancel or simply fail to show up. Next week’s
appointment calendar has 64 appointments. Let x be the number of missed
appointments out of the 64. The probability that more than 3 cancellations
and/or no-shows will occur during the next week is:
a. 0.3784
b. 0.8937
c. 0.7795
d. 0.9611
e. 0.1063
ANS: B PTS: 1 OBJ: Section 6.2
11. Brendalee and Mohammed go to a coffee shop during their lunch break and
toss a coin to see who will pay. The probability that Brendalee will pay three
days in a row is:
a. 0.2500
b. 0.5000
c. 0.1250
d. 0.0150
e. none of these.
ANS: C PTS: 1 OBJ: Section 6.2
12. Which of the following is not a characteristic of the hypergeometric
distribution?
a. The hypergeometric distribution is similar to the binomial distribution in
focusing on the number of successes in a given number of consecutive trials.
b. The hypergeometric distribution is similar to the binomial distribution in the
fact that the probability of success remains constant from one trail to the rest.
c. The hypergeometric distribution relies on sampling without replacement.
d. The hypergeometric distribution, the consecutive trials are not independent.
e. All of the above are characteristics of the hypergeometric distribution.
ANS: B PTS: 1 OBJ: Section 6.3
17. The likelihood that someone who logs onto a particular site in a “shopping
mall” on the Internet will purchase an item is 0.05 and follows the binomial
distribution. If the site has 35 people accessing it in the next hour, what is the
expected number of successes during this hour?
a. 1.75
b. 2.5
c. 0.05
d. 12
e. 35
ANS: A PTS: 1 OBJ: Section 6.2
19. Tornadoes for January in Kansas average 3.2 per month. The probability that
in the next January Kansas will experience exactly 2 tornadoes is:
a. 0.7913.
b. 0.4076.
c. 0.2087.
d. 0.1304.
e. 0.2226.
ANS: C PTS: 1 OBJ: Section 6.4
20. Tornadoes for January in Kansas average 3.2 per month. The probability that
in the next January there will be between 1 and 3 tornadoes (inclusive) is:
a. 0.2087.
b. 0.5617.
c. 0.4383.
d. 0.4076.
e. 0.5924.
ANS: B PTS: 1 OBJ: Section 6.4
21. Tornadoes for January in Kansas average 3.2 per month. The probability that
in the next January at least 2 tornadoes will occur is:
a. 0.9592.
b. 0.0408.
c. 0.8288.
d. 0.1712.
e. none of these.
ANS: C PTS: 1 OBJ: Section 6.4
NARRBEGIN: Police
A local police station receives an average 8.1 emergency telephone calls per
hour. These calls are Poisson distributed.
NARREND
22. The probability that the station will get nine calls in an hour is:
a. 0.1256.
b. 0.8744.
c. 0.0408.
d. 0.6918.
e. 0.3082.
ANS: A PTS: 1 OBJ: Section 6.4
23. The probability that the station will get five to seven calls in an hour is:
a. 0.5924.
b. 0.5617.
c. 0.4383.
d. 0.4076.
e. 0.3451.
ANS: E PTS: 1 OBJ: Section 6.4
24. The probability that the station will get at least 4 calls in an hour is:
a. 0.0397.
b. 0.0941.
c. 0.9059.
d. 0.9603.
e. 0.9630.
ANS: D PTS: 1 OBJ: Section 6.4
25. A cashier at Carter’s cafeteria can total an average of 2.2 trays per minute.
The probability that the cashier will total exactly four trays in a minute is:
a. 0.9275.
b. 0.8918.
c. 0.1082.
d. 0.0992.
e. 0.1169.
ANS: C PTS: 1 OBJ: Section 6.4
26. A cashier at Carter’s cafeteria can total an average of 2.2 trays per minute.
The probability that the cashier will total between four and ten trays in a minute
is:
a. 0.9281.
b. 0.8193.
c. 0.1807.
d. 0.3772.
e. 0.0724.
ANS: E PTS: 1 OBJ: Section 6.4
27. A cashier at Carter’s cafeteria can total an average of 2.2 trays per minute.
The probability that the cashier will total no more than three trays in a minute is:
a. 0.1966.
b. 0.6227.
c. 0.9275.
d. 0.8193.
ANS: D PTS: 1 OBJ: Section 6.4
28. As a rule of thumb, a binomial distribution for which the number of trials n is
large can well be approximated by a Poisson distribution when the probability of
success, , is:
a. larger than 0.95.
b. larger than 0.50.
c. between 0.50 and 0.95.
d. smaller than .05.
ANS: D PTS: 1 OBJ: Section 6.4
NARRBEGIN: Chocolates
A box of chocolates contains 18 pieces, 10 of which have nuts while the
remaining 8 do not. Four pieces are chosen at random.
NARREND
31. What is the probability that exactly 2 pieces have nuts?
a. 0.412
b. 0.564
c. 0.681
d. 0.750
e. 0.833
ANS: A PTS: 1 OBJ: Section 6.3
32. What is the probability that no more than one piece has nuts?
a. 0.116
b. 0.206
c. 0.289
d. 0.444
e. 0.517
ANS: B PTS: 1 OBJ: Section 6.3
NUMERIC RESPONSE
ANS: 3.5851
NARRBEGIN: Egyptian
Let x represent the number of children in an Egyptian household. The probability
distribution of x is as follows:
NARREND
2. What is the probability that a randomly selected Egyptian household will have
more than 2 children?
ANS: 0.42
3. What is the probability that a randomly selected Egyptian household will have
between 2 and 4 children?
ANS: 0.65
4. What is the probability that a randomly selected Egyptian household will have
fewer than 4 children?
ANS: 0.75
ANS: 2.52
ANS: 1.2844
7. A high school play has five roles to fill with students. Seven girls and nine
boys have expressed an interest in the play. What is the probability that exactly
3 girls will be selected?
ANS: 0.288
PTS: 1 OBJ: Section 6.3
8. A high school play has five roles to fill with students. Seven girls and nine
boys have expressed an interest in the play. What is the probability that exactly
0 girls will be selected?
ANS: 0.029
ANS: 0.176
ANS: 0.169
ANS: 0.9216
12. An urn contains 5 red marbles and 5 green ones. Two marbles are selected
without replacement. What is the probability that both marbles are red?
ANS: 0.222
PTS: 1 OBJ: Section 6.3
13. An urn contains 5 red marbles and 5 green ones. Two marbles are selected
without replacement. What is the probability that at least one marble is red?
ANS: 0.778
14. A business office has purchased five new electric pencil sharpeners. The
manufacturer of these pencil sharpeners states that twenty in one hundred of
their sharpeners will require some maintenance or repair work during the first
year’s use. Find the probability that at least 3 of these sharpeners will need
repair during the first year.
ANS: 0.0579
ANS: 0.0486
ANS: 0.008
ANS: 0.1178
ANS: 0.7636
ANS: 0.1209
ANS: 0.007125
ANS: 0.5679
ANS: 0.0110
ANS: 0.4321
ANS: 0.8971
26. Find the probability that no one says they favor the plan.
ANS: 0.0256
ANS: 0.0019
28. What is the probability that at least one T.V. home in the sample is served by
cable?
ANS: 0.3895
ANS: 0.0313
30. What is the probability that only one of the five canoes will capsize?
ANS: 0.1562
31. What is the probability that two of the five canoes will capsize?
ANS: 0.3125
32. What is the probability that three of the five canoes will capsize?
ANS: 0.3125
33. What is the probability that four of the five canoes will capsize?
ANS: 0.1562
34. What is the probability that all five canoes will capsize?
ANS: 0.0313
35. What is the probability that less than two of the five canoes will capsize?
ANS: 0.1875
ANS: 0.046
37. What is the probability that none of these suitcases has a prize?
ANS: 0.596
NARRBEGIN: Fans
Fans arriving on a given Saturday night for a non-conference home basketball
game average 0.25 per second. Let x represent the number of fans arriving at a
specified time period.
NARREND
38. What is the probability that from 10 to 12 fans arrive between 7:15 and 7:16
P.M.?
ANS: 0.1978
39. What is the probability that from 13 to 15 fans arrive between 7:15 and 7:16
P.M.?
ANS: 0.3004
40. What is the probability that thirty or more fans arrive between 7:15 and 7:16
P.M.?
ANS: 0.0004
ANS: 0.6131
42. An EMS service averages 5.2 calls per 8-hour shift. During a shift in which 8
or more calls occur, the EMS crew receives one extra hour pay. If the hourly
wage is $15 for each 8-hour day, and taking into account the extra-effort days,
what is the average wage for an 8-hour day?
ANS: $122.33
ANS: 0.1607
ANS: 0.4628
ANS: 0.0035
NARRBEGIN: Fishing
During the peak fishing months, a marina at a large man-made reservoir noted
that boats were arriving at the average of 3.2 boats every twenty minutes to use
the boat ramp. Let x be the number of boats arriving during the next twenty
minutes.
NARREND
46. What is the probability that during the next twenty minutes the number of
boats arriving to load or unload is exactly three?
ANS: 0.2226
47. What is the probability that during the next twenty minutes the number of
boats arriving to load or unload is at least one?
ANS: 0.9592
48. What is the probability that during the next twenty minutes the number of
boats arriving to load or unload is from two to four?
ANS: 0.6094
NARRBEGIN: Students
Students use the university mainframe computer log-on during their designated
time periods. Assume that the number of students logging on each five-minute
interval is 12.5, and these log-on attempts are Poisson distributed.
NARREND
49. Find the probability that exactly 9 students will log-in during the next 5-
minute interval.
ANS: 0.077
50. Find the probability that fewer than 9 students will log-in during the next 5-
minute interval.
ANS: 0.125
51. Find the probability that more than 6 students will log-in during the next 5-
minute interval.
ANS: 0.965
52. Find the probability that between 8 and 10 (inclusive) students will log-in
during the next 5-minute interval.
ANS: 0.227
ANS: 0.0498
54. What is the probability that more than five sheets are sub-standard?
ANS: 0.0840
55. What is the probability that fewer than three sheets are sub-standard?
ANS: 0.4232
ANS: 0.0821
57. Use the Poisson distribution to compute the binomial probability that one or
more checks are returned.
ANS: 0.9179
58. Use the Poisson distribution to compute the binomial probability that less
than three checks are returned.
ANS: 0.5438
COMPLETION
1. In general, we can define a ____________________ as the relative frequency
distribution that should theoretically occur for observations from a given
population.
2. A random variable that can take on only certain values along an interval, with
the possible values having gaps between them, is referred to as a
____________________.
3. A random variable that can take on a value at any point along an interval is
referred to as a ____________________.
4. Since any value of x in a discrete probability distribution can occur only once
for a given experiment, the values of x are said to be ____________________.
ANS:
consecutive; two
consecutive, 2
ANS: hypergeometric
ANS: binomial
SHORT ANSWER
NARRBEGIN: Distribution
The probability distribution of a discrete random variable x is shown below.
NARREND
1. Find the expected value, the variance, and the standard deviation of x.
E(x) = ____________________.
Variance = ____________________.
SD = ____________________.
ANS:
1.25; 0.9875; 0.9937
ANS:
0.35, 0.85, 0.60, 0.00, 0.75
Mean = ____________________
SD = ____________________
ANS:
1; 0.9899
Mean = ____________________
SD = ____________________
ANS:
2.5; 1.5
ANS:
Binomial. This is a Bernouilli process with two possible outcomes in consecutive
and mutually exclusive events.
n = __________________________________________
x = __________________________________________
= __________________________________________
ANS:
number of trials; number of successes in n trials; the probability of success in
any given trial
ANS:
The parameters are: n; the number of trials, and the probability of success in
any given trial.
Mean = ____________________
SD = ____________________
ANS:
0.54; 0.7010
ANS:
0.84; 0.8146
ANS:
x P(x)
0 0.0313
1 0.1562
2 0.3125
3 0.3125
4 0.1562
5 0.0313
PTS: 1 OBJ: Section 6.2
ANS:
12. What is the standard deviation of the number of canoes that will capsize?
ANS:
= 0.4, find the following probabilities using the binomial table.13. Given a
binomial random variable with n = 15 and
12)a. P(x
10)b. P(x
c. P(x = 8)
xd. P(9 < 12)
e. P(6 < x < 10)
ANS:
0.9997, 0.0338, 0.1181, 0.0931, 0.3564
14. A recent survey in Michigan revealed that 60% of the vehicles traveling on
highways, where speed limits are posted at 70 miles per hour, were exceeding
the limit. Suppose you randomly record the speeds of ten vehicles traveling on
US 131 where the speed limit is 70 miles per hour. Let x denote the number
vehicles that were exceeding the limit. Find the following probabilities:
a. P(x = 10)
b. P(4 < x < 9)
xc. P(3
ANS:
0.006, 0.787, 0.988
15. Compute the following probabilities (to 4 decimal places) using the Poisson
formula:
16. Let x be a Poisson random variable with = 6. Use the table of Poisson
probabilities to find:
A) P(x 8)
B) P(x = 8)
C) P(x 5)
D) P(6 x 10)
ANS:
0.8472; 0.1033; 0.7149; 0.5117
17. Let x be a Poisson random variable with = 8. Use the table of Poisson
probabilities to find:
A) P(x 6)
B) P(x = 4)
C) P(x 3)
D) P(9 x 14)
ANS:
0.3134; 0.0573; 0.9862; 0.3902
18. Phone calls arrive at the rate of 30 per hour at the reservation desk for a
hotel.
A) Find the probability that the executive will receive 12 calls between 2 and 4
pm on a particular afternoon.
B) Find the probability that the executive will receive eight calls between 2 and 3
pm on a particular afternoon.
C) Find the probability that the executive will receive at least six calls between 2
and 4 pm on a particular afternoon.
ANS:
0.0948; 0.0653; 0.9329
20. Stabbings of inmates by other inmates at a state prison unit have averaged
2.6 per month over the past year. Let x be the number of inmate stabbings per
month. Assume a Poisson distribution. Find the mean and standard deviation of
x.
Mean = ____________________
SD = ____________________
ANS:
2.6; 1.61
ESSAY
1. Identify and complete this equation: ______________________________ = E(x)
ANS:
= E(x); Mean of a discrete probability distribution
ANS:
(1) There are two or more consecutive trials; (2) in each trial there are just two
possible outcomes, usually denoted as success or failure; (3) The trials are
statistically independent; (4) The probability of a success remains the same from
one trial to the next.
ANS:
The Poisson distribution is a discrete probability distribution for an event
occurring over time with a low probability of success.
4. Assume that you used Minitab statistical package to generate the individual
binomial probabilities when n = 100 and = .042, and then generate the
individual Poisson probabilities for = 4.2. Do you expect the probabilities
obtained in both cases to be similar or different?
Explain.
ANS:
It is expected that the respective probabilities obtained in both cases to be
slightly different, but close to each other. In general, for a given value of , the
Poisson approximation to the binomial distribution becomes closer for longer
values of n.
Chapter 7. Continuous
Probability Distributions
Test Bank
TRUE/FALSE
1. True or False
In any normal distribution, the mean, median, mode, and standard deviation are
all at the same position on the horizontal axis.
2. True or False
In the normal distribution, the curve is asymptotic but never intercepts the
horizontal axis either to the left or right.
3. True or False
In the normal distribution, the total area beneath the curve represents the
probability for all possible outcomes for a given event.
4. True or False
Under certain conditions, the normal distribution can be used to approximate
the binomial distribution.
5. True or False
There exists a different normal curve for every possible pair of and .
ANS: T PTS: 1 OBJ: Section 7.3
6. True or False
The standard deviation of the binomial distribution with n = 25 and = .50 is
6.25.
7. True or False
The mean of the exponential distribution is the inverse of that of the normal
distribution.
8. True or False
If Excel and Minitab packages are both used to generate its own sample of
10,000 observations from a theoretical continuous distribution, the two sets of
observations are expected to be identical.
9. True or False
According to the law of large numbers, frequency distributions generated by
statistical packages will more closely approach the theoretical distribution from
which they were drawn whenever the number of observations becomes larger.
MULTIPLE CHOICE
2. Which of the following are not correct concerning the probability distribution
for any continuous random variable?
a. The vertical coordinate is the probability density function.
b. The range of the random variable is found on the y-axis.
c. The total area represented under the curve will be equal to 1.00.
d. The probability that x will take on a value between a and b will be the area
under the curve between points a and b.
e. The area under the curve represents the sum of probabilities for all possible
outcomes.
ANS: B PTS: 1 OBJ: Section 7.1
4. Which of the following statements are true regarding the areas beneath any
normal curve?
a. About 68.3% of the area is in the interval to .
b. About 95.5% of the area is in the interval to .
c. Nearly all of the area (about 99.7%) is in the interval to
d. All of these are true.
e. None of these is true.
ANS: D PTS: 1 OBJ: Section 7.2
5. Which of the following is not a characteristic for a normal distribution?
a. It is symmetrical distribution
b. The mean is always zero
c. The mean, median, and mode are all equal
d. It is a bell-shaped distribution
e. The area under the curve always equals 1.0
ANS: B PTS: 1 OBJ: Section 7.2
7. Using the standard normal table, the total area between z = -0.75 and z =
1.21 is:
a. 0.2734
b. 0.3869
c. 0.3397
d. 0.2266
e. 0.6603
ANS: E PTS: 1 OBJ: Section 7.3
8. Let z1 be a z score that is unknown but identifiable by position and area. If the
area to the right of z1 is 0.8413, then the value of z1 must be:
a. 1.00.
b. -1.00.
c. 0.00.
d. 0.41.
e. -0.41.
ANS: B PTS: 1 OBJ: Section 7.3
9. Using the standard normal table, the total area between z = -1.82 and z =
-1.38 is:
a. 0.0494
b. 0.4656
c. 0.1554
d. 0.4162
e. 0.1005
ANS: A PTS: 1 OBJ: Section 7.3
10. If the area to the right of a positive z1 is 0.0869, then the value of z1 must
be:
a. -1.36.
b. 1.36.
c. 1.71.
d. 1.80.
e. 0.22.
ANS: B PTS: 1 OBJ: Section 7.3
11. The z-score representing the 10th percentile of the standard normal
distribution is:
a. 0.255.
b. -0.255.
c. -1.645.
d. 1.28.
e. -1.28.
ANS: E PTS: 1 OBJ: Section 7.3
12. The z-score representing the 75th percentile of the standard normal
distribution is:
a. 0.67.
b. -0.67.
c. 1.28.
d. -1.28.
e. 1.645.
ANS: A PTS: 1 OBJ: Section 7.3
13. The z-score representing the 90th percentile of the standard normal
distribution is:
a. 1.28.
b. -1.28.
c. 0.67.
d. -0.67.
e. 1.645.
ANS: A PTS: 1 OBJ: Section 7.3
14. For a normal curve, if the mean is 20 minutes and the standard deviation is
5 minutes, then the area between 22 and 25 minutes is:
a. 0.3413.
b. 0.1554.
c. 0.4967.
d. 0.1859.
e. 0.2248.
ANS: D PTS: 1 OBJ: Section 7.3
15. For a normal curve, if the mean is 20 minutes and the standard deviation is
5 minutes, then the area between 11 and 19 minutes is:
a. 0.4641.
b. 0.0359.
c. 0.3848.
d. 0.8848.
e. 0.9641.
ANS: C PTS: 1 OBJ: Section 7.3
16. A bakery firm finds that its average weight of the most popular package of
cookies is 32.06 ounces with a standard deviation of 0.1 ounces. What portion of
cookie packages will weigh less than 31.9 ounces or more than 32.3 ounces?
a. 0.8000
b. 0.4452
c. 0.4918
d. 0.9370
e. 0.0630
ANS: E PTS: 1 OBJ: Section 7.3
17. A retailer finds that the demand for a very popular board game averages
100 per week with a standard deviation of 20. If the seller wishes to have
adequate stock 95% of the time, how many of the games must she keep on
hand?
a. 139.2
b. 100.0
c. 132.9
d. 195.0
e. 125.6
ANS: C PTS: 1 OBJ: Section 7.3
18. A manufacturer of tow chains finds that the average breaking point is at
3,500 pounds and the standard deviation is 250 pounds. If you pull a weight of
at least 4200 pounds with this tow chain, what percentage of the time would
you expect the chain to break?
a. 2.8%
b. 0.26%
c. 49.74%
d. 99.74%
e. 50%
ANS: B PTS: 1 OBJ: Section 7.3
19. The average labor time to sew a pair of denim jeans is 4.2 hours with a
standard deviation of 30 minutes. If the distribution is normal, then the
probability of a worker finishing a pair of jeans in less than 3.5 hours is:
a. 0.0808.
b. 0.4192.
c. 0.5808.
d. 0.9192.
e. 0.9808.
ANS: A PTS: 1 OBJ: Section 7.3
20. A salesman who uses his car extensively finds that his gasoline bills average
$125.32 per month with a standard deviation of $49.51. The probability that his
bill will be less than $50 a month or more than $150 for a single month is:
a. 0.4357.
b. 0.1915.
c. 0.6272.
d. 0.3728.
e. 0.6915.
ANS: D PTS: 1 OBJ: Section 7.3
21. Given that z is a standard normal random variable and that the area to the
right of z is 0.1949, then the value of z must be:
a. 0.51.
b. -0.51.
c. 0.86.
d. -0.86.
ANS: C PTS: 1 OBJ: Section 7.3
22. Using the standard normal table, the total area between z = 0.45 and z =
1.05 is:
a. 0.1736
b. 0.3531
c. 0.3264
d. 0.6469
e. 0.1795
ANS: E PTS: 1 OBJ: Section 7.3
24. Given that z is a standard normal random variable, and that the area to the
right of z is 0.9066, then the value of z must be:
a. 1.32.
b. -1.32.
c. 0.66.
d. -0.66.
ANS: B PTS: 1 OBJ: Section 7.3
25. The proportion of the data from a standard normal distribution that falls
within two standard deviations from the mean is:
a. 0.3413.
b. 0.4772.
c. 0.6826.
d. 0.9544
ANS: D PTS: 1 OBJ: Section 7.3
26. If the z-value for a given value x of the random variable x is z = 1.96, and
the distribution of x is normally distributed with a mean of 60 and a standard
deviation of 6, then the x-value that this z-value corresponds to is:
a. 71.76.
b. 67.96.
c. 61.96.
d. 48.24.
ANS: A PTS: 1 OBJ: Section 7.3
27. Your high school graduating class had 564 members. Thirty-three percent of
these are expected to attend college. If the number of students who attend
college follows the normal distribution, the probability that less than 160 will
attend college is:
a. 0.3610.
b. 0.4900.
c. 0.0090.
d. 0.0832.
e. 0.2500
ANS: C PTS: 1 OBJ: Section 7.4
28. A large mail house which mails such items as catalogues, magazines, and
other bulk mailings guarantees that there will be no more than a 3% error rate
on its mailing labels. A customer who contracted a mailing to 190,000
individuals experienced a return of 5,900 items, which had incorrect addresses.
Which of the following statements is true?
a. There is a 3% chance of an incorrect return.
b. There is a 0.0054 probability that a return of 5900 or more incorrect
addresses could occur if the true error rate is 3%.
c. There is a 0.4964 probability that a return of 5900 or more incorrect
addresses could occur if the true error rate is 3%.
d. There is a 2.69 percent possibility that a return of 5900 incorrect addresses
could occur if the true error rate is 3%.
e. None of these statements are correct.
ANS: B PTS: 1 OBJ: Section 7.4
29. In your college campus, 30% of the students use tobacco in some form. In
your statistics class of 40 students, the probability of finding at least 15 students
who use tobacco is:
a. 0.3051.
b. 0.1949.
c. 0.8051.
d. 0.1515.
e. 0.3485.
ANS: B PTS: 1 OBJ: Section 7.4
30. A zipper manufacturer has found that 1.5% of its zippers are defective. The
company averages 4982 zippers per day. The probability of finding between 90
and 100 defects in a given day is:
a. 0.4981.
b. 0.0310.
c. 0.0360.
d. 0.5310.
e. 0.4671.
ANS: B PTS: 1 OBJ: Section 7.4
32. A local radio station maintains a very popular phone service, which provides
callers with current weather reports. The incoming calls have a Poisson
distribution with an average of 5 calls for each 15 minute-period. If x = time
between calls, then the probability of receiving four or fewer calls in the next 15
minutes is:
a. 0.2341.
b. 0.7659.
c. 0.2636.
d. 0.7364.
e. 0.9502.
ANS: D PTS: 1 OBJ: Section 7.5
33. A dispatcher for an airport shuttle averages sending a van to the airport 2
times every 60 minutes. The distribution is Poisson, and the driver must take a
15-minute lunch break. The probability that he can finish lunch before getting a
call is:
a. 0.1350.
b. 0.6065.
c. 0.3935.
d. 1.6490.
e. 0.8650.
ANS: C PTS: 1 OBJ: Section 7.5
34. A very large logging operation has serious problems keeping their skidders
operating properly. The equipment fails at the rate of 3 breakdowns every 48
hours. Assume that x is time between breakdowns and is exponentially
distributed. The probability of two or less breakdowns in the next 48-hour period
is:
a. 0.9672.
b. 0.0307.
c. 0.2231.
d. 0.7769.
ANS: B PTS: 1 OBJ: Section 7.5
38. Which of the following is not true for an exponential distribution with
parameter ?
a. Mean = .
b. Standard deviation = .
c. The distribution is completely determined once the value of is known.
d. The distribution is a two-parameter distribution since the mean and standard
deviation are equal.
ANS: D PTS: 1 OBJ: Section 7.5
39. Like the normal distribution, the exponential density function f(x):
a. is bell-shaped.
b. is symmetrical.
c. approaches infinity as x approaches zero.
d. approaches zero as x approaches infinity.
ANS: D PTS: 1 OBJ: Section 7.5
40. Which of the following distributions is suitable to model the length of time
that elapses before the first telephone call is received by a switchboard?
a. Exponential
b. Normal
c. Poisson
d. Binomial
e. Hypergeometric
ANS: A PTS: 1 OBJ: Section 7.5
41. The mean of the exponential distribution equals the mean of the Poisson
distribution only when the former distribution has a mean equals:
a. 1.00.
b. 0.50.
c. 0.25.
d. 2.00.
ANS: A PTS: 1 OBJ: Section 7.5
43. Which of the following statements are true regarding the normal distribution
curve?
a. It is symmetrical.
b. It is bell-shaped.
c. It is asymptotic in that each end approaches the horizontal axis, but never
reaches it.
d. Its mean, median, and mode are located at the same position.
e. All of these statements are true.
ANS: E PTS: 1 OBJ: Section 7.7
NUMERIC RESPONSE
ANS: 0.50
ANS: 0.683
PTS: 1 OBJ: Section 7.2
ANS: 0.0225
ANS: 0
ANS: 0.955
ANS: 0.8415
7. In 2000, the average charge of tax preparation was $95. Assuming a normal
distribution and a standard deviation of $10, use the approximate areas beneath
the normal curve, as discussed in this section, to answer: What proportion of tax
preparation fees were more than $95?
ANS: 0.50
PTS: 1 OBJ: Section 7.2
8. In 2000, the average charge of tax preparation was $95. Assuming a normal
distribution and a standard deviation of $10, use the approximate areas beneath
the normal curve, as discussed in this section, to answer: What proportion of tax
preparation fees were between $75 and $115?
ANS: 0.955
9. In 2000, the average charge of tax preparation was $95. Assuming a normal
distribution and a standard deviation of $10, use the approximate areas beneath
the normal curve, as discussed in this section, to answer: What proportion of tax
preparation fees were between $85 and $105?
ANS: 0.683
10. In 2000, the average charge of tax preparation was $95. Assuming a normal
distribution and a standard deviation of $10, use the approximate areas beneath
the normal curve, as discussed in this section, to answer: What proportion of tax
preparation fees were more than $115?
ANS: 0.0225
11. In 2000, the average charge of tax preparation was $95. Assuming a normal
distribution and a standard deviation of $10, use the approximate areas beneath
the normal curve, as discussed in this section, to answer: What proportion of tax
preparation fees were between $80 and $90?
ANS: 0.2417
12. Using the standard normal curve, find the area to the left of z = -2.26.
ANS: 0.0119
PTS: 1 OBJ: Section 7.3
13. Using the standard normal curve, find the area between z = 0 and z = 3.3.
____________________.
ANS: 0.50
ANS: 0.0571
15. For the standard normal curve, determine the area to the right of z = 0.34.
ANS: 0.3669
ANS: 0.2266
ANS: 0.9474
ANS: 0.4929
19. Assume the standard normal curve and determine the probability or area
between z = -1.28 and z = 1.28.
ANS: 0.7994
PTS: 1 OBJ: Section 7.3
ANS: 1.36
21. Find the z-score that determines that the area to the left of z is 0.9292.
ANS: 1.47
22. Find the z-score that determines that the area to the right of z is 0.8264.
ANS: -0.94
ANS: -0.47
ANS: -1.50
ANS: 0.70
ANS: 0.0228
27. For a normal curve, if the mean is 20 minutes and the standard deviation is
5 minutes, find the area to the right of 12 minutes.
ANS: 0.9452
28. The age of customers for a particular retail store follows a normal
distribution with a mean of 37.5 years and a standard deviation of 7.6 years.
What is the probability that the next customer who enters the store will be more
than 31 years old?
ANS: 0.8051
29. The age of customers for a particular retail store follows a normal
distribution with a mean of 37.5 years and a standard deviation of 7.6 years.
What is the probability that the next customer who enters the store will be less
than 42 years old?
ANS: 0.7224
30. The age of customers for a particular retail store follows a normal
distribution with a mean of 37.5 years and a standard deviation of 7.6 years.
What is the probability that the next customer who enters the store will be
between 40 and 45 years old?
ANS: 0.2096
ANS: 0.0838
32. The average waiting time at the checkout counter for a large grocery chain
is 2.45 minutes with a standard deviation of 24 seconds (0.40 minutes). Assume
that the distribution of waiting time is normal. What proportion of the customers
are served in between 1 minute and 2.5 minutes?
ANS: 0.5517
33. Suppose the monthly demand for automobile tires at a tire dealer is
normally distributed with a mean of 250 tires and a standard deviation of 50
tires. How many tires must the store have in inventory at the beginning of each
month in order to meet demand 95 percent of the time?
ANS: 332
ANS: 0.0038
ANS: 0.3446
36. The Independent Bank surveyed the status of student accounts and found
that the average overdraft was $21.22 with a standard deviation of $5.49. If the
distribution is normal, find the probability of a student being overdrawn by more
than $18.75.
ANS: 0.6736
37. A circus performer who gets shot from a cannon is supposed to land in a
safety net positioned at the other end of the arena. The distance he travels is
normally distributed with a mean of 175 feet and a standard deviation of 15
feet. His landing net is 50 feet long and the mid-point of the net is positioned
175 feet from the cannon. What is the probability that the performer will hit the
net on a given night?
ANS: 0.9050
38. A circus performer who gets shot from a cannon is supposed to land in a
safety net positioned at the other end of the arena. The distance he travels is
normally distributed with a mean of 175 feet and a standard deviation of 15
feet. His landing net is 50 feet long and the mid-point of the net is positioned
175 feet from the cannon. What is the probability that the performer will miss
the net on a given night?
ANS: 0.0950
ANS: 0.4246
40. The speed of cars passing through a checkpoint follows a normal distribution
with a mean of 62.6 miles per hour and a standard deviation of 3.7 miles per
hour. What is the probability that the next car passing by will be exceeding 65.5
miles per hour?
ANS: 0.2177
41. The speed of cars passing through a checkpoint follows a normal distribution
with a mean of 62.6 miles per hour and a standard deviation of 3.7 miles per
hour. What is the probability that the next car passing by will be exceeding 58.1
miles per hour?
ANS: 0.8880
42. The speed of cars passing through a checkpoint follows a normal distribution
with a mean of 62.6 miles per hour and a standard deviation of 3.7 miles per
hour. What is the probability that the next car passing by will be traveling
between 61 and 70 miles per hour?
ANS: 0.6436
43. The recent average starting salary for new college graduates in computer
information systems is $47,500. Assume salaries are normally distributed with a
standard deviation of $4,500. What is the probability of a new graduate
receiving a salary between $45,000 and $50,000?
ANS: 0.4246
ANS: 2.46
45. An elementary school teacher learned that 40 percent of school age children
have at least 3 cavities. The teacher has 30 students in his class. How many
students would he expect in his class to have at least three cavities?
ANS: 12
46. An elementary school teacher learned that 40 percent of school age children
have at least 3 cavities. The teacher has 30 students in his class. What is the
standard deviation for the number of school age children that have at least 3
cavities?
ANS: 2.683
47. The selling price of various homes in a community follows the normal
distribution with a mean of $176,000 and a standard deviation of $22,300. What
is the probability that the next house will sell for less than $190,000?
ANS: 0.7357
ANS: 0.2090
49. The selling price of various homes in a community follows the normal
distribution with a mean of $176,000 and a standard deviation of $22,300. What
is the probability that the next house will sell for between $150,000 and
$168,000?
ANS: 0.2384
50. An elementary school teacher learned that 40 percent of school age children
have at least 3 cavities. The teacher has 30 students in his class. What is the
probability that 10 students from this class have at least 3 cavities?
ANS: 0.1115
51. Suppose that x is a binomial random variable with n = 100 and = 0.18.
Employ the normal approximation to find P(15 x 21).
ANS: 0.6976
52. Let x be a binomial random variable. Find P(x 40) if n = 100 and = 0.32.
ANS: 0.0537
53. Women make up 58% of the U. S. civilian workforce of 124 million. The U. S.
Department of Commerce randomly selects 100 workers for a conference on
national health care. What is the probability that more than 20 of these workers
are female?
54. Women make up 58% of the U. S. civilian workforce of 124 million. The U. S.
Department of Commerce randomly selects 100 workers for a conference on
national health care. If there are fewer than 45 or more than 65 females invited,
severe political ramifications are possible. What is the probability that the
conference planners will be criticized for the representation by women at the
conference?
ANS: 0.0674
55. A coin is flipped 14 times. Using the normal distribution, calculate the
probability of observing either 4, 5, or 6 heads.
ANS: 0.3629
56. An efficiency expert makes periodic checks for weighting errors for a long
distance shipping firm. The expert inspects for errors in weighing, in recording
weights, and errors in processing the bills of lading. Based on past records, the
number of weekly errors for all shipments averages 5.30 with a standard
deviation of 1.23, and the frequency histogram approximates a normal
distribution. Suppose x is the number of weighing errors that will occur next
week. Compute the approximate probability for (x = 6).
ANS: 0.2729
58. An efficiency expert makes periodic checks for weighting errors for a long
distance shipping firm. The expert inspects for errors in weighing, in recording
weights, and errors in processing the bills of lading. Based on past records, the
number of weekly errors for all shipments averages 5.30 with a standard
deviation of 1.23, and the frequency histogram approximates a normal
distribution. Suppose x is the number of weighing errors that will occur next
week.
ANS: 0.8912
59. An efficiency expert makes periodic checks for weighting errors for a long
distance shipping firm. The expert inspects for errors in weighing, in recording
weights, and errors in processing the bills of lading. Based on past records, the
number of weekly errors for all shipments averages 5.30 with a standard
deviation of 1.23, and the frequency histogram approximates a normal
distribution. Suppose x is the number of weighing errors that will occur next
week.
ANS: 0.0113
60. In a paper mill, which produces newsprint from Southern Pine, the
production manager finds a flaw for each 1000 feet of newsprint. He knows that
the outcome is a Poisson process with a continuous distribution. If x = feet
between flaws, what is the probability of finding 600 or more feet between the
next two flaws?
ANS: 0.5488
PTS: 1 OBJ: Section 7.5
61. In a paper mill, which produces newsprint from Southern Pine, the
production manager finds a flaw for each 1000 feet of newsprint. He knows that
the outcome is a Poisson process with a continuous distribution. A group of
visitors is touring the plant. After arriving at the final stage of production, if they
remain long enough to observe 330 feet of newsprint produced, what is the
probability that they will miss seeing the next defect?
ANS: 0.2811
62. In 1997, private planes had 1.5 fatal crashes per 100 thousand flying hours.
For a continuous random variable with an exponential distribution: What is the
probability that the time between the next two crashes will fall between 50 and
70 thousands hours, i.e. P(50 x 70); x = thousands of flying hours between fatal
crashes.
ANS: 0.1224
63. In 1997, private planes had 1.5 fatal crashes per 100 thousand flying hours.
Assume a continuous random variable with a exponential distribution. A national
association of private plane owners has 560 members who fly 1.5 million miles
each year. The mileage crash rate has been estimated to be 20 per 100 million
miles of air travel. What is the probability that the next crash by a member of
the association will not occur until more than one year from now?
ANS: 0.2592
64. Grades for a statistics exam for a certain class follow the normal probability
distribution with a mean of 82 and a standard deviation of 12. What percentage
of the students in the class had a grade 81 or more?
ANS: 0.5319
PTS: 1 OBJ: Section 7.3
65. Grades for a statistics exam for a certain class follow the normal probability
distribution with a mean of 82 and a standard deviation of 12. What percentage
of the students in the class earned a C grade (70-79)?
ANS: 0.2426
66. Grades for a statistics exam for a certain class follow the normal probability
distribution with a mean of 82 and a standard deviation of 12. What percentage
of the students in the class earned a B grade (80-89)?
ANS: 0.2865
ANS: 0.4866
ANS: 0.1455
____________________ minutes
ANS: 5.25
____________________ minutes
ANS: 225
71. The total area beneath the curve of any continuous distribution is
____________________.
ANS: 1.0
72. The probability that any continuous random variable x will take on any
specific value along an interval is ____________________.
ANS: 0.0
COMPLETION
1. In the normal distribution, the total area under the curve is equal to
____________________.
ANS:
one
1
PTS: 1 OBJ: Section 7.2
2. In the normal distribution, the right half of the curve is a mirror image of the
_________________________, since the distribution is ____________________.
ANS: standardize
ANS: exponential
SHORT ANSWER
1. In 2000, the average charge of tax preparation was $95. Assuming a normal
distribution and a standard deviation of $10, use the approximate areas beneath
the normal curve, as discussed in this section, to answer: What proportion of tax
preparation fees were exactly $100?
ANS:
P(x = 100) = 0
PTS: 1 OBJ: Section 7.2
ANS:
0.76; 0.41; -0.91; 1.24
A) P(x 128)
B) P(x 113)
C) P(87 x 98)
ANS:
0.0026; 0.9032; 0.3239
= 10. Sketch the approximate box-and-whisker plot for the resulting data. =
50 and standard deviation 4. A computer statistical package has simulated
1000 random observations from a normal distribution with mean
ANS:
ANS:
2.28%; No. We would expect 11.4 of the 500 observations (2.28%) to have a
value of 200 or more. The actual number would not be equal to the expected
number. However, the more observations we select, the closer we will tend to
come to what we expect.
A) What is the probability that a randomly selected student will have a score of
80 or higher?
B) If there were 97,680 students with scores higher than 91, how many students
took the test?
ANS:
0.9332; 925000
ANS:
0f(x) = (1/15) e-x/15, x
8. What are the mean and standard deviation of a normally distributed random
variable, which has been “standardized”?
Mean = ____________________
SD = ____________________
ANS:
0.0; 1
9. A certain brand of flood lamps has a lifetime that is normally distributed with
a mean of 3,750 hours and a standard deviation of 300 hours.
A). What proportion of these lamps will last for more than 4,000 hours?
B). What lifetime should the manufacturer advertise for these lamps in order
that only 2% of the lamps will burn out before the advertised lifetime?
ANS:
0.2033; 3132
ANS:
); n(1-n
11. Complete the following table indicating which procedure to use in calculating
binomial probabilities:
Proceduren
Large 0.2
Large 0.0
Large 0.9
Small 0.0
ANS:
Proceduren
Use the normal approximationLarge 0.2
Use the Poisson approximationLarge 0.0
Use the Poisson approximationLarge 0.9
Use the binomial probability formulaSmall 0.0
PTS: 1 OBJ: Section 7.4
12. Although the binomial distribution is discrete and the normal distribution is
continuous, the normal distribution is a good approximation to the binomial
whenever both ____________________ and ____________________ are 5, where n =
number of trials, and = the probability of success in any given trial, are the
parameters of the binomial distribution.
ANS:
; n(1-n
13. Regardless o is the standard deviation.is the mean and , respectively,
where f the shape of a particular normal curve, about what percentage of
the area is within
ANS:
95.5% and 99.7% respectively
ESSAY
ANS:
Continuous probability distributions describe probabilities associated with
random variables that are able to assume any of an infinite number of values
along an interval.
ANS:
The probability that a continuous random variable takes on any specific value is
equal to zero because there is an infinite number of possible values.
3. Explain why the total area beneath a probability density function is equal to
1.0.
ANS:
The area beneath the probability density function represents the probability of
the random variable, x, being between . Since x must be between (this is a
certain event), the area must be equal to 1.0.
ANS:
ANS:
A random variable x is standardized when each value of x has (the mean of x)
subtracted from it, and the difference is divided by (the standard deviation of x.)
ANS:
If the probabilities for the outcomes of the random variable are approximately
equal to the areas under the normal curve, its distribution is approximately
normal.
7. Express in your own words the procedure for finding the area under the
standard normal curve between z = 0 and z = 1.53.
ANS:
Use the standard normal table in text. Trace down the left hand column to “1.5”
and go across the row until reaching the column headed at the top with “.03”.
The value is 0.4370, which is the area between z = 0 and z = 1.53.
8. Describe the method of finding the area under the standard normal curve
between z = 0 and z =- 1.33.
ANS:
Recall the property of symmetry. The area to the left of 0 to – 1.33 is the same
as for the area to the right of 0 to 1.33. Go down the left most column of the
standard normal table to the row with “1.3” and right across the rows until in the
column headed at the top with “.03”. The value is 0.4082, which is the area
between z = 0 and z = -1.33.
9. Use the properties of the standard normal curve to describe the method of
finding the area to the right of z = 1.42.
ANS:
Since the standard normal table in text gives areas between 0 and positive
values of z, to find the area to the right of a z value requires the use of the
symmetry property which places .5 of the whole area to the right of z = 0. Trace
down the left most column to the value “1.4” and across the row to the column
headed by the value “.02”. The value is 0.4222, which is the area between z = 0
and z = 1.42. Subtract this area from 0.5 to obtain the desired area of 0.0778.
ANS:
By symmetry the area to the right of 0 is the same as the area to the left of 0 for
values of z. In finding the area to the left of a negative value of z use the
symmetry property to reason that .5 of the area lies to the left of 0. Go down the
left most column to the value “1.5” and across the row until reaching the column
headed by “9”. The value is 0.4441, which is the area between z = 0 and z =
-1.59. Subtract this area from 0.5 to obtain the desired area of 0.0559.
11. Why is it important to use the correction for continuity when approximating
binomial probabilities with the normal distribution?
ANS:
Because the binomial distribution has gaps between possible values of x (since
it is discrete), while the normal distribution is continuous, the normal
approximation to the binomial involves a correction for continuity. The correction
consists of expanding each possible value of the discrete random variable x by
0.5 in each direction.
f(x) = 2e-2x , x 0
ANS:
Exponential distribution with = 2
TRUE/FALSE
1. True or False
Sample data are used as the basis from which to make probability statements
about the true (but unknown) value of the population mean or proportion.
2. True or False
The sampling distribution of the mean will be approximately normal regardless
of the distribution of the original population or the sample size.
3. True or False
The standard error for the sampling distribution of the mean is equal to .
4. True or False
The number of all possible simple random samples of size 2 that can be taken
from a population of 5 elements is 6.
5. True or False
The sampling distribution of the mean will have the same mean as the original
population from which the samples were drawn.
6. True or False
The sampling distribution of the mean will have the same standard deviation as
the original population from which the samples were drawn.
ANS: F PTS: 1 OBJ: Section 8.2
7. True or False
The sampling distribution of the mean will be normal if the original population
from which the samples were drawn is normally distributed.
8. True or False
The sample proportion can be considered a random variable that will take on
different values as the procedure leading to the sample proportion is repeated a
great many times.
9. True or False
If a population is a Poisson distribution with a mean of 4, then the sampling
distribution of the mean for 200 Excel or Minitab simulated samples approaches
the normal distribution as the sample sizes are 5 or more.
MULTIPLE CHOICE
1. In a given year, the average annual salary of a NFL football player was
$189,000 with a standard deviation of $20,500. If a sample of 50 players was
taken, then probability that the sample mean will be $192,000 or more is:
a. 0.6970.
b. 0.3485.
c. 0.1515.
d. 0.0398.
e. 0.4602.
ANS: C PTS: 1 OBJ: Section 8.3
4. If all possible samples of size n are drawn from an infinite population with a
mean of and a standard deviation of , then the standard error of the sample
mean is inversely proportional to:
a. .
b. .
c. .
d. n.
ANS: B PTS: 1 OBJ: Section 8.3
5. The concept that allows us to draw conclusions about the population based
strictly on sample data without having any knowledge about the distribution of
the underlying population is
a. the central limit theorem.
b. the standard error of the mean.
c. the normal distribution.
d. the standard deviation.
e. the variance.
ANS: A PTS: 1 OBJ: Section 8.3
6. The average number of customers that visited a particular store during the
past year is 160 per day and a standard deviation of 25 customers per day. A
random sample of 64 days is chosen from this population. The standard
deviation of the sample mean equals:
a. 12.649.
b. 25.0.
c. 2.56.
d. 3.125.
e. 8.0
ANS: D PTS: 1 OBJ: Section 8.3
7. The average sales per customer at a home improvement store during the past
year is $75 with a standard deviation of $12. The probability that the average
sales per customer from a sample of 36 customers, taken at random from this
population, exceeds $78 is:
a. 0.4332.
b. 0.0668.
c. 0.0987.
d. 0.9013.
e. 0.5000
ANS: B PTS: 1 OBJ: Section 8.3
8. The average speed of cars passing a checkpoint is 60 miles per hour with a
standard deviation of 8 miles per hour. Fifty (50) passing cars are clocked at
random from this checkpoint. The probability that the sample mean will be
between 57 and 62 miles per hour is:
a. 0.9576.
b. 0.9960.
c. 0.2467.
d. 0.3520.
e. 0.4659
ANS: A PTS: 1 OBJ: Section 8.3
9. If all possible samples of size n are drawn from an infinite population with a
mean of 15 and a standard deviation of 5, then the standard error of the sample
mean equals 1.0 only for samples of size:
a. 5.
b. 15.
c. 25.
d. 75.
ANS: C PTS: 1 OBJ: Section 8.3
10. Each of the following are characteristics of the sampling distribution of the
mean except:
a. The sampling distribution of the mean has the same mean as the original
population.
b. The standard deviation of the sampling distribution of the mean is referred to
as the standard error.
c. If the original population is normally distributed, the sampling distribution of
the mean will also be normal.
d. If the original population is not normally distributed, the sampling distribution
of the mean will also be approximately normal for large sample sizes.
e. All of the above are characteristics of the sampling distribution of the mean.
ANS: E PTS: 1 OBJ: Section 8.2
12. The Coca-Cola Company has 40% of the cola market. The probability that a
sample proportion for n = 30 falls within 0.10 of the true population proportion
of 0.40, which represents the proportion of cola drinkers who prefer a Coca-Cola
drink is:
a. 0.7372.
b. 0.3686.
c. 0.4599.
d. 0.9198.
e. not enough information given to make calculation.
ANS: A PTS: 1 OBJ: Section 8.4
13. Fifty percent of all college students attend schools within 50 miles of their
homes. In a sample of 500 college students, the probability that the sample
proportion will be between 0.45 and 0.55 is:
a. 0.4875.
b. 0.9818.
c. 0.4909.
d. 0.9750.
e. 0.5000.
ANS: D PTS: 1 OBJ: Section 8.4
14. Suppose an actual census showed that 18.4% of the households in California
have incomes in excess of $50,000. The probability that the sample proportion
will be 0.22 or greater for a random sample of 750 households is:
a. 0.0055.
b. 0.9945.
c. 0.9949.
d. 0.0041.
e. not enough information given for estimation.
ANS: A PTS: 1 OBJ: Section 8.4
15. In a simple random sample of 1,000 automobile owners between ages 25-
35, there were 142 owners of a Ford Explorer. If the expected number of owners
of this vehicle was 125 out of 1000, the values of p, and n are, respectively:
a. 125,142,1000.
b. 142,125,1000.
c. 0.142, 0.125, 1000.
d. 0.125, 0.142, 1000.
e. 0.142, 0.125, 1350.
ANS: D PTS: 1 OBJ: Section 8.4
17. Sixty percent of a company’s customers are females. What is the probability
that a random sample of 200 customers is greater than 64 percent female?
a. 0.1229
b. 0.1556
c. 0.1898
d. .02158
e. 0.2847
ANS: A PTS: 1 OBJ: Section 8.4
18. Historically, a company that mails its monthly catalog to potential customers
receives orders from 8 percent of the addresses. If 500 addresses are selected
randomly from the last mailing, what is the probability that between 35 and 50
orders were received from this sample?
a. 0.0496
b. 0.2049
c. 0.7444
d. 0.7951
e. 0.9504
ANS: C PTS: 1 OBJ: Section 8.4
19. The Coca-Cola Company has 40% of the cola market. Determine the
probability that a sample proportion for n = 30 is within 0.10 of the true
proportion of 0.40, which is the proportion of cola drinkers who prefer a Coca-
Cola drink. In this particular market, the population size of those who drink cola
is 50.
a. 0.7372
b. 0.3686
c. 0.4599
d. 0.9198
e. Not enough information given to make calculation.
ANS: D PTS: 1 OBJ: Section 8.5
20. Fifty percent of all college students attend schools within 50 miles of their
homes. In a sample of 500 college students from a city with 5000 college
students, what is the probability that the sample proportion will be between 0.45
and 0.55?
a. 0.4875
b. 0.9818
c. 0.4909
d. 0.9750
e. 0.5000
ANS: B PTS: 1 OBJ: Section 8.5
21. Suppose an actual census showed that 18.4% of the households in California
have incomes in excess of $50,000. What is the probability that the sample
proportion will be 0.22 or greater for a random sample of 750 households from a
city population which totals 10,000?
a. 0.0055
b. 0.9945
c. 0.9949
d. 0.0041
e. Not enough information given for estimation.
ANS: D PTS: 1 OBJ: Section 8.5
23. A normally distributed population with 200 elements has a mean of 60 and a
standard deviation of 10. The probability that the mean of a sample of 25
elements taken from this population will be smaller than 56 is:
a. 0.0166.
b. 0.0228.
c. 0.3708.
d. 0.0394.
ANS: A PTS: 1 OBJ: Section 8.5
NUMERIC RESPONSE
1. An auditor knows from past history that the average accounts receivable for
ABC Company is $441.27 with a standard deviation of $555.48. If the auditor
takes a sample of 100 accounts, what is the probability that the mean of the
sample will be within $100 of the population mean?
ANS: 0.9282
ANS: 0.0018
3. The average weight of a 4 year old male is 40 lbs with a standard deviation of
6 lbs. What would be the probability that the average weight from a random
sample of 50 boys at this age would have be greater than 41 lbs?
ANS: 0.119
ANS: 0.0188
5. The average cost of renting a compact size car in a major metropolitan area
of the U.S. is $51.74 with a standard deviation of $7.48. Assume a normal
distribution. If a sample of 20 cities is taken, what is the probability that the
sample mean will fall between 47.74 and 55.74?
ANS: 0.9832
6. Your company is located in New York City and your job requires that you travel
by air to many other major cities such as Boston, Chicago, Los Angeles, Detroit,
Houston, Dallas, Atlanta, Denver, and Washington, DC. Your average round-trip
airfare for the past year has been $495 with a standard deviation of $240. If for
your next year’s planning budget you estimate your average air ticket between
$395 and $550, what is the probability that the mean, , of your next 36 flights
will fall in this range?
ANS: 0.9100
ANS: 0.1469
ANS: 0.9429
ANS: 0.5550
10. A manufacturer of printers claims that their color cartridge will print an
average of 200 4-by-6 photos with a standard deviation of 60 photos. A random
sample of 40 customers counted the number of photos printed with one
cartridge. If the manufacturer’s claim is correct, what is the probability that the
average customer printed less than 195 pages with one cartridge?
ANS: 0.2981
11. A trucking firm has trucks that average a payload of 22.5 tons and a
standard deviation of 1.5 tons. An accounting firm is auditing the shipping
orders for the past year for billing errors. It chooses a random sample of 64
shipping orders. What is the probability that the sample mean weight will be
within 1 ton of the true population mean of 22.5 tons?
ANS: 1
12. The heights of men in the USA are normally distributed with a mean of 68
inches and a standard of 4 inches. What is the probability that a randomly
selected man is taller than 70 inches?
ANS: 0.3085
13. The heights of men in the USA are normally distributed with a mean of 68
inches and a standard of 4 inches. A random sample of five men is selected.
What is the probability that the sample mean is greater than 70 inches?
ANS: 0.1314
14. The heights of men in the USA are normally distributed with a mean of 68
inches and a standard of 4 inches. What is the probability that the mean height
of a random sample of 35 men is greater than 70 inches?
ANS: 0.0015
15. The amount of time spent by American adults playing sports per day is
normally distributed with a mean of 4 hours and standard deviation of 1.25
hours. Find the probability that a randomly selected American adult plays sports
for more than 5 hours per day.
ANS: 0.2119
16. The amount of time spent by American adults playing sports per day is
normally distributed with a mean of 4 hours and standard deviation of 1.25
hours. Find the probability that if four American adults are randomly selected,
their average number of hours spent playing sports is more than 5 hours per
day.
ANS: 0.0548
17. If young children watch an average of 105 minutes of TV per day with a
standard deviation of 30 minutes per day, what is the probability that a random
sample of 60 children will watch more than 110 minutes of TV per day?
ANS: 0.0985
ANS: 0.075
19. In an effort to identify the true proportion of shoplifters who are under age
18, a random sample of 300 shoplifting cases was taken. One hundred and
twenty of those cases involved persons under age 18. What value would be used
to estimate the true proportion of under age 18 shoplifters?
ANS: 0.40
PTS: 1 OBJ: Section 8.4
20. A videocassette rental store wants to know what proportion of its stock
should include the Beta format VCR. A random sample of the owners of VCRs is
suggested. Five hundred VCR owners are asked if they own the Beta or VHS
format. Out of 500 interviewed, 125 were owners of the Beta format. Presume
that the true population proportion of Beta format VCRs is = 0.32. What is the
probability that the sample proportion p will be within 0.04 of the true proportion
of Beta format ownership?
ANS: 0.9438
21. A producer of a juice drink advertises that it contains 10% real fruit juices. A
sample of 75 bottles of the drink is analyzed and the percent of real fruit juices
is found to be 6.5%. If the true proportion is actually 0.10, what is the probability
that the sample percent will be 6.5% or less?
ANS: 0.1562
22. The Sunbelt City Peewee baseball organization has ordered 48 dozen balls
from the Liteflite Company for use in its summer program. The company has
guaranteed a 10% discount on the total cost if 32 or more of the balls are
defective. Experience in the past has shown 5% defects. A sample of 50 balls
will be taken, and the discount will be given if the sample proportion of defective
balls is at least 32 in 48 dozen. What is the probability that the organization will
get the 10% discount?
ANS: 0.4286
ANS: 150
ANS: 0.9664
ANS: 0.1587
26. The publisher of a daily newspaper claims that 90% of its subscribers are
under the age of 30. Suppose that a sample of 300 subscribers is selected at
random. Assuming the claim is correct, approximate the probability of finding at
least 240 subscribers in the sample under the age of 30.
ANS: 1
ANS: 0.4566
ANS: 0.5588
29. The average SAT score for the 300 students accepted at Tiny College this
year is 1050 with a standard deviation of 275. If a random sample of 40
students is selected from this class, what is the probability that the average SAT
score will be greater than 1000?
ANS: 0.8907
30. The average SAT score for the 300 students accepted at Tiny College this
year is 1050 with a standard deviation of 275. If a random sample of 40
students is selected from this class, what is the probability that the average SAT
score will be greater than 1070?
ANS: 0.3121
31. The average SAT score for the 300 students accepted at Tiny College this
year is 1050 with a standard deviation of 275. If a random sample of 40
students is selected from this class, what is the probability that the average SAT
score will be between 1060 and 1120?
ANS: 0.3595
32. In order to estimate the mean salary for a population of 500 employees, the
president of a certain company selected at random a sample of 40 employees.
What is the probability that the sample mean salary of the employees will be
within $200 of the population mean salary?
ANS: 0.901
ANS: 1.037
ANS: 2
NARRBEGIN: Vehicles
Four families live on a cul-de-sac represent a very small population. The number
of vehicles owned by each family is shown below:
NARREND
35. Calculate the population mean.
ANS: 1.75
ANS: 0.8292
37. Assume that all possible simple random samples of size two are taken from
this population, and that the original population is normally distributed. Apply
the Central Limit Theorem to find the mean of the sampling distribution of
sample means.
ANS: 1.75
38. Assume that all possible simple random samples of size two are taken from
this population, and that the original population is normally distributed. Apply
the Central Limit Theorem to find the standard error of the sampling distribution
of sample means.
ANS: 0.4787
39. List all possible simple random samples of size two that can be taken from
this population and determine the mean number of vehicles for each sample.
ANS:
ANS: 1.75
ANS: 0.4787
3. If the original population from which the samples were drawn is not normally
distributed, then the sampling distribution of the mean will be
____________________ for large sample sizes.
4. When a great many simple random samples of size n are drawn from a
population that is normally distributed, the sampling distribution of the sample
means will be ____________________ regardless of sample size n. As n increases,
the ____________________ of the sample distribution will become smaller.
SHORT ANSWER
NARRBEGIN: Pets
NARREND
1. Determine the probability distribution for the discrete random variable, x =
number of pets owned.
ANS:
x1234
P(x) .2 .4 .2 .2
PTS: 1 OBJ: Section 8.2
Mean = ____________________
SD = ____________________
ANS:
2.4; 1.0198
3. List all possible samples of size n = 2 from the five individuals and determine
the mean for each sample.
ANS:
4. Find the mean and standard deviation of the sampling distribution of the
mean.
Mean = ____________________
SD = ____________________
ANS:
5. A sample of size 400 is drawn from a population whose mean and variance
are 5,000 and 10,000, respectively. Find the following probabilities:
A) ____________________
B) ____________________
C) ____________________
ANS:
0.0228; 0.8185; 0.00
A) What does the central limit theorem say about the sampling distribution of
the sample mean? Why?
B) Find the mean and standard error of the sampling distribution of the sample
mean.
Mean = ____________________
SD = ____________________
C)
D)
ANS:
is normally distributed because the parent population is normal.; 75; 0.8485;
0.9909; 0.1190
ANS:
0.0188; 0.1492; 0.9164
Answer 1: ____________________
Answer 2: ____________________
ANS:
Number of trials; x
9. A firm claims that only 10% of its accounts receivables are over 30 days past
due. The bank that supplies working capital or interim loans to the firm is
suspect of the figure and accordingly takes a random sample of 100 accounts of
the firm. Find the probability that the sample proportion, p, will be:
ANS:
0.1293; 0.4972
Distribution is:
Mean:
SD:
ANS:
Approximately normal; 0.32; 0.0209
11. For each of the following combinations of n and , calculate the mean of the
sampling distribution of the sample proportion p.
A) n = 100; = 0.38
B) n = 200; = 0.92
C) n = 2000; = 0.02
ANS:
0.38; 0.92; 0.02
12. For each of the following combinations of n and , calculate the standard error
of the sampling distribution of the sample proportion p.
A) n = 100; = 0.30
B) n = 200; = 0.65
C) n = 500; = 0.25
D) n = 1000; = 0.70
ANS:
0.0458; 0.0337; 0.0194; 0.0145
13. For each of the following combinations of n and , recall the rule of thumb
used to indicate when the normal distribution is a good approximation of the
sampling distribution for the sample proportion p, and determine whether the
rule is satisfied.
A) n = 50; = 0.05
B) n = 50; = 0.10
C) n = 100; = 0.05
D) n = 100; = 0.99
ANS:
Rule is not satisfied; Rule is satisfied; Rule is satisfied; Rule is not satisfied
C) What is the probability that the sample proportion p will be within 0.02 of the
population proportion ?
ANS:
0.6000; 0.0283; 0.5222
ANS:
Approximately normal, since n*pi = 35 and n(1- pi) = 15 are both greater than
5.
E(p):
SE:
ANS:
0.70; 0.0648
17. Determine which of the following population size and sample size
combinations would require the use of the finite population correction factor in
calculating the standard error.
A) N = 150; n = 25
B) N = 1500; n = 300
C) N = 2500; n = 75
D) N = 15,000; n = 1000
ANS:
Yes; Yes; No; Yes
ANS:
Yes; n/N = 0.08 > 0.05; therefore, the finite population correction factor is
necessary.
19. In order to estimate the mean salary for a population of 500 employees, the
president of a certain company selected at random a sample of 40 employees. If
the population standard deviation is $800, compute the standard error both with
and without using the finite population correction factor.
ANS:
121.448; 126.491
20. According to the central limit theorem, samples of size 36 from a Poisson
population with a mean of 9 should have a mean ____________________ and a
standard error of ____________________.
ANS:
9; 0.50
NARRBEGIN: Vehicles
Four families live on a cul-de-sac represent a very small population. The number
of vehicles owned by each family is shown below:
NARREND
21. Describe the sampling distribution of sample means.
Distribution is:
Mean:
SD:
ANS:
Normal; 1.75; 0.4787
22. Assume that all possible simple random samples of size two are taken from
this population, and the mean number of vehicles for each sample is calculated.
Describe the probability distribution of the means.
ANS:
P( )
1.0 1/6
1.5 2/6
2.0 2/6
2.5 1/6
PTS: 1 OBJ: Section 8.7
NARRBEGIN: TV Sets
The following data give the number of TV sets owned for a population of 4
families:
NARREND
23. Find the mean and the standard deviation for the population.
Mean:
SD:
ANS:
2.5; 1.118;
ANS:
25. List all possible samples of 2 families that can be selected without
replacement from this population.
ANS:
AB, AC, AD, BC, BD, CD
26. Determine the sampling distribution of for all possible samples of 2 families
that can be selected without replacement from this population and calculate the
mean and standard deviation of .
ANS:
ESSAY
ANS:
The sampling distribution of a statistic is a probability distribution of the sample
statistic (such as or p) for all possible samples of that particular size. Therefore,
a sampling distribution is a probability distribution. It is for a statistic (which is a
random variable) with a sample size specified.
ANS:
For large, simple random samples from a population that is not normally
distributed, the sampling distribution of the mean will be approximately normal,
with the mean , and the standard deviation . As the sample size (n) is increased,
the sampling distribution of the mean will more closely approach the normal
distribution.
4. Why is the central limit theorem basic to the concept of statistical inference?
ANS:
Because it permits us to draw conclusions about the population based strictly on
sample data, and without having any knowledge about the distribution of the
underlying population.
TRUE/FALSE
1. True or False
A confidence interval will allow us to say with complete certainty that our
sample result will actually include the population mean.
3. True or False
An estimator is unbiased if the expected value of the sample statistic is the
same as the actual value of the population parameter it is intended to estimate.
4. True or False
If the population mean is unknown, one can use sample data as the basis from
which to make probability statements about the true (but unknown) value of .
5. True or False
Whenever sample data are used for estimating a population mean or proportion,
sampling error will not be present, since the observed sample statistic ( or p) will
not differ from the actual value of the population parameter ( or ).
6. True or False
The sample statistic s is an unbiased estimator of the population parameter
when (n – 1) is used as a divisor when we calculate the standard deviation of the
sample.
7. True or False
A 0.95 confidence coefficient is equivalent to a 5% confidence level.
9. True or False
To use the t distribution for a small sample size when deriving a confidence
interval requires that the population be normally distributed.
MULTIPLE CHOICE
5. Thirty-six months were randomly sampled and the discount rate on new
issues of 91-day Treasury Bills was collected. The sample mean is 4.76% and the
standard deviation is 171.21. What is the unbiased estimate for the mean of the
population?
a. 29312.86
b. 13.0847
c. 0.0476
d. 4.76%
e. 8.1496
ANS: D PTS: 1 OBJ: Section 9.2
6. A federal auditor for nationally chartered banks from a random sample of 100
accounts found that the average demand deposit balance at the First National
Bank of small town was $549.82. If the auditor needed a point estimate for the
population mean for all accounts at this bank, what would she use?
a. The average of $54.98 for this sample.
b. The average of $549.82 for this sample.
c. There is no acceptable value available.
d. She would survey the total of all accounts and determine the mean.
e. Not enough information is given to answer this question.
ANS: B PTS: 1 OBJ: Section 9.2
15. An auditor for a large oil corporation has taken a random sample of monthly
credit card charges for 32 individual (non-corporate) accounts. She found a
mean of $81.76. Assume that the population standard deviation is 48.2. A 98%
confidence interval estimate for the population mean of individual credit card
charges would be:
a. 81.76 19.89.
b. 81.76 16.72.
c. 81.76 112.94.
d. 81.76 22.0.
e. 81.76 2.33.
ANS: A PTS: 1 OBJ: Section 9.4
16. Delivery time by courier services is important to stock brokers. One such
firm wishes to estimate the average time taken by couriers delivering negotiable
securities in New York City. Specifically, the firm randomly samples 36 courier
pickup and delivery times. The collected data, in minutes, represent the time
between pickup and delivery. Suppose the population standard deviation is 0.3
minutes and the sample mean is 9.975. Find the sample mean and compute. A
90-percent confidence interval for the true mean delivery time, is:
a. 9.852 to 10.05.
b. 119.60 to 119.80.
c. 119.62 to 119.78.
d. 119.26 to 119.87.
e. 9.893 to 10.057.
ANS: E PTS: 1 OBJ: Section 9.4
19. The national sales manager of Fast Grow Corp is working on next year’s
budget projections. He knows that the population standard deviation is $17.66.
To estimate the daily per diem cost for outside sales people, he has taken a
random sample of 36 individual expense accounts and found a mean of $90.29.
The 91.08 percent confidence interval for the per diem expense is:
a. 86.45 to 94.13.
b. 60.27 to 120.37.
c. 85.29 to 95.29.
d. 74.43 to 107.9.
e. 84.32 to 96.26.
ANS: C PTS: 1 OBJ: Section 9.4
20. The security department of a state university is planning its budget for the
next year. In estimating the man-hours for security during university sponsored
music concerts, the average length of music concerts is needed. A random
sample from thirty-six security departments at universities was taken and the
sample mean length of concerts was 160 minutes. Suppose the population
standard deviation is 45 minutes. A 95 percent confidence interval for the true
mean duration of music concerts is:
a. 157.55 to 162.45.
b. 147.63 to 172.37.
c. 71.8 to 248.2.
d. 145.3 to 174.7.
e. 160 1.96(45 / 36).
ANS: D PTS: 1 OBJ: Section 9.4
22. The larger the level of confidence used in constructing a confidence interval
estimate of the population mean the:
a. smaller the probability that the confidence interval will contain the population
mean.
b. smaller the value of .
c. wider the confidence interval.
d. narrower the confidence interval.
ANS: C PTS: 1 OBJ: Section 9.4
27. A sample of 200 light bulbs was tested and it was found that 11 were
defective. Calculate a 95% confidence interval around this sample proportion.
a.
b.
c.
d.
e.
ANS: C PTS: 1 OBJ: Section 9.6
29. A total of 150 households were asked if they had Internet access, of which
57 responded “Yes”. Which of the following would represent a 98% confidence
interval around this sample proportion?
a.
b.
c.
d.
e.
ANS: B PTS: 1 OBJ: Section 9.6
30. In order to estimate the average daily down time, a manufacturer randomly
samples 40 days of production records and found a mean of 55.576 minutes and
standard deviation of 77.9 minutes. An 80% confidence interval is:
a. 55.675 1.28(12.32).
b. 55.675 1.304(12.32).
c. 55.675 1.28(77.9).
d. 55.675 1.304(77.9).
ANS: B PTS: 1 OBJ: Section 9.5
31. For a sample of size 20 taken from a normally distributed population with
standard deviation equal to 5, a 90% confidence interval for the population
mean would require the use of:
a. t = 1.328.
b. t = 1.729.
c. z = 1.96.
d. z = 1.645.
ANS: D PTS: 1 OBJ: Section 9.5
33. The lower limit of a confidence interval at the 95% level of confidence for
the population proportion if a sample of size 200 had 40 successes is:
a. 0.2554.
b. 0.1446.
c. 0.2465.
d. 0.1535.
ANS: B PTS: 1 OBJ: Section 9.6
34. You are interested in determining the average cost of a 3-minute telephone
call to locations within the continental U.S. What sample size must you take to
be 96% confident that the results will be within $.75 of the true mean cost per
call? From the phone company, you have gotten an estimate of = 7.71.
a. 185
b. 445
c. 406
d. 574
e. 22
ANS: B PTS: 1 OBJ: Section 9.7
36. The banking industry has suffered in recent years because of a high default
rate on energy-related loans. The standard deviation is $10,000. How large a
sample must you take from banking records to be 98% confident that your
estimate for the amount of these defaults is within $1000 of the true mean loan
amount?
a. 567
b. 233
c. 240
d. 543
e. 890
ANS: D PTS: 1 OBJ: Section 9.7
37. The Arkansas State Police wish to estimate the average mph being traveled
on the Interstate Highways, which cross the state. If the estimate is to be within
8 mpg of the true mean with 98% confidence and the estimated standard
deviation is 22 mph, how large a sample size must be taken?
a. 42
b. 15
c. 329
d. 14
e. 41
ANS: A PTS: 1 OBJ: Section 9.7
39. Your state is considering raising the legal age for consumption of alcoholic
beverages to 21 years old. How large a sample size must be taken to make an
estimate of the true proportion of citizens who favor this move. The error should
be no more than 0.01 with 99% confidence.
a. 167
b. 16641
c. 6450
d. 65
e. 1670
ANS: B PTS: 1 OBJ: Section 9.7
40. After calculating the sample size needed to estimate a population proportion
to within .04, your statistics professor told you the maximum allowable error
must be reduced to just .01. If the original calculation led to a sample size of
800, the sample size will now have to be:
a. 800.
b. 3200.
c. 12,800.
d. 6400.
ANS: C PTS: 1 OBJ: Section 9.7
41. Which of the following is not a part of the formula for constructing a
confidence interval estimate of the population mean?
a. A point estimate of the population mean
b. The standard error of the sampling distribution of the sample mean
c. The confidence level
d. The value of the population mean
e. The sample size
ANS: D PTS: 1 OBJ: Section 9.9
NUMERIC RESPONSE
1. A fund raiser for United Way wants to know the average annual income of
workers in a very large vegetable processing plant. She is able to obtain a
sample of 30 annual wage levels (in thousands). She finds that the summation
of the wages for the 30 is 370.7 and the standard deviation of 3.3. What
estimate may be used for ?
ANS: 12.36
ANS: 2.9
ANS: 4.767
ANS: 2.183
ANS: 0.573
The sample proportion would not differ from the actual population proportion by
more than:
ANS: 0.025
PTS: 1 OBJ: Section 9.3
ANS: $6.75
ANS: 5973
ANS: 304
ANS: 2789
ANS: 292
12. A medical records manager would like to review hospital records in order to
determine the average hospital stay for surgery patients during the past year.
What sample size must be taken in order to be 95% confident that the estimate
is within 1 day of the true mean? She has past studies, which indicate that is
20.25 days.
ANS: 1576
13. U.S. News and World Report each year conducts a survey of starting salaries
in various career fields. If you were to devise a plan to replicate this study, how
large a sample would you need to take to be 98% certain that your estimate is
within $500 of the true mean starting salary? Assume that is $3500.
ANS: 266
14. The economics department secretary wants to estimate the average number
of bottled drinks sold each day out of the vending machine. Similar vending
machine sales have a standard deviation of 9 bottles per day. She wants to be
95-percent confident that the estimate, , obtained is within 3 bottles of the true
mean . How large a sample is needed?
ANS: 35
ANS: 68
16. A proponent of private aid to assist those who are suffering from hunger has
decided to take a random sample to estimate the average annual contribution in
her community to world hunger relief. A similar study in a comparable
community reported the standard deviation of giving as $15. Our proponent
believes that her estimate, , should be within $3 of the true average and with
92-percent confidence. If the community population is 1000, determine the
appropriate sample size.
ANS: 72
ANS: 47
18. A proponent of private aid to assist those who are suffering from hunger has
decided to take a random sample to estimate the average annual contribution in
her community to world hunger relief. A similar study in a comparable
community reported the standard deviation of giving as $15. Our proponent
believes that her estimate, , should be within $3 of the true average and with
92-percent confidence. What size sample is appropriate?
ANS: 77
ANS: 97
ANS: 93
21. Texas passed a law requiring drivers and passengers in the front seat to
wear seat belts. In order to estimate the compliance with the law an unmarked
car will operate in a selected county and the officer will observe whether or not
seat belts are in use. The proportion complying, , in the county is to be
estimated by randomly selecting the cars to examine. A confidence coefficient
of 0.98 is desired with the estimate, p, to be within 0.05 of the true proportion.
How many cars are needed in this sample?
ANS: 542
ANS: 40
ANS: 150
24. As marketing manager for a very large discount building supply, you wish to
know what proportion of your city’s population conducted do-it-yourself home
improvement projects during the past year. The national percentage is 62%;
therefore, you decide to use this value in estimating sample size for a study to
achieve 90% confidence with the estimate no more than 5% from the true
proportion. What is the appropriate sample size?
ANS: 256
25. As marketing manager for a very large discount building supply, you wish to
know what proportion of your city’s population conducted do-it-yourself home
improvement projects during the past year. The national percentage is 62%;
therefore, you decide to use this value in estimating sample size for a study to
achieve 90% confidence with the estimate no more than 5% from the true
proportion. If you know that the total city population is 30,000, what is the
appropriate sample size?
ANS: 253
26. The economics department secretary wants to estimate the average number
of bottled drinks sold each day out of the vending machine. Similar vending
machine sales have a standard deviation of 9 bottles per day. He wants to be
95-percent confident that the estimate, , obtained is within 3 bottles of the true
mean . How large a sample is needed if the limit of the Coke machine for one
day is 120?
ANS: 27
27. A proponent of private aid to assist those who are suffering from hunger has
decided to take a random sample to estimate the average annual contribution in
her community to world hunger relief. A similar study in a comparable
community reported the standard deviation of giving as $15. Our proponent
believes that her estimate should be within $3 of the true average and with 92-
percent confidence. If 1222 in this community contribute to world hunger relief,
what size sample is appropriate?
ANS: 73
28. Texas passed a law requiring drivers and passengers in the front seat to
wear seat belts. In order to estimate the compliance with the law an unmarked
car will operate in a selected county and the officer will observe whether or not
seat belts are in use. The proportion complying, , in the county is to be
estimated by randomly selecting the cars to examine. A confidence coefficient
of 0.98 is desired with the estimate, p, to be within 0.05 of the true proportion. If
an average number of cars passing the observation point per day is 12,000, how
many cars are needed in this sample?
ANS: 518
ANS: 83
ANS: 93
COMPLETION
4. Interval limits refer to the upper and lower values of the ____________________.
ANS:
thirty
30
SHORT ANSWER
ANS:
;
Mean = ____________________
Variance = ____________________
ANS:
2489.067; 5575.61
ANS:
0.312 inches, 0.332 inches. Since this interval does not include the 0.30 inches
in the specifications, the process is not performing satisfactorily.
A) Population: Not given; Sample size: 50; Population standard deviation: Given
B) Population: Normal; Sample size: 16; Population standard deviation: Not given
C) Population: Not given; Sample size: 40; Population standard deviation: Given
ANS:
Standard normal; t-distribution; Standard normal; Standard normal
____________________ to ____________________.
ANS:
0.548; 0.598
Confidence level:
Confidence coefficient:
ANS:
95%; .95
7. Determine the limits of the 88% confidence interval for , given that n = 72; =
18; and = 6.
____________________ to ____________________.
ANS:
16.9; 19.1
8. Determine the limits of the 90% confidence interval for , given that n = 144;
= 7; and = 2.
____________________ to ____________________
ANS:
6.73; 7.27
9. Determine the limits of the 68.26% confidence interval for , given that n = 64;
= 118; and = 8.
____________________ to ____________________
ANS:
117; 119
10. Determine the limits of the 75% confidence interval for , given that n = 90;
= 50; and = 12.
____________________ to ____________________
ANS:
48.5; 51.5
____________________ to ____________________
ANS:
$201.25; $214.75
PTS: 1 OBJ: Section 9.4
12. There are 300 students enrolled in Business Statistics. Historically, exam
scores are normally distributed with a standard deviation of 30.9. Your instructor
randomly selected a sample of 30 examinations and finds a mean of 74.2.
Determine a 90% confidence interval for the mean score for all students taking
the course.
____________________ to ____________________
ANS:
64.92; 83.48
13. Suppose that the amount of time teenagers spend on the internet is
normally distributed with a standard deviation of 1.5 hours. A sample of 100
teenagers is selected at random, and the sample mean computed as 6.5 hours.
____________________ to ____________________
ANS:
6.206; 6.794; If we repeatedly draw samples of size 100 from the population of
teenagers, 95% of the values of sample means will be such that the population
mean amount of time teenagers spend on the internet ( ) would be somewhere
between 6.206 hours and 6.794, and 5% of the values of will produce intervals
that would not include .
14. A filling machine is designed to fill soda bottles with 16 ounces of soda. The
distribution for the weight of the bottles is normal. Twenty bottles are selected
and weighed. The sample mean is 15.3 ounces and sample standard deviation is
1.5 ounces. Develop a 90% confidence interval for this sample. What can we
conclude about this filling process?
ANS:
14.72 ounces, 15.88 ounces. Since this interval does not contain 16 ounces, the
process is not working properly.
15. A college basketball player has been keeping track of his free throw shooting
practice over the summer. Each day that he practices, he shoots 150 free throws
and counts the number he makes. Today he made 117 free throws. What is the
95% confidence interval around this sample?
ANS:
16. From a sample of 400 people who were asked if they were in favor of a
proposition to ban smoking in restaurants, 320 responded yes. Develop a 98%
confidence interval using this sample.
ANS:
0.7534, 0.8466
____________________ to ____________________
ANS:
40.97; 55.84
18. For a selected month, the average kilowatt hours used for 49 residential
customers showed that is 1160 and s is 1085. Compute a 98% confidence
interval estimate for the true mean.
____________________ to ____________________
ANS:
786.91; 1533.08
19. Cement for building a dam is delivered by truck. The dispatcher wants to
estimate the average driver time required to load, drive to the dam, unload, and
drive back to the plant. The time is assumed to be normally distributed. Total
time required is recorded for a random sample of 18 with a mean of 543.71 and
a standard deviation of 69.83. Use these data to obtain an 80-percent
confidence interval for the average driver time.
____________________ to ____________________
ANS:
521.77; 565.65
20. An economics professor is taping 60-minute lectures for replay over the
university radio station. To make sure that the brand of tapes that will be used
records for at least sixty minutes, one dozen are purchased and examined to
determine if they do record for sixty minutes. The results of the recording test in
minutes show a mean of 61.28 and standard deviation of 3.868. The professor
assumes that the playing time is normally distributed. Construct an 80-percent
confidence interval for the mean recording time employing these data.
____________________ to ____________________
ANS:
59.758; 62.802
____________________ to ____________________
ANS:
1.30; 3.18
22. Compute a 95 percent confidence interval for given a sample mean of 25, a
sample size of 16, a sample standard deviation of 3.6, and a normal population.
____________________ to ____________________
ANS:
23.08; 26.92
____________________ to ____________________
ANS:
6.595; 12.005
24. Raising in-state and out-of-state tuition is supposed to reduce the number of
students in state supported universities. The registrar of a university wants to
estimate the proportion, , of students who are paying out-of-state tuition on the
installment plan (to later be compared with in-state installment plan payers). A
random sample of 80 students who live out-of-state is taken and 50 of them pay
tuition on the installment plan. Find a 99-percent confidence interval for based
on these data.
____________________ to ____________________
ANS:
0.486; 0.764
25. A mother who is interested in the true proportion of R-rated movies shown
on pay TV by a cable system randomly selects 98 listings and finds 14 of them
are R-rated movies. In her report to the subcommittee she wants to be 98%
confident that the true proportion will be in an interval which she states. She has
asked you to assist her by preparing a 98% confidence interval based on the
data she collected. What should she report?
____________________ to ____________________
ANS:
0.06; 0.22
26. Closed caption movies allow the hearing impaired to enjoy the dialogue as
well as the acting. A local organization for the hearing impaired people of the
community takes a random sample of 100 movie listings offered by the cable
television company in order to estimate the proportion of closed caption movies
offered. Fourteen movies were closed caption. The cable television company
says at least 5% of the movies shown are closed captioned. Prepare a 90%
confidence interval for the true proportion, .
____________________ to ____________________
ANS:
0.08; 0.20; The cable television company is correct in saying that at least 5% of
the movies it shows are closed captioned.
PTS: 1 OBJ: Section 9.6
27. Determine the limits of the 85% confidence interval for , given that n = 50;
and p = 0.20.
____________________ to ____________________
ANS:
0.12; 0.28
28. Determine the limits of the 90% specified confidence interval for , given that
n = 400; and p = 0.10.
____________________ to ____________________
ANS:
0.0753; 0.1247
29. A survey based on a random sample of 3000 U.S. women in a Virginia Slims
Poll showed that 1950 were combining marriage, career, and children. Develop a
98% confidence interval estimate of the true proportion of women in the U.S.
who fit into this category.
____________________ to ____________________
ANS:
0.63; 0.67
____________________ to ____________________
ANS:
0.637; 0.653
31. Thirty-eight percent of a sample of 100 indicated that they used a company
credit union because of lower interest rates for loans. Determine the 99%
confidence interval estimate for the portion that use company credit unions due
to lower loan rates.
____________________ to ____________________
ANS:
0.2551; 0.5049
____________________ to ____________________
ANS:
0.5245; 0.5955
33. The owner of a small manufacturing firm is concerned about the high cost of
health insurance for his employees. A random sample was taken from the total
of 120 employees to determine whether they would support the owner’s
proposal for participation in a preferred-provider plan. Of the 50 employees
sampled, 28 strongly supported the plan. Construct a 95% confidence for the
true proportion.
____________________ to ____________________
Do you feel that the proposed plan would get majority support from the
employees if put to a vote? Explain your answer.
ANS:
0.4546; 0.6654; Because the confidence interval is extremely wide, it would be
risky to definitely predict the outcome.
34. From a community of 80 homes, the EPA took a random sample and found
the presence of lead in the tap water. In the 10 homes sampled, the average
lead content was 12 parts per billion with s = 5 pts/billion. Construct a 99%
confidence interval estimate for the population mean.
____________________ to ____________________
Do you feel that the homes exceed the EPA recommended maximum of 15
pts/billion?
ANS:
7.1575; 16.8425; Yes; Because 15 is contained in the confidence interval, there
is a possibility that the 15 pts/billion is exceeded.
ANS:
7.897, 8.007
36. The temperature readings for 20 winter days in Big Rapids, Michigan are
normally distributed with a mean 5.5 degrees and a standard deviation of 1.5.
Determine the 90% confidence interval estimate for the winter mean
temperature.
____________________ to ____________________
ANS:
4.948; 6.052
37. The office manager for an ophthalmologist wants to estimate the average
time that the doctor spends with each patient in order to better schedule
appointments. A random sample of 49 is taken and the sample mean is 20.3
minutes. Assume that the office manager knows from past experience that the
standard deviation is 14 minutes. She finds that a 95% confidence interval is
between 18.3 and 22.3 minutes.
ANS:
20.3; 0.95
ESSAY
ANS:
While the point estimate is a single number (e.g., or p) that estimates the exact
value of a population parameter of interest (e.g., or ) an interval estimate
includes a range of possible values that are likely to include the actual (but
unknown) population parameter.
ANS:
A confidence interval is an interval estimate for which there is a specified degree
of certainty that the actual value of the population parameter will fall within the
interval.
ANS:
For a confidence interval, the proportion of such intervals that would include the
population parameter if the process leading to the interval were repeated a
great many times.
ANS:
Accuracy is the difference between the observed sample statistic and the actual
value of the population parameter being estimated. This may also be referred to
as estimation error or sampling error.
6. Explain how precision for a confidence interval is related to the width of the
confidence interval.
ANS:
Precision refers to the distance between the lower limit and the upper limit of a
confidence interval. The wider the interval the less the precision.
PTS: 1 OBJ: Section 9.4
7. How can one reduce the width of a confidence interval for a given level of
confidence?
ANS:
Take a larger sample size.
ANS:
Any observation, which is outside this interval, should be audited.
ANS:
We have 90% confidence that the population mean is included in this interval.
ANS:
The approximation is satisfactory whenever np and n(1 – p) are both 5.
However, the approximation is better for large values of n and whenever p is
closer to .50.
TRUE/FALSE
1. True or False
A Type I error is the probability of failing to reject a false null hypothesis.
2. True or False
A Type II error is the probability of rejecting a true null hypothesis.
3. True or False
The probability of Type I error is referred to as the significance level of the test.
ANS: T PTS: 1 OBJ: Section 10.1
4. True or False
A directional claim or assertion states that a population parameter is greater
than (>), or less than (), no more than(), at least,(<), some quantity.
5. True or False
A non-directional claim or assertion states that a population parameter is equal
to some quantity.
6. True or False
Directional assertions lead to what is called a two-tail test.
7. True or False
A non-directional assertion involves what are called one-tail tests.
8. True or False
For a two-tail hypothesis test, the level of significance is the sum of both tail
areas.
9. True or False
In conducting a hypothesis test, we can choose any significance level we desire.
In practice, however, levels of 0.10, 0.05, and 0.01 tend to be most common.
The decision is: Because the test statistic t = -2.81 falls in the rejection region,
reject H0. At the 0.01 level, there is enough statistical evidence to conclude that
the alternative hypothesis is true and that the average P/E ratio for the stocks is
less than 20.
The decision is: Because the test statistic t = -2.83 is not less than -2.33, fail to
reject H0. At the 0.01 level, the sample data suggest that the average P/E ratio
for the stocks is not less than 20.
MULTIPLE CHOICE
2. In a criminal trial where the null hypothesis states that the defendant is
innocent, a Type I error is made when:
a. a guilty defendant is found not guilty.
b. an innocent person is found guilty.
c. a guilty defendant is found guilty.
d. an innocent person is found not guilty.
ANS: B PTS: 1 OBJ: Section 10.1
3. A Type II error is defined as:
a. rejecting a true null hypothesis.
b. rejecting a false null hypothesis.
c. failing to reject a true null hypothesis.
d. failing to reject a false null hypothesis.
ANS: D PTS: 1 OBJ: Section 10.1
6. In a criminal trial where the null hypothesis states that the defendant is
innocent, a Type II error is made when:
a. a guilty defendant is found not guilty.
b. an innocent person is found guilty.
c. a guilty defendant is found guilty.
d. an innocent person is found not guilty.
ANS: A PTS: 1 OBJ: Section 10.1
7. A professor of math refutes the claim that the average student spends 3
hours studying for the midterm exam. Which hypothesis is used to test the
claim?
a. H0: 3, H1: > 3
b. H0: = 3, H1: 3
c. H0: 3, H1: = 3
d. H0: = 3, H1: < 3
ANS: B PTS: 1 OBJ: Section 10.1
8. In a two-tail test for the population mean, if the null hypothesis is rejected
when the alternative hypothesis is true then:
a. a Type I error is committed.
b. a Type II error is committed.
c. a correct decision is made.
d. a one-tail test should be used instead of a two-tail test.
ANS: C PTS: 1 OBJ: Section 10.1
9. In a one-tail test for the population mean, if the null hypothesis is not rejected
when the alternative hypothesis is true then:
a. a Type I error is committed.
b. a Type II error is committed.
c. a one-tail test should be used instead of a two-tail test.
d. a two-tail test should be used instead of a one-tail test.
ANS: B PTS: 1 OBJ: Section 10.1
10. A spouse stated that the average amount of money spent on Christmas gifts
for immediate family members is above $1200. The correct set of hypotheses is:
a. H0: = 200, H1: < 1200
b. H0: > 1200, H1: = 1200
c. H0: = 1200, H1: > 1200
d. H0: < 1200, H1: = 1200
ANS: C PTS: 1 OBJ: Section 10.1
14. Consider testing the hypothesis H0: = 800 vs. H1: 800.
If the value of the test statistic z equals 1.75, then the p-value is:
a. 0.0401.
b. 0.0802.
c. 0.4599.
d. 0.9198.
ANS: B PTS: 1 OBJ: Section 10.3
15. In a one-tail test, the p-value is found to be equal to .068. If the test had
been two-tail, the p-value would have been:
a. 0.932.
b. 0.466.
c. 0.034.
d. 0.136.
ANS: D PTS: 1 OBJ: Section 10.3
16. In order to determine the p-value, which of the following is not needed?
a. The level of significance
b. Whether the test is one or two tail
c. The value of the test statistic
d. All of these are needed.
ANS: A PTS: 1 OBJ: Section 10.3
If n = 64, = 53.5, and = 10, then the value of the test statistic is:
a. z = 1.96.
b. t = 1.64.
c. z = 2.80.
d. t = 1.96.
e. z = 1.64
ANS: C PTS: 1 OBJ: Section 10.3
18. Which of the following p-values will lead us to reject the null hypothesis if =
0.05?
a. 0.15
b. 0.10
c. 0.05
d. 0.025
ANS: D PTS: 1 OBJ: Section 10.3
19. Suppose that we reject a null hypothesis at the .05 level of significance.
Then for which of the following values do we also reject the null hypothesis?
a. 0.06
b. 0.04
c. 0.03
d. 0.02
ANS: A PTS: 1 OBJ: Section 10.3
20. Using the confidence interval when conducting a two-tail test for the
population mean we do not reject the null hypothesis if the hypothesized value
for :
a. is to the left of the lower confidence limit.
b. is to the right of the upper confidence limit.
c. falls between the lower and upper confidence limits.
d. falls in the rejection region.
ANS: C PTS: 1 OBJ: Section 10.4
21. Based on sample data, the 90% confidence interval limits for the population
mean are 170.86 and 195.42. If the 10% level of significance was used in testing
the hypothesesH0: = 201, H1: 201, the null hypothesis:
a. would be rejected.
b. would not be rejected.
c. would have to be revised.
d. none of these.
ANS: A PTS: 1 OBJ: Section 10.4
23. In testing the hypotheses H0: = 0.40, H1: > 0.40 at the 5% significance
level, if the sample proportion is .45, and the standard error of the sample
proportion is .035, the appropriate conclusion would be:
a. to reject H0.
b. not to reject H0.
c. to reject H1.
d. to reject both H0 and H1.
ANS: B PTS: 1 OBJ: Section 10.6
NUMERIC RESPONSE
Find
ANS: 2.575
NARRBEGIN: Sermons
My daughter and I have argued about the average length of our preacher’s
sermons on Sunday morning. Despite my arguments, she thinks that the
sermons are more than twenty minutes (over twenty minutes is important to her
especially when soccer was playing on TV). For one year she has randomly
selected 12 Sundays and found the average time of 26.42 minutes with a
standard deviation of 6.69 minutes. Assuming that the population is normally
distributed and using a 0.05 level of significance, is my daughter correct in
thinking that the average length of sermons is more than 20 minutes?
NARREND
2. What is the value of test statistic t?
ANS: 3.32
NARRBEGIN: Bank
A bank holds the mortgage on a motel which the motel owner says has an
average daily revenue of at least $2048. In the event the average is not at least
$2048, the bank will ask the motel owner for additional collateral for the loan as
protection against loan default. A loan officer is aware that the daily average
revenue is normally distributed and randomly selects 20 days during the last six
months. The sample has a mean of $2003 and standard deviation of $144.00. At
the 0.01 level, is the bank loan financially sound (should the bank ask for
additional collateral)?
NARREND
3. What is the value of test statistic t?
ANS: -1.3975
ANS: 2.33
ANS: 0.0141
6. For the critical z, identify the sample proportion that this z-value represents.
p = ____________________
ANS: 0.0428
7. Calculate the probability of a Type II error for the following test of hypothesis
H0: = 50 versus H1: > 50 given that = 55, =.05, = 10, and n = 16.
ANS: 0.3594
8. A long-haul truck driver is eligible for a year-end bonus if the average miles
driven per day are more than 375 miles per eight-hour day. To determine
whether or not a driver is to receive a year-end bonus, a random sample of 36
days from the driver’s logbook is taken and the sample mean is computed. The
hypotheses are: H0: 375 miles (no bonus) vs. H1: > 375 miles (eligible for
bonus). If the population standard deviation = 70 miles and the true mean is
405, what is the power of the test at = 0.05?
ANS: 0.8238
9. Determine the power for the following test of hypothesis, H0: = 950 vs. H1:
950, given that = 1000, = 0.10, = 200, and n = 25.
ANS: 0.3465
COMPLETION
3. The ____________________ will bound rejection and non-rejection regions for the
null hypothesis.
ANS:
critical value
critical values
ANS: t-test
SHORT ANSWER
NARREND
1. Using the above graph, choose the answers that indicate a right-tail test:
Area 1: ____________________
Area 2: ____________________
Area 3: ____________________
ANS:
Do not reject H0; Do not reject H0; Reject H0
2. Using the above graph, choose the answers that indicate a left-tail test:
Area 1: ____________________
Area 2: ____________________
Area 3: ____________________
ANS:
Reject H0; Do not reject H0; Do not reject H0
3. Using the above graph, choose the answers that indicate a two-tail test:
Area 1: ____________________
Area 2: ____________________
Area 3: ____________________
ANS:
Reject H0; Do not reject H0; Reject H0
4. A researcher wants to carry out a hypothesis test involving the mean for a
sample of size n = 15. He does not know the true value of the population
standard deviation, but is reasonably sure that the underlying population is
approximately normally distributed. Should he use a z-test or a t-test in carrying
out the analysis? Why?
ANS:
t-test; He should use a t-test to carry out the analysis since the population
standard deviation is unknown, but he is reasonably sure the population is
approximately normally distributed.
5. It has been claimed that no more than 3% of the units coming off an
assembly line are defective. Formulate a null hypothesis and an alternative
hypothesis for this situation.
H0: ____________________
H1: ____________________
ANS:
0.03; > 0.03
6. It has been claimed that no more than 3% of the units coming off an
assembly line are defective. Will this test be a one-tail – left, a one tail – right, or
a two-tail? Explain your answer.
ANS:
Right-tail; This is a one-tail test since this is a directional claim (no more than
3%). It is a right-tail test since the alternative hypothesis has a greater-than
sign. The rejection region is located in the right tail of the standard normal
curve.
NARRBEGIN: Merit
In order to receive a merit raise a computer clerk must average less than 15
minutes per order form processed. An across-the-board raise was given if the
average is 15 minutes or more. The supervisor randomly selects 36 order forms
and carefully times the length of time required to enter all of the data on each
form. The significance level = 0.02. The average time to enter the 36 order
forms is 14.36 minutes, and the population standard deviation is 3.03 minutes
with a normal distribution. The test question is: Should a merit raise be given?
NARREND
7. State the null and alternative hypotheses.
ANS:
15 15, H1: H0:
8. State the decision rule in terms of the critical value as well as the p-value.
ANS:
Reject H0 if z < -2.05 or if p-value < 0.02
PTS: 1 OBJ: Section 10.3
Conclusion: ____________________
ANS:
z-test; -1.267; Do not reject H0
ANS:
ANS:
, then reject H0z orthen fail to reject H0. If z z If -1.64
ANS:
Since z = 2.96, we reject H0.
ANS:
ANS:
, then reject H0then fail to reject H0. If z If z -1.64
ANS:
Since z = -0.735, we fail to reject H0.
NARRBEGIN: Dimensions
The dimensions of a two by four board are not actually two inches by four
inches. Rather the actual dimensions are supposed to be 1.50 inches by 3.50
inches. A planer finishes the boards to the proper dimensions. To maintain
quality control, periodic checks are made to see that the planer is set to the
correct dimension. When the planer is adjusted properly, the average width is =
3.5 inches. If the average width is not 3.5 inches, the planer must be adjusted.
Thus the null hypothesis is = 3.5 inches and the alternate hypothesis is 3.5
inches. For control purposes, a random sample of 49 boards is taken twice daily.
The standard deviation of the process is = 0.25 inches. To minimize the
unnecessary adjustment of the planer, the risk of a Type I error, , is set at 0.05.
NARREND
17. Determine the lower and upper critical values for .
ANS:
3.43; 3.57
ANS:
If 3.43 inches 3.57 inches, then fail to reject H0. If > 3.5784 inches, then reject
H0.
ANS:
ANS:
, then reject H0then fail to reject H0. If z If z
ANS:
Since z = 2.09, we reject H0.
ANS:
Since the p-value equals 0.0183, which is less than = 0.02, we reject H0.
ANS:
ANS:
Since z = 1.41 and the p-value is 0.9207, we fail to reject the null hypothesis.
There is not enough evidence to conclude that at least 80% of their customers
rate the establishment “Excellent”.
ANS:
If z 1.645, reject H0. If z > 1.645, fail to reject H0.
ANS:
If z -1.28 fail to reject H0. If z < -1.28, then reject H0
ANS:
If z 2.575 fail to reject H0. If z > 2.575 reject H0.
NARRBEGIN: Restaurant
A restaurant claims that the average gratuity per table is $15.00. To test this
claim, a sample of 18 tables was selected which had an average gratuity of
$13.75 with a standard deviation of $2.60. Assume the population of gratuities
is normally distributed and use = 0.10.
NARREND
28. State the null and alternative hypotheses.
ANS:
ANS:
Since t = -2.040, we reject the null hypothesis. The claim that the average
gratuity is $15.00 is not supported by this sample.
ANS:
ANS:
1.645, then reject H0then fail to reject H0. If z 1.645If z
ANS:
Since z = 2.26 and the p-value is 0.0119 we reject the null hypothesis, and
conclude that the candidate will receive more than 50% of the votes.
H0: ____________________
H1: ____________________
ANS:
4500; > 4500
ANS:
Reject H0 if z > 2.33, fail to reject H0 if z 2.33.
ANS:
z-test; 2.504
ANS:
Reject H0; there is enough statistical evidence to conclude that the alternative
hypothesis is true and that the average per capita retail sales for the city is
greater than $4500
38. Determine the p-value associated with each of the following values of the
standardized test statistic z.
ANS:
0.1336; 0.1469; 0.0082
ANS:
H1: H0:
ANS:
Reject H0 if z < -1.645– 1.645; fail to reject H0 if z
41. During the last energy crisis, a government official claimed that the average
car owner refilled the tank when there was more than 3 gallons left. To check the
claim, 10 cars were surveyed as they entered a gas station. The amount of gas
was measured and recorded as follows: 3, 5, 3, 2, 3, 3, 2, 6, 4, and 1. If we
assume that the amount of gas remaining in tanks is normally distributed with a
standard deviation of one gallon, can we conclude at the 10% significance level
that the official was correct?
Conclusion: _________________________
Interpretation: _____________________________________________
ANS:
z-test; 0.63; Do not reject H0; Official was not correct
42. Use the given information to calculate the p-value and interpret the result:
p-value = ____________________
Interpretation: ____________________
ANS:
0.05; Do not reject H0
43. Use the given information to calculate the p-value and interpret the result:
p-value = ____________________
Interpretation: ____________________
ANS:
0.0228; Reject H0
44. Use the given information to calculate the p-value and interpret the result:
p-value = ____________________
Interpretation: ____________________
ANS:
0.1736; Do not reject H0
45. Use the given information to calculate the p-value and interpret the result:
p-value = ____________________
Interpretation: ____________________
ANS:
0.0094; Reject H0
46. Use the given information to calculate the p-value and interpret the result:
p-value = ____________________
Interpretation: ____________________
ANS:
0.0344; Reject H0
47. A filling machine is designed to fill soda bottles with 16 ounces of soda. The
distribution for the weight of the bottles is normal. Twenty bottles are selected
and weighed. The sample mean is 15.3 ounces and sample standard deviation is
1.5 ounces. Develop a 90% confidence interval for this sample. What can we
conclude about this filling process with = 0.10?
ANS:
The 90% confidence interval is 14.72, 15.88. Since this interval does not include
the expected population mean of 16.0 ounces, we can conclude the process is
not performing properly.
48. Suppose that 9 observations are drawn from a normal population whose
standard deviation is 2. The observations are: 15, 9, 13, 11, 8, 12, 11, 7, and 10.
At 95% confidence, you want to determine whether the mean of the population
from which this sample was taken is significantly different from 10.
H0: ____________________
H1: ____________________
ANS:
= 10; 10; 1.0; Reject H0 if |z| > 1.96; Do not reject H0
H0: ____________________
H1: ____________________
ANS:
= 1300; > 1300; 3.0; The mean GRE score at MSU is higher than the national
average.
50. A social scientist claims that the average adult watches less than 26 hours of
television per week. He collects data on 25 individuals’ television viewing habits
and finds that the mean number of hours that the 25 people spent watching
television was 22.4 hours. If the population standard deviation is known to be
eight hours, can we conclude at the 1% significance level that he is right?
Conclusion: ______________________________
ANS:
z-test; -2.25; The scientist is wrong
NARRBEGIN: Table 1
Based on the sample data, a confidence interval has been constructed such that
we have 95% confidence that the population mean is between 75 and 105.
NARREND
51. What conclusion would be reached at the = .05 level in testing H0: = 85
versus H1: 85?
ANS:
Do not reject H0
52. What conclusion would be reached at the = .05 level in testing H0: = 125
versus H1: 125?
ANS:
Reject H0
53. What conclusion would be reached at the = .05 level in testing H0: = 100
versus H1: 100?
ANS:
Do not reject H0
54. What conclusion would be reached at the = .05 level in testing H0: = 65
versus H1: 65?
ANS:
Reject H0
55. For a sample of 36 items from a population for which the standard deviation
= 12, the sample mean is 350. Construct a 95% confidence interval for the
population mean.
____________________ to ____________________
ANS:
346.08; 353.92
56. For a sample of 36 items from a population for which the standard deviation
the sample mean is 350. Use the 95% confidence interval to test the hypothesis
H0: = 400 versus H1: 400 at the = .05 level of significance.
ANS:
= 0.05 = 400 falls outside the confidence limits (346.08 and 353.92), we
reject H0 at Since the hypothesized value of
ANS:
Reject H0 if the calculated t > 1.677.
Conclusion: __________________________
ANS:
t-test; 1.922; Reject H0
PTS: 1 OBJ: Section 10.5
ANS:
At the 0.05 level of significance, there is enough statistical evidence to conclude
that the alternative hypothesis is true and that the average annual rainfall in the
50 cities is greater than 31 inches.
60. A parents group in New York City was concerned with the alcohol content in
an over-the-counter cough syrup. They felt that the content was greater than
the 2 cc/bottle stated on the label. A sample of 30 bottles showed an average
alcohol content of 3.1 cc/bottle with a standard deviation of 2 cc./bottle. Using a
0.05 level of significance and assuming a normal distribution, does the sample
indicate that the alcoholic content is above 2 cc/bottle?
Conclusion: ____________________
Interpretation: _________________________________________________________
ANS:
t-test; 3.01; Reject H0; Parents are correct
61. A production engineer is checking the volume in cans of paint as the cans
leave the filling line. The machine is set to fill the cans with 4 liters. However,
the engineer wishes to determine if the cans are being filled with less than 4
liters. A sample of 36 is taken. The sample shows a mean of 3.97 and a standard
deviation of 0.14. Conduct the appropriate test at the 0.01level of significance.
Interpretation: ____________________________________________________
ANS:
t-test; -1.286; Do not reject H0; The cans are filled properly
NARRBEGIN: Sermons
My daughter and I have argued about the average length of our preacher’s
sermons on Sunday morning. Despite my arguments, she thinks that the
sermons are more than twenty minutes (over twenty minutes is important to her
especially when soccer was playing on TV). For one year she has randomly
selected 12 Sundays and found the average time of 26.42 minutes with a
standard deviation of 6.69 minutes. Assuming that the population is normally
distributed and using a 0.05 level of significance, is my daughter correct in
thinking that the average length of sermons is more than 20 minutes?
NARREND
62. State the null and alternative hypotheses.
H0: ____________________
H1: ____________________
ANS:
20; > 20
ANS:
Reject H0 if t > 1.796
NARRBEGIN: Bank
A bank holds the mortgage on a motel which the motel owner says has an
average daily revenue of at least $2048. In the event the average is not at least
$2048, the bank will ask the motel owner for additional collateral for the loan as
protection against loan default. A loan officer is aware that the daily average
revenue is normally distributed and randomly selects 20 days during the last six
months. The sample has a mean of $2003 and standard deviation of $144.00. At
the 0.01 level, is the bank loan financially sound (should the bank ask for
additional collateral)?
NARREND
65. State the null and alternative hypotheses.
ANS:
H1: H0:
ANS:
Reject H0 if t < -2.539
ANS:
Do not reject H0. The bank should not ask for additional collateral.
ANS:
H0:
H1:
ANS:
2.093, fail to reject H0. If tIf -2.093 t < -2.093 or t > 2.093, then reject H0.
ANS:
Since t = 3.05, we reject the null hypothesis and conclude that the average life
span of male mosquitos is not 15 days.
NARRBEGIN: Golf
A round of golf on your typical course requires at least 4.5 hours. A particular
golf club claims that the average length of time to complete a round of golf on
their course is less than 4.5 hours. To test this claim, the club sampled 15 golfers
and recorded their time to finish their round. The sample average was 4.4 hours
with a standard deviation of 0.8 hours. Assume the time to complete a round of
golf is normally distributed and use = 0.01.
NARREND
71. State the null and alternative hypotheses.
ANS:
ANS:
, then reject H0.then fail to reject H0. If t If t
ANS:
Since t = -0.484, we fail to reject the null hypothesis. There does not appear to
be a basis for the claim that the average round of golf can be completed in less
than 4.5 hours.
74. A random sample of 10 observations was drawn from a large population. The
data values were: 22, 17, 27, 20, 23, 19, 24, 18, 19, and 24.
____________________ to ____________________
H0: ____________________
H1: ____________________
Conclusion: ______________________
C) What is the required condition of the techniques used in parts (a) and (b)?
D) What graphical device can you use to check to see if that required condition
is satisfied?
ANS:
19.446; 23.154; = 20; 20; t-test; 1.285; Do not reject H0; The population is
approximately normally distributed.; Histogram
75. A random sample of 10 observations was drawn from a large population. The
data values were: 6, 4, 4, 7, 5, 5, 4, 5, 6, and 4 Test to determine if we can infer
at the 5% significance level that the population mean is less than 6.
ANS:
6, Rejection region: t = 6, H1: H0: < -t.05, 9 = -1.833; Test statistic: t =
-3.0; Reject H0, that is, enough statistical evidence to suggest that the
population mean is less than 6
Test the following hypotheses: H0: = 8, H1: > 8, at the 1% significance level.
Appropriate test:____________________
Conclusion: ____________________
ANS:
t-test; 3.403; Reject H0
77. Use the following information to test at a 0.01 level of significance. n = 400;
p = 0.03;
H0: 0.05, H1: < 0.05.
ANS:
z-test; -1.835
78. Use the following information to test at a 0.01 level of significance. n = 400;
p = 0.03;
H0: 0.05, H1: < 0.05.
ANS:
Do not reject H0; there is not enough statistical evidence to conclude that the
alternative hypothesis is true
ANS:
z-test; 1.50
ANS:
Do not reject H0; there is not enough statistical evidence to conclude that the
alternative hypothesis is true
ANS:
H1: H0:
82. For the critical z, identify the sample proportion that this z-value represents.
p = ____________________
Use the value of p determined above to state a decision rule for the test.
ANS:
0.0428; Reject H0 if p > 0.0428
83. What is the probability that the manager will fail to reject a false H0 if the
actual population proportion of defects is 0.03?
Z = ____________________
= ____________________
ANS:
0.91; 0.8186
ANS:
= 0.24, H1: H0: < 0.24; Rejection region: z < -z.01=-2.33; Test statistic: z =
-1.99; Fail to reject H0;
ANS:
rejecting a false null hypothesis.
86. A hypothesis test has been set up and is to be conducted at the =.025 level
of significance. If the sample is doubled, what will be the effect on ? On ?
ANS:
None; Decrease
H0: ____________________
H1: ____________________
ANS:
10; > 10
ANS:
Reject H0 if > 11.974
89. What is the probability of making a Type II error if the true mean is 14?
ANS:
= 0.0455
ANS:
= 0.9545Power = 1 –
___________________________________________a.
___________________________________________b.
___________________________________________c. 1 –
d. H0 ___________________________________________
e. H1 ___________________________________________
ANS:
a. Alpha, represents the probability of making a Type I error.
b. Beta, represents the probability of making a Type II error.
c. The power of a test (the probability that the test will correctly reject a fall null
hypothesis)
d. Null hypothesis
e. Alternative hypothesis
ESSAY
ANS:
A statement about the value of a population parameter; it is put up for testing in
the face of numerical evidence.
ANS:
An alternative hypothesis is an assertion that holds if the null hypothesis is false.
ANS:
A one-tail test is one in which a null hypothesis can be rejected by an extreme
result in one direction only.
ANS:
A two-tail test is one in which a null hypothesis can be rejected by an extreme
result occurring in either direction.
ANS:
Apparently a minimum number of cars passing by the location is needed to be
profitable. Since the average number of cars in the sample was not enough to
make a profitable outlet in the judgment of the executive, he rejected the
hypothesis of profitability. When another company built a store on the same
location , which was profitable, it showed the mistake, which had been made. In
statistical terms, the executive committed a Type I error (erroneously rejecting a
true null hypothesis).
ANS:
Equation for calculating test statistic, z-test for sample proportion.
ANS:
The power of a test is the probability that the test will respond correctly by
rejecting a false null hypothesis. By calculating the power of the test (1 – ) for
several assumed values for the population mean and plotting the power (on the
vertical axis) versus the population mean (on the horizontal axis), we arrive at
the power curve. By looking at the power curve, we can get an idea of how
powerful the hypothesis test is for different possible values of the population
mean.
ANS:
The operating characteristic (OC) curve plots the probability that the hypothesis
test will not reject the null hypothesis for assumed values for the population
mean. The OC curve is the complement of the power curve. It is found by
plotting (on the vertical axis) versus the population mean (on the horizontal
axis).
TRUE/FALSE
1. True or False
When comparing the mean of two dependent samples, we are interested in only
one variable: the difference between measurements for each person or object.
2. True or False
A typical example of dependent samples occurs when we have before-and-after
measures of the same individuals.
3. True or False
An important factor in choosing between the pooled-variances t-test and the
unequal-variances t-test is whether we can assume the population means might
be equal.
ANS: F PTS: 1 OBJ: Section 11.1
4. True or False
Although it is often associated only with small-sample tests, the t distribution is
appropriate when the population standard deviations are unknown, regardless of
how large or small the samples happen to be.
5. True or False
When the population standard deviations are unknown and not assumed to be
equal, pooling the sample standard deviations into a single estimate of their
common population value is no longer applicable.
6. True or False
When the population standard deviations are unknown and are not assumed to
be equal, s1 and s2 must be used to estimate their respective population
standard deviations, 1 and 2.
7. True or False
The number of degrees of freedom associated with the unequal-variances t-test
for comparing the means of two independent samples is n1 + n2 – 2.
8. True or False
When either test can be applied to the same data, the z-test for comparing the
means of two independent samples is preferable to the unequal-variances t-
tests, especially when doing the test with computer assistance.
9. True or False
The z-test requires no assumptions about the shape of the population
distributions as long as both n1 and n2 are 30.
ANS: T PTS: 1 OBJ: Section 11.4
MULTIPLE CHOICE
1. A political analyst in Texas surveys a random sample of registered Democrats
and compares the results with those obtained from a random sample of
registered Republicans. This would be an example of:
a. Independent samples.
b. dependent samples.
c. Independent samples only if the sample sizes are equal.
d. Dependent samples only if the sample sizes are equal.
ANS: A PTS: 1 OBJ: Section 11.1
3. In testing whether the means of two normal populations are equal, summary
statistics computed for two independent samples are as follows:
Assume that the population variances are equal. Then, the standard error of the
sampling distribution of the sample mean difference is equal to:
a. 0.1017.
b. 1.2713.
c. 0.3189.
d. 1.1275.
ANS: C PTS: 1 OBJ: Section 11.2
4. Two samples of sizes 25 and 35 are independently drawn from two normal
populations, where the unknown population variances are assumed to be equal.
The number of degrees of freedom of the equal-variances t-test statistic is:
a. 60.
b. 59.
c. 58.
d. 35.
ANS: C PTS: 1 OBJ: Section 11.2
7. When the necessary conditions are met, a two-tail test is being conducted to
test the difference between two population means, but your statistical software
provides only a one-tail area of 0.036 as part of its output. The p-value for this
two-tail test will be:
a. 0.018.
b. 0.009.
c. 0.072.
d. 0.964.
ANS: C PTS: 1 OBJ: Section 11.4
8. The number of degrees of freedom associated with the t test, when the data
are gathered from a matched pairs experiment with 13 pairs, is:
a. 13.
b. 26.
c. 12.
d. 24.
ANS: C PTS: 1 OBJ: Section 11.5
9. Diet and health have become big business, but many people are not following
good health practices. In 1980, a survey of 150 indicated that 30 percent said
that at least one time during that year they had five or more alcoholic drinks in
one sitting. In 2000, a similar survey of 150 showed that 38 percent had five
drinks in one sitting at least one time during the year. Does this represent a
significant increase? Use 0.01 level of significance.
a. The test statistic of -1.46 is not less than -1.96; therefore, the sample
evidence indicates that there has been no significant change in the proportion of
those who drink heavily at least one day during the year.
b. The test statistic of -1.46 is not less than -2.58; therefore, the sample
evidence indicates that there has been no significant change in the proportion of
those who drink heavily at least one day during the year.
c. The test statistic of -1.46 is not less than -2.33; therefore, the sample
evidence indicates that there has not been significant change in the proportion
of those who drink heavily at least one day during the year.
d. The test statistic of -2.56 is less than -2.33; therefore, the sample evidence
indicates that there has been significant change in the proportion of those who
drink heavily at least one day during the year.
e. The test statistic of -2.56 is less than -2.58; therefore, the sample evidence
indicates that there has been significant change in the proportion of those who
drink heavily at least one day during the year.
ANS: C PTS: 1 OBJ: Section 11.6
10. When the necessary conditions are met, a two-tail test is being conducted to
test the difference between two population proportions. The two sample
proportions are p1 = 0.25 and p2 = 0.20, and the standard error of the sampling
distribution of p1 – p2 is 0.04. The calculated value of the test statistic will be:
a. z = 0.25.
b. z = 1.25.
c. t = 0.25.
d. t = 0.80.
ANS: B PTS: 1 OBJ: Section 11.6
11. A sample of size 100 selected from one population has 60 successes, and a
sample of size 150 selected from a second population has 95 successes. The
test statistic for testing the equality of the population proportions equal to:
a. -0.5319.
b. 0.7293.
c. -0.419.
d. 0.2702.
ANS: A PTS: 1 OBJ: Section 11.6
12. When the necessary conditions are met, a two-tail test is being conducted to
test the difference between two population proportions. If the value of the test
statistic z is 2.05, then the p-value is:
a. 0.4798.
b. 0.0404.
c. 0.2399.
d. 0.0202.
ANS: B PTS: 1 OBJ: Section 11.6
13. In testing for the equality of two population variances, when the populations
are normally distributed, the 10% level of significance has been used. To
determine the rejection region, it will be necessary to refer to the F table
corresponding to an upper-tail area of:
a. 0.90.
b. 0.05.
c. 0.20.
d. 0.10.
ANS: B PTS: 1 OBJ: Section 11.7
14. Which of the following statements are true?
a. The pooled-variances t-test is used whenever the population standard
deviations are assumed to be equal regardless of the sample size.
b. The unequal-variances t-test is used whenever the population standard
deviations are unknown and cannot be assumed to be equal.
c. The z-test can be used as a close approximation to the unequal-variances t-
test when the population standard deviations are not assumed to be equal, but
samples are large (each n 30).
d. All of these statements are true.
e. None of these statements is true.
ANS: D PTS: 1 OBJ: Section 11.8
NUMERIC RESPONSE
ANS: 2.48
ANS: 4.776
H1: A random sample of 35 sale homes is taken from the Northeast and 41 from
those in the Southwest using the multiple listing services. A Minitab summary of
the results of the two samples expressed in thousands follows:
NARREND
3. What is the standard error of the difference between the two means?
ANS: 10.410
z = ____________________
ANS: 2.526
H1: (mean age is different) A random sample of males and females involved in
rear-end accidents results in the following Minitab summary:
NARREND
5. What is the standard error of the difference between the two means?
ANS: 2.314
ANS: 0.1646
ANS: 0.3557
NARRBEGIN: Medicare
An economist wishes to test whether there is any difference in the proportion of
households receiving Medicare for white households with less than $5,000
income per year and white households with $15,000 or more income per year. A
random sample from households with less than $5,000 income per year is taken,
and a random sample of households with $15,000 or more income is taken. The
results of the samples are:
NARREND
8. Use a 0.01 significance level to determine if there is sufficient evidence to
conclude that a difference in proportions exists. Calculate the standard error of
estimate.
ANS: 0.039
PTS: 1 OBJ: Section 11.6
ANS: 0.0094
ANS: 0.0375
COMPLETION
1. ____________________ samples are those for which the selection process for one
is not related to the selection process for the other.
ANS: Independent
2. We say samples are ____________________ when the selection process for one is
related to the selection process for the other.
ANS: dependent
PTS: 1 OBJ: Section 11.1
3. The pooled-variances t-test requires the assumption that the two population
___________________________________ are equal, and that the populations are at
least ___________________________________ distributed.
ANS: z-test
6. A test that we can use to determine whether two population variances are
equal is the ____________________.
ANS: F-test
ANS: t-test
SHORT ANSWER
NARRBEGIN: Price earnings
A comparison of the price-earnings (P/E) ratio for the top and bottom 100
companies in valuation is being prepared. A financial advisor randomly sampled
each group to determine whether there is any difference in P/E ratios of the two
groups of companies. Let 1 = a top 100 company and 2 = a bottom 100
company. Assume equal population variances and that the populations are
normally distributed. The advisor is to use a 0.01 significance level. The data
were randomly selected and are summarized below:
NARREND
1. State the null and alternative hypotheses.
H0: ____________________
H1: ____________________
ANS:
;
ANS:
t-test; 1.71
ANS:
Reject H0 if t < -3.169 or if t > 3.169.
H0: ____________________
H1: ____________________
ANS:
(no effect on sales
; (some effect on sales)
ANS:
t-test; -0.446
ANS:
Do not reject H0
NARRBEGIN: Battery
A new brand of battery for use in calculators and cameras is said to last
significantly longer than another brand. A camera manufacturer is to test this
brand (Y) with brand (X) to see if brand Y has a longer life. If brand Y does last
longer, the camera manufacturer will equip their new cameras with them. If not,
they will equip them with brand X that is less costly. Twenty cameras are
equipped as follows: ten with brand Y batteries and ten with brand X batteries
and the life of the batteries measured. Is brand Y superior (does it last longer) to
brand X? To answer this question, the camera manufacturer uses a 0.01
significance level and knows that the populations are normally distributed with
equal variances. Sample data are:
NARREND
9. State the null and alternative hypotheses.
H0: ____________________
H1: ____________________
ANS:
ANS:
t-test; -15.95
ANS:
Reject H0 if t < -2.552; Fail to reject H0 if t
ANS:
Reject H0 The evidence points to Brand Y lasting longer than Brand X, so the
camera manufacturer should equip its cameras with Brand Y
Variable Q3 Maximum
Supplier 1 241.50 246.00
Supplier 2 225.25 230.00
ANS:
Difference = mu (Supplier 1) – mu (Supplier 2)
Estimate for difference: 19.0000
95% CI for difference: (13.9271, 24.0729)
T-Test of difference = 0 (vs not =): T-Value = 7.98 P-Value = 0.000 DF = 15
Since the P-value < 0.05, we reject H0, there is enough statistical evidence to
conclude that the true mean number of good pigeons from supplier 1 is higher
than those from supplier 2.
Long-Lasting Battery
(Population 1) Regular Battery
(Population 2)
51 42
44 29
58 51
36 38
48 39
53 44
57 35
40 40
49 48
44 45
60
50
NARREND
15. Assume unequal variances. State the null and alternative hypothesis.
ANS:
16. Assume unequal variances. How many degrees of freedom are associated
with this test?
ANS:
d.f. = 19.92, rounded to 20.
17. Assume unequal variances. Using = 0.01, state the decision rule in terms of
the critical value.
ANS:
Reject if t > 2.528. Fail to reject if t 2.528.
ANS:
Total
Variable Count Mean SE Mean TrMean StDev Minimum Q1 Median Q3
Top ten 10 7.796 0.322 7.823 1.019 6.420 6.625 7.935 8.800
Bottom ten 10 6.476 0.753 7.015 2.382 0.000 6.345 7.130 7.540
Variable Maximum
Top ten 8.960
Bottom ten 8.640
20. Use Minitab or Excel to conduct a two-sample t-test to test the hypotheses:
H0: H1: at the 5% significance level.
ANS:
Two-sample T for Top ten vs Bottom ten
H0: ____________________
H1: ____________________
Conclusion: ____________________
Interpretation: _________________________________________________
ANS:
; ; t-test; 2.766; Reject H0; Government employees do take longer breaks
22. Estimate with 95% confidence the difference in coffee breaks mean time
between the two groups.
____________________ to ____________________
ANS:
4.309 = (1.371, 10.029)5.7
ANS:
We estimate that government employees, on average, take between 1.371 and
10.029 minutes longer for coffee breaks than private sector workers do.
NARRBEGIN: Realtor
A new realtor in a large community is attempting to determine differences in the
selling prices of houses in two sections of the community. Population 1 is the
Northeast and Population 2 is the Southwest. The realtor is going to perform the
hypothesis test: H0:
H1: A random sample of 35 sale homes is taken from the Northeast and 41 from
those in the Southwest using the multiple listing services. A Minitab summary of
the results of the two samples expressed in thousands follows:
NARREND
24. What is the decision rule at the 0.01 level of significance?
ANS:
or if z 2.575; Reject H0 if z z Fail to reject H0 if -2.575
ANS:
Do not reject H0
H1: (mean age is different) A random sample of males and females involved in
rear-end accidents results in the following Minitab summary:
NARREND
26. What is the value of the test statistic?
ANS:
z-test; -2.14
ANS:
if z or 2.24; Reject H0 if z z Fail to reject H0 if -2.24
ANS:
Do not reject H0 There is not enough statistical evidence to say that there is a
difference in the average age of female and male drivers involved in rear-end
accidents.
H0: ____________________
H1: ____________________
ANS:
;
30. The test question is: At the 0.05 level of significance, is the gluing of seams
better than sewing? What is the value of the test statistic?
ANS:
z-test; -3.86
31. The test question is: At the 0.05 level of significance, is the gluing of seams
better than sewing? What is the decision rule?
ANS:
-1.645; Reject H0 if z Fail to reject H0 if z
32. The test question is: At the 0.05 level of significance, is the gluing of seams
better than sewing? What is the conclusion?
ANS:
Reject H0 Conclude that the new gluing procedure produces stronger seams
than the conventional sewing.
____________________ to ____________________
ANS:
-1.49; 7.89
If a 98% confidence interval for the true difference in means is (-1.49, 7.89)
what can you conclude about a hypothesis test for H0: at the 0.02 level?
ANS:
Do not reject H0 because 0 is included in the confidence interval. We conclude
that the difference between the two population means could be zero.
____________________ to ____________________
ANS:
-0.2015; 3.2015
____________________ to ____________________
ANS:
-6.00; 3.96
NARRBEGIN: Portfolio
An analyst is looking at two portfolios of common stocks in terms of the average
price-earnings ratio. He wishes to determine if there is a difference between the
two portfolios.
NARREND
37. State the null and alternative hypotheses.
H0: ____________________
H1: ____________________
ANS:
;
H0: ____________________
H1: ____________________
ANS:
;
ANS:
z-test; 2.40
PTS: 1 OBJ: Section 11.4
ANS:
to reject H0; If zfailzIf -1.96 < -1.96 or if z > 1.96, reject H0;
ANS:
Reject H0 and conclude that the true mean rank of the two soft drinks are not
equal
H0: ____________________
H1: ____________________
ANS:
;
44. A test for the equality of average number of tickets per driver for the two
groups is desired. Use = 0.01. What is the value of the test statistic?
ANS:
z-test; 1.39
45. A test for the equality of average number of tickets per driver for the two
groups is desired. Use = 0.01. What is the conclusion?
ANS:
Do not reject H0
H0: ____________________
H1: ____________________
ANS:
;
ANS:
z-test; 0.373
PTS: 1 OBJ: Section 11.4
ANS:
Do not reject H0 There is no sufficient evidence to conclude that Toyota compact
version gets much better gasoline mileage for in-city driving than any U.S. made
car.
H0:
H1: t
wo random samples from two normal populations produced the following
statistics: n1 = 50, = 35, s1 = 28, n2 = 50, = 30, s2 = 10. What conclusion can
we draw at the 5% significance level?
Conclusion: ____________________________
ANS:
There is no reason to believe that the population variances are equal. Since both
sample sizes are larger than 30, we can use the z-test approximation. Rejection
region: |z| > 1.96
Test statistic: z = 1.189; Do not reject H0 and conclude that there is no
statistical evidence to show a significant difference between the population
means.
NARREND
50. Use = 0.05. State the null and alternative hypothesis.
ANS:
d 0H0:
dH1: < 0
51. Use = 0.05. State the decision rule in terms of the critical value.
ANS:
Reject if t < -1.86. Fail to reject if t -1.86.
ANS:
Since t = -1.789, we fail to reject H0 and conclude that the new golf ball does
not increase driving distance when compared to the current ball.
PTS: 1 OBJ: Section 11.5
H0: ____________________
H1: ____________________
Conclusion: ________________________________________
ANS:
; ; Rejection region t > 1.943 Test statistic: t = 4.111; Reject H0 and infer that at
the 5% significance level, sales increase during the campaign.
ANS:
0.387, 1.527)0.957
55. Briefly explain what the 95% interval estimate tells you.
ANS:
We estimate that the daily sales during the campaign increase, on average,
between .387 and $1.527.
PTS: 1 OBJ: Section 11.5
ANS:
Dependent; In this case, we do not really have two different samples of persons,
but rather before and after measurement for the same individuals.
57. A university vice president for academic affairs randomly selects 15 tenured
faculty from the College of Education and 15 tenured faculty from the College of
Business. Each faculty member is then asked to rate his or her job satisfaction
on a scale of 1(very dissatisfied) to 10(very satisfied). Would this be an example
of independent samples or dependent samples? Explain.
ANS:
Independent; We have two different samples of tenured faculty from two
different colleges.
NARREND
58. State the null and alternative hypotheses.
ANS:
d 0H0:
dH1: > 0
59. Use = 0.05. State the decision rule in terms of the critical value.
ANS:
If t 1.812, fail to reject H0. If t > 1.812, reject H0.
ANS:
Since t = 2.141, we reject H0 and conclude that the new diet program does
reduce weight after 7 days.
NARRBEGIN: SAT
A new SAT preparation program claims that students will improve their verbal
scores with a practice test after one month on the program. To validate their
claim, they recorded the starting and practice verbal SAT scores of 12
individuals after a one-month period. The results are shown in the following
table.
NARREND
61. State the null and alternative hypotheses.
ANS:
d 0H0:
dH1: < 0
62. Use = 0.01. State the decision rule in terms of the critical value.
ANS:
Reject if t < -2.718. Fail to reject if t -2.718.
ANS:
Since t = -1.483, we fail to reject H0 and conclude that the new preparation
course does not improve verbal SAT scores after one month.
NARRBEGIN: Population
In an effort to test the hypothesis that the proportion of home ownership in the
state of Delaware is different than the national proportion, the following data
was collected.
NARREND
64. State the null and alternative hypotheses.
ANS:
Setting = the proportion of home ownership in Delaware and = the proportion of
home ownership in the nation,,
65. Use = 0.01. State the decision rule in terms of the critical value.
ANS:
to reject H0; If zfail2.57zIf -2.57 < -2.57 or if z > 2.57, reject H0;
66. Use = 0.01. State your conclusions in terms of the test statistic.
ANS:
Since z = -1.61, we fail to reject the null hypothesis and conclude that the
proportion of home ownership in Delaware is not different from that of the
nation.
ANS:
Since the p-value = 0.107 and = 0.01, we fail to reject the null hypothesis and
conclude that the proportion of home ownership in Delaware is not different
from that of the nation.
ANS:
Setting = the proportion of male smokers and = the proportion of female
smokers,,
NARRBEGIN: Smoking
In an effort to test the hypothesis that the proportion of males who smoke
cigarettes regularly is greater than the proportion of females who smoke
cigarettes regularly, the following data was collected.
NARREND
69. Use = 0.05. State the decision rule in terms of the critical value.
ANS:
Reject if z > 1.64. Fail to reject if z 1.64.
70. Use = 0.05. State your conclusions in terms of the test statistic.
ANS:
Since z = 1.72, we reject the null hypothesis and conclude that the proportion of
male smokers is greater than the proportion of female smokers.
ANS:
Since p-value = 0.042 and = 0.05, we reject the null hypothesis and conclude
that the proportion of male smokers is greater than the proportion of female
smokers.
PTS: 1 OBJ: Section 11.6
NARRBEGIN: Medicare
An economist wishes to test whether there is any difference in the proportion of
households receiving Medicare for white households with less than $5,000
income per year and white households with $15,000 or more income per year. A
random sample from households with less than $5,000 income per year is taken,
and a random sample of households with $15,000 or more income is taken. The
results of the samples are:
NARREND
72. Use a 0.01 significance level to determine if there is sufficient evidence to
conclude that a difference in proportions exists. What is the random variable?
ANS:
p1 – p2; the difference between two sample proportions.
H0: ____________________
H1: ____________________
ANS:
;
ANS:
There is sufficient evidence for the economist to conclude that there is a
significant difference between the two proportions
NARRBEGIN: VCR’s
An advertising executive suspects that there is a higher proportion of
households which own VCR’s residing in places with cable systems than there
are for households residing in places without cable systems. The basis of this
suspicion is the broader offering of premium programming available for taping in
places with a cable system. The executive commissions a survey, which results
in random samples of 165 households located in a cable system and 150
households, located outside any cable system. Of the households in the cable
system, 54 own VCR’s; outside the cable system, 33 households own VCR’s. Let
be the proportion of households owning a VCR in a cable system and be the
proportion of households owning a VCR, but not in a cable system. The
executive chooses 0.05 as the significance level. Can the executive reasonably
conclude that the ownership of VCR’s is higher in cable systems?
NARREND
76. What are the null and alternative hypotheses?
H0: ________________
H1: ________________
ANS:
;
ANS:
z-test; 2.13
ANS:
Fail to reject H0 if z 1.645; Reject H0 if z > 1.645.
ANS:
Reject H0 The sample evidence suggests that ownership of VCR’s is higher for
households on a cable system.
NARRBEGIN: Textbook
A large textbook publisher is concerned about the sale of complimentary desk
copies of textbooks to wholesale book buyers. The publisher randomly samples
the faculty from two universities, A and B. Information provided by salespeople
suggests that the faculty at University A had copies of the publisher’s books that
had been shipped, whereas the faculty at University B seldomly had them. Let
be the proportion of faculty at University A who sell this publisher’s books and
be the proportion of faculty at University B who sell this publisher’s books. The
samples provide these data:
H0: ____________________
H1: ____________________
ANS:
;
ANS:
z test; z = -2.472
ANS:
Fail to reject H0 if z -1.645; Reject H0 if z < -1.645.
ANS:
Reject H0. There is sufficient evidence to conclude that there is a higher
proportion of faculty at University B who sell the publishers books
H0: ____________________
H1: ____________________
ANS:
;
ANS:
Fail to reject H0 if z -1.645; Reject H0 if z < -1.645.
ANS:
z-test; -1.136
ANS:
Do not reject H0 There has not been a significant increase in the proportion of
flextime.
H0:____________________
H1: ____________________
ANS:
;
ANS:
z-test; 2.35
ANS:
Reject H0 There is enough evidence to conclude that there is a significant
improvement in the quality of the steel.
NARRBEGIN: Politician
In an effort to test the hypothesis that the proportion of voters in the younger
than 40 year old age bracket who will vote for a particular politician is different
than the proportion voters in the above 40 age bracket, the following data was
collected.
NARREND
91. Use = 0.10. State your conclusions in terms of the test statistic.
ANS:
Since z = 1.87, we reject the null hypothesis and conclude that the proportion of
voters in the younger than 40 year old age bracket who will vote for a particular
politician is different than the proportion voters in the above 40 age bracket.
p-value = 0.0614
Use = 0.10. State the decision rule in terms of the critical value.
ANS:
Setting = the proportion of home ownership in Delaware and = the proportion of
home ownership in the nation,,
to reject H0; If zfail1.645zIf -1.645 < -1.645 or if z > 1.645, reject H0;
Conclusion: ________________
Interpretation: __________________________________________________
ANS:
21 – 0, H1: 2 1 – H0: > 0,
Rejection region: z > 1.645, Test statistic z = 0.993; Do not reject H0; There is
insufficient evidence to conclude that Production Line 1 is doing better than
Production Line 2.
ANS:
0.0915 = (-0.0473, 0.1357)0.0442
95. The A-1 Video Rental has run two specials in the local newspaper. One was a
two-for-one rental for Tuesday through Thursday. The second was for $1 off on
Friday and Saturday. A computer output is shown below:
Determine at the 0.05 level if the standard deviations of the populations are
equal.
Conclusion: ____________________
ANS:
The test statistic of F = 1.178 is less than the critical value of 3.05; Do not reject
H0 and conclude that the population standard deviations could be equal.
Hypotheses: ____________________
Conclusion: ____________________
ANS:
22, 12 22, H1: 12 = H0:
Rejection region: F> F(0.05, 13, 15) = 2.48; Test statistic: F = 2.145; Do not
reject H0 and conclude that there is no sufficient statistical evidence to infer
that the two population variances differ.
ANS:
ESSAY
1. When testing dependent samples, we test for only one variable. What is it?
ANS:
d = Difference between measurements
2. There are three different methods for comparing the means of two
independent samples. What are they?
ANS:
The three methods are: the pooled-variances t-test, the unequal-variances t-
test, and the z-test.
PTS: 1 OBJ: Section 11.1
TRUE/FALSE
1. True or False
Analysis of variance (ANOVA) is a set of techniques that allow us to compare two
or more sample means at the same time.
2. True or False
In ANOVA, the variation between groups represents random error from the
sampling process.
3. True or False
Comparing the variation between the groups to the variation within the groups
is the basis of ANOVA.
4. True or False
The ANOVA model requires that the sample size for each treatment be equal.
5. True or False
An assumption of the ANOVA model is that the population distributions are
normal.
7. True or False
When there are two treatment groups, one-way ANOVA is equivalent to the
pooled-variances t-test.
8. True or False
Three racquetball players, a beginner, an experienced, a professional, have
been randomly selected from the membership of a large city racquetball club.
Using the same ball, each person hits three serves, one with each of five racquet
models, with the five racquet models selected randomly. Each serve is clocked
with a radar gun and the result recorded. Among ANOVA models, this setup is
most like the two-way ANOVA.
9. True or False
In the randomized block design ANOVA, test units are first arranged in similar
groups after the treatments are assigned.
MULTIPLE CHOICE
2. Given the significance level 0.01, the F-value for the degrees of freedom, df =
(7,3) is:
a. 8.45.
b. 27.67.
c. 5.89.
d. 14.62.
e. 38.22.
ANS: B PTS: 1 OBJ: Section 12.3
3. The variation that reflects the effect of the factor levels is known as the
a. variation within the groups.
b. variation between the groups.
c. error mean square.
d. sampling error.
e. interaction effect.
ANS: B PTS: 1 OBJ: Section 12.2
4. The variation that reflects the random error from the sampling process is
known as the
a. variation within the groups.
b. variation between the groups.
c. treatment mean square.
d. interaction effect.
e. treatment sum of squares.
ANS: A PTS: 1 OBJ: Section 12.2
6. In using the ANOVA models, the assumptions made about the data are:
a. the samples are independent.
b. the population variances are equal.
c. the population distributions are normal.
d. A, B, and C are all assumptions made about the data.
e. the samples are dependent.
ANS: D PTS: 1 OBJ: Section 12.3
7. Two samples of ten each from the male and female workers of a large
company have been taken. The data involved the wage rate of each worker. To
test whether there is any difference in the average wage rate between male and
female workers a pooled-variances t-test will be considered. Another test option
to consider is ANOVA. The most likely ANOVA to fit this test situation is the:
a. one-way analysis of variance.
b. randomized block design.
c. two-way analysis of variance.
d. simple regression model.
e. chi-square test for independence.
ANS: A PTS: 1 OBJ: Section 12.3
11. One-way ANOVA is applied to three independent samples having means 10,
13, and 18, respectively. If each observation in the third sample were increased
by 30, the value of the F-statistic would:
a. increase.
b. decrease.
c. remain unchanged.
d. increase by 30.
ANS: A PTS: 1 OBJ: Section 12.3
15. The distribution of the test statistic for analysis of variance is the:
a. normal distribution.
b. student t-distribution.
c. F-distribution.
d. Poisson distribution.
ANS: C PTS: 1 OBJ: Section 12.3
17. In one-way ANOVA, suppose that there are five treatments with n1 = n2 =
n3 = 5, and n4 = n5 = 7. Then the mean square for error, MSE, equals:
a. SSE / 4.
b. SSE / 29.
c. SSE / 24.
d. SSE / 5.
ANS: C PTS: 1 OBJ: Section 12.3
23. The number of degrees of freedom for the denominator in a one-way ANOVA
test for 4 population means with 15 observations sampled from each population
is:
a. 60.
b. 19.
c. 56.
d. 45.
ANS: C PTS: 1 OBJ: Section 12.3
24. In employing the randomized block design, the primary interest lies in:
a. increasing the treatment sum of squares (SSTR).
b. reducing the block sum of squares (SSB).
c. improving the significance of any differences among treatment groups.
d. reducing the total sum of squares (SST).
e. increasing the total sum of squares (SST).
ANS: C PTS: 1 OBJ: Section 12.4
25. In employing the randomized block design, the expected effect is to:
a. increase the treatment sum of squares (SSTR).
b. reduce the error sum of squares (SSE).
c. improve the significance of any differences within the treatment groups.
d. reduce the total sum of squares (SST).
e. increase the error sum of squares (SSE).
ANS: B PTS: 1 OBJ: Section 12.4
26. A randomized block design ANOVA has two treatments. The test to be
performed in this procedure is equivalent to:
a. dependent samples z-test.
b. dependent samples t-test.
c. independent samples z-test.
d. independent samples t-test.
e. Spearman rank correlation test.
ANS: B PTS: 1 OBJ: Section 12.4
27. A randomized block design ANOVA has five treatments and four blocks. The
computed test statistic (value of F) is 4.35. With a 0.05 significance level, the
appropriate table value and conclusion will be:
a. 14.37: accept the null hypothesis.
b. 3.11: accept the null hypothesis.
c. 3.26: accept the null hypothesis.
d. 2.87: reject the null hypothesis.
e. 3.26: reject the null hypothesis.
ANS: E PTS: 1 OBJ: Section 12.4
28. A randomized block experiment having five treatments and six blocks
produced the following values: SSTR = 287, SST = 1,446, SSE = 180. The value
of SSB must be:
a. 30.
b. 979.
c. 467.
d. 20.
e. 107.
ANS: B PTS: 1 OBJ: Section 12.4
29. The randomized block design with two treatments is equivalent to a non-
directional:
a. independent samples z-test.
b. independent samples t-test.
c. dependent samples z-test.
d. dependent samples t-test.
e. chi-square test for independence.
ANS: D PTS: 1 OBJ: Section 12.4
30. The F-test of the randomized block design of the analysis of variance
requires that the random variable of interest must be normally distributed and
the population variances must be equal. When the random variable is not
normally distributed, we can use:
a. one-way ANOVA.
b. two-way ANOVA.
c. Friedman test.
d. all of these.
ANS: C PTS: 1 OBJ: Section 12.4
31. Three tennis players, a beginner, an intermediate, and advanced, have been
randomly selected from the membership of a racquet facility club in a large city.
Using the same tennis ball, each player hits ten serves, one with each of three
racquet models, with the three racquet models selected randomly. The speed of
each serve is measured with a machine and the result recorded. Among the
ANOVA models listed below, the most likely model to fit this situation is the:
a. one-way ANOVA.
b. two-way ANOVA.
c. randomized block design.
d. none of these.
ANS: C PTS: 1 OBJ: Section 12.4
32. A randomized block design with 4 treatments and 5 blocks produced the
following sum of squares values: SST = 1951, SSTR = 349, SSE = 188. The value
of SSB must be:
a. 1414.
b. 537.
c. 1763.
d. 1602.
ANS: A PTS: 1 OBJ: Section 12.4
33. In a two-way ANOVA, there are 3 levels for factor A and 4 levels for factor B,
and three observations within each cell. The number of treatments in this
experiment will be:
a. 36.
b. 9.
c. 12.
d. 18.
e. 6.
ANS: C PTS: 1 OBJ: Section 12.5
34. In a two-way ANOVA, there are 5 levels for factor A and 4 levels for factor B,
and three observations within each cell. The number of treatments in this
experiment will be:
a. 20.
b. 9.
c. 12.
d. 18.
e. 6.
ANS: A PTS: 1 OBJ: Section 12.5
35. When the effect of a level for one factor depends on which level of the other
factor is present, use:
a. one-way analysis of variance.
b. randomized block design.
c. two-way analysis of variance.
d. simple regression model.
e. paired sample model.
ANS: C PTS: 1 OBJ: Section 12.5
36. The equation: SST = SSA + SSB + SSAB + SSE, applies to which ANOVA
model?
a. One-way ANOVA
b. Two-way ANOVA
c. Randomized block design
d. All of these
ANS: B PTS: 1 OBJ: Section 12.5
37. In a two-way ANOVA, there are 4 levels for factor A, 5 levels for factor B, and
3 observations for each combination of factor A and factor B levels. The number
of treatments in this experiment equals:
a. 60.
b. 25.
c. 20.
d. 16.
ANS: C PTS: 1 OBJ: Section 12.5
38. Which of the following statements are not true regarding the randomized
block design?
a. In this design, persons or test units are first arranged into similar groups, or
blocks, then the treatments are randomly assigned.
b. Blocking helps reduce error variation by ensuring that the treatment groups
are comparable in terms of the blocking variable.
c. This design assumes that interactions exist between the blocks and the
treatments.
d. Although the blocking variable can be viewed as representing a second
independent variable, it is introduced solely for the purpose of controlling error
variation.
e. All of these statements are true.
ANS: C PTS: 1 OBJ: Section 12.6
NUMERIC RESPONSE
NARRBEGIN: Annuity
A large annuity company holds many industry group stocks. Among the
industries are banks, business services and construction. Seven companies from
each industry group are randomly sampled to test the hypothesis that the mean
price per share is the same among industries. The data are:
NARREND
1. Compute the mean square for the between-group variation (MSTR).
ANS: 520.33
2. Compute the appropriate table value to use as the critical value at the 2.5%
significance level.
ANS: 4.56
ANS: 151.67
ANS: 15
ANS: 2
ANS: 1636.0
ANS: 6608.5
ANS: 4972.5
ANS: 331.5
10. Compute the appropriate table value to use as the critical value at the 5%
significance level.
ANS: 3.68
ANS: 70
PTS: 1 OBJ: Section 12.3
ANS: 1091.6
ANS: 1021.6
ANS: 12
ANS: 2
ANS: 85.13
17. Compute the appropriate table value to use as the critical value at the 5%
significance level.
ANS: 3.89
ANS: 361.583
ANS: 74
ANS: 12.667
ANS: 448.250
ANS: 120.528
ANS: 37
NARRBEGIN: Music
A study to examine the effect of music type and assembly method on the
productivity of workers on a production line for electronic circuit boards
produced the following results:
NARREND
25. What are the degrees of freedom for the error sum of squares?
ANS: 4
26. What is the computed value for Factor A (assembly method) sum of squares
(SSA)?
ANS: 50
27. What is the computed value for Factor B (music) sum of squares (SSB)?
ANS: 338
28. What is the computed value for the error sum of squares (SSE)?
ANS: 26
29. What is the computed value for the total sum of squares (SST)?
ANS: 416
30. What is the computed value for the sum of squares for the interaction
between Factor A and Factor B (SSAB)?
ANS: 2
31. What is the computed value for Factor A (assembly method) mean square
(MSA)?
ANS: 50
32. What is the computed value for Factor B (music) mean square (MSB)?
ANS: 338
33. What is the computed value for the mean square for the interaction between
Factor A and Factor B (MSAB)?
ANS: 2
34. What is the computed value for the error mean square (MSE)?
ANS: 6.5
NARRBEGIN: Lighting
A study to examine the effect of lighting type and assembly method on the
productivity of workers on a production line for electronic circuit boards
produced the following results:
NARREND
35. What are the total degrees of freedom for this experiment?
ANS: 7
ANS: 72
37. What is the computed value for Factor B (lighting) sum of squares (SSB)?
ANS: 32
38. What is the computed value for the error sum of squares (SSE)?
ANS: 258
39. What is the computed value for the total sum of squares (SST)?
ANS: 364
40. What is the computed value for the sum of squares for the interaction
between Factor A and Factor B (SSAB)?
ANS: 2
NARREND
41. Compute the test statistic for the main effect for Factor A.
ANS: 13.58
42. Compute the test statistic for the main effect for Factor B.
ANS: 4.52
PTS: 1 OBJ: Section 12.5
43. Compute the test statistic for the interaction effect between Factor A and
Factor B.
ANS: 2.83
44. What is the critical F for the 0.05 level for the main effects test, Factor A?
ANS: 3.40
45. What is the critical F for the 0.05 level for the main effects test, Factor B?
ANS: 3.01
46. What is the critical F for the 0.05 level for the interaction effects between
Factor A and Factor B.
ANS: 2.51
COMPLETION
ANS: dependent
ANS: independent
ANS:
one-way analysis of variance
ANOVA
ANS:
Two-way analysis of variance
ANOVA
5. The starting point for ANOVA is often an experiment in which the goal is to
determine whether various levels of an independent variable, or factor, might be
exerting different effects on the dependent variable, or measurement. When
there is only one factor in an experiment, each factor level can be referred to as
a ____________________.
ANS: treatment
6. In the one-way ANOVA, there is just one factor, and the null hypothesis is that
the population means are equal for the respective treatments, or factor levels.
Treatments are randomly assigned to the persons or test units in the
experiment, so this method is also referred to as the
___________________________________ ANOVA.
ANS: pooled-variances
8. When there are just two treatments, the randomized block design is
equivalent to the nondirectional ______________________________ t-test.
ANS: dependent-samples
ANS:
two; two
2, 2
ANS: treatment
MATCHING
SHORT ANSWER
1. A university vice president for academic affairs (VPAA) collects data showing
the number of papers published over the past academic year for a random
sample of professors in the College of Arts and Sciences. He does the same for a
random sample of 8 professors in the College of Education, and for a random
sample of 10 professors in the College of Business. Does this represent a
designed experiment? Explain.
ANS:
No; This is not a random experience since the university VPAA has no control
over which professors are in a particular college. The professors were not
randomly assigned to the colleges.
ANS:
Yes; This experiment represents a designed experiment since the two
treatments (methods of teaching) are randomly assigned to the students.
NARRBEGIN: Annuity
A large annuity company holds many industry group stocks. Among the
industries are banks, business services and construction. Seven companies from
each industry group are randomly sampled to test the hypothesis that the mean
price per share is the same among industries. The data are:
NARREND
3. State the null and alternative hypothesis.
H0: ____________________
H1: ___________________________________
ANS:
; The population means are not equal
ANS:
F; 3.431
ANS:
Do not reject H0
H0: ____________________
H1: ______________________________________________
ANS:
; At least one of the population means is not equal
ANS:
F; 2.468
ANS:
Do not reject H0
H1: Not all of the mean mpg is the same for the three companies An = 0.05 is
used by the consumer research organization. Is there any difference in mean
mpg?
Conclusion: ____________________
Interpretation: ________________________________________________________
ANS:
SST = 63.340, SSTR = 28.948, SSE = 34.392; Critical value:F(2,12,0.05) = 3.89
Test statistic: F= 5.05; Reject H0; Mean mpg is NOT equal in all three companies
NARRBEGIN: Model
A consumer group is testing the gas mileage of 3 different models of cars.
Several cars were driven 1000 miles and the gas mileage was recorded in the
following table.
NARREND
10. State the null and alternative hypotheses.
ANS:
H0:
H1: The population means are not all equal
PTS: 1 OBJ: Section 12.3
11. Calculate the treatment mean square and the error mean square.
ANS:
MSTR = 14.71
MSE = 3.32
F = 4.430
12. Using = 0.05, state the decision rule in terms of the critical value.
ANS:
If F > 3.59, we reject the null hypothesis. If F 3.59, we fail to reject the null
hypothesis.
ANS:
Since F > 3.59, we reject the null hypothesis and conclude that the gas mileage
differs between the 3 models.
H0: ____________________
H1: _________________________________________
ANS:
; The population means are not equal
ANS:
F; 0.411
ANS:
Do not reject H0
17. Fill in the blanks (identified by asterisks and numbers) in the following partial
ANOVA table:
Source of Variation SS df MS F
Treatments (*1) (*2) 195 (*3)
Error 625 (*4) (*5)
Total 1600 25
ANS:
*1 = 975; *2 = 5; *3 = 6.24; *4 = 20, *5 = 31.25
Conclusion: ____________________
Interpretation: ______________________________________________
ANS:
Source of variation SS df MS F P-value F critical
Treatments 36.4 2 18.2 5.2 .0236 3.885
Error 42.0 12 3.5
Total 78.4 14
NARRBEGIN: Courses
A father and his two sons played golf at four different courses with the following
scores:
NARREND
19. State the null and alternative hypotheses to test for a difference in golf score
means for each person.
ANS:
H0:
H1: The golf score means are not equal
20. Use a randomized block design to test for the difference of golf score means
using = 0.05 and using the courses as the blocking variable.
ANS:
SSB = 56.33
21. Use a randomized block design to test for the difference of golf score means
using = 0.05 and using the courses as the blocking variable.
Calculate the error sum of squares and the error mean square.
ANS:
SSE = 41.17
MSE = 6.86
22. Use a randomized block design to test for the difference of golf score means
using = 0.05 and using the courses as the blocking variable.
Calculate the treatment sum of squares and the treatment mean square.
ANS:
SSTR = 100.17
MSTR = 50.08
23. Use a randomized block design to test for the difference of golf score means
using = 0.05 and using the courses as the blocking variable.
Calculate the test statistic.
ANS:
F = 7.300
24. Use a randomized block design to test for the difference of golf score means
using = 0.05 and using the courses as the blocking variable.
ANS:
If F > 5.14, we reject the null hypothesis. If F 5.14, we fail to reject the null
hypothesis.
25. Use a randomized block design to test for the difference of golf score means
using = 0.05 and using the courses as the blocking variable.
ANS:
Since F > 5.14, we reject the null hypothesis and conclude that the average golf
scores are not equal.
NARRBEGIN: Stores
A vice president would like to know whether there is a difference between the
average number of customers per day between four different stores. The
following data was collected over the same 5-day period for each store.
ANS:
H0:
H1: The mean numbers of customers per day for each store are not all equal
27. Develop a randomized block design to test for the difference between the
average number of customers per day between four different stores using =
0.05 and using the days as the blocking variable.
ANS:
SSB = 353.7
28. Develop a randomized block design to test for the difference between the
average number of customers per day between four different stores using =
0.05 and using the days as the blocking variable.
Calculate the error sum of squares and the error mean square.
ANS:
SSE = 948.7
MSE = 79.06
29. Develop a randomized block design to test for the difference between the
average number of customers per day between four different stores using =
0.05 and using the days as the blocking variable.
Calculate the treatment sum of squares and the treatment mean square.
ANS:
SSTR = 420.55
MSTR = 140.18
30. Develop a randomized block design to test for the difference between the
average number of customers per day between four different stores using =
0.05 and using the days as the blocking variable.
ANS:
F = 1.773
31. Develop a randomized block design to test for the difference between the
average number of customers per day between four different stores using =
0.05 and using the days as the blocking variable.
ANS:
If F > 3.49, we reject the null hypothesis. If F 3.49, we fail to reject the null
hypothesis.
Since F 3.49, we fail to reject the null hypothesis and conclude that we do not
have enough evidence to claim that the average number of customers per day
between four different stores are not equal.
NARREND
32. State the null and alternative hypotheses to test for a difference in average
number of customers per day for each store.
ANS:
H0:
H1: The average weights of grass clippings for each fertilizer are not all equal
33. Develop a randomized block design to test for the difference between the
average number of customers per day between four different stores using =
0.05 and using the days as the blocking variable.
ANS:
SSB = 1.14
34. Develop a randomized block design to test for the difference between the
average number of customers per day between four different stores using =
0.05 and using the days as the blocking variable.
Calculate the error sum of squares and the error mean square.
ANS:
SSE = 16.76
MSE = 1.68
PTS: 1 OBJ: Section 12.4
35. Develop a randomized block design to test for the difference between the
average number of customers per day between four different stores using =
0.05 and using the days as the blocking variable.
Calculate the treatment sum of squares and the treatment mean square.
ANS:
SSTR = 10.71
MSTR = 5.36
36. Develop a randomized block design to test for the difference between the
average number of customers per day between four different stores using =
0.05 and using the days as the blocking variable.
ANS:
F = 3.197
37. Develop a randomized block design to test for the difference between the
average number of customers per day between four different stores using =
0.05 and using the days as the blocking variable.
ANS:
If F > 4.10, we reject the null hypothesis. If F 4.10, we fail to reject the null
hypothesis.
Since F 4.10, we fail to reject the null hypothesis and conclude that we do not
have enough evidence to claim that the average weight of the grass clippings
for each type of fertilizer are not equal.
PTS: 1 OBJ: Section 12.4
Conclusion: ________________________
Interpretation: ______________________________________
ANS:
Critical value: F(0.05, 2, 6) = 5.14; Test statistic: F = 17.527; Reject H0; There is
a difference in the mean sales for the shelf heights.
Conclusion: ________________________
Interpretation: _______________________________________
ANS:
Critical value: F(0.05, 3, 6) = 4.76; Test statistic: F = 57.092. Reject H0; There is
a difference in mean sales for the stores.
Conclusion: ____________________
Interpretation: _______________________________________
Conclusion: ____________________
Interpretation: ________________________________________
ANS:
Source of variation SS df MS F P-value F critical
Treatments 29.733 2 14.867 6.511 .02097 4.459
Blocks 336.933 4 84.233 36.891 0.00335 3.838
Error 18.267 8 2.283
Total 384.933 14
NARRBEGIN: Music
A study to examine the effect of music type and assembly method on the
productivity of workers on a production line for electronic circuit boards
produced the following results:
NARREND
41. Test for the main effects of factor A at the 5% significance level.
ANS:
i = 0 (No level of factor A has an effect)H0:
0 (At least one level of factor A has an effect)i H1:
42. Test for the main effects of factor B at the 5% significance level.
ANS:
i = 0 (No level of factor B has an effect)H0:
0 (At least one level of factor B has an effect)i H1:
43. Test for the interaction effects between levels of factors A and B at the 5%
significance level.
ANS:
ij = 0 (There is no interaction effect)H0: (
0 (At least one combination of i and j has an effect)ij H1: (
Test statistic:= F= 0.308;Critical value: F(0.05, 1, 4) = 7.71; Do not reject H0;
There are no interaction effects between levels of factors A and B.
NARRBEGIN: Lighting
A study to examine the effect of lighting type and assembly method on the
productivity of workers on a production line for electronic circuit boards
produced the following results:
NARREND
44. Test for the main effects of factor A at the 5% significance level.
ANS:
i = 0 (No level of factor A has an effect)H0:
0 (At least one level of factor A has an effect)i H1:
NARREND
45. What decisions can be made about the null hypothesis for the main effects
and the interaction effects at the 0.05 significance level?
ANS:
All null hypotheses will be rejected at alpha = 0.05.
46. In a two-way analysis of variance, there are “a” levels for factor A, “b” levels
for factor B, and “r” replications within each of the factor-level combinations, or
cells. What are the degrees of freedom for error?
ANS:
ab(r – 1)
ANS:
(a – 1)(b – 1)
48. In a two-way analysis of variance, there are “a” levels for factor A, “b” levels
for factor B, and “r” replications within each of the factor-level combinations, or
cells. What are the total degrees of freedom?
ANS:
abr – 1
Is there sufficient evidence at the 10% significance level to infer that the time it
takes a student to master software and the gender of the student interact?
ANS:
Source of variation SS df MS F P-value F critical
Software 34.778 2 17.389 0.978 0.4041 2.807
Gender 53.389 1 53.389 3.003 0.1087 3.177
Interaction 26.778 2 13.389 0.753 0.4919 2.807
Error 213.333 12 17.778
Total 328.278 17
H0: Software type and gender do not interact; H1: Software and gender do
interact
F= 0.753;Critical value = 2.807; Do not reject H0; Software and gender do NOT
interact
ANS:
12; A treatment consists of a combination of one level of factor A and one level
of factor B.
ESSAY
1. What is an experiment?
ANS:
A study or investigation designed for the purpose of examining the effect that
one variable has on the value of another variable.
ANS:
A designed experiment is an experiment in which treatments are randomly
assigned to the participants or test units.
3. What is a factor?
ANS:
A factor is an independent variable.
ANS:
In ANOVA, we compare between-sample variation to within-sample variation.
The basic role of the two variations is that their ratio provides the test statistic.
ANS:
The purpose of the randomize block design is to reduce the amount of error
variation by ensuring that the treatment groups will be comparable in terms of
the blocking variable.
ANS:
Blocking allows for a better comparison of the effect of the treatments by
reducing the amount of error variation.
ANS:
a. The one observation in each treatment-block combination has been randomly
selected from a normally distributed population. b. The variances are equal for
the values in the respective populations. c. There is no interaction between the
blocks and treatments.
8. How does the randomized block design differ from the one-way, completely
randomized design?
ANS:
In the one-way ANOVA, the treatments are randomly assigned to the test units.
In the randomized block design, the test units are first arranged into similar
groups and then the treatments are randomly assigned.
ANS:
The purpose two-way ANOVA is to simultaneously examine the effects of two
factors on a dependent variable and the effect of interactions between the
different levels of these two factors.
ANS:
Replications refers to the number of persons or test units within each cell. Within
each combination of levels, there will be k = 1 through r observations or
replications.
ANS:
We assume the r observations in each cell have been drawn from normally
distributed populations with equal variances.
12. Why are there more sets of null and alternative hypotheses that can be
tested in two-way ANOVA compared to the one-way and the randomized block
design?
ANS:
In the two-way analysis of variance we are interested in the effect of two factors
and their interaction on the dependent variable. Therefore, we test for each
factor effect and for interaction. In the one-way analysis of variance and the
randomized block designs, we are interested in only one factor effect.
13. How is two-way ANOVA similar to the randomized block design? How does it
differ?
ANS:
The two-way analysis of variance and the randomized block design are similar
since they both consider two factors in the model. However, in the randomized
block design, the blocking variable is only used for the purpose of exerting
improved control over the examination of the single factor of interest. In two-
way analysis of variance, we are interested in the effects of two factors and their
interaction on the dependent variable.
14. In the two-way ANOVA, what is meant by main effects and interactive
effects?
ANS:
The main effect is an effect caused by one of the factors in the experiment. The
interactive effect is the effect that results from the combination of a level of one
factor with a level of another factor.
TRUE/FALSE
1. True or False
The chi-square distribution can be used in constructing confidence intervals and
carrying out hypothesis tests regarding the value of a population variance.
ANS: T PTS: 1 OBJ: Section 13.1
2. True or False
The normal distribution can be used to examine whether two nominal variables
could be independent of each other.
3. True or False
Like the critical values for the normal distribution and the t-distribution, the
critical values for the chi-square distribution can be either positive or negative.
4. True or False
The chi-square distribution is skewed to the left (negatively skewed), but as
degrees of freedom increase, it approaches the shape of the binomial
distribution.
5. True or False
The goodness-of-fit chi-square test is a two-tailed test.
6. True or False
In a goodness-of-fit chi-square test, a large amount of discrepancy between the
frequencies that are observed and those that are expected tend to cause us to
fail to reject the null hypothesis.
7. True or False
In goodness-of-fit tests, the chi-square analysis is applied for the purpose of
examining whether two nominal-scale variables are related to each other.
9. True or False
The purpose of the chi-square analysis to test the independence of two variables
is to determine the exact nature of the relationship between two ratio variables.
MULTIPLE CHOICE
1. With larger and larger numbers of categories in chi-square tests, the chi-
square distribution takes on the shape of the:
a. Poisson distribution.
b. t-distribution.
c. normal distribution.
d. F-distribution.
e. binomial distribution.
ANS: C PTS: 1 OBJ: Section 13.2
3. The area to the right of a chi-square value is 0.025. For 5 degrees of freedom,
the table value is:
a. 12.833.
b. 15.086.
c. 11.07.
d. 11.143.
e. 16.182.
ANS: A PTS: 1 OBJ: Section 13.2
4. When the number of degrees of freedom is small, the shape of the chi-square
distribution is
a. symmetrical.
b. uniform.
c. positively skewed.
d. negatively skewed.
e. normal.
ANS: C PTS: 1 OBJ: Section 13.2
6. Which of the following tests will determine whether sample data could have
been drawn from a population having a specified probability distribution?
a. independence of two variables
b. comparing proportions from two independent samples
c. estimating the population variance
d. goodness-of-fit test
e. confidence interval around a sample mean
ANS: D PTS: 1 OBJ: Section 13.3
9. To use the chi-square distribution table to select a table value requires the
following:
a. numerator degrees of freedom.
b. denominator degrees of freedom.
c. level of significance.
d. degrees of freedom.
e. C and D above.
ANS: E PTS: 1 OBJ: Section 13.2
10. The middle 0.95 portion of the chi-square distribution with 10 degrees of
freedom has table values of:
a. 2.558, 3.247, respectively.
b. 3.247, 18.307, respectively.
c. 3.247, 20.483, respectively.
d. 3.940, 20.483, respectively.
e. 4.366, 23.843, respectively.
ANS: C PTS: 1 OBJ: Section 13.6
11. In the goodness-of-fit chi-square test, under which assumption are the
expected frequencies constructed?
a. The population follows the chi-square distribution.
b. The null hypothesis is true.
c. The alternative hypothesis is true.
d. The null hypothesis is false.
e. None of the above assumptions are necessary to construct the expected
frequencies.
ANS: B PTS: 1 OBJ: Section 13.3
12. A left tail area in the chi-square distribution equals 0.95. For df = 10, the
table value equals:
a. 18.307.
b. 15.987.
c. 3.940.
d. 20.483.
ANS: A PTS: 1 OBJ: Section 13.6
14. Which of the following tests do not use the chi-square distribution?
a. Test of a contingency table
b. Goodness-of-fit test
c. Difference between two population means test
d. All of these tests use the chi-squared distribution
ANS: C PTS: 1 OBJ: Section 13.2
15. In the goodness-of-fit chi-square test, the rule of thumb that the expected
frequency for each category should be at least 5 exists because:
a. the chi-square distribution is not symmetrical when the expected frequency of
a category is less than 5.
b. the test statistic for the chi-square distribution is based on the assumption
that the expected frequency of each category is greater than 5.
c. the chi-square distribution is negatively skewed when the expected frequency
of a category is less than 5.
d. the chi-square distribution is continuous while the counts on which the test
statistic are based are discrete.
e. the chi-square distribution is positively skewed when the expected frequency
of a category is less than 5.
ANS: D PTS: 1 OBJ: Section 13.3
17. Which of the following would be the recommended action if the expected
frequency for a particular category is less than 5 in the goodness-of-fit chi-
square test?
a. Decrease the significance level so that each category will have an expected
frequency greater than 5.
b. Combine adjacent cells so that the result will have an expected frequency
greater than 5.
c. Increase the sample size so that each category will have an expected
frequency greater than 5.
d. Either b or c could be used to ensure that each category will have an
expected frequency greater than 5.
e. Either a, b or c could be used to ensure that each category will have an
expected frequency greater than 5.
ANS: D PTS: 1 OBJ: Section 13.3
18. In a goodness-of-fit test, the null hypothesis states that the data came from
a normally distributed population. The researcher estimated the population
mean and population standard deviation from a sample of 500 observations. In
addition, the researcher used 6 standardized intervals to test for normality.
Using a 5% level of significance, the critical value for this test is:
a. 11.143.
b. 9.348.
c. 7.815.
d. 9.488.
ANS: C PTS: 1 OBJ: Section 13.3
20. In a goodness-of-fit chi-square test, if the null hypothesis states “The sample
was drawn from a population that follows the normal distribution” and the test
has 7 categories that are mutually exclusive and exhaustive, the number of
degrees of freedom will be
a. 4
b. 5
c. 6
d. 7
e. 8
ANS: A PTS: 1 OBJ: Section 13.3
21. In chi-square tests, the conventional and conservative rule, known as the
Rule of Five, is to require that:
a. observed frequency for each cell be at least five.
b. degrees of freedom for the test be at least five.
c. expected frequency for each cell be at least five.
d. difference between the observed and expected frequency for each cell be at
least five.
ANS: C PTS: 1 OBJ: Section 13.3
22. Of the values for a chi-squared test statistic listed below, which one is likely
to lead to rejecting the null hypothesis in a goodness-of-fit test?
a. 30
b. 3
c. 2
d. 1
ANS: A PTS: 1 OBJ: Section 13.3
23. To determine whether a single coin is fair, the coin was tossed 100 times,
and head was observed 60 times. The value of the test statistic is:
a. 4.
b. 40.
c. 6.
d. 60.
ANS: A PTS: 1 OBJ: Section 13.3
24. To determine whether data were drawn from a specific distribution, we use
a:
a. chi-square goodness-of-fit test.
b. chi-square test of a contingency table.
c. chi-square test for normality.
d. chi-square test for a population variance.
ANS: A PTS: 1 OBJ: Section 13.3
25. In a goodness-of-fit chi-square test, if the null hypothesis states “The sample
was drawn from a population that follows the binomial distribution” and the test
has 6 categories that are mutually exclusive and exhaustive, the number of
degrees of freedom will be
a. 3.
b. 4.
c. 5.
d. 6.
e. 7.
ANS: B PTS: 1 OBJ: Section 13.3
2 = 12.678, and the critical value at = 0.025 was 14.4494. Thus:26. In a chi-
squared test of a contingency table, the value of the test statistic was
a. we fail to reject the null hypothesis at = 0.025.
b. we reject the null hypothesis at = 0.025.
c. we don’t have enough evidence to accept or reject the null hypothesis at =
0.025.
d. we should decrease the level of significance in order to reject the null
hypothesis.
ANS: A PTS: 1 OBJ: Section 13.4
29. In the chi-square test for equality of proportions, the procedure is identical
test for independence of two variables with the exception that in the test for
equality of proportions:
a. each column consists of observations for an independent sample.
b. each table will always have 2 rows.
c. the number of degrees of freedom will be the number of rows times the
number of columns.
d. both a and b.
e. all of the above.
ANS: D PTS: 1 OBJ: Section 13.5
30. In testing a population variance or constructing a confidence interval for the
population variance, an essential assumption is that:
a. sample size exceeds 30.
b. expected frequencies equal or exceed 5.
c. the population is uniformly distributed.
d. the population is normally distributed.
e. the population is binomially distributed.
ANS: D PTS: 1 OBJ: Section 13.6
31. In a hypothesis test for the population variance, the alternate hypothesis is
the population variance does not equal 17.0. The significance level to be used is
0.05 and the sample size to be taken is 25. The table value(s) to use from the
chi-square distribution is (are):
a. 39.364.
b. 40.647.
c. 12.401 and 39.364.
d. 13.120 and 40.647.
e. 12.401.
ANS: C PTS: 1 OBJ: Section 13.6
34. A researcher is studying cancer mortality rates for men in five southern
states. In the five states, the sample standard deviation is 6.76 (per 100,000). A
98 percent confidence interval estimate of the population variance is
represented by:
a. 5(6.76) / 13.277 and 5(6.76) / 0.297.
b. 5(6.762) / 13.277 and 5(6.762) / 0.297.
c. 4(6.76) / 13.277 and 4(6.76) / 0.297.
d. 4(6.762) / 13.277 and 4(6.762) / 0.297.
e. 4(6.762) / 0.297 and 4(6.762) / 13.277.
ANS: D PTS: 1 OBJ: Section 13.6
NUMERIC RESPONSE
1. The null hypothesis states that the sample data came from a normally
distributed population. The researcher calculates the sample mean and the
sample standard deviation from the data. The data arrangement consisted of six
categories. Using a 0.01 significance level, what is the appropriate critical value
for this chi-square test?
ANS: 11.345
PTS: 1 OBJ: Section 13.3
ANS: 4
NARRBEGIN: Stocks
One thousand stocks watched by a financial advisory company are advertised as
having an average return-on-equity (ROE) of 11.4 percent and a standard
deviation of 3 percent. An inquiry is made by a prospective client who finds that
the ROE is supposed to be normally distributed for these stocks. The prospective
client obtains a list of the stocks and randomly samples 300 stocks and records
the ROE. The results are:
NARREND
3. What are the appropriate degrees of freedom based on the above data?
ANS: 3
4. What is the appropriate table value for the .10 significance level?
ANS: 6.251
NARRBEGIN: Accommodations
Business travelers stay in a variety of accommodations. A set of national
percentages for types of accommodations shows their choices below. During a
sluggish economy there may be changes in business traveler accommodation
patterns so a travel consultant surveyed 300 business travelers and asked about
their accommodations. These data will be used to test whether or not the
present accommodation distribution is the same as the national one.
NARREND
6. What are the appropriate degrees of freedom?
ANS: 3
ANS: 192
ANS: 6.851
ANS: 14.01
PTS: 1 OBJ: Section 13.3
ANS: 1.97
11. A test for independence is applied to a contingency table with 4 rows and 5
columns for two nominal variables. Calculate the degrees of freedom for this
test.
ANS: 12
ANS: 11.143
13. What is the expected frequency of the $50,000 and over and No categories?
ANS: 10.29
ANS: 13.77
ANS: 15.56
NARRBEGIN: Video
The owner of a video store is interested in whether or not there is a difference in
the proportions of VCR movies rented by members and non-members among
movie titles. Three movies are selected and each person who rents them is
identified as a member or non-member. A total of 350 people rent these movies
and the responses are:
NARREND
16. What are the degrees of freedom for the test of equal proportions?
ANS: 2
17. What is the expected frequency of the cell Rambo and Non-member?
ANS: 70
ANS: 35.89
ANS: 9.21
PTS: 1 OBJ: Section 13.5
NARRBEGIN: Employee
Four employees are monitored to determine whether there is any difference in
the proportions of acceptable parts produced by the employees. The sample of
parts produced is given below.
NARREND
20. These data provide the information for testing the hypothesis that the
proportion of unacceptable is the same for the four employees. What is the
expected frequency for Acceptable and employee 3?
ANS: 237.50
21. These data provide the information for testing the hypothesis that the
proportion of unacceptable is the same for the four employees. What will be the
degrees of freedom for this test?
ANS: 3
22. These data provide the information for testing the hypothesis that the
proportion of unacceptable is the same for the four employees. At a significance
level of 0.01, what will be he critical value for this chi-square test?
ANS: 11.345
23. These data provide the information for testing the hypothesis that the
proportion of unacceptable is the same for the four employees. Calculate the
test statistic.
ANS: 24.02
ANS: 29.455
NARRBEGIN: Survey
In a survey of 100 males and 100 females at a regional university, 26 percent of
the males and 18 percent of the females responded that they were offended by
frequently aired television. The test question is: Could these results indicate that
the same proportions of the two samples are offended?
NARREND
25. What is the calculated value of the test statistic?
ANS: 1.864
NARRBEGIN: Biology
A biology professor claimed that the proportions of grades in his classes are the
same. A sample of 100 students showed the following frequencies:
NARREND
26. Compute the value of the test statistics.
ANS: 7.90
ANS: 12.401
28. In a hypothesis test for the population variance, the alternate hypothesis is
the population variance is greater than 17.0. The significance level to be used is
0.025 and the sample size to be taken is 25. What table value should we use
from the chi-square distribution?
ANS: 39.364
NARRBEGIN: Hypotheses
In testing the hypotheses:
H0:
H1:
ANS: 1.664
COMPLETION
2. Under the assumption that the null hypothesis is true, we construct a table of
____________________ that are based on the probability distribution from which the
sample is assumed to have been drawn.
3. The starting point for the chi-square test of variable independence is the
____________________.
ANS: chi-square
SHORT ANSWER
1. When samples of size n are drawn from a normal population, the chi-square
distribution is the sampling distribution of = ____________________, where s2 and
are the sample and population variances, respectively.
ANS:
2. Find the chi-square value for each of the right-tail areas below, given that the
degrees of freedom are 7:
A) 0.95 ____________________
B) 0.01 ____________________
C) 0.025 ____________________
D) 0.05 ____________________
ANS:
2.167; 18.475; 16.013; 14.067
3. Find the chi-square value for each of the four degrees of freedom below, given
that the area to the left of a chi-square value is 0.05.
A) 2 ____________________
B) 6 ____________________
C) 10 ____________________
D) 26 ____________________
ANS:
0.103; 1.635; 3.940; 15.379
H0: _______________________________________________________________
H1: _______________________________________________________________
ANS:
H0: The sample was drawn from a population that is Poisson distributed with a
mean equal to 1.90.;
H1: The sample was not drawn from such population.
5. She tests the hypothesis that returns per day are Poisson distributed with a
population mean equal to 1.90. Her significance level is 0.05. Identify the critical
value.
ANS:
2(0.05, 5) = 11.07Critical Value
6. She tests the hypothesis that returns per day are Poisson distributed with a
population mean equal to 1.90. Her significance level is 0.05. Calculate the
expected frequencies.
ANS:
7. She tests the hypothesis that returns per day are Poisson distributed with a
population mean equal to 1.90. Her significance level is 0.05. Determine the
calculated value of the chi-square test statistic.
ANS:
2 = 3.172Calculated Value
PTS: 1 OBJ: Section 13.3
8. She tests the hypothesis that returns per day are Poisson distributed with a
population mean equal to 1.90. Her significance level is 0.05. State the
appropriate conclusion at the 0.05 significance level.
ANS:
Do not reject H0 No statistical evidence that these data do not come from a
Poisson distribution with a mean equal to 1.90
NARRBEGIN: Stocks
One thousand stocks watched by a financial advisory company are advertised as
having an average return-on-equity (ROE) of 11.4 percent and a standard
deviation of 3 percent. An inquiry is made by a prospective client who finds that
the ROE is supposed to be normally distributed for these stocks. The prospective
client obtains a list of the stocks and randomly samples 300 stocks and records
the ROE. The results are:
NARREND
9. Employing a 0.10 significance level, what is the appropriate conclusion based
on the calculations from the above data?
ANS:
Do not reject H0
NARRBEGIN: Accommodations
Business travelers stay in a variety of accommodations. A set of national
percentages for types of accommodations shows their choices below. During a
sluggish economy there may be changes in business traveler accommodation
patterns so a travel consultant surveyed 300 business travelers and asked about
their accommodations. These data will be used to test whether or not the
present accommodation distribution is the same as the national one.
NARREND
10. At the 0.05 significance level, what is the appropriate table value?
ANS:
2(0.05, 3) = 7.815Critical Value
ANS:
Do not reject H0; there is no evidence to suggest the present accommodation is
different from the national one.
12. Assuming that there is only one category with an expected frequency less
than five, what is the appropriate table value for the 0.05 significance level?
ANS:
2(0.05, 4) = 9.488Critical Value
ANS:
x Oj p(x) Ej
0 10 .0313 6.26
1 38 .1562 31.24
2 69 .3125 62.50
3 63 .3125 62.50
4 18 .1562 31.24
5 2 .0313 6.26
TOTAL 200 1.00 200
PTS: 1 OBJ: Section 13.3
Hypotheses: ____________________
Conclusion: ____________________________
ANS:
H0: The sample is from a binomial distribution with = 0.50
H1: The sample is not from a binomial distribution with = 0.50
2Rejection region: reject H0 if calculated > 2 = 12.888, we reject H0; conclude
that there is statistical evidence to show that the sample is not from a binomial
distribution with = 0.5011.07 Since calculated
ANS:
H0: _________________________________________________________________
H1: _________________________________________________________________
ANS:
H0: Level of income and the decision of the voters on the bond issue are
independent.;
H1: Level of income and the decision of the voters on the bond issue are
dependent.
ANS:
Reject H0 and conclude that the level of income impacts the decision of voters
on the bond issue.
H0: ___________________________________________________________
H1: ___________________________________________________________
Conclusion: ____________________________________
ANS:
Gender and level of job satisfaction are independent.;
Gender and level of job satisfaction are dependent.;
2Rejection region reject H0 if calculated > 2 = 4.708;2(0.10) = 4.605
Calculated
Reject H0 and conclude that the level of job satisfaction is related to gender.
PTS: 1 OBJ: Section 13.4
19. A large carpet store wishes to determine if the brand of carpet purchased is
related to the purchaser’s family income. As a sampling frame, they mailed a
survey to people who have a store credit card. Five hundred customers returned
the survey and the results follow:
At the 5% level of significance, can you conclude that the brand of carpet
purchased is related to the purchaser’s family income?
H0: ____________________________________________________________
H1: ____________________________________________________________
Conclusion: ____________________________
ANS:
H0: Family income and brand of carpet are independent.;
H1: Family income and brand of carpet are dependent.;
2Rejection region: reject H0 if calculated > 2(.05, 4) = 9.488
2 = 27.372; we reject H0 and conclude that the brand of carpet purchased is
related to the purchaser’s family income.Calculated
20. A sport preference poll showed the following data for men and women:
H0: _______________________________________________________________
H1: _______________________________________________________________
ANS:
H0: Gender and favorite sport are independent.;
H1: Gender and favorite sport are dependent.;
2Rejection region: Reject H0 if calculated >2(.05,4) = 9.488;
2 = 3.30; Do not reject H0; there is not enough statistical evidence to conclude
that sport preferences depend on gender.Calculated
NARRBEGIN: Gifts
Three choices of free gifts are offered to 270 persons who subscribed to a new
magazine. The number choosing each gift is shown in the table below. Assume
the null hypothesis that the gifts are equally attractive to the subscribers. Use a
0.01 significance level.
NARREND
21. State the null and alternative hypotheses.
H0: __________________________________________________________
H1: __________________________________________________________
ANS:
The gifts are equally attractive to the subscribers).;H0:
values differs from the others. (Gifts are not equally attractive to the
subscribers).H1: At least one of the
ANS:
Reject H0 if calculated chi-square > critical value = 9.21.
NARRBEGIN: Video
The owner of a video store is interested in whether or not there is a difference in
the proportions of VCR movies rented by members and non-members among
movie titles. Three movies are selected and each person who rents them is
identified as a member or non-member. A total of 350 people rent these movies
and the responses are:
NARREND
24. What is the conclusion?
ANS:
Reject H0 and conclude there is a difference in the proportion of movies rented
by members and non-members at the movie store.
NARRBEGIN: Employee
Four employees are monitored to determine whether there is any difference in
the proportions of acceptable parts produced by the employees. The sample of
parts produced is given below.
NARREND
25. These data provide the information for testing the hypothesis that the
proportion of unacceptable is the same for the four employees. What is the
conclusion?
ANS:
Reject H0 and conclude that there is a difference in the proportions of the
production of acceptable parts for at least one employee.
1994: ____________________
1997: ____________________
ANS:
40; 4
27. What are the expected frequencies of the 1994 and 1997 breakdowns?
1994: ____________________
1997: ____________________
ANS:
22; 22
ANS:
Reject H0 and conclude that at least one of the proportions of breakdowns is not
the same as the other years.
NARRBEGIN: Survey
In a survey of 100 males and 100 females at a regional university, 26 percent of
the males and 18 percent of the females responded that they were offended by
frequently aired television. The test question is: Could these results indicate that
the same proportions of the two samples are offended?
NARREND
29. What are the observed frequencies of the males and females who are
offended by the ads?
Males: ____________________
Females: ____________________
ANS:
26; 18
30. Calculate the expected frequencies of the males and females who are
offended by the ads.
Males: ____________________
Females: ____________________
ANS:
22; 22
The proportions of males and females who are offended by the ads are:
____________________
ANS:
equal
NARRBEGIN: Biology
A biology professor claimed that the proportions of grades in his classes are the
same. A sample of 100 students showed the following frequencies:
NARREND
32. State the null and alternative hypotheses to be tested.
H0: ____________________________________________________________
H1: ____________________________________________________________
ANS:
;
At least one proportion differs from its specified value.
33. State the critical value which determines the rejection region at the 5%
significance level.
ANS:
2Reject H0 if > 9.488
34. Do the data provide enough evidence to support the professor’s claim?
ANS:
Since 7.90 < 9.488, we fail to reject the null hypothesis. The data supports the
professor’s claim.
35. In 2000, the student body of a state university in Michigan consists of 30%
freshmen, 25% sophomores, 27% juniors, and 18% seniors. A sample of 400
students taken from the 2000 student body showed that there are 138
freshmen, 88 sophomores, 94 juniors, and 80 seniors. Test with 5% significance
level to determine whether the student body proportions have changed.
H0: _______________________________________________________
H1: _______________________________________________________
Conclusion: ___________________________
ANS:
H0: ;
H1: At least one proportion differs from its specified value.;
2Rejection region: reject H0 if calculated > 2(0.05, 3) = 7.815.
2 = 6.844, do not reject H0 and conclude that there is no statistical evidence to
suggest that the student body proportions have changed.Since calculated
NARRBEGIN: TV Model
Three hundred individuals were asked to view the picture quality of four different
TVs and asked to choose the one they preferred. The data is shown in the
following table:
NARREND
36. Test the equality of proportions at the 0.05 level.
ANS:
H0:
H1: At least one population proportion differs from the others.
37. Test the equality of proportions at the 0.05 level the hypotheses.
38. Test the equality of proportions at the 0.05 level the hypotheses.
ANS:
Reject the null hypothesis if > 7.815.
A B C D F Total
Male 124 192 212 87 15 630
Female 195 238 240 120 27 820
Total 319 430 452 207 42 1450
NARREND
39. Compute the expected counts for each category.
ANS:
A B C D F Total
Male 138.6 186.8 196.4 89.9 18.2 630
Female 180.4 243.2 255.6 117.1 23.8 820
Total 319 430 452 207 42 1450
PTS: 1 OBJ: Section 13.5
41. Using = 0.05, state the decision rule in terms of the critical value.
ANS:
Reject the null hypothesis if > 9.488.
ANS:
Since < 9.488, we fail to reject the null hypothesis. Therefore there is no
evidence to suggest that male students have a different grade distribution than
female students.
43. The realtor for an exclusive, planned housing development says that house
prices average $125 thousand and uniformity of prices within the development
is indicated by the standard deviation of $ 8 thousand. A prospective buyer
wants to check on the uniformity of prices and randomly samples 12 houses and
determines the selling price. These data are: 113.0, 126.2, 134.4, 129.4, 106.6,
117.4, 123.5, 121.0, 109.6, 119.4, 126.6, and 119.6. Use these data to test the
hypothesis:
The buyer assumes selling prices are normally distributed and employs a 0.05
significance level. Is the statement by the realtor correct?
Conclusion: _____________________
Interpretation: _____________________________________
ANS:
Calculated value = 11.426; Critical value = 19.675; Do not reject H0; The
realtor’s statement is supported by this data.
44. In preparing a confidence level for a population variance, the size of the
sample is 12, and the level of confidence is 0.90. What are the appropriate table
values for this interval?
ANS:
4.575; 19.675
45. In preparing a confidence level for a population variance, the size of the
sample is 13, and the level of confidence is 0.90. What are the appropriate table
values for this interval?
ANS:
5.226; 21.026
46. In preparing a confidence level for a population variance, the size of the
sample is 13 and the level of confidence is 0.95. What are the appropriate table
values for this interval?
ANS:
4.404; 23.337
H0:
H1:
ANS:
Reject H0 if calculated chi-square < 2.7 or if calculated chi-square > 19.023
ANS:
Reject H0 We can infer that the population variance differs significantly from
0.01
49. A sample of size 20 is used to obtain the following statistics: and Can we
conclude at the 1% significance level that the population variance is less than
100?
Hypotheses:
Conclusion: _________________________
Interpretation: __________________________________________________
ANS:
1002 H0:
2H1: < 100
2Rejection region: if < 2 = 13.916; Do not reject H0; There is not enough
evidence to conclude that the population variance is less than 100.7.633
Calculated
50. A sample of size 20 is used to obtain the following statistics: and Estimate
the population variance with 98% confidence.
ANS:
38.450; 182.307
H0:
H1:
Conclusion: ____________________
Interpretation: _________________________________________________
ANS:
2Rejection region: Reject H0 if > 2 = 4.90; Do not reject H0; There is not
enough evidence to conclude that the population variance is greater than
20.7.779; Calculated
ANS:
10.329; 137.888
53. In order to examine whether a sample could have come from a given type of
population distribution (e.g., the normal distribution), chi-square analysis
involves the comparison of a table of ______________________________ Oij with a
table of ______________________________ Eij that has been constructed under the
assumption that the null hypothesis is true.
ANS:
observed frequencies; expected frequencies
ESSAY
1. Using the chi-square distribution, along with sample data and frequency
counts, what will we be able to examine?
ANS:
We will be able to examine: 1) whether a sample could have come from a given
type of population distribution. 2) whether two nominal variables could be
independent of each other. 3) whether two or more independent samples could
have the same population proportion.
3. In carrying out a chi-square goodness-of-fit test, what are the “k” and “m”
terms in the “df = k – 1 – m” expression and why is each term present?
ANS:
Given k is the number of categories or groups in the, and m is the number of
population parameters that must be estimated from sample data in order to
carry out the test. “k-1” cells to determine the count in the cell. In addition, each
population parameter that must be estimated also reduces our degrees of
freedom.
ANS:
The rationale for df = (r-1) (k-1) is that this quantity represents the number of
cell frequencies that are free to vary. Given that we know the sample size (the
total number of observations in the table), we need only know the contents of (r-
1) rows and (k-1) columns in order to completely fill in the cells within the r x k
table.
TRUE/FALSE
1. True or False
A one-sample t-test is the parametric counterpart to the Wilcoxon signed rank
test for one sample.
2. True or False
A Nonparametric test is one that assumes that the population from which a
sample is drawn is normally distributed or approximately so.
3. True or False
Nonparametric tests are useful when data are of the nominal or ordinal scale of
measurement.
4. True or False
Compared to a parametric test, Nonparametric tests use the information in the
data more efficiently.
5. True or False
The Spearman coefficient of rank correlation measures both the strength and
direction of paired observations that are at least a nominal measurement scale.
6. True or False
The Wilcoxon signed rank test for comparing paired samples assumes the data
are continuous and are of the interval or ratio scale of measurement.
7. True or False
The Wilcoxon signed rank test is applied to a single sample in testing whether
the population proportion could be a hypothesized value.
ANS: F PTS: 1 OBJ: Section 14.8
8. True or False
The Kruskal-Wallis test compares the medians of more than two independent
samples.
9. True or False
The Kruskal-Wallis test requires that the scale of measurement of the data must
be at least interval or ratio, and the samples are assumed to be randomly
selected from their respective populations.
MULTIPLE CHOICE
3. An NBA executive says that players who substitute as the sixth, seventh and
eighth positions for the games starting five players have a median playing time
of 24 minutes. A random sample of 25 substitutes (6, 7 or 8) produces 20 non-
zero differences in a Wilcoxon signed rank test and a W (R+) equal to 146. The
alternate hypothesis is m 24 minutes. Assume the population is symmetrically
distributed. The computed test statistic in this situation equals:
a. 146.
b. 24.
c. 25.
d. 1.53.
e. 41.
ANS: D PTS: 1 OBJ: Section 14.2
5. An NHL executive says that players who incur penalty time during a game
have a median time of 1.2 minutes in the penalty box. A random sample of 28
hockey players with penalty time produced 24 non-zero differences when a
Wilcoxon signed rank test was used. The W (R+) equaled 156.5. The alternate
hypothesis is m 1.2 minutes. Assume the population is symmetrically
distributed. The computed test statistic in this instance equals:
a. 156.5.
b. 0.186.
c. 6.5.
d. 35.
e. 143.5.
ANS: B PTS: 1 OBJ: Section 14.2
7. The significance level for a Wilcoxon signed rank test for matched pairs is
0.05. The alternative hypothesis is H1: md > 0. The appropriate critical value for
n = 20 nonzero differences is:
a. 158
b. 149
c. 61
d. 52
e. 112
ANS: B PTS: 1 OBJ: Section 14.3
8. In a Wilcoxon signed rank test for matched pairs with n = 35, the rank sums
of the positive and negative differences are 380 and 225, respectively. The value
of the standardized test statistic (that is, the value of z approximation test
statistic) is:
a. 1.689.
b. 1.065.
c. 1.206.
d. 1.400.
e. 1.960.
ANS: B PTS: 1 OBJ: Section 14.3
9. A nonparametric method to compare two populations, when the samples are
matched pairs and the data are quantitative where the normality requirement
necessary to perform the parametric test is unsatisfied, is the:
a. the Wilcoxon rank sum test.
b. the Friedman test.
c. the Kruskal-Wallis test.
d. the Matched pairs t-test.
e. the Wilcoxon signed rank test.
ANS: E PTS: 1 OBJ: Section 14.3
10. In a Wilcoxon signed rank test for matched pairs, the test statistic is
calculated as W = 48. The alternative hypothesis is H1: md > 0. If there are n =
10 nonzero differences, and the 5% significance level is used, then the:
a. the null hypothesis will be rejected.
b. the null hypothesis will not be rejected.
c. the test results are inconclusive.
d. the t-test for matched pairs must be used.
ANS: A PTS: 1 OBJ: Section 14.3
12. In a Wilcoxon rank sum test n1 = 12, n2 = 15 and the sum of R1 = 108. The
value of the test statistic for the normal approximation is:
a. -1.23.
b. -2.93.
c. -108.
d. 108.
e. 15.
ANS: B PTS: 1 OBJ: Section 14.4
13. In a Wilcoxon rank sum test n1 = 14, n2 = 16 and the sum of R1 = 260. The
value of the test statistic for the normal approximation is:
a. 1.79.
b. -2.93.
c. -260.
d. 260.
e. 16.
ANS: A PTS: 1 OBJ: Section 14.4
14. The Wilcoxon rank sum test assumes that the measurement scale of the
data be at least:
a. nominal.
b. ordinal.
c. interval.
d. ratio.
e. Any measurement scale can be used for the Wilcoxon rank sum test.
ANS: B PTS: 1 OBJ: Section 14.4
15. In a normal approximation to the Wilcoxon rank sum test, the standardized
test statistic value is calculated as z = 1.80. For a two- tail test, the p-value is:
a. 0.0359.
b. 0.4641.
c. 0.2321.
d. 0.0718.
e. 0.1077.
ANS: D PTS: 1 OBJ: Section 14.4
16. A Wilcoxon rank sum test for comparing two populations involves two
independent samples of sizes 15 and 20. The un-standardized test statistic (that
is the rank sum) is W = 210. The value of the standardized test statistic (that is,
the value of z-approximation test statistic) is:
a. 14.
b. 10.5.
c. 6.
d. -1.4.
e. -8.
ANS: D PTS: 1 OBJ: Section 14.4
The value of the test statistic for a left-tail Wilcoxon rank sum test is:
a. 6.
b. 20.
c. 35.
d. 55.
e. 9.
ANS: B PTS: 1 OBJ: Section 14.4
18. The Wilcoxon rank sum test statistic W is approximately normally distributed
whenever both sample sizes are at least:
a. 10.
b. 15.
c. 20.
d. 25.
e. 30.
ANS: A PTS: 1 OBJ: Section 14.4
20. A nonparametric method to compare more than two populations, when the
samples are independent and the data are either ranked or quantitative but not
normal, is:
a. Kruskal-Wallis test.
b. Friedman test.
c. Wilcoxon rank sum test.
d. Wilcoxon signed rank test.
e. Sign test for comparing paired samples.
ANS: A PTS: 1 OBJ: Section 14.5
22. In a Kruskal-Wallis test at the 5% significance level, there are four samples
and the value of the test statistic is calculated as H = 8.79. The most accurate
statement that can be made about the p-value is that it is:
a. greater than 0.10.
b. greater than 0.05 but smaller than 0.10.
c. greater than 0.05.
d. greater than 0.025 but smaller than 0.05.
e. greater than 0.10 but smaller than 0.15.
ANS: D PTS: 1 OBJ: Section 14.5
26. The sign test for comparing paired samples assumes that the measurement
scale of the sample data is:
a. nominal.
b. ordinal.
c. interval.
d. ratio.
e. Any measurement scale can be used for the sign test for comparing paired
samples.
ANS: B PTS: 1 OBJ: Section 14.7
29. A random sample of ten basketball players from the SLC revealed 3 (+
differences) who weighed more than 245 pounds. Test the null hypothesis that
md = 0 versus the alternative hypothesis that md < 0. The p-value for the
computed test statistic is:
a. 0.3438.
b. 0.9453.
c. 0.0547.
d. 0.1094.
e. 0.1719.
ANS: E PTS: 1 OBJ: Section 14.7
NUMERIC RESPONSE
1. The significance level for a Wilcoxon signed rank test for one sample is 0.025.
The alternative hypothesis is m < m0, and the sample size is 14. What is the
appropriate critical value?
ANS: 22
ANS: 83
NARRBEGIN: Dataset
Consider the following data set:
NARREND
3. What is the rank assigned to the four observations of value 22?
ANS: 13.5
ANS: 8
NARRBEGIN: IRS
As a guideline, a local Internal Revenue Service office recommends that auditors
try to spend 16 minutes or less with taxpayers who are called for audit
interviews. In a recent random sample of 22 interviews, the interview times
were as shown below:
NARREND
5. Using the appropriate statistical table, what is the most accurate statement
we can make about the p-value test?
p-value = ____________________
ANS: 0.017
NARRBEGIN: Writers
Eight professional food writers have been asked to rate the taste characteristics
of a meat dish prepared first by flame grilling and then by charcoal grilling. After
each of the dishes is prepared it is rated on a scale from 1 to 5 with 1 being best
and 5 being worst. The results of the rating were:
NARREND
6. The Wilcoxon signed rank test for paired samples is used to test the
hypotheses H0: md = 0 vs. H1: md 0. What will be the rank for writer A?
ANS: 7
7. The Wilcoxon signed rank test for paired samples is used to test the
hypotheses H0: md = 0 vs. H1: md 0. What is the calculated test statistic, W?
ANS: 20.5
ANS: -2.312
ANS: 0.208
NARRBEGIN: Apartments
Residents of two different high rise apartment units may or may not have the
same consumer indebtedness. A sample from each results in the following
outstanding debt amounts:
NARREND
10. The test question is: Do residents of the two apartment units have similar
debt medians? What is the rank for 12832 debt amount?
ANS: 6
11. The test question is: Do residents of the two apartment units have similar
debt medians? What is the calculated test statistic W?
ANS: 25
ANS: 11
13. The test question is: Do the data indicate that the two programs are equally
effective? What is the calculated test statistic W?
ANS: 32.5
ANS: 3
15. The test question is: Do the different age groups require the same median
time to name the Dwarfs? In testing the hypotheses, what is the appropriate
critical value for the 0.05 significance level?
ANS: 5.991
16. The test question is: Do the different age groups require the same median
time to name the Dwarfs? What is the calculated test statistic H?
ANS: 6.795
NARRBEGIN: FAA
The FAA recorded the number of complaints per month for three airlines. The
data were:
NARREND
17. The test question is: Do the three airlines have the same median number of
complaints? What is the rank of 1544 from Airline C?
ANS: 2
18. The test question is: Do the three airlines have the same median number of
complaints? In testing the hypotheses, what is the appropriate critical value for
the 0.025 significance level?
ANS: 7.378
ANS: 8.24
NARRBEGIN: Treatments
ANS: 3
ANS: 7.583
ANS: 2
ANS: 5.991
ANS: 7.93
ANS: 7.378
ANS: 5.2
ANS: 1.47
ANS: 4.2
ANS: 0.90
COMPLETION
1. For one sample, the Wilcoxon signed rank method tests whether the sample
could have been drawn from a population having a hypothesized value as its
____________________.
ANS: median
2. The z-test approximation to the Wilcoxon signed rank test is used whenever
the number of nonzero differences is at least ____________________.
ANS:
20
twenty
3. The critical value of the Kruskal-Wallis test for comparing more than two
independent samples is obtained from the ____________________ distribution
whenever each sample size is at least ____________________.
ANS:
chi-square; 5
chi-square, five
ANS: Friedman
6. Unlike the Wilcoxon signed rank test, the ____________________ test assumes
data are ordinal.
ANS: sign
8. Like the randomized block ANOVA, the ____________________ test compares the
effectiveness of treatments, but it is concerned with their medians rather than
their mean.
ANS: Friedman
ANS: sign
10. The ____________________ test for normality can be used for samples of any
size, including those that are too small.
ANS: Kolmogorov-Smirnov
11. The z-test approximation to the Wilcoxon rank sum test for comparing two
independent samples is used whenever both sample sizes are at least
____________________.
ANS:
10
ten
12. In applying the Friedman test, either the number of blocks or the number of
treatments should be at least ____________________.
ANS:
5
five
SHORT ANSWER
1. The significance level for a Wilcoxon signed rank test for one sample is 0.02.
The alternative hypothesis is m m0, and the sample size is 14. What is/are the
appropriate critical value(s)?
2. The significance level for a Wilcoxon signed rank test for one sample is 0.01.
The alternative hypothesis is m m0, and the sample size is 14. What is/are the
appropriate critical value(s)?
ANS:
13, 92
NARRBEGIN: Wilcoxon
In a Wilcoxon signed rank test the following information describes the results:
Number of nonzero differences = 12; R+ = 22; R- = 56. The hypothesis set is:
H0: m 33
H1: m > 33
NARREND
3. Assume the population is symmetrically distributed. At a 0.05 significance
level, what is the appropriate decision rule?
ANS:
Reject the null hypothesis if R+ > 60
Conclusion: ____________________________
ANS:
22; 60; Do not reject H0
PTS: 1 OBJ: Section 14.2
NARRBEGIN: Nonzero
Number of nonzero differences = 14; R+ = 16; R- = 89.
ANS:
Reject the null hypothesis if R+ < 26
Conclusion: _______________________
ANS:
16; 26; Reject H0
NARRBEGIN: IRS
As a guideline, a local Internal Revenue Service office recommends that auditors
try to spend 16 minutes or less with taxpayers who are called for audit
interviews. In a recent random sample of 22 interviews, the interview times
were as shown below:
NARREND
7. Using the 0.025 level, do these data suggest that the median interview time
for auditors from this office exceeds the recommend guideline?
H0: ____________________
H1: ____________________
Test statistic = ____________________
Conclusion: __________________________________
Interpretation: _______________________________________________________
ANS:
m 16 minutes; m > 16 minutes; 2.12; 1.96; Reject H0; The median interview
time exceeds the recommended guideline
8. Explain how to use the p-value to reach a conclusion when testing at the
0.025 level of significance.
ANS:
Since p-value = .017 < = 0.25, we reject H0
NARRBEGIN: Writers
Eight professional food writers have been asked to rate the taste characteristics
of a meat dish prepared first by flame grilling and then by charcoal grilling. After
each of the dishes is prepared it is rated on a scale from 1 to 5 with 1 being best
and 5 being worst. The results of the rating were:
NARREND
9. The Wilcoxon signed rank test for paired samples is used to test the
hypotheses H0: md = 0 vs. H1: md 0. At the 0.05 significance level, what are the
critical values?
ANS:
3, 25
ANS:
Do not reject H0
11. The following data represent the test scores of eight students on a statistics
test before and after attending extra help sessions for the test:
Use the Wilcoxon signed rank test to determine at the 5% significance level
whether the extra help sessions have been effective.
ANS:
0, H1: mdH0: md < 0;
Test statistic WRejection region: W
Fail to reject the null hypothesis; It does not appear that the help sessions have
been effective.
12. In testing the hypotheses H0: md = 0, H1: md 0, data are drawn from a
matched pairs experiment, and the following statistics are calculated: n =
number of nonzero differences = 40; R+ = 238; R- = 582. Assume that the 5%
level of significance is used. Which test is used for testing the hypotheses
above?
ANS:
The Wilcoxon signed rank test.
13. In testing the hypotheses H0: md = 0, H1: md 0, data are drawn from a
matched pairs experiment, and the following statistics are calculated: n =
number of nonzero differences = 40; R+ = 238; R- = 582. Assume that the 5%
level of significance is used. What is the conclusion?
ANS:
Reject H0 and conclude that the population median of differences is not zero.
14. The following data are drawn from a matched pairs experiment:
Use the 5% significance level to test the hypotheses H0: md 0, H1: md < 0
Conclusion: ___________________________
Interpretation: __________________________________________________________
ANS:
4; 4; Reject H0; The population median of differences is smaller than zero
15. A Wilcoxon rank sum test for comparing two independent samples involves
two sample sizes: 8 and 6. The alternate hypothesis is m1 m2. Use a 0.05
significance level. What are the appropriate table values?
ANS:
29, 61
H0: ____________________
H1: ____________________
ANS:
m2H0: m1 = m2, H1: m1
17. The test question is: Do residents of the two apartment units have similar
debt medians? In testing the hypotheses, the significance level is 0.05. What are
the appropriate critical values?
ANS:
19, 41
18. The test question is: Do residents of the two apartment units have similar
debt medians? State the appropriate conclusion employing a 0.05 significance
level.
ANS:
Do not reject H0
H0: ____________________
H1: ____________________
ANS:
m2H0: m1 = m2, H1: m1
20. The test question is: Do the data indicate that the two programs are equally
effective? In testing the hypotheses, the significance level is 0.10. What are the
appropriate critical values?
ANS:
20, 40
21. The test question is: Do the data indicate that the two programs are equally
effective? State the appropriate conclusion employing a 0.05 significance level.
ANS:
Do not reject H0
22. Use the Wilcoxon rank sum test on the data below to determine at the 10%
significance level whether the two population medians differ.
H0: ____________________
H1: ____________________
Conclusion: _____________________________
Interpretation: _______________________________________________________
ANS:
m1 = m2; m1 m2;
W = 119.5;
83,or WW > 127;
Do not reject H0; Not enough evidence to state that the population medians are
different
23. The following statistics are drawn from two independent samples: R1 = 800,
n1 = 25, R2 = 1100 and n2 = 28. Test at the 5% significance level to determine
whether the two population medians differ.
ANS:
m2; m2 , H1: m1 H0: m1
Test statistic zRejection region: |z|
Reject the null hypothesis; conclude that the population medians differ.
24. A Wilcoxon rank sum test for comparing two independent samples involves
two sample sizes: 8 and 6. The alternate hypothesis is m1 m2. Use a 0.10
significance level. What are the appropriate table values?
ANS:
32, 58
Conclusion: _____________________________________
Interpretation: _____________________________________________________
ANS:
H = 2.54; Rejection region > 7.378; Do not reject H0; Not enough evidence to
show that the median total expenditure by households is different among these
states.
H0: _______________________
H1: ________________________
ANS:
m1 = m2 = m3; At least one mi differs from the others.
ANS:
Reject H0 and conclude there is a difference in the median total times to name
the dwarfs recorded by age group.
NARRBEGIN: FAA
The FAA recorded the number of complaints per month for three airlines. The
data were:
NARREND
28. The test question is: Do the three airlines have the same median number of
complaints? State the null and alternative hypotheses.
H0: _____________________________
H1: _____________________________
ANS:
m1 = m2 = m3; At least one mi differs from the others.
29. The test question is: Do the three airlines have the same median number of
complaints? State the appropriate conclusion employing a 0.025 significance
level.
ANS:
Reject H0 and conclude that the three airlines do not have the same median
number of complaints
30. Apply the Kruskal-Wallis test to determine if there is enough evidence at the
5% significance level to infer that at least one of the population medians differs
from the others.
H0: ______________________________________
H1: ______________________________________
Conclusion: ________________________________
Interpretation: _________________________________________________________
ANS:
m1 = m2 = m3; At least one median differs from the others;
H = 0.38; Rejection region H > 5.991; Do not reject H0; Not enough evidence to
conclude that the population medians are different
31. The following data are drawn from four independent samples:
H0: __________________________
H1: __________________________
Conclusion: _____________________________
Interpretation: ______________________________________________________
ANS:
m1 = m2 = m3 = m4; At least one median differs from the others;
H = 10.2601; Rejection region H > 11.345; Do not reject H0; Not enough
evidence to conclude that the population medians are different
H0: _____________________________
H1: _____________________________
ANS:
m1 = m2 = m3; At least one median differs from the others.
PTS: 1
ANS:
Reject H0 and conclude that the population medians of the treatments are not
all equal.
H0: ____________________________
H1: ____________________________
ANS:
m1 = m2 = m3; At least one median differs from the others.
ANS:
Reject H0 and conclude that at least one of the population treatment medians is
different from the others.
PTS: 1 OBJ: Section 14.6
NARRBEGIN: Airline
Ten business people who fly frequently were asked to rank four airlines in terms
of the quality of service. The people assigned scores using a 5-point scale
where: 1 = bad, 2 = poor, 3 = average, 4 = good, and 5 = excellent. The results
are shown below:
NARREND
36. Which test is appropriate if you want to compare the quality of service of the
four airlines?
ANS:
The Friedman test
37. Can we conclude at the 5% significance level that there are differences in
service quality among the four airlines?
H0: __________________________
H1: __________________________
Conclusion: ________________________________
Interpretation: ________________________________________________________
ANS:
m1 = m2 = m3 = m4; At least one of the population medians is different;
Fr = 5.64; Rejection region Fr > 7.815;
Do not reject H0; There is not enough evidence to show that the population
medians are different.
ANS:
p-value > .10
39. The test question is: Can we conclude at the 5% significance level that there
are differences in service quality among the four airlines? Using Excel or Minitab,
what is the p-value for the above test?
ANS:
p-value = 0.1305
40. Because of the cost of producing television shows and the profits associated
with successful shows, television network executives are keenly interested in
public opinion. A network has recently developed three comedy series. The pilot
of each series is shown in to 10 randomly selected people who evaluate each
show on a 9-point scale where 1 = terrible and 9 = excellent. From the results
shown below, can we infer at the 5% significance level that differences exist
among the show’s evaluations?
H0: ____________________
H1: ____________________
Conclusion: ___________________________
Interpretation: _________________________________________________________
ANS:
m1 = m2 = m3 ; At least one of the population medians is different;
Fr = 7.40; Rejection region Fr > 5.991;
Reject H0; There is enough evidence to show that at least one of the population
medians of the show’s evaluations is different.
41. The value of the test statistic in a Kolmogorov-Smirnov test for normality is D
= 0.199, and it is based on a sample of 20 observations. What conclusion can
you draw at the 5% significance level?
Hypotheses: _______________________
Conclusion: ___________________________
ANS:
H0: The data are normally distributed
H1: The data are not normally distributed
Test statistic = 0.199; Rejection region: D > 0.190;
Reject H0; There is enough evidence to show that the data are not normally
distributed.
ANS:
The sign test
Hypotheses: _________________________
Conclusion: ______________________
ANS:
H0: md = 0 H1: md
Test statistic: z = -2.53; Rejection region: |z| > 1.96;
Reject H0 and conclude that the median quiz scores from the 2 commercials
differ
Can we infer at the 5% significance level that the distribution is not normal?
Hypotheses: ________________________________
Conclusion: _____________________________
ANS:
H0: The data are normally distributed H1: the data is not normally distributed
Test statistic: D = 0.1434; Rejection region: D > 0.258;
Fail to reject H0 and conclude that the data may be normally distributed
45. A husband and wife ranked 8 houses that they visited that were for sale. The
rankings are shown in the following table.
Calculate and interpret the Spearman coefficient of rank correlation and use the
0.05 level of significance in testing whether its true value could be something
other than 0.
ANS:
= 0.5
Critical values are -0.643 and +0.643
We fail to reject that the population value for the Spearman coefficient of rank
correlation is 0.
46. A class of 12 statistics students completed the course with the following
exam averages and homework averages.
Calculate and interpret the Spearman coefficient of rank correlation and use the
0.05 level of significance in testing whether its true value could be something
other than 0.
ANS:
= 0.6923
Critical values are -0.497 and +0.497
We reject the possibility that the population value for the Spearman coefficient
of rank correlation could be 0.
47. John took a test that required him to rank order the importance of 15 items
to help him survive in the desert. The following table shows his results along
with the correct order.
Calculate and interpret the Spearman coefficient of rank correlation and use the
0.01 level of significance in testing whether its true value could be something
other than 0.
ANS:
= 0.6679
Critical values are -0.623 and +0.623
We reject that the population value for the Spearman coefficient of rank
correlation is 0.
ESSAY
ANS:
A nonparametric test is one that makes no assumptions about the specific shape
of the population from which a sample is drawn.
2. The chapter lists four major advantages to nonparametric testing. What are
they?
ANS:
a) Fewer assumptions about the population. Most important, the population
need not be normally distributed or approximately so. Nonparametric tests do
not assume the population has any specific distribution. b) The techniques can
be applied when samples are small. c) Samples with data of the nominal and
ordinal scales of measurement can be tested. d) Calculations by hand or by
pocket calculator tend to be simpler. This is primarily due to the small sample
sizes typically involved.
3. The chapter lists two major disadvantages to nonparametric testing. What are
they?
ANS:
a) Compared to parametric test, information in the data is used less efficiently,
and the power of the test will be lower. For this reason, a parametric test is
preferable whenever its assumptions have been met. b) Nonparametric testing
places greater reliance on statistical tables, if a computer statistical package or
spreadsheet is not being used.
ANS:
A nonparametric test should be used instead of its parametric counterpart
whenever: (a) data are of the nominal or ordinal scales of measurement, or (b)
data are of the interval or ratio scales of measurement but one or more other
assumptions, such as the normality of the underlying population distribution,
have not been met.
ANS:
Nonparametric tests make no assumptions about the specific shape of the
population from which a sample was drawn; most of the tests seen in previous
chapters assumed normally distributed populations.
ANS:
The parametric counterpart to the Wilcoxon signed rank test is the one-sample t-
test. The one sample t-test assumes that the underlying population is normally
distributed and that the data are continuous on the ratio or interval scale. The
Wilcoxon signed rank test assumes that the underlying population is
approximately symmetric and that the data are continuous on the ratio or
interval scale.
7. What is the parametric counterpart to the Wilcoxon signed rank test for paired
samples? Compare the assumptions involved in using the respective tests.
ANS:
The parametric counterpart to the Wilcoxon signed rank test for paired samples
is the paired samples t-test. Both tests assume continuous data on the interval
or ratio scale, and the data come in pairs. The t-test assumes that both
underlying populations are normally distributed, whereas the Wilcoxon test
assumes only that both distributions are fairly symmetric in shape.
8. Differentiate between the Wilcoxon signed rank test and the Wilcoxon rank
sum test, and explain the circumstances under which each should be applied.
ANS:
The Wilcoxon rank sum test compares two independent samples, while the
Wilcoxon signed rank test can be used to compare two dependent samples. The
Wilcoxon rank sum test should be used with independent samples, and the
signed rank test should be used when dealing with pairs.
9. What is the parametric counterpart to the Wilcoxon rank sum test? Compare
the assumptions involved in using the respective tests.
ANS:
The parametric counterpart to the Wilcoxon rank sum test is the two-sample
pooled-variances t-test for independent samples. The t-test assumes data on the
interval or ratio scale, independent random samples, normally-distributed
populations, and equal variances. The Wilcoxon rank sum test assumes data on
the ordinal, interval, or ratio scale and that the samples are independent
random samples from populations with similar shapes.
10. Differentiate between the Kruskal-Wallis test and the one-way analysis of
variance in terms of their assumptions and the circumstances under which each
should be applied.
ANS:
The one-way analysis of variance test assumes that samples have been drawn
from normally distributed populations with equal variances. In addition, data
must be of the interval or ratio scale. The Kruskal-Wallis test requires neither
normal populations nor equal variances. The data must be at least ordinal, and
the samples are assumed to be randomly selected. The one-way ANOVA test is
preferred whenever the necessary assumptions are met.
11. Compare the Friedman test with the randomized block ANOVA in terms of:
(a) their respective null and alternative hypotheses, and (b) the assumptions
required in order to use each test.
ANS:
a) The null and alternative hypotheses for the Friedman test and the randomized
block ANOVA are almost identical except that the Friedman test deals with
medians and the ANOVA test deals with means. b) Randomized block ANOVA
requires that observations be from normal populations with equal variances and
that the observations be measured on interval or ratio scale. The Friedman test
makes no assumptions about the underlying populations, and the observations
may be measured on the ordinal, interval, or ratio scale.
PTS: 1 OBJ: Section 14.6
12. What role does the binomial distribution play in the sign test, and when is it
permissible to use the normal approximation in carrying out the test?
ANS:
The sign test relies on the fact that, if the true population (of differences)
median is equal to zero, there is a 0.50 probability of obtaining a positive
difference. Thus, this is a binomial experiment and a difference will be either
positive or negative (ties are ignored). When the number of observations (non-
zero differences) is at least ten, the normal approximation may be used.
13. Whether testing a single sample or comparing paired samples, how should
one go ahead deciding whether to use the sign test versus the Wilcoxon signed
rank test?
ANS:
The sign test does not use all of the information contained in the data; the data
are merely reduced to (+) and (-) signs, and magnitude is ignored. If the original
data are interval in nature, the Wilcoxon signed rank test will provide a more
powerful test.
PTS: 1
TRUE/FALSE
1. True or False
In the sample regression line the term b0 is the y-intercept; this is the value of y
where the line intersects the y-axis whenever x = 0.
3. True or False
The simple linear regression model assumes that for any given value of x, the
population of residuals will be normally distributed with a mean of zero and a
standard deviation of 1.
4. True or False
When the predicted values of y and the actual values of y are the same, the
standard error of estimate will be 0.0.
5. True or False
The simple linear regression model assumes that regardless of the value for x,
the standard deviation of the distribution of y values about the regression line is
the same.
6. True or False
The simple linear regression model assumes that the y values are statistically
independent of each other but the residuals are statistically dependent of each
other.
7. True or False
The least-squares criterion requires that the sum of the squared deviations
between the y values in the scatter diagram and the y values predicted by the
equation be minimized.
8. True or False
The value of the variation explained by the regression line can never be larger
than 1.0.
9. True or False
In a simple linear regression model, the residual is the horizontal distance from
the regression line to an observed data point.
MULTIPLE CHOICE
4. A regression analysis between sales (in $1000) and advertising (in $) resulted
in the following least squares line: = 80,000 + 5x. This implies that an:
a. increase of $1 in advertising is expected to result in an increase of $5 in sales.
b. increase $5 in advertising is expected to result in an increase of $5,000 in
sales.
c. increase of $1 in advertising is expected to result in an increase of $80,005 in
sales.
d. increase of $1 in advertising is expected to result in an increase of $5,000 in
sales.
ANS: D PTS: 1 OBJ: Section 15.2
5. A regression analysis between weight (y in pounds) and height (x in inches)
resulted in the following least squares line: = 120 + 5x. This implies that if the
height is increased by 1 inch, the weight is expected to:
a. increase by 1 pound.
b. decrease by 1 pound.
c. increase by 5 pounds.
d. decrease by 24 pounds.
ANS: C PTS: 1 OBJ: Section 15.2
6. A regression analysis between sales (in $1000) and advertising (in $100)
resulted in the following least squares line: = 75 +6x. This implies that if
advertising is $800, then the predicted amount of sales (in dollars) is:
a. $4875.
b. $123,000.
c. $487,500.
d. $12,300.
ANS: B PTS: 1 OBJ: Section 15.2
10. The vertical spread of the data points about the regression line is measured
by the:
a. regression coefficient.
b. standard error of estimate.
c. y-intercept.
d. homoscedasticity coefficient.
e. t-ratio.
ANS: B PTS: 1 OBJ: Section 15.3
11. In order to estimate with 95% confidence the expected value of y in a simple
linear regression problem, a random sample of 10 observations is taken. Which
of the following t-table values listed below would be used?
a. 2.228
b. 2.306
c. 1.860
d. 1.812
ANS: B PTS: 1 OBJ: Section 15.3
12. If the standard error of estimate = 18 and n = 10, then the error sum of
squares, SSE, is:
a. 2916.
b. 2592.
c. 1800.
d. 3240.
ANS: B PTS: 1 OBJ: Section 15.3
13. An indication of no linear relationship between two variables would be:
a. a coefficient of determination equal to 1.
b. a coefficient of determination equal to -1.
c. a coefficient of correlation of 0.
d. a coefficient of correlation equal to -1.
e. Both “A” and “B” are correct.
ANS: C PTS: 1 OBJ: Section 15.4
14. In publishing the results of some research work, the following values of the
coefficient of determination were listed. Which one would appear to be
incorrect?
a. 0.91
b. 0.06
c. 0.47
d. -0.64
e. 0.00
ANS: D PTS: 1 OBJ: Section 15.4
15. Given the least squares regression line = -2.88 + 1.77x, and a coefficient of
determination of 0.81, the coefficient of correlation is:
a. -0.88.
b. +0.88.
c. +0.90.
d. -0.90.
ANS: C PTS: 1 OBJ: Section 15.4
16. In a regression problem the following pairs of (x, y) are given: (2,1), (2,-1),
(2,0), (2,-2) and (2,2). That indicates that the:
a. coefficient of correlation is -1.
b. coefficient of correlation is 0.
c. coefficient of correlation is 1.
d. coefficient of determination is between -1 and 1.
ANS: B PTS: 1 OBJ: Section 15.4
17. The number of degrees of freedom associated with the standard error of
estimate is
a. n-1 since only the slope is estimated from sample data.
b. n-1 since only the intercept is estimated from sample data.
c. n-1 since only the predicted value of y is estimated from sample data.
d. n-2 since the slope and the intercept are estimated from sample data.
ANS: D PTS: 1 OBJ: Section 15.3
20. The regression line = 3 + 2x has been fitted to the data points (4,8), (2,5),
and (1,2). The residual sum of squares will be:
a. 10.
b. 15.
c. 13.
d. 22.
ANS: D PTS: 1 OBJ: Section 15.4
21. The following values are listed as coefficients of correlation (r). The one that
indicates an inverse relationship between the two variables x and y is:
a. 0.0.
b. -0.8.
c. 0.9.
d. 1.3.
e. -1.4.
ANS: B PTS: 1 OBJ: Section 15.4
22. For a given value of x, the estimation interval for an individual y observation
is called the:
a. confidence interval.
b. residual.
c. prediction interval.
d. least-squares interval.
e. standard error of estimate.
ANS: C PTS: 1 OBJ: Section 15.4
23. The value for SSE equals zero. This means that the coefficient of
determination (r 2 ) must equal:
a. 0.0.
b. -1.0.
c. 2.3.
d. -2.3.
e. 1.0.
ANS: E PTS: 1 OBJ: Section 15.4
24. All of the values of an independent variable equal the same number.
Regressing a dependent variable on this independent variable will result in a
coefficient of determination (r 2 ) of:
a. 0.0.
b. -1.0.
c. 2.3.
d. -2.3.
e. 1.0.
ANS: A PTS: 1 OBJ: Section 15.4
26. For the values of the coefficient of determination listed below, which one
implies the greatest value of SSR (regression sum of squares) given that SST =
500?
a. 0.95
b. -1.00
c. 0.28
d. 0.00
e. -0.88
ANS: A PTS: 1 OBJ: Section 15.4
27. In simple linear regression, the coefficient of correlation r and the least
squares estimate of the population slope :
a. must have the same numerical value.
b. must have opposite signs.
c. must have the same sign.
d. may have opposite signs or the same sign.
ANS: C PTS: 1 OBJ: Section 15.4
28. If all the points in a scatter diagram lie on the least squares regression line,
then the coefficient of correlation:
a. must be 1.0.
b. must be -1.0.
c. must be either 1.0 or -1.0.
d. must be 0.
ANS: C PTS: 1 OBJ: Section 15.4
29. If the sum of squares due to regression (SSR) is 60, which of the following
must be true?
a. The coefficient of correlation is 0.9.
b. The total sum of squares (SST) is at least 60.
c. The y-intercept is positive.
d. The slope, b, is positive.
e. The coefficient of determination is 0.81.
ANS: B PTS: 1 OBJ: Section 15.6
NUMERIC RESPONSE
ANS: 0.90
ANS: 81%
ANS: 75%
COMPLETION
ANS: x, y
ANS: homoscedasticity
ANS: -1; 1
ANS:
0; 1
zero, one
ANS: -1; +1
12. If the coefficient of correlation r is positive, then the dependent variable and
the independent variable x are said to be ____________________ related. However,
if r is negative, then x and y are said to be ____________________ related.
SHORT ANSWER
ANS:
As x increases, y decreases, and vice versa.; The variables x and y increase or
decrease together.; A straight line is best for describing this relationship.; A
nonlinear curve best describes this relationship; There will be at least one x term
in which the exponent is some value other than 1.
ANS:
a) The assumption of equal population variance of the residuals
b) Random error; the difference between the actual and predicted value from
the regression line.
ANS:
4. What does the following scatter diagram tell you about the relationship
between x and y?
ANS:
The relationship between x and y appears to be linear and positive.
Mileage Price
22,600 $15,200
32,100 $12,500
43,800 $13,900
51,800 $9500
59,400 $11,600
65,500 $11,700
74,700 $8900
NARREND
5. Identify the dependent and independent variable.
ANS:
Independent variable: mileage
Dependent variable: price
7. Using the least-squares regression line, predict the selling price of a car with
55,000 miles.
ANS:
, when x = 55,000,
x 5 4 3 6 9 8 10
y 7 8 10 5 2 3 1
Use Minitab or Excel to construct a scatter diagram of the data points and plot
the least squares regression line on it.
ANS:
x 5 4 3 6 9 8 10
y 7 8 10 5 2 3 1
What does the scatter diagram tell you about the relationship between x and y?
ANS:
The relationship between x and y appears to be linear and negative.
NARREND
10. Identify the dependent and independent variables.
ANS:
Dependent variable: Salary
Independent variable: Years of Experience
ANS:
12. Using the least-squares regression line, predict the salary of an employee
with 11 years of experience.
ANS:
x = 11
13. The divisor of the standard error of estimate in simple linear regression is:
ANS:
n–2
NARREND
14. Determine the least-squares regression line.
ANS:
.
ANS:
16. Construct a 95% confidence interval for the mean GPA when GMAT = 6.5.
ANS:
When x = 6.5, . Also t.025,5 = 2.571. Therefore, the 95% confidence interval for
the mean of y when x = 6.5 is
17. Construct a 95% prediction interval for an individual y value when x = 6.5.
ANS:
When x = 6.5, . Also t.025,5 = 2.571. Therefore, the 95% prediction interval for
the mean of y when x = 6.5 is
18. A simple linear regression problem produced the following sum of squares:
ANS:
1 – 75% = 25%
NARRBEGIN: Teacher
A statistics teacher collected the following data to determine if the number of
hours a student studied during the semester could be used to predict the final
grade for the course.
NARREND
19. In testing the hypotheses vs. , what is the value of the test statistic?
ANS:
t = 2.797
ANS:
|t| > 2.447
21. In testing the hypotheses vs. , what is the conclusion at the 0.05 significance
level?
ANS:
Since t = 2.797 > 2.447, we reject the null hypothesis and conclude there is a
relationship between number of hours studying and the final grade.
NARRBEGIN: X and Y
Consider the following data values of variables x and y
x 5 4 3 6 9 8 10
y 7 8 10 5 2 3 1
NARREND
22. Given that when a simple linear regression model applied to the data, it
produced the following residuals:
Apply the Lilliefors test to the residuals at = 0.05. What is your conclusion?
ANS:
The residuals are normally distributed.
The residuals are not normally distributed.
The rejection region is D >; 0.30. Since D = 0.126, we fail to reject the null
hypothesis and conclude that the residuals could have come from a normally
distributed population
ANS:
24. The simple linear regression equation is . what does the normal probability
plot of the residuals (shown below) tell you?:
ANS:
The points in the normal probability plot are approximately in a straight line,
which indicates that the residuals could have come from a normally distributed
population.
x 97 113 81 68 90 79
y 103 103 105 115 127 104
ANS:
The regression equation is
y = 125 – 0.173 x
Predictor Coef SE Coef T P
Constant 124.76 26.38 4.73 0.009
x -0.1734 0.2959 -0.59 0.589
S = 10.4190 R-Sq = 7.9% R-Sq(adj) = 0.0%
Analysis of Variance
Source DF SS MS F P
Regression 1 37.3 37.3 0.34 0.589
Residual Error 4 434.2 108.6
Total 5 471.5
x 97 113 81 68 90 79
y 103 103 105 115 127 104
NARREND
26. Use Excel or Minitab to construct a scatter diagram of the data points.
ANS:
27.
What does the scatter diagram tell you about the relationship between x and y?
ANS:
The relationship between x and y is not linear
ANS:
NARRBEGIN: Household
The following data was collected by a particular company to determine if a
relationship exists between the number of people in a household and weekly
food expenditures.
NARREND
29. Identify the independent and dependent variables.
ANS:
Independent variable: Number in Household
ANS:
ANS:
32. Construct a 95% confidence interval for the mean weekly food expense
when the number in household equals 3.
ANS:
When x = 3, . Also t.025,5 = 2.571. Therefore, the 95% confidence interval for
the mean of y when x = 3 is
33. Construct a 95% prediction interval for the mean weekly food expense when
the number in household equals 3.
ANS:
When x = 3, . Also t.025,5 = 2.571. Therefore, the 95% prediction interval for
the mean of y when x = 3 is
ANS:
r = +0.713. Since the coefficient is positive, the number of people in the
household and the weekly food expenses are directly related. Since the
coefficient is large, the relationship is relatively strong.
ANS:
. This means that 50.8% of the total variation in weekly food expenses is
explained by the regression equation.
36. At the 0.05 level of significance, test the null hypothesis that the population
coefficient of correlation is really zero.
ANS:
If |t| > 2.571, we reject the null hypothesis. Since t = 2.274, we fail to reject the
null hypothesis. We do not have enough evidence to conclude that the
population coefficient of correlation is not zero.
NARREND
37. Determine the least-squares regression line.
ANS:
ANS:
39. In testing the hypotheses vs. , what is the value of the test statistic?
ANS:
t = 3.684
40. In testing the hypotheses vs. , what is the rejection region at the 0.02
significance level?
ANS:
We will reject the null hypothesis if |t| > 3.365. Since t = 3.684, we reject the
null hypothesis and conclude there is a relationship between age and
maintenance costs for this type of car.
ANS:
.
This means that 73.1% of the total variation in maintenance costs is explained
by the age of the car.
PTS: 1 OBJ: Section 15.7
NARRBEGIN: Unemployed
The data below are for the number of unemployed persons (in millions) and the
federal unemployment insurance payments (in billions of dollars) for the years
1978 – 1985. Some economists state that these two variables are positively
related.
Year 1978 1979 1980 1981 1982 1983 1984 1985
Federal Unemployment Insurance Payments 11.8 10.7 18.0 19.7 23.7 31.5 18.4
16.8
Number of Unemployed Persons 6.2 6.1 7.6 8.3 10.7 10.7 8.5 8.3
NARREND
42. Use these data to develop an estimated regression equation.
= ____________________
ANS:
43. Assume that a simple linear regression model is appropriate for these data.
Identify the dependent and independent variables.
ANS:
Federal unemployment insurance payments; Number of unemployed persons
44. Use these data to develop a scatter diagram for these data.
ANS:
45. What does a scatter diagram of these data indicate about the relationship
between these two variables?
ANS:
The number of unemployed persons and the federal unemployment insurance
payments appear to be positively linearly related.
r = ____________________
ANS:
r = 0.93
ANS:
48. Construct a 95% confidence interval for the slope of the population
regression line.
ANS:
49. A 95% confidence interval for the slope of the population regression line is
(2.168, 4.897) Explain how to use the confidence interval to test the hypothesis
that the population slope is zero using a 0.05 level of significance.
ANS:
is not within the 95% confidence interval, we reject the null hypothesis at the
5% level of significance. We can conclude that a linear relationship might exist
between the number of unemployed persons and the federal unemployment
insurance payments.Since
ANS:
= 0 (There is no linear relationship)H0:
0 (A linear relationship exists)H1:
Rejection region: | t | > 2.447. Test statistic: t = 6.3348. Since t = 6.3348 >
2.447, we reject the null hypothesis. A linear relationship exists between these
two variables.
51. At the 0.05 level of significance, test the null hypothesis that the population
slope is zero.
ANS:
= 0 (There is no linear relationship)H0:
0 (A linear relationship exists)H1:
Rejection region: F > 5.99. Test statistic: F = 40.13. We reject the null hypothesis
and infer that a linear relationship exists between these two variables.
52. The general manager of a chain of furniture stores believes that experience
is the most important factor in determining the level of success of a
salesperson. To examine this belief she records last month’s sales (in $1,000s)
and the years of experience of 10 randomly selected salespeople. These data
are listed below.
Salesperson Years of Experience Sales
107
229
3 10 20
4 3 15
5 8 18
6 5 14
7 12 20
8 7 17
9 20 30
10 15 25
ANS:
NARRBEGIN: Salesperson
The general manager of a chain of furniture stores believes that experience is
the most important factor in determining the level of success of a salesperson.
To examine this belief she records last month’s sales (in $1,000s) and the years
of experience of 10 randomly selected salespeople. These data are listed below.
NARREND
53. Determine the least squares regression line.
= _____________________________
ANS:
= 1.0817031Yrs_Exper+8.6300345
ANS:
For each additional year of experience, monthly sales of a salesperson increase
on average by $1,081.70.
= ____________________
Describe what this statistic tells you about the regression line.
ANS:
1.5724; The model’s fit is good.
56. Determine the coefficient of determination and discuss what its value tells
you about the two variables.
= ____________________
Discuss
ANS:
0.9536; This means that 95.36% of the total variation in sales is explained by
the years of experience in the regression model.
r = ____________________
ANS:
0.9765; It has a positive sign since the slope of the regression line is positive.
ANS:
= 0 (There is no linear relationship)H0:
0 (A linear relationship exists)H1:
Rejection region: | t | > 2.306 Test statistic: t = 12.8258. Since t = 12.8258 >
2.306, we reject the null hypothesis. A linear relationship exists between years
of experience and sales.
ANS:
= 0 (There is no linear relationship)H0:
0 (A linear relationship exists)H1:
Rejection region: | t | > 2.306 Test statistic: t = 12.8258. Since t = 12.8258 >
2.306, we reject the null hypothesis. A linear relationship exists between years
of experience and sales.
60. Predict with 95% confidence the monthly sales of a salesperson with 10
years of experience.
95% CI = ________________
ANS:
(15640,23253)
61. Predict with 95% confidence the average monthly sales of all salespersons
with 10 years of experience.
95% CI10 yrs + = ________________
ANS:
(18290,20604)
ANS:
The values are: 8.630, 10.793, 19.447, 11.875, 17.284, 14.039, 21.610, 16.202,
30.264, 24.856
ANS:
The residuals are: -1.630, -1.793, 0.553, 3.125, 0.716, -0.039, -1.610, 0.798,
-0.264, 0.144
64. If the regression equation is , plot the residuals against the predicted values
of y. Does the variance appear to be constant?
ANS:
ANS:
The standardized residuals are: -1.100, -1.210, 0.373, 2.108, 0.483, -0.026,
-1.086, 0.538, -0.178, 0.097. No outliers exist.
PTS: 1 OBJ: Section 15.7
ESSAY
x32545
y 8 6 12 10 14
Using the least squares method, which of these regression models provide the
better fit to the data? Why?
ANS:
SSE = 4.95 and 593.25 for models 1 and 2, respectively. Therefore, model 1 is
better than model 2.
2. Which assumptions are required in using the simple linear regression model ?
ANS:
a) For any given value of x, the y values are normally distributed with a mean
that is on the regression line. b) Regardless of the value of x, the standard
deviation of the distribution of y values about the regression line is the same;
this is the assumption of homoscedasticity. c) The y values are statistically
independent of each other.
3. In the sample regression line, what does the cap (^) over the y indicate?
ANS:
The (^) indicates that is the estimated value of the dependent variable y for a
given value x; that is, is an estimate of the (unknown) “true” value of y.
ANS:
The least squares criterion is one method used to determine the “best fitting”
line. The criterion requires that the sum of the squared deviations between the
actual y values and values predicted by the regression line be minimized.
(x = 2, y = 5),
(x = 4, y = 12),
(x = 6, y = 20),
(x = 8, y = 28), and
(x = 10, y = 30).
(1) = -.6 + 3x
(2) = -1 + 5 x
Using the least squares criterion, which of these regression lines is the better fit
to the data? Why?
ANS:
The residual sum of squares, for the two regression proposed lines (1) and (2)
are 28.8 and 628, respectively. Using the least squares criterion, the first line fits
the data much better.
ANS:
A confidence interval provides a range of possible values for the mean value of y
given a specific value of x. The point estimate for the mean will fall on the
regression line. A prediction interval provides a range of possible values for an
individual observation of y given a value of x.
7. Is it possible for the standard error of the estimate to be equal to zero? If so,
under what circumstances?
ANS:
The standard error of the estimate will be zero only when every data point falls
on the regression line, that is, when the sum of the squared residual terms is
also zero.
8. What is the standard error of estimate, and what role does it play in simple
linear regression and correlation analysis?
ANS:
The standard error of the estimate is a measure which describes the dispersion
of the data points above and below the regression line. The standard error may
be used as a measure of how well the regression line fits the data. In addition,
the standard error is used in calculating confidence intervals for the mean value
of y given a specific value of x, and in calculating the prediction interval for an
individual y observation.
ANS:
The prediction interval for y gets wider as the x value on which the interval
estimate is based gets farther away from the mean of x because there is less
error in making interval estimates based on x values that are closer to the
mean. This can be seen in the formula; the numerator of the fraction under the
square root includes This value, of course, increases as the x value moves away
from the mean.
10. For a given value of x, which will be wider, a 90% confidence interval or a
90% prediction interval? Explain
ANS:
For a given value of s, the prediction interval will always be wider than the
confidence interval. The confidence interval is concerned with the mean value of
y only; the prediction interval is concerned with all of the possible values of y.
ANS:
The coefficient of correlation (r) describes both the direction and the strength of
the linear relationship between two variables. The coefficient of determination
expresses the proportion of the variation in the dependent variable (y) that is
explained by the regression line, but it does not indicate the direction of the
relationship.
ANS:
The coefficient of determination is 0.64 (or the coefficient of correlation
squared). This means that 64% of the variation in y and is explained by x.
13. The values of y and x are inversely related, and 84.64% of the variation in y
is explained by the regression equation. What is the coefficient of correlation?
ANS:
Since 84.64% of the variation in y is explained by x .8464 is the coefficient of
determination. The square root of the coefficient of determination, with the
appropriate sign, is equal to the coefficient of correlation. Therefore, the
coefficient of correlation is -.92. We know this is negative because y and x are
inversely related.
14. For a set of data, the total variation or sum of squares for y is SST =180, and
the error sum of squares SSE =54. What proportion of the variation in y is
explained by the regression equation?
ANS:
The coefficient of determination = = 1- 0.30 = 0.70. This means that 70% of the
variation in y is explained by x.
15. In regression and correlation analysis, what are the two components of the
total amount of variation in the y values? Define the coefficient of determination
in terms of these components.
ANS:
Variation explained by the regression line (SSR) and variation not explained by
the regression line (SSE) are the two components of the total variation y (SST).
The coefficient of determination can be expressed as:
ANS:
(There is no linear relationship)
17. For n = 16 data points, At the 0.05 level of significance, can we conclude
that the true coefficient of correlation could be zero?
ANS:
This calls for the following hypothesis test: (there is no linear relationship) and
(there is a linear relationship). This is a two-tailed t-test with and d.f.=16-2=14.
The critical values of t are -2.145 and +2.145. The calculated value of the test
statistic is:
= 7.726.
Since the calculated test statistic falls outside the critical values, we reject There
is evidence to suggest that a linear relationship exists between x and y.
18. Testing the null hypothesis that the slope of the true regression line equals
zero is equivalent to testing whether the true coefficient of correlation could be
zero. Why?
ANS:
If the slope of the true regression line is zero, then the true coefficient of
correlation is also zero. Both of these cases imply no linear relationship between
the dependent variable (y) and the independent variable (x).
19. Based on sample data, the 90% confidence interval for the slope of the
population regression line is found to be from -2 to 1. Based on this information,
what is the most accurate statement than can be made about the p-value in
testing versus
ANS:
At the 0.10 level of significance, we would not reject the null hypothesis that the
true slope is equal to zero, since the 90% confidence interval contains 0. The p-
value must be greater than 1-0.90; that is, it must be greater than 0.10.
20. For n = 37 data points, r2 = 0.49. At the 0.02 level of significance, can we
conclude that the true coefficient of correlation could be zero?
ANS:
This calls for the following hypothesis test:
(there is no linear relationship) and (there is a linear relationship).
This is a two-tail t-test with and d.f.=37-2-35. The critical values of t are -2.438
and +2.438. The calculated value of the test statistic is:
Since the calculated test statistic falls outside the critical values, we reject There
is no evidence to suggest that a linear relationship exists between x and y.
ANS:
Residual analysis refers to a broad class of tests which can be performed using
the error terms from a regression line to test the assumptions of regression
analysis. Residual analysis can test the assumption of normal distribution of the
error terms, the assumption of equal standard deviations of the error terms, and
the assumption that the error terms are independent from one another. In
residual analysis, we may also “flag” error values that are especially large.
ANS:
Spurious correlation occurs whenever two variables are highly correlated but
there is little evidence to suggest any causation. For instance, the closing of the
Dow Jones Industrial Average (DJIA) could be highly correlated with the winner of
the Super Bowl. (When the NFC wins it has historically tended to close higher for
the year, and when the AFC wins it has tended to close lower for the year).
However, there is no evidence to suggest that the Super Bowl winning
conference causes the DJIA to move up or down. Likewise, there is no evidence
to suggest that the movement in the DJIA causes one conference or the other to
win the Super Bowl.
23. What is the Lilliefors test, and how can it contribute to regression and
correlation analysis?
ANS:
The Lilliefors test, introduced in Chapter 14, is used to examine whether the
residuals could have come from a normally distributed population. Recall that
one of the assumptions of the regression model is whether the residuals are
from a normal population with a mean of zero.
24. A firm finds the coefficient of correlation between y = annual sales and x =
annual expenditure on research and development to r = +.095. Comment on
the likely direction of causation, if any, between these variables.
ANS:
Correlation does not necessarily imply causation of any kind. Perhaps sales have
increased as a result of better products which came from more money invested
in research and development. On the other hand, perhaps the R&D budget is set
on past or anticipated sales. It is also very possible that both of these variables
have simply increased over time.
TRUE/FALSE
1. True or False
Multiple regression analysis examines the linear relationship between a
dependent variable (y) and two or more independent variables (x1, x2, and so
on).
2. True or False
Multicollinearity is a situation in which two or more of the independent variables
are highly correlated with each other.
3. True or False
For each y term in the multiple regression equation, the corresponding is
referred to as the partial regression coefficient.
4. True or False
In interpreting the multiple regression equation, it can be a mistake to conclude
that one independent variable is more important than another just because its
partial regression coefficient happens to be large.
5. True or False
It is a good idea to make point estimates based on x values that lie beyond the
range of the underlying data.
6. True or False
The purpose of the multiple correlation analysis is to measure the strength of
the relationship between the dependent (y) and the set of independent (x)
variables.
ANS: T PTS: 1 OBJ: Section 16.4
7. True or False
A dummy variable is used to incorporate qualitative data into the analysis.
MULTIPLE CHOICE
7. For the multiple regression model = 3 – 4×1 + 5×2 + 2×3, a unit increase in
x1, holding x2 and x3 constant, results in:
a. an increase of 4 units in the value of y.
b. a decrease of 4 units in the value of y.
c. a decrease of 4 units, on average, in the value of y.
d. an increase of 6 units in the value of y.
ANS: C PTS: 1 OBJ: Section 16.2
11. In a multiple regression model, the following statistics are given: SSE = 100,
R2 = 0.995, k = 5, n = 15. The multiple coefficient of determination adjusted for
degrees of freedom is:
a. 0.955.
b. 0.992.
c. 0.900.
d. 0.855.
ANS: B PTS: 1 OBJ: Section 16.4
18. In testing the significance of a multiple regression model in which there are
three independent variables, the null hypothesis is:
a. H0: .
b. H0: .
c. H0: .
d. H0: .
ANS: C PTS: 1 OBJ: Section 16.5
NUMERIC RESPONSE
ANS: 43.46
ANS: 25.05
PTS: 1 OBJ: Section 16.2
ANS: 19.05
ANS: 1410
SST = ____________________
ANS: 1258.5
6. For a multiple regression model the following statistics are given: SSE = 40,
SST = 200, k = 4, n = 20. Calculate the coefficient of determination adjusted for
degrees of freedom.
ANS: 0.747
= 60 + 2.8×1 + 1.2×2 – x3. For this model, SST = 600 and SSE = 150. Calculate
the value of the F statistic for testing the significance of this model.
F = ____________________
ANS: 21.0
NARRBEGIN: States
Concern over the number of car thefts grew into a project to determine the
relationship between car thefts by state and these variables:
Data from 13 states were collected. The MINITAB regression results are:
NARREND
8. Compute the multiple standard error of estimate (se) from the regression
results.
ANS: 10.14889
ANS: 0.9727
ANS: 5.305
11. Use the values in the analysis of variance table to compute R2 using the
values for SST and SSE or SSR.
ANS: 0.982
COMPLETION
ANS: hyperplane
ANS:
one; zero
1, 0
ANS:
The weight loss of a female compared to a male is 6 more pounds.
ANS:
yi = a value of the dependent variable, y
= a constant
xi… = values of the independent variables
…. = partial regression coefficients for independent variables
= random error, or residual
y-intercept: ____________________
x1: ____________________
x2: ____________________
x3: ____________________
ANS:
80; 15; -5; 100
ANS:
Increase; 100
ANS:
NARRBEGIN: Equation
The regression equation, = 4 + 1.5×1 + 2.5×2 has been fitted to 25 data
points. The means of x1 and x2 are 30 and 46, respectively. The sum of the
squared differences between observed and predicted values of y has been
calculated as SSE = 175, and the sum of the squared differences between y
values and mean of y is SST = 525.
NARREND
6. Determine the multiple standard error of estimate.
ANS:
7. What is the approximate 95% confidence interval for the mean of y whenever
x1 = 20 and x2 = 25.
ANS:
ANS:
ANS:
NARREND
10. Calculate the residual sum of squares.
ANS:
SSE = 0.148.
ANS:
SST = 0.634
ANS:
NARRBEGIN: Grade
A statistics teacher collected the following data to determine if the number of
hours a student studied during the semester and the number of classes missed
could be used to predict the final grade for the course.The following table shows
the results of the model being applied to 8 students.
ANS:
ANS:
SSE = 139
ANS:
SST = 322.9
ANS:
NARRBEGIN: Salary
Data was collected from 40 employees to develop a regression model to predict
the employee’s annual salary using their years with the company (Years), their
starting salary (Starting), and their Gender (Male = 0, Female = 1). The results
from Excel regression analysis are shown below:
SUMMARY OUTPUT
Regression Statistics
Multiple R 0.718714957
R Square 0.516551189
Adjusted R Square 0.476263788
Standard Error 10615.63461
Observations 40
ANOVA
df SS MS F Significance F
Regression 3 4334682510 1444894170 12.82165585 7.48476E-06
Residual 36 4056901131 112691698.1
Total 39 8391583641
NARREND
17. What is the regression equation?
ANS:
18. In testing the null hypothesis that the regression equation is not significant
at the 0.05 level, what is the appropriate conclusion?
ANS:
Reject H0
ANS:
Since the p-value equals 0.00038, which is less than 0.05, we reject Ho. The
Years variable is statistically significant.
ANS:
Since the p-value equals 0.0417, which is less than 0.05, we reject Ho. The
Starting variable is statistically significant.
ANS:
Since the p-value equals 0.562, which is more than 0.05, we fail to reject Ho.
The Gender variable is not statistically significant.
22. For a male employee with 5 years of experience and a starting salary of
$30,000, what is the approximate 95% confidence interval for his annual salary?
ANS:
23. The computer output for the multiple regression model is shown below.
However, because of a printer malfunction some of the results are not shown.
These are indicated by the boldface letters a to i. Fill in the missing results (up to
three decimal places).
a = ____________________
b = ____________________
c = ____________________
d = ____________________
e = ____________________
f = ____________________
g = ____________________
h = ____________________
i = ____________________
ANS:
25.277; 2.808; -2.367; 3.897; 0.423; 562; 206; 15.189; 13.562
NARRBEGIN: States
Concern over the number of car thefts grew into a project to determine the
relationship between car thefts by state and these variables:
Data from 13 states were collected. The MINITAB regression results are:
NARREND
24. Do the partial regression coefficients have the algebraic sign you might
expect?
ANS:
The sign for b1 is questionable since it implies the more police per 10,000
persons, the more car thefts taking place. The more spent for police protection
coefficient, b2, is questionable since it is positive; thus implying the more spent
the more car thefts. The sign for b3 is expected since the more new cars the
more to steal.
25. Perform a test for each partial regression coefficient using a 0.05
significance level. Results:
Conclusion: _________________________________________________________
ANS:
Coef t-ratio p-values
1.2831 1.38 0.200
0.018827 2.23 0.053
0.09686 2.74 0.023
ANS:
Multicollinearity is correlation between independent variables. Expenditures for
police and registration are highly correlated, indicating a multicollinearity
problem.
27. Test the significance of the regression equation at the 0.01 level of
significance.
Conclusion: ____________________
ANS:
F = 107.14; F critical = 6.99; Reject H0
NARRBEGIN: Nutritionist
A nutritionist is analyzing the cost of an 8 oz. serving of pasta. The nutritionist
anticipates that cost is related to:
x1 = Grams of protein/8 oz.
x2 = Grams of carbohydrates/8 oz.
x3 = Grams of fat/8 oz.
ANS:
(1.299,1.363)
29. From these regression results, compute a 95% prediction interval for y when
x1 = 4, x2 = 5, and x3 = 3.
ANS:
(1.220,1.442)
3.2, and 1, 30. From these regression results, compute a 95% confidence
interval for
ANS:
(0.00259,0.03303); (-0.02954,-0.02211); (-0.00691,0.005907)
Conclusion: ____________________
ANS:
F = 104.67; F critical = 7.59; Reject H0
PTS: 1 OBJ: Section 16.6
b0 = ____________________
b1 = ____________________
b3 = ____________________
ANS:
50.168; 4.3609; -0.0682
ANS:
0.85; It means that 85% of the variation in knowledge about compact disc
players is explained by the variation in education, age and income variables.
ANS:
= 78.301
MSR = ____________________
MSE = ____________________
ANS:
10874.66; 28.14285
NARRBEGIN: Professor
A statistics professor investigated some of the factors that affect an individual
student’s final grade in his course. He proposed the multiple regression model
where:
The professor recorded the data for 50 randomly selected students. The
computer output is shown below.
NARREND
36. Do these data provide enough evidence to conclude at the 5% significance
level that the model is useful in predicting the final mark?
Conclusion: ____________________
ANS:
F = 6.558; F critical = 2.84; Reject H0
Conclusion: ____________________
ANS:
t = -1.916; t critical = -2.013; Do not reject H0
ANS:
t = -1.035; t critical = -1.679; Do not reject H0
Conclusion: ____________________
ANS:
t = 4.846; t critical = 2.410; Reject H0
ANS:
0.30; This means that 30% of the variation in the student’s final grade in
statistics is explained by the three variables: number of lectures skipped,
number of late assignments, and mid-term test grade, while 70% remains
unexplained.
b1 = ____________________
Interpretation: _____________________________________________________
b3 = ____________________
Interpretation: _____________________________________________________
ANS:
b1 = -3.18; This tells us that for each additional lecture skipped, the student’s
final score on average decreases by 3.18 points (assuming that the other
independent variables in the model are held constant).;
b2 = 0.63; This tells us that for each additional mid-term test score point, the
student’s final score on average increases by .63 points (assuming that the
other independent variables in the model are held constant).
ESSAY
ANS:
The inclusion of more than one independent variable.
ANS:
Multiple regression analysis is used in determining and interpreting the linear
relationship between a single y and two or more x’s, while correlation analysis
measures the strength of the relationship.
ANS:
Multiple regression analysis is preferred whenever two or more independent
variables have impact upon the single dependent variable.
ANS:
Many variables could affect the annual household expenditure for appliances
maintenance and repair: the number of appliances owned, the age(s) of the
appliance(s), the make(s) of the appliance(s). These are just a few of the many
variables that could have a notable effect.
ANS:
In terms of the residual component of the model, the assumptions underlying
multiple regression are: a) For any given set of values for the independent
variables, the population of residuals will be normally distributed with a mean of
zero and a standard deviation of . b) The standard deviation of the error terms is
the same regardless of the combination of values taken on by the independent
variables. c) The error terms are statistically independent from each other.
ANS:
When there are two independent variables, the regression equation can be
thought of in terms of a geometric plane. When there are three or more
independent variables, the regression equation becomes a mathematical entity
called a hyperplane. It is impossible to visually summarize a regression with
three or more independent variables because it will be in four or more
dimensions.
ANS:
The numerical value of se reflects the amount of scatter, or dispersion, of data
points about the plane (or hyperplane) represented by the multiple regression
equation.
ANS:
The multiple standard error of the estimate is an indication of the amount of
scatter of the data points about the regression surface. It is analogous to the
standard error of the estimate for simple regression.
ANS:
The coefficient of multiple determination (R2) is analogous to the coefficient of
determination in simple linear regression. It is the proportion of variation in y
that is explained by the multiple regression equation.
10. Some computer packages report an adjusted R2 along with the unadjusted
R2. In general, why is the adjusted R2 smaller, and under what conditions will it
approach the size of the unadjusted R2?
ANS:
The R2 has been adjusted for the degrees of freedom. If the number of variables
is large relative to the number of observations, R2 will exaggerate the strength
of the linear relationship. The adjusted R2 will get closer to R2 as sample sizes
become larger relative to the number of variables.
PTS: 1 OBJ: Section 16.4
11. Among the results of a multiple regression analysis are the following sum-of-
squares terms: SST, SSR, and SSE. What does each term represent, and how do
the terms contribute to our understanding of the relationship between y and the
set of independent variables.
ANS:
SST is the total variation in the y values, SSR is the variation in the y values that
is explained by the regression, and SSE is the variation in the y values that is not
explained by the regression. The coefficient of multiple determination is equal to
1 – (SSE/SST), or SSR/SST. If SSE is small compared to SST, SSR will be large
compared to SST, and the multiple regression equation will explain a large
portion of the variation in y. Recall that SST = SSR + SSE.
(A) the results of the hypothesis test examining whether b2 differs significantly
from zero at the 0.05 level and
ANS:
Both of these tests will reach the same conclusion. If the confidence interval for
does not include zero, the hypothesis test will reject the null hypothesis. On the
other hand, if the confidence interval for does contain zero, the hypothesis test
will not reject the null hypothesis.
ANS:
An ANOVA test is used to evaluate the overall significance of a multiple
regression equation. If the F calculated in the ANOVA exceeds the critical F, the
null hypothesis of no significance is rejected. Individual t-tests are used to
evaluate the significance of the individual partial regression coefficients.
14. What role does the normal probability plot play in examining whether the
multiple regression model is appropriate for a given set of data?
ANS:
The normal probability plot is used to examine whether the residuals could have
come from a normally distributed population. One of the assumptions underlying
multiple regression analysis is that the residuals are normally distributed with a
mean of zero.
15. Discuss residual analysis in terms of the assumptions required for using the
multiple regression model.
ANS:
Residual analysis can be used to examine the residuals with respect to the
assumptions underlying multiple regression analysis. We can do many things
with residual analysis, including: (1) constructing a histogram of the residuals as
a rough check to see if they are approximately normally distributed, (2)
constructing a normal probability plot or applying the Lilliefors test to examine
whether the residuals could have come from a normally-distributed population,
(3) plotting the residuals versus each of the independent variables to see if they
exhibit some cycle or pattern with respect to that variable, and (4) plotting the
residuals versus the order in which the observations were recorded to look for
auto-correlation.
16. What is a dummy variable, and how is it useful to multiple regression? Give
an example of three dummy variables that could be used in describing your
home town.
ANS:
A dummy variable is a variable that takes on a value of one or zero to indicate
the presence or absence of an attribute. Dummy variables can help explain
some of the variation in y due to the presence or absence of a characteristic.
Three dummy variables that can be used to describe one town versus another
are URBAN (1 if urban, 0 otherwise), MANUF (1 if durable goods manufacturing is
the major industry, 0 otherwise), and POPMIL (1 if the population is 1 million or
more, 0 otherwise). Other dummy variables could include the presence of a
major university, a major medical center, a major research institution, and many
more.
ANS:
A situation in which two or more of the independent variables are highly
correlated with each other.
18. A multiple regression equation has been developed for y = daily attendance
at a community swimming pool, x1 = temperature (degrees Fahrenheit), and x2
= weekend versus weekday, (x2 = 1 for Saturday and Sunday, and 0 for other
days of the week.) For the regression equation shown below, interpret each
partial regression coefficient: = 100 + 10×1 + 175×2.
ANS:
The partial regression coefficient for x1 implies that, holding the day of the week
constant, a one degree Fahrenheit increase in the temperature will result in an
increase of 10 in attendance. The partial regression coefficient for x2 implies
that the attendance increases by 175 people on Saturdays and Sundays
(assuming a constant temperature).
TRUE/FALSE
1. True or False
For the second-order polynomial model , the shape of the curve will be
determined by the relative sizes of and and on their signs.
2. True or False
The second-order polynomial model allows the plotted line to reverse the
direction in which it is curving.
3. True or False
A qualitative variable such as gender can be included in regression analysis and
is referred to as a dummy variable.
4. True or False
Interaction between two predictor variables is present when the effect on E(y) of
a 1-unit increase in does not depend on the value of , and vice versa.
5. True or False
The first-order polynomial model is the same as the simple linear regression
model.
6. True or False
The coefficient of determination R2 for the third-order polynomial model is
always larger than R2 for the second-order polynomial model.
8. True or False
The exponential model = b0b1x can be converted to a linear model by taking
the logarithms of both sides of the equation (either natural or common
logarithms can be used as long as we are consistent).
9. True or False
Data transformation may include converting x to its square root or inverse.
MULTIPLE CHOICE
1. Consider the model described by the equation . The slope of the relationship
between and
a. is constant for all values of .
b. depends on the value of .
c. depends on the value of .
d. is constant for all values of .
ANS: B PTS: 1 OBJ: Section 17.3
9. In a first-order model with two predictors x1 and x2, an interaction term may
be used when:
a. the effect of x1 on the dependent variable is influenced by x2.
b. the effect of x2 on the dependent variable is influenced by x1.
c. the relationship between the dependent variable and the independent
variables is linear.
d. Both A and B are correct.
ANS: D PTS: 1 OBJ: Section 17.3
10. Consider the first-order regression model = 15 + 6×1 + 5×2 + 4x1x2. A unit
increase in x1 increases the value of y on average by:
a. 5.
b. 30.
c. 26.
d. an amount that depends on the value of x2.
ANS: D PTS: 1 OBJ: Section 17.3
11. Consider the model described by the equation . Which of the following
coefficients determine the initial slope of the line for small positive values for x.
a.
b.
c.
d.
ANS: B PTS: 1 OBJ: Section 17.2
12. Consider the first-order model = 70 + 35×1 – 10×2 + 5x1x2. A unit increase
in x2, while holding x1 constant at 1, changes the value of E(y) by:
a. -10.
b. -5.
c. 10.
d. 5.
ANS: B PTS: 1 OBJ: Section 17.3
13. Consider the model described by the equation . Which term provides a linear
component to the model?
a.
b.
c.
d.
ANS: B PTS: 1 OBJ: Section 17.2
14. Consider the model described by the equation . Which term provides a
nonlinear component to the model allowing the line to curve?
a.
b.
c.
d.
ANS: C PTS: 1 OBJ: Section 17.2
15. Consider the model described by the equation . Which term provides a
nonlinear component to the model enabling the line to reverse the direction of
curvature?
a.
b.
c.
d.
ANS: D PTS: 1 OBJ: Section 17.2
19. In explaining the income earned by college graduates, which of the following
independent variables is best represented by a dummy variable?
a. Grade point average
b. Age
c. Number of years since graduating from high school
d. College major
ANS: D PTS: 1 OBJ: Section 17.4
21. In explaining the amount of money spent on children’s toys each year, which
of the following independent variables is best represented with a dummy
variable?
a. Age
b. Height
c. Gender
d. Weight
ANS: C PTS: 1 OBJ: Section 17.4
26. In forward selection stepwise regression with ten predictor variables, which
of the following statements is correct?
a. The procedure starts with 10 predictor variables and eliminate one predictor
at each iteration.
b. The procedure starts with 2 predictor variables and add one predictor at each
iteration.
c. The procedure starts with 1 predictor variable and add one predictor at each
iteration.
d. The procedure starts with no predictor variables and add one predictor at
each iteration.
ANS: D PTS: 1 OBJ: Section 17.7
28. Which of the following statements are important steps in selecting a model?
a. Specify the dependent variable and determine the goal for the analysis.
b. Specify possible predictor variables and collect the data.
c. Specify the candidate models, and for each candidate model decide which of
the available predictor variables will be included in the model.
d. Select a model, evaluate its suitability and test its reliability.
e. All of these.
ANS: E PTS: 1 OBJ: Section 17.8
NUMERIC RESPONSE
ANS: 4.60
____________________ years
ANS: 67.5
____________________ years
ANS: 69.5
COMPLETION
1. If the goal for the regression analysis is simply to predict the value of the
dependent variable based on a set of predictor variables, we need not worry
about ____________________, which means that some of the predictor variables
might be highly correlated with each other.
ANS: multicollinearity
ANS: logarithmic
ANS:
p
ANS:
It represents the intercept, which is the value of the predicted y when x is zero.
What would be the signs of the partial regression coefficient(s) in the model?
ANS:
Third-order; The estimation equation fit to the data would be A curvature such
as that described for the experimental lubricant could occur if b1 and b3 < 0,
and b2 > 0.
4. The regression model = 160 + 8×1 – 5×2 has been fitted to a set of data. If
x2 = 10, what will be the effect on if x1 increases by 1 unit?
ANS:
increase; 8
ANS:
increase; 8
6. The regression model = 160 + 8×1 – 5×2 has been fitted to a set of data.
Comment on whether interaction would appear to exist between x1 and x2?
Interaction: ____________________
Comments:
ANS:
Interaction does not appear to exist. There is no interaction between the two
predictor variables since the effect on by a one unit increase in x1 does not
depend on the value of x2.
NARRBEGIN: Economist
An economist is in the process of developing a model to predict the price of
gold. She believes that the two most important variables are the price of a barrel
of oil (x1) and the interest rate (x2). She proposes the first-order model with
interaction
Interpretation: ______________________________________________________
ANS:
F; 6.626; 3.24; Reject H0; The model is useful in predicting the price of gold
Conclusion: ____________________
Interpretation: _______________________________________________________
ANS:
t; 3.141; 2.921; Reject H0; The price of a barrel of oil and the price of gold are
linearly related
Conclusion: _____________________
Interpretation: _____________________________________________________
ANS:
t; 2.333; 2.921; Do not reject H0; The interest rate and the price of gold are NOT
linearly related
Conclusion: _______________________
Interpretation: __________________________________________________
ANS:
t; -2.615; 2.921; Do not reject H0; The interaction term should NOT be retained
b1 = ____________________
Interpretation:
ANS:
22.3; As the price of a barrel of oil increases by one unit (holding the interest
rate constant), the price of gold on average increases 22.3 units.
ANS:
The most suitable model would be a first-order model with two predictor
variables.
13. The regression model = 180 + 6×1 – 2×2 has been fitted to a set of data. If
x2 = 10, what will be the effect on if x1 increases by 1 unit?
14. The regression model = 180 + 6×1 – 2×2 has been fitted to a set of data. If
x2 = 20, what will be the effect on if x1 increases by 1 unit?
ANS:
increase; 6
15. The regression model = 180 + 6×1 – 2×2 has been fitted to a set of data.
Comment on whether interaction appears to exist between x1 and x2.
Interaction: _________________________________
Comments:
ANS:
Interaction does not appear to exist. Regardless of the value of x2, a 1-unit
increase in x1 will increase by 6 units. In this model, there is no interaction
between x1and x2.
16. The estimated regression equation for a sample of 500 college professors is
given by: = 275 – 3x – 2D, where y is retirement age, x is pre-retirement annual
income (in $1000s), and D is a dummy variable that takes the value of 0 for
female professors and 1 for male professors. Assume that there is a relationship
between y, x and D. How does the average age at retirement for male
professors change for each additional thousand dollars of pre-retirement
income?
= ____________________
ANS:
0.5623*1.4125x
18. Regression analysis results in the estimation equation log = 3.0 + 0.50 log
x1 + 0.35 log x2 following data transformation. Transform this equation to the
equivalent exponential model of the form = . (The base of the logarithm is 10.)
= ____________________
ANS:
1000*x10.50*x20.35
19. Three predictor variables are being considered for use in a linear regression
model. Given the correlation matrix below, does it appear that multicollinearity
could be a problem?
x1 x2 x3
x1 1.000 . .
x2 0.025 1.000 .
x3 0.968 0.897 1.000
Comment:
ANS:
Yes; It appears that multicollinearity could be a problem because x3 is highly
correlated with both x1 and x2.
20. The two largest values in a correlation matrix are the 0.89 correlation
between y and x3 and the 0.83 correlation between y and x7. During a stepwise
regression analysis is the first independent variable brought into the equation.
Will x7 necessarily be next? If not, why not?
ANS:
Predictor variable x7 will not necessarily be the next variable brought into the
equation. We do not know about the correlation between x3 and x7, so we
cannot determine whether x7 will explain the greatest amount of the remaining
variation in y.
ESSAY
ANS:
The scatter diagram indicates that a linear model is not a very good fit to the
data.
ANS:
. Due to its complexity, this model would not be very practical for use with most
business data.
ANS:
The second-order polynomial model , and the form of the estimation equation it
to the data would be . Assuming nonnegative values for both x and y, b1 < 0
and b2 < 0 would result in such a curvature.
ANS:
The second-order polynomial model and the form of the estimation equation fit
to the data would be Assuming nonnegative values for both x and y, b1 > 0 and
b2 < 0 would result in such a curvature.
ANS:
Although we have three types of credit cards (Visa, MasterCard, and other), this
information can be represented by only two qualitative variables. The equation
will have this form: Dollars = b0 + b1*Minutes + b2*Visa + b3*MasterCard. The
coding of the two qualitative variables: Visa=1 if customer used Visa card,
otherwise=0; and MasterCard=1 if customer used MasterCard, otherwise=0. For
example, the data for a customer who spent $50 during a 10-minute visit to the
site and used Master Card could be listed as 50, 10, 0, and 1. If the same
customer had used a card other than Visa or MasterCard, her data could have
been listed as 50, 10, 0, and 0.
6. What is a correlation matrix, and what role does it play in multiple regression?
ANS:
A correlation matrix is a matrix that shows the correlation of each variable in a
multiple regression analysis with each of the other variables. A high correlation
between two independent variables is an indication of multicollinearity.
7. What is multicollinearity?
ANS:
Multicollinearity is a condition which two or more of the independent variables
are highly correlated with each other.
ANS:
There are several clues to the presence of multicollinearity: (1) an independent
variable known to be an important predictor ends up having a partial regression
coefficient that is not significant; (2) a partial regression coefficient exhibits the
wrong sign; and/or, (3) when an independent variable is added or deleted, the
partial regression coefficients for the other variables change dramatically. A
more practical way to identify multicollinearity is through the examination of a
correlation matrix, which is a matrix that shows the correlation of each variable
with each of the other variables. A high correlation between two independent
variables is an indication of multicollinearity.
ANS:
Stepwise regression is a multiple regression estimation technique whereby
independent variables are added to the regression equation one at a time. The
first x variable to enter the regression is the one that explains the greatest
amount of variation in y. The second variable to enter is the one that explains
the greatest amount of the remaining variation. The use of stepwise regression
can reduce the possibility for multicollinearity since it is unlikely that two highly
correlated x variables will be included in a multiple regression that is estimated
using the stepwise technique. This technique is useful when there are a great
many independent variables.
10. In general, on what basis are independent variables selected for entry into
the equation during stepwise regression?
ANS:
Independent variables are selected for entry into the equation during stepwise
regression based upon the amount of the remaining variation in y (the variation
that has not already been explained by included variables) that a candidate
variable can explain.
ANS:
The model can be tested by applying it to new data as time goes by. Another
possibility is to build the model on the basis of a portion of the available data,
then test its performance on the “fresh” data that were initially withheld from
the model.
ANS:
One way to identify multicollinearity is to use a computer to generate a
correlation matrix that shows the correlation of each independent variable with
each of the other independent variables.
ANS:
Stepwise regression is a multiple regression estimation technique whereby the
independent variables are added to the regression equation one at a time. The
first x variable to enter is the one that explains the greatest amount of variation
in y. The second x variable to enter will be the one that explains the greatest
amount of the remaining variation in y, and so on.
ANS:
The use of stepwise regression can reduce the possibility of multicollinearity
since it is unlikely that two highly correlated x variables will be included in a
multiple regression that is estimated using the stepwise technique. This
technique is especially useful when there are a great many independent
variables.
ANS:
The general idea of stepwise regression is the balancing act of trying to (a)
explain the most possible variation in the dependent variable y, while (b) using
the fewest possible number of the independent variables x.
TRUE/FALSE
1. True or False
A time series may consist of weekly production output, monthly sales, annual
rainfall, or any other variable for which the value is observed or repeated at
regular intervals.
2. True or False
By analyzing a time series, we can identify patterns and tendencies that help
explain variation in past sales, shipments, rainfall, or any other variable of
interest.
3. True or False
A trend component in a time series model must be expressed as a linear
relationship over time using a linear trend equation.
5. True or False
In general, the most important component of most time series is the irregular,
which can be examined by using regression techniques.
6. True or False
The Consumer Price Index (CPI) describes the change in prices from one time
period to another for a fixed “market basket” of goods and services.
7. True or False
The trend component of a time series can be examined (1) by using regression
techniques to fit a trend line to the data, or (2) by using smoothing techniques
to moderate the peaks and valleys within the series.
8. True or False
The smoothing techniques, such as moving average or exponential smoothing,
function much like the shock absorbers of an automobile; damping the sudden
upward and downward “jolts” that occur over the series.
9. True or False
If you are not using a computer, it is desirable to select an even number of
periods for a centered moving average unless there is some compelling reason
to do otherwise.
MULTIPLE CHOICE
2. The classical time series model combines the various components in which of
the following ways?
a.
b.
c.
d.
e. None of these.
ANS: A PTS: 1 OBJ: Section 18.2
3. In general, the most important component of most time series is the
a. trend.
b. cyclical.
c. seasonal.
d. irregular.
ANS: A PTS: 1 OBJ: Section 18.2
4. The upsurge in school supply sales in the fall of each year is an example of
the:
a. irregular component.
b. trend component.
c. seasonal component.
d. cyclical component.
ANS: C PTS: 1 OBJ: Section 18.2
5. The effect that business recessions and prosperity have on time series values
is an example of the:
a. irregular component.
b. trend component.
c. seasonal component.
d. cyclical component.
ANS: D PTS: 1 OBJ: Section 18.2
8. The component of a time series that has repeating patterns that have a
period longer than one year is the:
a. irregular component.
b. trend component.
c. seasonal component.
d. cyclical component.
ANS: D PTS: 1 OBJ: Section 18.2
10. The time series component that reflects variability over short repetitive time
periods and has duration of less than one year is called:
a. trend.
b. cyclical.
c. seasonal.
d. random variation.
ANS: C PTS: 1 OBJ: Section 18.2
11. The time series component that reflects the irregular changes in a time
series that are not caused by any other component, and tends to hide the
existence of the other more predictable components, is called:
a. trend.
b. cyclical.
c. seasonal.
d. random variation.
ANS: D PTS: 1 OBJ: Section 18.2
12. The time series component that reflects a wavelike pattern describing a
long-term trend that is generally apparent over a number of years is called:
a. trend.
b. cyclical.
c. seasonal.
d. random variation.
ANS: B PTS: 1 OBJ: Section 18.2
13. Which of the four time series components is more likely to exhibit the steady
growth of the population of the United States from 1950 to 2000?
a. Trend
b. Cyclical
c. Seasonal
d. Random variation
ANS: A PTS: 1 OBJ: Section 18.2
14. Selecting a large the number of periods (N) for a centered moving average
a. results in greater dampening.
b. is less responsive to directional changes.
c. results in a more accurate forecast.
d. both A and B.
e. All of the above.
ANS: D PTS: 1 OBJ: Section 18.3
16. We calculate the three-period moving average for a time series for all time
periods except the:
a. first period.
b. last period.
c. first and last period.
d. first and last two periods.
ANS: C PTS: 1 OBJ: Section 18.3
19. A factor that inflates or deflates a trend value to compensate for the effect
of a pattern that repeats over a period of one year or less is known as a
a. smoothing factor.
b. dampening factor.
c. seasonal index.
d. residual factor.
ANS: C PTS: 1 OBJ: Section 18.4
20. An auto parts supplier forecasted 1998 annual sales of $800,000. Seasonal
indexes of sales for each quarter of the year were: I – 80, II – 112, III – 117, IV –
91. The sales forecast for quarter II was:
a. 300,000.
b. 200,000.
c. 224,000.
d. 228,000.
e. 284,000.
ANS: C PTS: 1 OBJ: Section 18.4
22. Which of the following will not be present in a de-seasonalized time series?
a. Trend effects
b. Cyclical effects
c. Seasonal effects
d. Random variation
ANS: C PTS: 1 OBJ: Section 18.4
23. In determining weekly seasonal indexes for gas consumption, the sum of the
52 means for gas consumption as a percentage of the moving average is 5050.
To get the seasonal indexes, each weekly mean is to be multiplied by:
a. 5200/5050.
b. (5200 + 5050)/52.
c. (5050 + 52)/5200.
d. 5050/5200.
ANS: A PTS: 1 OBJ: Section 18.4
24. In determining monthly seasonal indexes for gas consumption, the sum of
the 12 means for gas consumption as a percentage of the moving average is
1150. To get the seasonal indexes, each of the 12 monthly means is to be
multiplied by:
a. 1150/1200.
b. (1200 + 1150)/12.
c. (1150 + 12)/1200.
d. 1200/1150.
ANS: D PTS: 1 OBJ: Section 18.4
25. The results of a quadratic model fit to time series data were = 7.5 – 0.25t +
3.5t2, where t = 1 for 1995. The forecasted value for 2002 is:
a. -2.0.
b. 10.75.
c. 28.
d. 229.5.
ANS: D PTS: 1 OBJ: Section 18.5
26. The trend equation for quarter sales data (in millions of dollars) for 1996-
2000 is = 6.8 + 1.2t, where t = 1 for the first quarter of 1996. If the seasonal
index for the third quarter of 2001 is 1.25, then the forecasted sales for the third
quarter of 2001 is:
a. 34.4.
b. 27.52.
c. 43.00.
d. 35.65.
ANS: C PTS: 1 OBJ: Section 18.5
27. Five trend models applied to the same time series data are compared and
the MAD are given below. The model which is the best fit for the data is the one
whose MAD is:
a. 35.
b. 9.
c. 15.
d. 118.
e. 40.
ANS: B PTS: 1 OBJ: Section 18.6
28. The mean absolute deviation averages the absolute differences between the
actual values of the time series at time t and the forecast values at time:
a. t + 1.
b. t.
c. t – 1.
d. t – 2.
ANS: B PTS: 1 OBJ: Section 18.6
29. Which method would you recommend to your statistics professor in selecting
the appropriate forecasting model if avoiding large errors is extremely important
to him or her?
a. Mean absolute deviation (MAD)
b. Mean square error (MSE)
c. Either A or B
d. Neither A nor B
ANS: B PTS: 1 OBJ: Section 18.6
30. One of the assumptions required by the regression model is that the
residuals (errors) be independent of each other. When this assumption is not
met:
a. multicollinearity exists.
b. heteroscedasticity exists.
c. homoscedasticity exists.
d. serial correlation exists.
e. a normal distribution exists.
ANS: D PTS: 1 OBJ: Section 18.7
32. Which of the following statements is not true regarding the Consumer Price
Index (CPI)?
a. The CPI describes the change in prices from one period to the next for a fixed
“market basket” of goods and services.
b. The CPI is used as a measurement of inflation.
c. The CPI is used as a “cost of living” adjustment in labor contracts.
d. The CPI is used as a means of deflating current prices or wages to “real”
prices wages.
e. All of these statements are true.
ANS: E PTS: 1 OBJ: Section 18.9
NUMERIC RESPONSE
Month Minutes
1 74
2 86
3 70
4 96
5 111
6 102
7 115
8 126
NARREND
1. What is the linear trend estimate for Month 7?
ANS: 115.95
NARREND
2. What is the linear trend estimate for Susan’s GPA for Spring semester
Sophomore year?
ANS: 2.906
ANS: 1600
4. The following trend line was calculated from quarterly data for 1995 – 1999:
where t is 1 for the first quarter of 1995. Calculate the trend value for the
second quarter of the year 2000.
ANS: 0.81
ANS: 510.896
ANS: 513.536
ANS: 25
8. Using exponential smoothing, with = 0.30, what is the forecasted value for
time period 5?
ANS: 24.644
ANS: 21.25
10. The seasonal indexes for a convention center’s bookings are 85, 128, 104,
and 83 for quarters 1 through 4, respectively. What percentage of the center’s
annual bookings tend to occur during the second quarter?
ANS: 32%
NARRBEGIN: Inventory
The following values are the means for each month of the inventory as a
percentage of the moving average for the twelve months of the year.
NARREND
11. Compute the de-seasonalized value for February if the inventory in February
is $48,000.
ANS: $55581
ANS: 86.36
13. The trend equation for annual sales is = 32 + 3.1x where x = 0 in 1992.
Sales (y) are in millions of dollars. The monthly seasonal index for July is 98.
What is the July sales estimate for 2000?
ANS: 4.64
14. The monthly seasonal index for August is 101. The de-seasonalized August
output is 250,000 units. What is the annual output?
ANS: 2970297
15. If summer 2003 sales were $12,600 and the summer seasonal index was
1.20. What is the de-seasonalized 2003 summer sales value?
ANS: $10500
16. The following trend line was calculated from quarterly data for 1996 – 2000:
= 0.70 + 0.005t, where t = 1 for the first quarter of 1996. The seasonal indexes
computed from the trend line for the four quarters of the year 2001 were 0.85,
1.05, 1.15, and 0.80, respectively. What is the seasonalized forecast for the third
quarter of the year 2001?
ANS: 0.937
____________________ thousand
ANS: 193.6
18. Based on annual car rentals over years, a rental firm has developed the
following quadratic trend equation: = 425 + 18x + 6.5×2, with x = 1 for 1998.
Forecast y for 2005.
ANS: 985
19. Five trend models for the same time series data are compared and the MSE
values are 35, 9, 15, 118, and 25. The worst fitting model to the data is the one
whose MSE is ____________________.
ANS: 118
NARREND
20. What is the mean absolute deviation for this forecast?
ANS: 5.2
ANS: 43.6
NARRBEGIN: Table
NARREND
22. What is the computed value of the Durbin-Watson statistic?
ANS: 1.19
23. Daily sales volume from the Avalon Convenience Store is shown in the table
below.
ANS: $522.12
COMPLETION
1. In forecasting, we use data from the ____________________ in predicting the
____________________ value of the variable of interest, such as automobile sales.
4. Fill in the blank Interpreting the Durbin-Watson test statistic (d), weak or
nonexistent autocorrelation is indicated by a value near ____________________.
ANS:
2
two
6. In 2000 a company’s inventory costs were $4 million. Then the index number
for the 2000 base period is ____________________.
ANS: 100
8. When values in a time series are highly correlated with each other,
____________________ forecasting models can be useful.
ANS: autoregressive
9. When data are a sequence of observations over regular intervals of time, they
are referred to as a ____________________.
SHORT ANSWER
Month Minutes
1 74
2 86
3 70
4 96
5 111
6 102
7 115
8 126
NARREND
1. What is the linear trend equation?
ANS:
NARREND
2. What is the linear trend equation?
ANS:
3. For five years of data, compute a linear trend equation and forecast the value
for the sixth year. Code the years 0 through 4. The data values are: 27, 32, 28,
37, and 38.
= ____________________
ANS:
= 27+2.7x; 40.5
ANS:
The model is y = T•C•S•I. Therefore the quantity of water consumed is
500,000(1.25)(0.70)(1.05) = 459,375 gallons
NARREND
6. What is the linear trend equation?
ANS:
7. The occupancy rate for motel and hotel rooms in a small city in Michigan from
1993 to 2002 follow:
65.7 67.3 61.8 60.9 67.0 77.7 82.5 72.4 85.5 59.5
ANS:
ANS:
ANS:
ANS:
ANS:
There appears to be a gradual trend of increasing sales.
12. Gashol (a mixture of gasoline and methyl alcohol) sales in Iowa City have
been recorded over the past 10 months as shown below:
Month Jan Feb Mar Apr May June July Aug Sep Oct
Sales 75 72 81 92 90 105 112 107 110 93
ANS:
Month Jan Feb Mar Apr May June July Aug Sep Oct
Sales 75 72 81 92 90 105 112 107 110 93
Mov. Av – – 82.0 88.0 96.0 101.2 104.8 105.4 – –
PTS: 1 OBJ: Section 18.3
13. Gashol (a mixture of gasoline and methyl alcohol) sales in Iowa City have
been recorded over the past 10 months as shown below:
Month Jan Feb Mar Apr May June July Aug Sep Oct
Sales 75 72 81 92 90 105 112 107 110 93
ANS:
Month Jan Feb Mar Apr May June July Aug Sep Oct
Sales 75 72 81 92 90 105 112 107 110 93
= 0.4 75 73.8 76.68 82.81 85.69 93.41 100.85 103.31 105.99 100.79
= 0.8 75 72.6 79.32 89.46 89.89 101.98 110 107.6 109.52 96.3
PTS: 1 OBJ: Section 18.3
14. Gashol (a mixture of gasoline and methyl alcohol) sales in Iowa City have
been recorded over the past 10 months as shown below:
Month Jan Feb Mar Apr May June July Aug Sep Oct
Sales 75 72 81 92 90 105 112 107 110 93
ANS:
Month Jan Feb Mar Apr May June July Aug Sep Oct
Sales 75 72 81 92 90 105 112 107 110 93
Mov. Av – – 81.88 87.50 95.88 101.63 106.00 107.00 – –
PTS: 1 OBJ: Section 18.3
15. In determining monthly seasonal indexes for electrical consumption, the sum
of the 12 means for electrical consumption as a percentage of the moving
average is 1193. To get the seasonal indexes, each monthly mean is to be
multiplied by ________________________.
ANS:
1200/1193
16. In determining weekly seasonal indexes for electrical consumption, the sum
of the 52 means for electrical consumption as a percentage of the moving
average is 5193. To get the seasonal indexes, each monthly mean is to be
multiplied by ____________________________________.
ANS:
5200/5193
17. In determining seasonal indexes for the months of the year, a statistician
finds the total of the “unadjusted” seasonal indexes is 1150 instead of 1200.
What correction must be made to arrive at the adjusted seasonal indexes?
ANS:
The statistician must multiply each seasonal index by the adjustment factor
1200/1150
18. The means of the quarterly cost as a percentage of moving average are
151.5, 77.5, 67.3, and 110.6. Prepare the seasonal index for each quarter.
ANS:
NARRBEGIN: Overtime
The quarterly overtime hours (in 1000s) recorded in a large steel mill are shown
below.
Using the regression technique, the linear trend line = 21.675 + .854t was
computed.
NARREND
19. Calculate the seasonal indexes based on the regression trend line.
ANS:
Using Excel to calculate for each value of t and compute the ratio of y/ . The
ratios associated with each quarter, the averages, and the seasonal indexes are
shown below.
ANS:
The seasonal indexes tell us that, on average, the total overtime hours are
below the annual average in the second and third quarters, and above the
average in the first and fourth quarter. That is, we expect the total number of
overtime hours for the second and third quarters to be 30.16 % and 11.41%,
respectively, below the annual average. The total overtime hours for the first
and fourth quarters are expected to be 31.06% and 10.51%, respectively, above
the average.
21. The following seasonal indexes and linear trend were computed from five
year quarterly sales data.
Trend line is
ANS:
22. The regression output produced by MINITAB for times series set of data is
shown below:
Conclusion: ____________________
Interpretation: __________________________________________________
ANS:
1.94; 1.37; Do not reject H0; There is NO evidence of positive autocorrelation
NARRBEGIN: Table
NARREND
23. At the 0.01 significance level with n = 15, k = 2, test the hypothesis that
there is positive autocorrelation present. What is the appropriate conclusion?
ANS:
The test is inconclusive.
24. If a company’s inventory costs were $3.5 million in 1995 (base period) and
$5.95 million in 2002, what is the index number for 2002?
What does it mean?
ANS:
170; This means that if inventory costs in 1995 are defined as 100, 2002 costs
can be expressed as 170 or 70% higher.
25. In the classical time series model, the time series is made up of (T), (C), (S),
and (I) components What do these initials indicate?
ANS:
trend; cyclical; seasonal; irregular
ESSAY
ANS:
Time series are examined for two reasons: (a) to understand the past and (b) to
predict the future.
ANS:
Trend. (T); an overall upward or downward tendency.
Cyclical. (C); fluctuations that repeat over time, with a period being more than a
year from one peak to the next.
Seasonal. (S); Periodic fluctuations that repeat over a period of one year or less
Irregular. (I); Random or “noise” fluctuations that are the result of chance
events.
ANS:
Exponential smoothing of a time series
ANS:
When exponential smoothing is used to smooth a time series, the formula for
the current period includes the exponentially-smoothed value for the previous
period and the actual value for the current period. However, when exponential
smoothing is used to forecast, the formula for the next period includes the
forecasted value for the current period and the actual data for the current
period. Thus, in smoothing, the smoothed data is the combination of the
smoothed data for previous periods and actual data for current periods. In
forecasting, the forecast for the next period is the combination of the forecast
and the actual data for the current period.
6. How do the MAD and MSE criteria differ in their approach to evaluating the fit
of an estimation equation to a time series?
ANS:
MAD considers the mean absolute error from the actual data; the direction of the
error is not considered. In calculating MAD, the absolute values of the deviations
are used. MSE considers the mean squared error from the actual data. In
calculating MSE, the squared deviations from the actual data are used. The MSE
approach penalizes large errors more since the errors are squared.
7. According to the chapter, regression models based on time series data are
especially susceptible to autocorrelation. Why?
ANS:
a) Key independent (x) variables that have not been included in the model are
likely to be related to time. b) Omission of one or more of these independent
variables will tend to yield residuals (errors) that are also related to time.
8. What is the Durbin-Watson test for autocorrelation, and how can it be useful in
evaluating the relevance of a given regression model that has been fitted to a
set of time series data?
ANS:
The Durbin-Watson test is used to detect the presence of autocorrelation
between adjacent residuals. If the test detects the presence of autocorrelation,
there is the possibility in the given model that important variables may have
been omitted. The inclusion of these variables can reduce the presence of
autocorrelation and enhance the quality of confidence and prediction intervals.
9. When autocorrelation of the residuals is present, what effect can this have on
interval estimation and significance tests regarding the regression model
involved?
ANS:
When autocorrelation of the residuals is present, the interval estimations will
tend to be too narrow, and the respective independent variables will be
exaggerated in their significance.
ANS:
An index number is a percentage that expresses a measurement in a given
period in terms of the corresponding measurement for a selected base period.
ANS:
The Consumer Price Index (CPI) is an index which describes the change in prices
from one time period to another for a fixed “market basket” of goods and
services.
ANS:
The Producer Price Index (PPI) is an index which reflects the prices of
commodities produced or processed in the United States.
ANS:
The Employment Cost Index (ECI) is a quarterly measure of the total cost of
employing labor. It includes wages and salaries, plus employers’ costs for
employee benefits such as paid leave, health and life insurance, pension and
saving plans, Social Security, and unemployment insurance.
ANS:
Changing the base of an index can be desirable for two purposes: (a) to better
reflect the mix of public tastes or economic activities from which the index is
compiled, and (b) to convert the index to the same base period as another
series so that the two may be more easily compared.
TRUE/FALSE
1. True or False
In hypothesis testing, uncertainty is reflected in Type I but not Type II error.
2. True or False
Decisions can never be made without the benefit of knowledge gained from
sampling.
3. True or False
A decision situation can be expressed as either a payoff table or a decision tree
diagram.
4. True or False
Incremental analysis is a technique especially useful for inventory decisions.
5. True or False
In non-Bayesian decision making, the decision rule considers the respective
probability of the states of nature.
7. True or False
The maximax criterion specifies that we select the decision alternative having
the highest maximum payoff.
8. True or False
In using the minimax regret criterion, we must first construct a regret table.
9. True or False
Regrets refer to the differences between each payoff and the worst possible
payoff if that state of nature were to occur.
MULTIPLE CHOICE
1. The level of doubt regarding the decision situation where both the possible
states of nature and their exact probabilities of occurrence are known is which of
the following?
a. risk
b. uncertainty.
c. ignorance
d. expected payoff
e. state of nature
ANS: A PTS: 1 OBJ: Section 19.2
5. A person bets his remaining savings on a horse race based on the large payoff
if his horse wins. Which of the following decision making criteria does this typify?
a. Maximum
b. Maximax
c. Minimax
d. Minimax regret
e. Maximum expected payoff
ANS: B PTS: 1 OBJ: Section 19.3
NUMERIC RESPONSE
NARRBEGIN: Economy
An investor has constructed the following payoff table for his profit in thousands
of dollars.
NARREND
1. What is the most the investor should be willing to pay for perfect information
about the future state of the economy?
____________________ thousand
ANS: 12.8
2. A fruit stand manager must decide how many pounds of bananas to stock
weekly. From past records it appears that the demand for bananas is
approximately normal with a mean of 1,200 pounds and a standard deviation of
400 pounds. The cost of the bananas is $0.15 per pound and the usual selling
price is $0.45 per pound. At the end of each week a hog farmer comes by the
fruit stand and purchases any unsold, over-ripened bananas for $0.05 per
pound. Calculate the amount of bananas the manager needs to purchase each
week to maximize his profit.
____________________ pounds
ANS: 1468
____________________ T-shirts
ANS: 411
____________________ pancakes
ANS: 3194
5. The manager of a grocery store pays 60 cents for each rose and sells them for
$1.50 each. Roses left at the end of the day are discarded. The daily demand
can be approximated by a normal distribution having a mean of 250 and a
standard deviation of 40. To maximize his expected profit, how many roses
should the manager buy before the market opens each day?
____________________ roses
ANS: 260
6. A retailer can purchase units of a perishable item for 75 cents each. The
retailer sells the item for $2.00. Unsold items are worthless and will be
discarded. If demand for the item can be approximated by a normal distribution
having a mean of 150 and a standard deviation of 30, how many units should
the retailer stock to maximize expected profit?
____________________ items
ANS: 160
____________________ million
ANS: $26.1
8. What is the most the firm would be willing to pay for a research study
designed to reduce its uncertainty about market conditions?
____________________ million
ANS: $5.6
____________________ million
ANS: $9
____________________ thousand
ANS: $82.5
____________________ thousand
ANS: $26
12. Calculate the expected opportunity loss for Alternative A (in thousands of
dollars).
____________________ thousand
ANS: $46.5
COMPLETION
SHORT ANSWER
Columns:
ANS:
The rows of a payoff table depict the alternatives the decision maker has
available. In this case, the mango grower may choose to install the heating
system or not to install the system.; The columns of a payoff table represent the
possible states of nature that might occur; in this case, the winter will be cold
enough to require the system to save the crop or it will be mild enough not to
require the system.
2. When Jessica takes the direct route to work, she gets there in 25 minutes if no
train is at the railroad crossing. When a train is using the crossing, her travel
time is 10 minutes longer. Her alternative is a route that takes 32 minutes to
travel, but has no traffic signals or other delays. Assuming that shorter transit
times are more desirable, which route will Jessica select if she uses the maximin
criterion? Explain.
ANS:
Alternative route; Assuming that shorter transit times are more desirable,
Jessica’s reason using the maximin criterion will follow these steps: If she takes
the direct route, the greatest time it might take her to get to work is 35 minutes.
If she takes the longer route, the greatest possible time is 32 minutes.
Therefore, using the maximin criterion, Jessica will choose the longer route
because it has the better of the worst possible travel times.
3. When Jessica takes the direct route to work, she gets there in 25 minutes if no
train is at the railroad crossing. When a train is using the crossing, her travel
time is 10 minutes longer. Her alternative is a route that takes 32 minutes to
travel, but has no traffic signals or other delays. Assuming that shorter transit
times are more desirable, which route will Jessica select if she uses the maximax
criterion? Explain.
ANS:
Direct route; Using the maximax criterion, Jessica will assume that good things
will happen to her. So if she takes the direct route, she will get to work in 25
minutes, and, if she takes the longer route, it will take 32 minutes. Therefore,
Jessica will choose the direct route if she is using the maximax criterion.
4. When Jessica takes the direct route to work, she gets there in 25 minutes if no
train is at the railroad crossing. When a train is using the crossing, her travel
time is 10 minutes longer. Her alternative is a route that takes 32 minutes to
travel, but has no traffic signals or other delays. Assuming that shorter transit
times are more desirable, which route will Jessica select if she uses the minimax
regret criterion? Explain.
ANS:
Direct route; This criterion minimizes the maximum possible regret. Jessica’s
maximum possible regret if she takes the direct route is 3 minutes; if she takes
the longer route, it is 7 minutes. Therefore, using the minimax regret criterion,
Jessica will take the direct route, since her maximum possible route on that
route is only 3 minutes as compared to 7 minutes on the longer route.
5. For each of the following, indicate whether the behavior represents the
maximin criterion or the maximax criterion. Explain your reasoning.
A) A truck driver is behind schedule and knows he is approaching an area where
police radar is often encountered, but he continues driving at 20 mph over the
speed limit. Explain:
Competitor’s Price
ABC Price High Medium Low
High $60,000 $25,000 -$20,000
Medium $35,000 $46,000 -$4,000
Low $18,000 $12,000 $8,000
NARREND
6. What price should ABC choose using the maximax criterion?
ANS:
PAYOFFS Competitor Price Maximax
Alternatives High Medium Low Max Payoff Choice
High $60,000 $25,000 -$20,000 $60,000 Best
Medium $35,000 $46,000 -$4,000 $46,000
Low $18,000 $12,000 $8,000 $18,000
8. What price should ABC choose using the minimax regret criterion?
ANS:
Regrets Competitor Price Minimax
Alternatives High Medium Low Max Regret Regret
High $0 $21,000 $28,000 $28,000
Medium $25,000 $0 $12,000 $25,000 Best
Low $42,000 $34,000 $0 $42,000
9. If the probability of the competitor setting their price High, Medium, and Low
is 40%, 25%, and 35% respectively, which price provides ABC with the highest
expected payoff?
ANS:
PAYOFFS Competitor Price Expected
Alternatives High Medium Low Payoff Choice
High $60,000 $25,000 -$20,000 $23,250
Medium $35,000 $46,000 -$4,000 $24,100 Best
Low $18,000 $12,000 $8,000 $13,000
Probability 0.40 0.25 0.35
ANS:
PAYOFFS Competitor Price Expected
Alternatives High Medium Low Payoff Choice
High $60,000 $25,000 -$20,000 $23,250
Medium $35,000 $46,000 -$ 4,000 $24,100 Best
Low $18,000 $12,000 $ 8,000 $13,000
Probability 0.40 0.25 0.35
Best Outcome $60,000 $46,000 $ 8,000
NARRBEGIN: Economy
An investor has constructed the following payoff table for his profit in thousands
of dollars.
NARREND
11. Construct the regret matrix.
ANS:
ANS:
$14.8; $20.8; $12.8
13. Using the opportunity loss approach, which alternative will be selected, and
what is its expected opportunity loss?
ANS:
Company C; 12.8
PAYOFFS Demand
Alternatives 2000 3000 4000 5000
2000 $ 10,000 $10,000 $10,000 $10,000
3000 $ 3,000 $15,000 $15,000 $15,000
4000 -$ 4,000 $ 8,000 $20,000 $20,000
5000 -$11,000 $ 1,000 $13,000 $25,000
NARREND
14. How many shirts should be produced using the maximax criterion?
ANS:
PAYOFFS Demand Maximax
Alternatives 2000 3000 4000 5000 Max Payoff Choice
2000 $ 10,000 $10,000 $10,000 $10,000 $10,000
3000 $ 3,000 $15,000 $15,000 $15,000 $15,000
4000 -$ 4,000 $ 8,000 $20,000 $20,000 $20,000
5000 -$11,000 $ 1,000 $13,000 $25,000 $25,000 Best
15. How many shirts should be produced using the maximin criterion?
ANS:
PAYOFFS Demand Maximum
Alternatives 2000 3000 4000 5000 Min Payoff Choice
2000 $10,000 $10,000 $10,000 $10,000 $ 10,000 Best
3000 $ 3,000 $15,000 $15,000 $15,000 $ 3,000
4000 -$ 4,000 $ 8,000 $20,000 $20,000 -$ 4,000
5000 -$11,000 $ 1,000 $13,000 $25,000 -$11,000
16. How many shirts should be produced using the minimax regret criterion?
ANS:
REGRET Demand Max Minimax
Alternatives 2000 3000 4000 5000 Regret Regret
2000 $0 $5,000 $10,000 $15,000 $15,000
3000 $7,000 $0 $5,000 $10,000 $10,000 Best
4000 $14,000 $7,000 $0 $5,000 $14,000
5000 $21,000 $14,000 $7,000 $0 $21,000
ANS:
PAYOFFS Demand Expected
Alternatives 2000 3000 4000 5000 Payoff Choice
2000 $10,000 $10,000 $10,000 $10,000 $10,000
3000 $3,000 $15,000 $15,000 $15,000 $12,600 Best
4000 -$4,000 $8,000 $20,000 $20,000 $10,400
5000 -$11,000 $1,000 $13,000 $25,000 $6,400
18. The probabilities for demand levels 2000, 3000, 4000, and 5000 have been
estimated to be 20%, 40%, 15%, and 25% respectively. Which is the expected
value of perfect information?
ANS:
PAYOFFS Demand Expected
Alternatives 2000 3000 4000 5000 Payoff Choice
2000 $10,000 $10,000 $10,000 $10,000 $10,000
3000 $3,000 $15,000 $15,000 $15,000 $12,600 Best
4000 -$4,000 $8,000 $20,000 $20,000 $10,400
5000 -$11,000 $1,000 $13,000 $25,000 $6,400
Probability 0.20 0.40 0.15 0.25
Best Outcome $10,000 $15,000 $20,000 $25,000
ANS:
$11.4; $7.6; $5.6
ANS:
Design B
ANS:
Design C
ANS:
Market Condition
Alternative I II III
A 0 2 34
B 10 0 22
C 20 6 0
PTS: 1 OBJ: Section 19.7
23. What decision will be made using the minimax regret criterion?
ANS:
Design C
24. Assume that the probabilities for the three market conditions are estimated
as 0.1, 0.6, and 0.3, respectively. Which design should be selected in order to
maximize the firm’s expected profit?
ANS:
Design C
25. In what way is the lowest of the expected opportunity losses related to the
expected value of perfect information?
ANS:
The smallest of the expected opportunity losses for the alternative is the
expected value of perfect information (EVPI).
26. A company must decide whether or not to change its packaging now to a
more environmentally safe material. The impact of the decision on profits
depends on which future scenario develops. Scenario 1 is that the media does
not focus heavily on concerns about packaging and no new laws requiring
changes in packaging are passed. Under this scenario, the company will make
$30 million if they change their packaging now, but will make $70 million if they
do not change their packaging now. Scenario 2 is that the media does focus
heavily on concerns about packaging and no new laws requiring changes in
packaging are passed. Under this scenario, the company will make $45 million if
they change their packaging now, but will make $50 million if they do not
change their packaging now. Scenario 3 is that the media does focus heavily on
concerns about packaging and new laws requiring changes in packaging are
passed. Under this scenario, the company will make $55 million if they change
their packaging now, but will make only $10 million if they do not change their
packaging now. The probabilities of the three scenarios are 0.3, 0.5, and 0.2,
respectively.
ANS:
Scenario
123
Decision Prob = 0.3 Prob = 0.5 Prob = 0.2
Change $30 $45 $55
Don’t change $70 $50 $10
PTS: 1 OBJ: Section 19.7
NARRBEGIN: Scenario
A company must decide whether or not to change its packaging now to a more
environmentally safe material. The impact of the decision on profits depends on
which future scenario develops. Scenario 1 is that the media does not focus
heavily on concerns about packaging and no new laws requiring changes in
packaging are passed. Under this scenario, the company will make $30 million if
they change their packaging now, but will make $70 million if they do not
change their packaging now. Scenario 2 is that the media does focus heavily on
concerns about packaging and no new laws requiring changes in packaging are
passed. Under this scenario, the company will make $45 million if they change
their packaging now, but will make $50 million if they do not change their
packaging now. Scenario 3 is that the media does focus heavily on concerns
about packaging and new laws requiring changes in packaging are passed.
Under this scenario, the company will make $55 million if they change their
packaging now, but will make only $10 million if they do not change their
packaging now. The probabilities of the three scenarios are 0.3, 0.5, and 0.2,
respectively.
NARREND
27. What decision will be made using the maximin criterion?
ANS:
Change packaging
ANS:
Don’t change packaging
29. What decision will be made using the minimax regret criterion?
ANS:
Change packaging
ANS:
Don’t change packaging
ANS:
Don’t change packaging
PAYOFFS Demand
Alternatives 10 20 30
10 $140 $140 $140
20 -$20 $280 $280
30 -$180 $120 $420
NARREND
32. How many canoes should be leased using the maximax criterion?
ANS:
PAYOFFS Demand Maximax
Alternatives 10 20 30 Max Payoff Choice
10 $140 $140 $140 $140
20 -$20 $280 $280 $280
30 -$180 $120 $420 $420 Best
Lease 30 canoes.
33. How many canoes should be leased using the minimax regret criterion?
ANS:
REGRET Demand Minimax
Alternatives 10 20 30 Max Regret Choice
10 $0 $140 $280 $280
20 $160 $0 $140 $160 Best
30 $320 $160 $0 $320
Lease 20 canoes.
34. How many canoes should be leased using the maximin criterion?
ANS:
PAYOFFS Demand Maximum
Alternatives 10 20 30 Min Payoff Choice
10 $140 $140 $140 $140 Best
20 -$20 $280 $280 -$20
30 -$180 $120 $420 -$180
Lease 10 canoes.
35. John assumes the probabilities for demand levels of 10, 20, and 30 canoes is
40%, 25%, and 35% respectively. Which leasing alternative will provide the
largest expected payoff?
ANS:
PAYOFFS Demand Expected
Alternatives 10 20 30 Payoff Choice
10 $140 $140 $140 $140
20 -$20 $280 $280 $160 Best
30 -$180 $120 $420 $105
Probability 0.40 0.25 0.35
Lease 20 canoes.
36. John assumes the probabilities for demand levels of 10, 20, and 30 canoes is
40%, 25%, and 35% respectively. What is the expected value of perfect
information?
ANS:
PAYOFFS Demand Expected
Alternatives 10 20 30 Payoff Choice
10 $140 $140 $140 $140
20 -$20 $280 $280 $160 Best
30 -$180 $120 $420 $105
Probability 0.40 0.25 0.35
Best Outcome $140 $280 $420
ANS:
Alternative D
ESSAY
ANS:
The payoff table is a table in which the rows are decision alternatives, the
columns are states of nature, and the entry at each intersection of a row and
column is a numerical payoff such as a profit or loss. The table may also include
estimated probabilities for the states of nature.
2. What is meant when we say that the states of nature in the payoff table must
be mutually exclusive and exhaustive?
ANS:
The states of nature must be mutually exclusive (only one of them will occur),
and they must be exhaustive (one of them must occur). For example, the states
of nature for a roll of a single die are the numbers 1, 2, 3, 4, 5, and 6. This list is
mutually exclusive – only one of the states of nature can occur for a single roll of
the die. It is also exhaustive, because it lists all the possible outcomes and one
of the listed outcomes must occur.
3. The manager of a small local airport must extend a runway if business jets
are to be accommodated. However, the residents living close to the airport are
already complaining to the city council about the volume and noise of existing
air traffic at the facility. Identify the decision alternatives and the possible states
of nature that might occur after the decision is made.
ANS:
The decision alternatives for the airport manager are to extend the runway or
not to extend the runway. The possible states of nature would include increased
complaints from local residents, reduced complaints from local residents, or no
change in complaints from area residents.
4. What are the differences among risk, uncertainty, and ignorance? Into which
category of decision situation do most business decisions fall?
ANS:
Risk is the condition when all the possible states of nature and their associated
probabilities are known. Uncertainty is the condition when the possible states of
nature are known, but the exact probabilities for the states of nature are not
known. Ignorance is the condition in which neither the possible states of nature
nor any of the associated probabilities are known. Most business decisions are
made under conditions of uncertainty.
5. A new car salesman is being paid a straight salary, but has the choice to (a)
continue with straight salary, (b) change to straight commission, or (c) change
to a combination that includes a lower base salary and a commission on each
sale of a new car. What states of nature might be useful in constructing a payoff
table for this situation?
ANS:
The salesman should consider the possible levels of sales he might make as his
states of nature. Of course, the various levels of sales would affect his income
under the different types of income plans.
ANS:
The expected payoff of a decision alternative is the sum of the products of the
payoffs and the state of nature probabilities.
ANS:
The Expected Value of Perfect Information (EVPI) is the difference between (1)
the expected payoff if we are guaranteed perfect knowledge about the future
and (2) the expected payoff under uncertainty (present information).
ANS:
Rarely will we ever obtain perfect information, yet we can usually purchase
“better” information for a price. The expected value of perfect information
(EVPI) tells us the upper limit for what “perfect” information would be worth.
Certainly we would not want to pay more than the EVPI for “imperfect”
information.
10. In what way are the minimax regret criterion and the expected opportunity
loss (EOL) criterion similar? In what ways do they differ?
ANS:
Both the minimax regret criterion and the expected opportunity loss (EOL)
criterion call for the construction of a “regret matrix”. That is, they force the
decision maker to consider the possible lost opportunities in choosing one
alternative over another. The two criteria differ in that the minimax regret does
not take into account the probabilities of the various states of nature, and the
EOL criterion does.
TRUE/FALSE
1. True or False
Random variation occurs due to identifiable causes that have changed the
process.
2. True or False
Assignable variation occurs due to identifiable causes that have changed the
process.
4. True or False
Perceived quality refers to attributes that relate to the human senses, for
example, how a product looks, feels, tastes, or smells.
5. True or False
Six Sigma is a philosophy that focuses on small, ongoing improvements rather
than relying on large-scale innovations, such as might come from advances in
technology or equipment.
6. True or False
Competitive benchmarking involves studying, and attempting to emulate, the
strategies and practices of organizations already known to generate world-class
products and services.
7. True or False
In general, as long as random variation is absent and assignable variation
remains within acceptable limits, the process is deemed to be stable, or in
statistical control.
8. True or False
Although both statistical process control (SPC) and acceptance sampling make
inferences on the basis of samples, SPC uses sampled products to make
inferences about a larger population of the products, while acceptance sampling
uses the sampled products to make inferences about the process from which
they came.
ANS: F PTS: 1 OBJ: Section 20.2
9. True or False
Acceptance sampling involves inspection of a sample of products after an entire
production run or shipment lot has been produced. As such, it represents a
defect-detection approach to quality management.
MULTIPLE CHOICE
5. A random sample of 100 rods from a process with a mean of 20 feet produced
a sample mean of 20.01 feet and a standard deviation of 0.015. This difference
between the process mean and the sample mean is likely to be:
a. assignable variation.
b. special-cause variation.
c. random variation.
d. homoscedasticity.
e. costs of attrition.
ANS: C PTS: 1 OBJ: Section 20.1
7. Variations in process output that are caused by events that in general cannot
be eliminated without changing the process are:
a. assignable variations.
b. random variations.
c. out of control variations.
d. None of these.
e. All of these.
ANS: B PTS: 1 OBJ: Section 20.1
8. When no point lies outside the control limits of a chart, we conclude that:
a. variation in the process is caused by assignable causes and there is enough
evidence to infer that the process is in statistical control.
b. variation in the process is caused by chance and there is enough evidence to
infer that the process is out of control.
c. variation in the process is caused by chance and there is not enough evidence
to infer that the process is out of control.
d. variation in the process is caused by assignable causes and there is not
enough evidence to infer that the process is out of control.
e. None of these.
ANS: C PTS: 1 OBJ: Section 20.2
9. Variations in process output that are caused by specific events or factors that
are frequently temporary such as machine malfunction are:
a. assignable variations.
b. chance variations.
c. out of control variations.
d. random variations.
e. All of these.
ANS: A PTS: 1 OBJ: Section 20.1
10. Variations in process output that can usually be identified and eliminated
without changing the process are:
a. assignable variations.
b. chance variations.
c. out of control variations.
d. random variations.
e. All of these.
ANS: A PTS: 1 OBJ: Section 20.1
11. In statistical process control (SPC), a Type I error occurs if we conclude that
the process is:
a. in control when in fact it is not.
b. out of control when in fact it is not.
c. in control when this is really true.
d. out of control when this is really true.
e. None of these.
ANS: B PTS: 1 OBJ: Section 20.6
12. Which of the following describes the purpose of statistical process control
(defect-prevention)?
a. A consumer buys a TV and plugs it in. It doesn’t work and she returns it.
b. A soft drink company samples its product at 25 retail establishments to
determine if the canner’s mixture meets the soft drink company’s standard.
c. The Army purchases hammers and tests 10 of them before accepting the
entire shipment.
d. Computer chips received by a computer maker are sampled before the
shipment is accepted or rejected.
e. The manufacturer of pocket knifes samples its process at three stages before
allowing the pocket knives to be assembled.
ANS: E PTS: 1 OBJ: Section 20.2
14. When the only sources of variation in a production process are caused by
chance, and random variation remains within acceptable limits, the process is
said to be:
a. stable.
b. in statistical control.
c. out of control.
d. out of luck.
e. Both A and B since they are the same.
ANS: E PTS: 1 OBJ: Section 20.2
15. Perry Company, a variety store, sends a team of five executives to Wal-Mart
headquarters in Bentonville, Arkansas, to observe the distribution and logistics
practices so that Perry Company can improve in these areas. This firm is
engaging in:
a. worker training.
b. statistical process control.
c. a quality circle.
d. competitive benchmarking.
e. acceptance sampling.
ANS: D PTS: 1 OBJ: Section 20.4
17. Individual units can have higher quality than others, even while meeting
traditional specifications. Deviations from the target lead to lower and lower
quality of the unit. This concept is associated with a:
a. cause and effect diagram.
b. process flow chart.
c. check sheet.
d. Taguchi method.
e. Pareto diagram.
ANS: D PTS: 1 OBJ: Section 20.5
18. The visual display of effects and their possible causes as a method of
problem solving is associated with a:
a. cause and effect diagram.
b. process flow chart.
c. check sheet.
d. Taguchi method.
e. Pareto diagram.
ANS: A PTS: 1 OBJ: Section 20.5
20. Control charts that are used to monitor a process whose results are
categorized as either defective or non-defective are called:
a. control charts for attributes.
b. control charts for variables.
c. control charts for normality.
d. control charts for out of luck.
e. None of these.
ANS: A PTS: 1 OBJ: Section 20.6
21. For control chart, the lower and upper control limits are usually set at:
a. five standard deviations from the centerline.
b. four standard deviations from the centerline.
c. three standard deviations from the centerline.
d. two standard deviations from the centerline.
e. one standard deviation from the centerline.
ANS: C PTS: 1 OBJ: Section 20.7
22. A chart for a normally distributed random variable shows that one sample
mean is outside the control chart limits. The probability that this happened by
chance is:
a. 0.9554.
b. 0.5000.
c. 0.9974.
d. 0.0300
e. 0.0026
ANS: E PTS: 1 OBJ: Section 20.7
24. Forty samples of size 1,000 were drawn from a manufacturing process and
the number of defectives in each sample was counted. The mean sample
proportion was 0.05. The centerline for the p chart is:
a. 0.05.
b. 50.0.
c. 2.00.
d. 25.0.
ANS: A PTS: 1 OBJ: Section 20.8
25. Fifty samples of size 600 were drawn from a manufacturing process and the
number of defectives in each sample was counted. The mean sample proportion
was 0.025. The lower control limit for the 3-sigma p chart is:
a. 0.1561.
b. 0.0059.
c. 0.0064.
d. 0.0191.
ANS: B PTS: 1 OBJ: Section 20.8
26. Thirty samples of size 1200 were drawn from a manufacturing process and
the number of defectives in each sample was counted. The mean sample
proportion was 0.04. The upper control limit for the 3-sigma p chart is:
a. 0.0170.
b. 0.0570.
c. 0.1518.
d. 0.0230.
ANS: B PTS: 1 OBJ: Section 20.8
28. Which of the following are among the statistical tools discussed in your text
and used in applying TQM?
a. The process flow chart
b. The cause-and-effect diagram
c. The Pareto diagram
d. The Taguchi view of quality loss
e. All of these
ANS: E PTS: 1 OBJ: Section 20.10
NUMERIC RESPONSE
1. Ground coffee empties into a chute containing a roll of plastic, which forms a
sealed bag of coffee. From every ten bags filled, four are randomly sampled and
weighed to see if the mean is correct (the process in control). The average mean
for twenty samples of four equals 6.01 ounces. The average range of the twenty
samples is 0.08 ounces. What will be the 3-sigma lower control limit for the
mean chart?
ANS: 5.952
2. A 3-sigma mean chart for a normally distributed random variable shows that
one mean lies out of the control limits. What is the probability that this
observation is a result of random chance?
ANS: 0.0026
3. Ground coffee empties into a chute containing a roll of plastic, which forms a
sealed bag of coffee. The weight on the coffee bag states the weight is 6
ounces. From every ten bags filled, four are randomly sampled and weighed to
see if the mean is correct (the process in control). The standard deviation of the
filling process is 0.05 ounces. Compute the 3-sigma lower control limit for the
mean chart.
ANS: 5.925
ANS: 2025
COMPLETION
ANS:
Plan, Do, Check, and Act
Plan, Do, Check, Act
5. Costs that occur as the result of poor quality are also known as
____________________.
ANS: special-cause
9. The idea of total quality management in a word from the Japanese is “Kaizen”
or improvement. In his 1986 book, Kaizen, author ____________________ describes
the concept that Kaizen represents.
11. A bar chart describing the relative frequencies with which various kinds of
defects or nonconformities have occurred is called ____________________.
14. Each sample involves a 9 square foot piece of carpet. The number of flaws
found in a piece would be entered on a(n) ____________________ chart.
ANS: c-chart
15. Organizations seeking to improve the quality of their goods and services are
increasingly adopting the principles and practices of
________________________________________.
ANS:
total quality management
TQM
ANS: Kaizen
ANS: p; c
SHORT ANSWER
1. Use the following data to prepare the upper and lower control limits for a
mean chart. Each sample contains 3 values.
Compute the lower and upper control limits.
ANS:
29.724; 54.276
Centerline = ____________________
ANS:
4; 3.75; 4.25
NARRBEGIN: Bottles
A bottle filling machine fills bottles with 16 oz of soda. Each hour, 3 bottles are
weighed on a precision scale to determine if the process is under control. The
following data has been collected from the last 4 hours. Each value in the table
represents the weight of an individual bottle.
ANS:
UCL = 16.59
LCL = 15.62
4. Assuming this data was collected when the process was operating
satisfactorily, construct 3-sigma limits for the range chart.
ANS:
UCL = 1.22
LCL = 0
5. At 11:00 AM, the weights of the following three bottles were recorded: 16.0,
16.7, 15.2. According to this sample, is the process in control?
ANS:
Sample mean = 15.97
Sample range = 1.5
Since the sample range exceeds the range chart limits, the process is out if
control.
6. The Cattleman’s Bar and Grille would like to use statistical process control to
monitor the time that elapses between a customer’s arrival and when the food is
served. The manager has determined that the average time between customer
arrival and service is 19.5 minutes with a standard deviation of 2.5 minutes.
Each evening, the manager plans on selecting 4 tables at random and will
measure the elapsed time. What would the 3-sigma control limits be for the X-
bar chart?
ANS:
UCL = 23.25 minutes
LCL = 15.75 minutes
NARRBEGIN: Samples
Twenty-five samples of size 4 were taken from a production process. The mean
of the sample means is = 13.9. The process standard deviation is known to be =
3.5. The sample means are listed below.
NARREND
7. Calculate the centerline and control limits for the 3-sigma chart
Centerline = ____________________
ANS:
13.9; 8.65; 19.15
ANS:
ANS:
No; The process is out of control at samples 8, 9, 21, 22, and 25.
NARRBEGIN: Size 4
Twenty-five samples of size 4 were drawn from a production process. For each
sample, the mean and range were computed as shown below.
NARREND
10. Compute the centerline and control limits for the R chart.
Centerline = ____________________
ANS:
12.68; 0; 28.9358
ANS:
12. Compute the centerline and control limits for the chart.
Centerline = ____________________
ANS:
20.76; 11.5163; 30.0037
ANS:
ANS:
No; The chart indicates that the process went out of control at sample 13.
PTS: 1 OBJ: Section 20.7
15. A particular golfer is obsessed with making 10 foot putts. Each day that he
practices, he puts 60 times and counts his misses. The following table indicates
how he has done over the last 10 practices.
ANS:
UCL = 0.3549
LCL = 0.0451
16. In a set of data collected for construction of a c-chart, the average number
of defects per unit of output is = 9. What are the 3-sigma lower and upper
control limits for the control chart?
ANS:
0; 18
ANS:
UCL = 0.372
LCL = 0.156
Centerline = ____________________
19. Construct the p chart and determine if the production process is under
control.
ANS:
ESSAY
1. According to Greg Brue, author of Six Sigma for Managers, Six Sigma has a
strong profitability and project/problem orientation, and involves five phases.
List these phases.
ANS:
(1) define the project or goal and the internal and external customers involved;
(2) measure the current performance of the process;
(3) analyze the root causes of the defects generated by the process;
(4) improve the process so as to eliminate the defects;
(5) control the future performance of the process.
ANS:
Quality is important because customers value it, and because quality and the
lack of it cost money. Even spending a lot of money to prevent poor quality is a
bargain compared to the enormous costs incurred by poor quality.
ANS:
It is generally easier to find and eliminate the source for assignable variation
than for random variation. Assignable variation can be traced to some
identifiable cause, and the cause can be changed or eliminated. Random
variation, on the other hand, occurs because of chance.
ANS:
Worker empowerment (1) makes the individual worker responsible for the
quality of the output from his or her own workstation, and (2) provides the
worker with the authority to make changes deemed necessary for quality
improvement.
ANS:
Random variation in a manufacturing process is the variation in the process that
results from chance. The amount of cereal injected into a box may vary as a
result of the temperature of the cereal, or the temperature of the room, or the
different compositions of the grain.
ANS:
Assignable variation in the process of filling cereal boxes by machine could be a
malfunction of the weight control that might allow too much cereal into the
boxes. Other factors of assigned variation could be operator error or line voltage
“spikes”.
8. Two techniques introduced during the era of statistical quality control are
especially noteworthy. What are they?
ANS:
a) Statistical process control chart: graphical procedure for determining whether
a production process is operating properly. b) Acceptance sampling: set of
procedures whereby, based on a random sampling of the items within, an entire
production lot/shipment is accepted, rejected, or subjected to further sampling.
PTS: 1 OBJ: Section 20.2
ANS:
SPC uses the sampled products to make inferences about the process from
which they came, while acceptance sampling uses sampled products to make
inferences about a larger population of the products themselves.
A) automobile accidents
B) alcoholism
ANS:
A) Automobile accidents. Among strategies for preventing “defects”: driver
education programs, improved road signs and markings. Among strategies for
detecting “defects”: periodic driver retesting, automobile state inspection
programs, checking driver license lists with lists of persons known to have
serious alcohol, drug, medical problems that might impair their driving.;
B) Alcoholism. Preventing: educational programs, family support groups and
other approaches to dealing with some of the causes of alcoholism. Detecting:
various social and law enforcement agencies, identifying repeat DUI (driving
under the influence).
PTS: 1 OBJ: Section 20.2
ANS:
SPC method represents a defect-prevention strategy by monitoring the stability
of the manufacturing process while the products are actually being made.
Acceptance sampling is used after an entire production run has been completed.
It represents a defect-detection approach to quality management rather than a
detection prevention approach.
ANS:
a) Plan: Develop a plan for a possible improvement to the product or the process
through which the product is made.
b) Do: Apply potential improvement in a small-scale test.
c) Check: Evaluate the change to see if it was successful.
d) Act: Based on results: adopt the plan, abandon the plan, or use the
information gained to revise the plan and go through cycle again.
PTS: 1 OBJ: Section 20.3
13. What is meant by the statement that total quality management has a
“process orientation”?
ANS:
TQM stresses the prevention of defects through continual attention to, and
improvements in, the processes through which the product is designed,
produced, delivered, and supported.
14. What is the difference between total quality management and the traditional
approach to quality management?
ANS:
Traditional quality management primarily focuses on the end product and defect
detection. Total quality management is process-oriented and aims at defect
prevention.
15. What is the Deming PDCA cycle and how is it applicable to total quality
management?
ANS:
Deming’s PDCA (Plan-Do-Check-Act) cycle stresses the importance of continuous
and systematic quality-improvement efforts at all levels of the organization.
ANS:
First, the individual worker must be responsible for the quality of output that he
or she produces. Second, the worker must be provided with the authority to
make changes deemed necessary for quality improvement.
PTS: 1 OBJ: Section 20.4
ANS:
“Kaizen” is the name of a book written by Masaaki Imai. Kaizen is a philosophy
centered on ongoing improvement involving everyone, including workers and
managers.
18. What is process capability, and how can its presence be ascertained?
ANS:
The most important element in a quality audit is checking for process capability
– e.g., demanding that a prospective supplier provide evidence of its defect-
prevention strategies and programs, and of its ability to supply products that
meet the customer’s quality expectations.
ANS:
JIT operates with a minimum of inventory on hand, with materials and
components literally arriving “just in time” to enter the production process.
Mistakes that do occur tend to have high visibility and are more quickly
identified and corrected than in conventional systems that have large production
runs before shipment or transfer to either the final customer or the next
workstation.
ANS:
Kaizen focuses on small, ongoing improvements in quality rather than on large-
scale innovations that come from advances in technology and equipment.
Kaizen is the baseball equivalent of winning by hitting many singles.
ANS:
Competitive benchmarking involves studying and emulating the strategies and
practices of organizations already known to generate products and services that
are of world-class quality.
22. In what ways can large production runs and high inventory levels adversely
affect product quality?
ANS:
By the time defects or errors are discovered, there may be an extremely large
number of them. Also, due to the relatively long time that passes between the
occurrence of defects/errors and their discovery, the process problem(s) that led
to them may be difficult or impossible to identify and correct.
23. What is a quality audit, and for what purposes might one be performed?
ANS:
Quality audits involve evaluating a firm’s process capabilities. The goal is to
assess the firm’s strategies, and success, in defect prevention. These audits can
also be quantified to facilitate comparisons from one firm to the next.
ANS:
The traditional approach views a product as either “good” or “bad”, depending
on whether it is within certain specification limits. Taguchi’s method goes
beyond this to recognize the loss in quality that occurs as a dimension or
measurement deviates from the target value.
25. What is a process flow chart, and how can it contribute to the goals of total
quality management?
ANS:
By visually depicting the operations or sub-processes in a process, the process
flow chart contributes to our understanding of the process as well as to our
efforts to improve it.
26. What is the “80/20” rule, and how does it apply to total quality
management?
ANS:
The “80/20” rule is a guideline which states that 80% of the problems come
from only 20% of the problem areas.
27. What is a “check sheet” and what is its role in total quality management?
ANS:
A check sheet is a mechanism for entering and retaining data that may be the
subject of further analysis by another statistical tool.
28. What is a “cause-and-effect diagram”, and what role can it play in total
quality management?
ANS:
The cause and effect diagram provides a visual description of effects and their
possible causes. It facilitates the identification and solution of problems in the
production process.
PTS: 1 OBJ: Section 20.5
29. In testing the null hypothesis “H0: The process is in control”, explain the
difference between a Type I error and Type II error.
ANS:
A Type I error is rejecting a true null hypothesis; therefore, a Type I error in this
situation will result in a belief that the process is not in control when it really is.
A Type II error is failing to reject a false null hypothesis; therefore, a Type II error
in this situation will result in a belief that the process is in control when it really
is not.
ANS:
A variable monitors measurements such as weight, length or diameter. An
attribute is either present or it is not, and this type of control chart applies when
we are counting something associated with the process output.
31. In terms of random variation and assignable variation, what is meant by the
statement, “The process is in control”?
ANS:
The process is considered to be in control if the variation in the sample statistic
is judged to be the result of random variation alone.
ANS:
A control chart is a chart that displays a sample statistic for each of a series of
samples from a process. It has upper and lower control limits and is used in
determining whether the process is in control.
ANS:
A control chart’s upper and lower limits represent the boundaries for virtually all
of the random variation. Values that fall outside these limits are considered as
possibly having occurred as the result of assignable variation.
34. Identify the circumstances when the chart and R chart might be useful.
ANS:
Use the mean ( ) and range (R) charts when evaluating variables that are
quantitatively measured; usually continuous random variables. The chart helps
with questions about whether the process is producing the correct or acceptable
mean and the R-chart helps with questions about the variability of the process.
Often these charts use 3-sigma control limits.
35. How does the purpose of a range chart differ from that of a mean chart, and
why is it a good idea to use both of them in monitoring the same process?
ANS:
The range chart measures the variation in the measurements; it is similar to a
standard error chart. The mean chart, on the other hand, measures the central
tendency of the measurements. Both the range and the mean should be
monitored to assure product quality. The mean is important for obvious reasons:
we must be close to the specifications on average. The range is also important
because we cannot tolerate too much overall deviation.
36. What assumptions are made in the mean chart about the distribution of
individual measurements for the process that is being monitored?
ANS:
In the construction of a mean chart it is assumed that the individual
measurements come from a population that is normally distributed or
approximately so.
37. In general, how will the lower and upper control limits change if a 2-sigma
control chart is used instead of a 3-sigma control chart?
ANS:
The upper and lower control limits will be closer to the centerline if 2-sigma
limits are used instead of 3-sigma limits. (The lines will be only two standard
errors either side of the centerline instead of three).
38. In a 3-sigma mean control chart for a process that is in control, what is the
probability that a given sample mean will fall outside the control limits as the
result of random variation alone?
ANS:
Given 3-sigma control limits for a control chart, there is a 0.0026 (or 0.26%)
chance that a given sample will fall outside the control limits as the result of
random variation alone. (Three-sigma limits include 99.74% of the area under
the normal curve).
39. Identify the circumstances when the p-chart and c-chart might be useful.
ANS:
Use the c-chart and the p-chart for control questions about attributes of items
such as defects and bad parts. Often these charts use 3-sigma control limits.