Problem I

Download as pdf or txt
Download as pdf or txt
You are on page 1of 13

Problem I

1 Visual Representation of Data

Which of the following statements are true?

I. In a stemplot the number of leaves equals the size of the set of data.
II. Both the dotplot and the stemplot are useful in identifying outliers.
III. Histograms do not retain the identity of individual scores; however, dotplots, stemplots, and
boxplots all do.

(A) I and II
(B) I and III
(C) II and III
(D) I, II, and III
(E) None of the above gives the complete set of true responses

2 Visual Representation of Data

A stemplot for the 1988 per capita personal income (in hundreds of dollars) for the 50 states is
Which of the following best describes these data?

(A) Skewed distribution, mean greater than median


(B) Skewed distribution, median greater than mean
(C) Symmetric distribution, mean greater than median
(D) Symmetric distribution, median greater than mean
(E) Symmetric distribution with outliers on high end
3 Linear Model. Bivariant Data

Following are advertising expenditures and total sales for six detergent products:

Predict the total sales if $5000 is spent on advertising and interpret the slope of the regression
line. What if $100,000 is spent on advertising?

4 Linear Model, Bivariant Data

The number of youngsters playing Little League baseball in Ithaca, New York, during the
years 1995–2003 is fitted with a least squares regression line. The graph of the residuals and
some computer output for their regression are as follows:

a. Does it appear that a line is an appropriate model for the data? Explain.
b. What is the equation of the regression line (in context)?
c. Interpret the slope of the regression line in the context of the problem.
d. Interpret the y-intercept of the regression line in the context of the problem
e. What is the predicted number of players in 1997?
f. What was the actual number of players in 1997?
g. What years, if any, did the number of players decrease from the previous year?
Explain.

5 Linear Model, Bivariant Data

Miles per gallon versus speed for a new model automobile is fitted with a least squares
regression line. The graph of the residuals and some computer output for the regression are
as follows:

a. Interpret the slope of the regression line in context.


b. What is the mean predicted MPG at a speed of 30 mph?
c. What was the actual MPG at a speed of 30 mph?
d. Is a line the most appropriate model? Explain.
e. What does “S = 7.252” refer to?
6 Linear Model

A random sample of 30 U.S. farm regions surveyed during the summer of 2003 produced the
following statistics:

Based on this study, what is the mean predicted corn yield for a region where the average
growing season temperature is 76.5°F?

7 Experiments

A study is to be designed to examine the life expectancies of tall people versus those of short
people. Which is more appropriate, an observational study or an experiment?

8 Experiments

There is a rising trend for star college athletes to turn professional without finishing their
degrees. A study is performed to assess whether reading an article about professional
salaries has an impact on such decisions. Randomization can be used to split the subjects
into two groups, and those in one group given the article before answering questions. How
can a block design be incorporated into the design of this experiment?

9 Probability

The table below gives the results of a survey of the drinking and smoking habits of 1200
college students. Rows and columns have also been summed.
What is the probability that someone in this group smokes?

What is the probability a student smokes given that she is a beer drinker?

What is the probability that someone drinks?

What is the probability that students drink given that they smoke?

10 Probability

If 90% of the households in a certain region have answering machines and 50% have both
answering machines and call waiting, what is the probability that a household chosen at
random and found to have an answering machine also has call waiting?

11 Probability

A grocery store manager notes that 35% of customers who buy a particular product make use
of a store coupon to receive a discount. If seven people purchase the product, what is the
probability that fewer than four will use a coupon?

12 Probability

Suppose that 60% of students who take the AP Statistics exam score 4 or 5, 25% score 3,
and the rest score 1 or 2.
Suppose further that 95% of those scoring 4 or 5 receive college credit, 50% of those scoring
3 receive such credit, and 4% of those scoring 1 or 2 receive credit.

If a student who is chosen at random from among those taking the exam receives college
credit, what is the probability that she received a 3 on the exam?
13 Probability

If a two-person committee is chosen at random from a group consisting of three supervisors


and six employees, what is the probability that the committee will have exactly one supervisor
and one employee? Use simulation to estimate the answer.

This random number table may be used.

14 Mean and Standard Deviation, Random Variable

A highway engineer knows that his crew can lay 5 miles of highway on a clear day, 2 miles on
a rainy day, and only 1 mile on a snowy day. Suppose the probabilities are as follows:

15 Mean and Standard Deviation, Random Variable

Concessionaires know that attendance at a football stadium will be 60,000 on a clear day,
45,000 if there is light snow, and 15,000 if there is heavy snow.

Furthermore, the probability of clear skies, light snow, or heavy snow on any particular day is
as followed:
(Here we have a random variable X that takes the values 60,000, 45,000, and 15,000.)

What average attendance should be expected for the season?

16 Independence

An automobile salesperson sells three models of vans with selling prices of $20,000, $25,000,
and $30,000, respectively. For each sale, the salesperson receives a bonus of either $500 or
$750. The probabilities of the various outcomes are given by the following table:

If X is the “amount of sale” random variable, what is the probability distribution of X?

17 Mean and SD for Sums and Differences of Independent Random Variables

Suppose that two random variables X and Y have the following joint probability distribution
table:

If one computes all the conditional probabilities, it can be shown that P(x|y) = P(x) for all
values of x and y, and thus X and Y are independent. What are the variances of the random
variables X, Y, and X + Y?

18 Mean and SD for Sums and Differences of Independent Random Variables

Suppose that two random variables X and Y have the following joint probability distribution
table:

What are the expected values (means) of the random variables X, Y, and X + Y?

19 Normal Distribution

Given a normal distribution with a standard deviation of 10, what is the mean if 21% of the
values are below 50?

20 Normal Distribution

A packing machine is set to fill a cardboard box with a mean of 16.1 ounces of cereal.
Suppose the amounts per box form a normal distribution with a standard deviation equal to
0.04 ounce.
a. What percentage of the boxes will end up with at least 1 pound of cereal?
b. Ten percent of the boxes will contain less than what number of ounces?
c. Eighty percent of the boxes will contain more than what number of ounces?
d. The middle 90% of the boxes will be between what two weights?

21 Sampling Distribution of Sample Mean, Central Limit Theorem

The strength of paper coming from a manufacturing plant is known to be 25 pounds per
square inch with a standard deviation of 2.3. In a simple random sample of 40 pieces of
paper, what is the probability that the mean strength is between 24.5 and 25.5 pounds per
square inch?

22 Sampling Distribution of Two Proportions, Central Limit Theorem

In urban America 43% of married couples own their own homes while only 19% of single
people own their own homes. In an SRS of 200 married couples and an independent SRS of
180 single people, the probability is .90 that the difference in percentages of married couples
and single people who are homeowners is greater than what percentage?

23 Sampling Distribution of Two Proportions, Central Limit Theorem

The strength of paper coming from a manufacturing plant is known to be 25 pounds per
square inch with a standard deviation of 2.3. In a simple random sample of 40 pieces of
paper, what is the probability that the mean strength is between 24.5 and 25.5 pounds per
square inch?

24 Sampling Distribution of Proportions, Central Limit Theorem

Suppose that the average outstanding credit card balance for young couples is $650 with a
standard deviation of $420. In an SRS of 100 couples, what is the probability that the mean
outstanding credit card balance exceeds $700?

25 Sampling Distribution of Sample Means, Central Limit Theorem


When fertilizer A is used, the vegetable yield is 4.5 tons per acre with a standard deviation of
0.7 tons, while the yield when fertilizer B is used is 4.3 tons per acre with a standard deviation
of 0.4 tons. In 45 sample plots using fertilizer A and 50 plots using fertilizer B, what is the
probability that the difference in average yields will be negative, that is, that the average yield
for the plots using fertilizer A will be less than the average yield for the plots using fertilizer B?

26 Normal Distribution

The mean income per household in a certain state is $9500 with a standard deviation of
$1750. The middle 95% of incomes are between what two values?

(A) $5422 and $13,578


(B) $6070 and $12,930
(C) $6621 and $12,379
(D) $7260 and $11,740
(E) $8049 and $10,951

27 Normal Distribution

Consider the following two normal curves:

Which has the larger mean and which has the larger standard deviation?

(A) Larger mean, a; larger standard deviation, a


(B) Larger mean, a; larger standard deviation, b
(C) Larger mean, b; larger standard deviation, a
(D) Larger mean, b; larger standard deviation, b
(E) Larger mean, b; same standard deviation

28 Normal Distribution
One company produces movie trailers with mean 150 seconds and standard deviation 40
seconds, while a second company produces trailers with mean 120 seconds and standard
deviation 30 seconds. What is the probability that two randomly selected trailers, one
produced by each company, will combine to less than three minutes?

29 Sampling Distribution

A jewelry company uses a machine to apply a coating of gold on a certain style of necklace.
The amount of gold applied to a necklace is approximately normally distributed. When the
machine is working properly, the amount of gold applied to a necklace has a mean of 300
milligrams (mg) and standard deviation of 5 mg.

(a) A necklace is randomly selected from the necklaces produced by the machine. Assuming
that the machine is working properly, calculate the probability that the amount of gold applied
to the necklace is between 296mg and 304mg.

The jewelry company wants to make sure the machine is working properly. Each day, Cleo, a
statistician at the jewelry company, will take a random sample of the necklaces produced that
day. Each selected necklace will be melted down and the amount of the gold applied to that
necklace will be determined. Because a necklace must be destroyed to determine the amount
of gold that was applied, Cleo will use random samples of size n = 2 necklaces. Cleo starts by
considering the mean amount of gold being applied to the necklaces. After Cleo takes a
random sample of n = 2 necklaces, she computes the sample mean amount of gold applied to
the two necklaces.

(b) Suppose the machine is working properly with a population mean amount of gold being
applied of 300mg and a population standard deviation of 5mg.

(i) Calculate the probability that the sample mean amount of gold applied to a random sample
of n = 2 necklaces will be greater than 303mg.

(ii) Suppose Cleo took a random sample of n = 2 necklaces that resulted in a sample mean
amount of gold applied of 303mg. Would that result indicate that the population mean amount
of gold being applied by the machine is different from 300mg? Justify your answer without
performing an inference procedure.

Now, Cleo will consider the variation in the amount of gold the machine applies to the
necklaces. Because of the small sample size, n = 2, Cleo will use the sample range of the
data for the two randomly selected necklaces, rather than the sample standard deviation.
Cleo will investigate the behavior of the range for samples of size n = 2.

She will simulate the sampling distribution of the range of the amount of gold applied to two
randomly sampled necklaces.

Cleo generates 100,000 random samples of size n = 2 independent values from a normal
distribution with mean m = 300 and standard deviation s = 5. The range is calculated for the
two observations in each sample.

The simulated sampling distribution of the range is shown in Graph I. This process is repeated
using s = 8, as shown in Graph II, and again using s = 12, as shown in Graph III.

(c) Use the information in the graphs to complete the following.

(i) Describe the sampling distribution of the sample range for random samples of size n = 2
from a normal distribution with standard deviation s = 5, as shown in Graph I.

(ii) Describe how the sampling distribution of the sample range for samples of size n = 2
changes as the value of the population standard deviation s increases
Recall that Cleo needs to consider both the mean and standard deviation of the amount of
gold applied to necklaces to determine whether the machine is working properly. Suppose that
one month later, Cleo is again checking the machine to make sure it is working properly. Cleo
takes a random sample of 2 necklaces and calculates the sample mean amount of gold
applied as 303mg and the sample range as 10mg.

(d) Recall that the machine is working properly if the amount of gold applied to the necklaces
has a mean of 300mg and standard deviation of 5mg.

(i) Consider Cleo’s range of 10mg from the sample of size n = 2. If the machine is working
properly with a standard deviation of 5mg, is a sample range of 10mg unusual? Justify your
answer.

(ii) Do Cleo’s sample mean of 303mg and range of 10mg indicate that the machine is not
working properly? Explain your answer.

30 Sampling Distribution

A promoter knows that 23% of males enjoy watching boxing matches; however, only 12% of
females enjoy watching this sport. In an SRS of 100 men and an independent SRS of 125
women, what is the probability that the difference in the percentages of men and women who
enjoy watching boxing is more than 10%?

You might also like