Statistics - Week VI
Statistics - Week VI
Examples
The mean speed of automobiles passing milepost 150 on the West Virginia Turnpike is
68 mph
The mean cost to remodel a kitchen is $20,000
Hypothesis Testing (2 of 3)
State the null hypothesis (H0) and the alternate hypothesis (H1)
Since the researcher cannot study every item or individual in the population, error is
possible
TYPE I ERROR Rejecting the null hypothesis, H0, when it is true.
In hypothesis testing for the mean, , when is known, the test statistic z is
computed with the following formula
CRITICAL VALUE The dividing point between the region where the null
hypothesis is rejected and the region where it is not rejected.
Critical Value
0: ≥ 60,000 miles
H 0: = $65,000 per year
H
H1: < 60,000 miles H1: ≠ $65,000 per year
with an with an
Left-tailed test Two-tailed test
Jamestown Steel Company manufactures and assembles desks and other office
equipment at several plants in New York State. At the Fredonia plant, the weekly
production of the Model A325 desk follows a normal distribution with a mean of
200 and a standard deviation of 16. New production methods have been
introduced and the vice president of manufacturing would like to investigate
whether there has been a change in weekly production of the Model A325. Is the
mean number of desks produced different from 200 at the .01 significance level?
Step 4: Formulate the decision rule by first determining the critical values of z.
Decision Rule: If the computed value of z is not between −2.576 and 2.576, reject
the null hypothesis. If z falls between −2.576 and 2.576, do not reject the null
hypothesis.
Two-Tailed Test Example, Known (3 of 3)
Decision: Because 1.547 does not fall in the rejection region, we decide not to
reject H0.
We did not reject the null hypothesis, so we have failed to show that the
population mean has changed from 200 per week.
One-Tailed Test
Suppose instead of wanting to know if there had been a change in the mean
number of desks assembled, the vice president wanted to know if there had
been an increase in the number of units assembled. Can we conclude, because
of the improved production methods, that the mean number of desks
assembled in the last 50 weeks was more than 200? Use .
Before: Now:
A two-tailed test A one-tailed test
H0: = 200 desks H0: ≤ 200 desks
H1: ≠ 200 desks H1: > 200 desks
The p-Value in Hypothesis Testing
• In the previous example about desk production, the computed z was 1.547 and H0
was not rejected
• Round the computed z-value to two decimal places, 1.55
• Using the z-table, find the probability of finding a z-value of 1.55 or more by .5000
− .4394 = .0606
• Since this is a two-tailed test 2(.0606) = .1212
• In this chart, we can
easily compare the
p-value with the
level of significance
Hypothesis Testing, "σ" Unknown
The Myrtle Beach International Airport provides a cell phone parking lot where people
can wait for a message to pick up arriving passengers. To decide if the cell phone lot
has enough parking places, the manager of airport parking needs to know if the mean
time in the lot is more than 15 minutes. A sample of 12 recent customers showed they
were in the lot the following lengths of time, in minutes (see below).
At the .05 significance level, is it reasonable to conclude that the mean time in the lot
is more than 15 minutes?
t = = = 2.818
The test statistic of 8.818 is greater than our critical value of 1.796.
Therefore, our decision is: Do not reject H0
Step 6: Interpret the result; The test results do not allow the claims manager to
conclude the cost-cutting measures have been
effective.
Type I and Type II Errors
Western Wire Products purchases steel bars to make cotter pins. Past
experience indicates that the mean tensile strength of all incoming shipments
is 10,000 psi and that the standard deviation is 400 psi. To monitor the quality
of the cotter pins, samples of 100 pins are randomly selected and tested for
their strength. In our hypothesis testing procedure the hypotheses are:
H0: μ = 10,000
H1: μ ≠ 10,000
Using a 0.05 significance level, accept the shipment if the sample mean
strength falls between the critical values 9.922 psi and 10.078 psi. If the sample
mean does not fall between the critical values, we conclude the shipment does
not meet the quality standard.
Type II Error Example (2 of 2)
The sample mean, 9.900 psi, is not within the specified range. To calculate the
probability of a Type II error, assume the sample mean is the true mean (see graph
B). Determine the probability of the sample mean falling between 9.900 and 9.922.
Then subtract this probability from .5000 to arrive at the probability of making a
Type II error, .2912
Two-Sample Tests of
Hypothesis
• Examples
• Is there a difference in the mean value of
residential real estate sold by male agents and
female agents in south Florida?
• Is there an increase in the production rate after
music is piped into the production area?
Comparing Two Population Means (2 of 2)
• We can use the following formula to compute z if the following conditions are met
• The two populations follow normal distributions
• The samples are from independent (unrelated) populations
• The population standard deviations are known
• In the z formula, 1- 2, is the difference in the sample means and the square
root of the variance found with formula 11-1 is the standard deviation
Comparing Two Population Means Example
Customers at the FoodTown Supermarket have a choice when paying for their
groceries. They may check out and pay using the standard cashier-assisted checkout
or they may use the new Fast Lane procedure (self-checkout). The store manager
would like to know if the mean checkout time using the standard checkout method is
longer than using the Fast Lane. The time was measured from when the customer
enters the line until all his or her bags are in the cart.
Step 6: Interpret the result. The difference of .20 minute is too large to have
occurred by chance. We conclude the Fast Lane method is faster.
Compare Two Means Using t
• There are two major differences in this test and the test just described in this chapter
• We assume the sampled populations have equal but unknown standard
deviations
• We use the t distribution
• The three requirements for the test:
• The sampled populations are approximately normally distributed
• The sampled populations are independent
• The standard deviations of the two populations are equal
Compare Two Means Using t (2 of 2)
Owens Lawn Care Inc. manufactures and assembles lawnmowers that are shipped to
dealers throughout the United States and Canada. Two different procedures have
been proposed for mounting the engine on the frame of the lawnmower, the Welles
method and the Atkins method. The question is, is there a difference in the methods’
mean time to mount the engines on the frames of the lawnmowers?
A time and motion study is conducted to evaluate.
Step 4: Formulate the decision rule, do not reject H0 if t falls between −1.833 and 1.833
First:
Calculate the sample standard deviations
Two-Sample Pooled Test Example Concluded
The decision is not to reject the null hypothesis because –0.662 falls in the
region between −1.833 and 1.833.
Step 6: Interpret the result; we conclude the sample data failed to show a
difference between the mean assembly times of the two methods.
Unequal Population Standard Deviations
• If we cannot assume the population standard deviations are equal, we adjust the
degrees of freedom and the formula for finding t
• We determine the degrees of freedom based on the following formula
• The value of the test statistic is computed from the following formula
Unequal Population Standard Deviations Example
We must adjust the degrees of freedom with formula 11-6 before finding
the critical values and round the result down to an integer; in this case, 10
Unequal Population Standard Deviations Example (3 of 3)
Step 4: State the decision rule, do not reject H0 if t falls between −1.812 and 1.812.
Step 5: Make decision
Step 6: Interpret; the mean absorption rate of the two types of towels is not the same.
Dependent Samples
•We first compute the mean and the standard deviation of the sample differences
• The value of the test statistic is computed with the following formula
• Note: the standard deviation of the differences will be computed with the formula
3-11, except d is substituted for x
• Example
• Nickel Savings and Loan employs two firms, Schadek Appraisals and Bowyer Real
Estate, to appraise the value of the real estate on which it makes loans. To review
the consistency of the two appraisal firms, Nickel randomly selects 10 homes and
has both of the firms appraise the values of the selected homes. Thus, there will be
a pair of values for each home; these appraised values are related to the home
selected. This is called a paired sample.
Dependent Samples Example