Chapter 07
Chapter 07
1
Chapter 7 Overview
Introduction
7-1 Confidence Intervals for the Mean When
σ Is Known and Sample Size
7-2 Confidence Intervals for the Mean When
σ Is Unknown
7-3 Confidence Intervals and Sample
Size for Proportions
7-4 Confidence Intervals and Sample Size for
Variances and Standard Deviations
2
Chapter 7 Objectives
1. Find the confidence interval for the mean when
σ is known.
2. Determine the minimum sample size for finding
a confidence interval for the mean.
3. Find the confidence interval for the mean when
σ is unknown.
4. Find the confidence interval for a proportion.
3
Chapter 7 Objectives
5. Determine the minimum sample size for
finding a confidence interval for a proportion.
6. Find a confidence interval for a variance and a
standard deviation.
4
7.1 Confidence Intervals for the
Mean When σ Is Known and
Sample Size
A point estimate is a specific
numerical value estimate of a
parameter.
The best point estimate of the
population mean µ is the
sample mean X .
5
Three Properties of a Good
Estimator
1. The estimator should be an
unbiased estimator. That is,
the expected value or the mean
of the estimates obtained from
samples of a given size is
equal to the parameter being
estimated.
6
Three Properties of a Good
Estimator
2. The estimator should be
consistent. For a consistent
estimator, as sample size
increases, the value of the
estimator approaches the value
of the parameter estimated.
7
Three Properties of a Good
Estimator
3. The estimator should be a
relatively efficient estimator;
that is, of all the statistics that
can be used to estimate a
parameter, the relatively
efficient estimator has the
smallest variance.
8
Confidence Intervals for the Mean
When σ Is Known and Sample Size
An interval estimate of a
parameter is an interval or a range
of values used to estimate the
parameter.
This estimate may or may not
contain the value of the parameter
being estimated.
9
Confidence Level of an Interval
Estimate
The confidence level of an interval
estimate of a parameter is the probability
that the interval estimate will contain the
parameter, assuming that a large
number of samples are selected and that
the estimation process on the same
parameter is repeated.
10
Confidence Interval
11
Formula for the Confidence Interval
of the Mean for a Specific α
σ σ
X − zα / 2 < µ < X + zα /2
n n
13
Maximum Error of the Estimate
The maximum error of the estimate is the
maximum likely difference between the point
estimate of a parameter and the actual value
of the parameter.
σ
E = zα / 2
n
14
Example 7-1: Days to Sell an Aveo
A researcher wishes to estimate the number of days it
takes an automobile dealer to sell a Chevrolet Aveo. A
sample of 50 cars had a mean time on the dealer’s lot of
54 days. Assume the population standard deviation to be
6.0 days. Find the best point estimate of the population
mean and the 95% confidence interval of the population
mean.
zα 2 = 2.33
18
Example 1
A sample of the reading scores of 35 fifth-graders has a
mean of 82. The standard deviation of the population is 15.
a. Find the 95% confidence
interval of the mean
reading scores of all fifth-
graders.
b. Find the 99% confidence
interval of the mean
reading scores of all fifth-
graders.
c. Which interval is larger?
Explain why.
19
n = 35 X = 82 s = 15
a. Find the 95% confidence interval of
mean reading scores of all fifth-graders.
s s
X – zα 2 n < µ < X +zα 2 n
15 15
82 – (1.96) < µ < 82 + (1.96)
35 35
77 < µ < 87
20
n = 35 X = 82 s = 15
b. Find the 99% confidence interval of the
mean reading scores of all fifth-graders.
s s
X – zα 2 n < µ < X +zα 2 n
15 15
82 – (2.58) < m < 82 + (2.58)
35 35
21
c. Which interval is larger? Explain why.
22
Example 7-3: Credit Union Assets
The following data represent a sample of the assets (in
millions of dollars) of 30 credit unions in southwestern
Pennsylvania. Find the 90% confidence interval of the
mean.
12.23 16.56 4.39
2.89 1.24 2.17
13.19 9.16 1.42
73.25 1.91 14.64
11.59 6.69 1.06
8.74 3.17 18.13
7.92 4.78 16.85
40.22 2.42 21.58
5.01 1.47 12.24
2.27 12.77 2.76
23
Example 7-3: Credit Union Assets
Step 1: Find the mean and standard deviation. Using
technology, we find X = 11.091 and s = 14.405.
.
.
.
24
Example 7-3: Credit Union Assets
Step 4: Substitute in the formula.
σ σ
X − zα 2 < µ < X + zα 2
n n
14.405 14.405
11.091 − 1.65 < µ < 11.091 + 1.65
30 30
11.091 − 4.339 < µ < 11.091 + 4.339
6.752 < µ < 15.430
One can be 90% confident that the population mean of the
assets of all credit unions is between $6.752 million and
$15.430 million, based on a sample of 30 credit unions.
25
Formula for Minimum Sample Size
Needed for an Interval Estimate of the
Population Mean
2
zα 2 ⋅ σ
n=
E
26
Example 7-4: Depth of a River
A scientist wishes to estimate the average depth of a river.
He wants to be 99% confident that the estimate is
accurate within 2 feet. From a previous study, the
standard deviation of the depths measured was 4.38 feet.
99% → z= 2.58, E= 2, σ= 4.38
2
z
α2 ⋅ σ 2.58 ⋅ 4.38
2
n= = = 31.92 = 32
E 2
Therefore, to be 99% confident that the estimate is within
2 feet of the true mean depth, the scientist needs at least
a sample of 32 measurements.
27
Example 3
s = 2.5
confidence level = 95%
maximum error = 1 day
28
s = 2.5
confidence level = 95%
maximum error = 1 day
2
zα 2 σ (1.96)(2.5) 2 2
n = = = (4.9)
E 1
n = 24.01
or
n = 25 workers
29
7.2 Confidence Intervals for the
Mean When σ Is Unknown
The value of σ, when it is not known, must be estimated
by using s, the standard deviation of the sample.
30
Characteristics of the t Distribution
The t distribution is similar to the standard
normal distribution in these ways:
1. It is bell-shaped.
2. It is symmetric about the mean.
3. The mean, median, and mode are equal to
0 and are located at the center of the
distribution.
4. The curve never touches the x axis.
31
Characteristics of the t Distribution
The t distribution differs from the standard
normal distribution in the following ways:
1. The variance is greater than 1.
2. The t distribution is actually a family of
curves based on the concept of degrees of
freedom, which is related to sample size.
3. As the sample size increases, the t
distribution approaches the standard
normal distribution.
32
Degrees of Freedom
The symbol d.f. will be used for degrees of
freedom.
The degrees of freedom for a confidence
interval for the mean are found by subtracting
1 from the sample size. That is, d.f. = n - 1.
Note: For some statistical tests used later in
this book, the degrees of freedom are not
equal to n - 1.
33
Formula for a Specific Confidence
Interval for the Mean When σ Is
Unknown and n < 30
s s
X − tα 2 < µ < X + tα 2
n n
34
Example 7-5: Using Table F
Find the tα/2 value for a 95% confidence interval when the
sample size is 22.
35
Example 7-6: Sleeping Time
Ten randomly selected people were asked how long they
slept at night. The mean time was 7.1 hours, and the
standard deviation was 0.78 hour. Find the 95%
confidence interval of the mean time. Assume the variable
is normally distributed.
Since σ is unknown and s must replace it, the t distribution
must be used for the confidence interval. Hence, with 9
degrees of freedom, tα/2 = 2.262.
s s
X − tα 2 < µ < X + tα 2
n n
0.78 0.78
7.1 − 2.262 < µ < 7.1 + 2.262
10 10
36
Example 7-6: Sleeping Time
0.78 0.78
7.1 − 2.262 < µ < 7.1 + 2.262
10 10
7.1 − 0.56 < µ < 7.1 + 0.56
37
Example 7-7: Home Fires by Candles
The data represent a sample of the number of home fires
started by candles for the past several years. Find the
99% confidence interval for the mean number of home
fires started by candles each year.
5460 5900 6090 6310 7160 8440 9930
38
Example 7-7: Home Fires by Candles
Step 3: Substitute in the formula.
s s
X − tα 2 < µ < X + tα 2
n n
1610.3 1610.3
7041.4 − 3.707 < µ < 7041.4 − 3.707
7 7
7041.4 − 2256.2 < µ < 7041.4 + 2256.2
4785.2 < µ < 9297.6
One can be 99% confident that the population mean
number of home fires started by candles each year is
between 4785.2 and 9297.6, based on a sample of home
fires occurring over a period of 7 years.
39
7.3 Confidence Intervals and
Sample Size for Proportions
p = population proportion
p̂ (read p “hat”) = sample proportion
For a sample proportion,
X n− X
pˆ = and qˆ = or qˆ = 1 − pˆ
n n
where X = number of sample units that possess the
characteristics of interest and n = sample size.
40
Example 7-8: Air Conditioned
Households
In a recent survey of 150 households, 54 had central air
conditioning. Find p̂ and q̂, where p̂ is the proportion of
households that have central air conditioning.
X 54
pˆ
= = = 0.36 = 36%
n 150
qˆ =1 − pˆ =1 − 0.36 =0.64 =64%
41
Formula for a Specific Confidence
Interval for a Proportion
ˆˆ
pq ˆˆ
pq
pˆ − zα 2 < p < pˆ + zα 2
n n
when np ≥ 5 and nq ≥ 5.
42
Example 7-9: Male Nurses
A sample of 500 nursing applications included 60 from
men. Find the 90% confidence interval of the true
proportion of men who applied to the nursing program.
=p X= = 0.12, qˆ = 0.88
n 60 500
ˆˆ
pq ˆˆ
pq
pˆ − zα 2 < p < pˆ + zα 2
n n
0.12 − 1.65
( 0.12 )( 0.88 )
< p < 0.12 + 1.65
( 0.12 )( 0.88 )
500 500
0.12 − 0.024 < p < 0.12 + 0.024
0.096 < p < 0.144
You can be 90% confident that the percentage of
applicants who are men is between 9.6% and 14.4%.
43
Example 7-10: Religious Books
A survey of 1721 people found that 15.9% of individuals
purchase books at a College bookstore. Find the 95%
confidence interval of the true proportion of people who
purchase their books at a College bookstore.
ˆˆ
pq ˆˆ
pq
pˆ − zα 2 < p < pˆ + zα 2
n n
0.159 − 1.96
( 0.159 )( 0.841) < p < 0.159 + 1.96
( 0.159 )( 0.841)
1721 1721
ˆˆ ˆˆ
(
pˆ – zα 2 ) pq
n (
< p < pˆ + zα 2 ) pq
n
45
ˆˆ ˆˆ
(
pˆ – zα 2 ) pq
n (
< p < pˆ + zα 2 ) pq
n
(0.12)(0.88) (0.12)(0.88)
0.12 – 1.96 < p < 0.12 + 1.96
450 450
46
Formula for Minimum Sample Size
Needed for Interval Estimate of a
Population Proportion
2
zα 2
ˆ ˆ
n = pq
E
47
Example 7-11: Home Computers
A researcher wishes to estimate, with 95% confidence,
the proportion of people who own a home computer. A
previous study shows that 40% of those interviewed had a
computer at home. The researcher wishes to be accurate
within 2% of the true proportion. Find the minimum sample
size necessary.
2 2
zα 2 1.96
ˆ ˆ
n = pq = ( 0.40 )( 0.60 ) 0.02 = 2304.96
E
48
Example 7-12: Car Phone Ownership
The same researcher wishes to estimate the proportion of
executives who own a car phone. She wants to be 90%
confident and be accurate within 5% of the true
proportion. Find the minimum sample size necessary.
49
7-4 Confidence Intervals for
Variances and Standard Deviations
When products that fit together (such as pipes) are
manufactured, it is important to keep the variations of
the diameters of the products as small as possible;
otherwise, they will not fit together properly and will
have to be scrapped.
In the manufacture of medicines, the variance and
standard deviation of the medication in the pills play
an important role in making sure patients receive the
proper dosage.
For these reasons, confidence intervals for variances
and standard deviations are necessary.
50
Chi-Square Distributions
The chi-square distribution must be used to calculate
confidence intervals for variances and standard
deviations.
The chi-square variable is similar to the t variable in
that its distribution is a family of curves based on the
number of degrees of freedom.
χ 2
The symbol for chi-square is (Greek letter chi,
pronounced “ki”).
A chi-square variable cannot be negative, and the
distributions are skewed to the right.
51
Chi-Square Distributions
At about 100 degrees of freedom, the chi-square
distribution becomes somewhat symmetric.
The area under each chi-square distribution is equal to
1.00, or 100%.
52
Formula for the Confidence Interval for
a Variance
( n − 1) s 2 2 ( n
<σ <
− 1) s 2
, d.f. = n − 1
χ 2
right χ 2
left
, d.f. = n − 1
χ 2
right χ 2
left
53
Example 7-13: Using Table G
Find the values for χ right
2
and χ left
2
for a 90% confidence
interval when n = 25.
χ 2
To find right, subtract 1 - 0.90 = 0.10. Divide by 2 to
get 0.05. 2
To find χ left , subtract 1 - 0.05 to get 0.95.
54
Example 7-13: Using Table G
Use the 0.95 and 0.05 columns and the row
corresponding to 24 d.f. in Table G.
The χ right
2
value is 36.415; the χ left
2
value is 13.848.
55
Example 7-14: Nicotine Content
Find the 95% confidence interval for the variance and
standard deviation of the nicotine content of cigarettes
manufactured if a sample of 20 cigarettes has a standard
deviation of 1.6 milligrams.
To find χ
2
right, subtract 1 - 0.95 = 0.05. Divide by 2 to
get 0.025.
χ 2
To find left , subtract 1 - 0.025 to get 0.975.
In Table G, the 0.025 and 0.975 columns with the d.f.
19 row yield values of 32.852 and 8.907, respectively.
56
Example 7-14: Nicotine Content
( n − 1) s 2 < σ 2 < ( n − 1) s 2
χ right
2
χ left
2
<σ <
2
32.852 8.907
1.5 < σ 2 < 5.5
You can be 95% confident that the true variance for the
nicotine content is between 1.5 and 5.5 milligrams.
58
Example 7-15: Cost of Ski Lift Tickets
( n − 1) s 2 < σ 2 < ( n − 1) s 2
χ right
2
χ left
2