Chap8 STAT 2 Merged
Chap8 STAT 2 Merged
Biased Estimator
Estimate
] ✓ˆbiased ]
E[✓ˆunbiasedE[
Example 8.4
The sample mean is an unbiased estimator of the population mean (E(X) = µ).
Sample variance is an unbiased estimator for population variance (E(S ) = ).
The sample proportion is an unbiased estimator for the population proportion
(E(P̂ ) = p).
The second rule for choosing a statistic is that, of all unbiased statistics, the best statistic
to use is the one with the smallest variance.
Probability Density
True value ✓
Low variance
High variance
Estimate
E[✓ˆ ] = E[✓ˆ ] = ✓
↵/2 ↵/2
confidence level
1 ↵
✓ z ✓ ✓+z
The sample mean X is a point estimator of the population mean µ. According to the
Central Limit Theorem, the sampling distribution of the sample mean is approximately
normal for large samples. For a large sample of size n 30.
1. Suppose either (a) population is normal, or (b) the sample size n is large, or
both, and the population standard deviation is known.
2. The sample mean X is (approximately) normal: X N (µ; /n).
3. A 100(1 ↵)% confidence interval for µ :
x z
n
where z can be found in the normal table.
Definition 8.6 The value z is called critical value. It is a value on the measurement
axis in a standard normal distribution such that P (Z z ) = 1 ↵/2.
Example 8.7 ( is known) Fifteen vehicles were observed at random for their speeds (in
mph) on a highway with speed limit posted as 70 mph, and it was found that their average
speed was 73.3 mph. Suppose that from past experience we can assume that vehicle speeds
are normally distributed with = 3.2.
Construct a 90% confidence interval for the true mean speed µ, of the vehicles on this
highway.
Solution.
We have . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Confidence level 1 ↵ = .............. Hence ↵ = ............. and z = ..................
A ................. confidence interval for the true mean speed µ is
x z <µ<x z
n n
or
71.681 < µ < 74.919.
We are 90% confident that the true mean speed µ of the vehicles on this highway is
between 71.681 and 74.919 mph.
Example 8.8 Suppose the weight of a 185/60/14 tire filled with air is normally distributed
with standard deviation 1.25 pounds. In a random sample of 15 filled tires, the sample
mean weight was x = 18.75 pounds. Find a 95% confidence interval for the true mean
weight of 185/60/14 tires.
Solution. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................
Definition 8.9 The margin of error for the estimate of µ is z
n
Example 8.10 A manufacturing firm is interested in estimating the average distance trav-
eled to work by its employees. Past studies of this type indicate that the standard deviation
of these distances should be in the neighborhood of 2 miles. How many employees should
be sampled if the estimate is to be within 0.1 mile of the true average, with 95% confi-
dence?
Solution. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................
Case 1: Assume that the population standard deviation is unknown and the sample size
n is large (n 30). The confidence interval is
s
x z
n
where s is the sample standard deviation.
Example 8.11 ( is unknown, large sample) Scholastic Aptitude Test (SAT) mathematics
scores of a random sample of 500 high school seniors in a city are collected, and the
sample mean and standard deviation are found to be 501 and 112, respectively. Find a
99% confidence interval on the mean SAT mathematics score for seniors in that city.
Solution.
Sample mean x = ..........., sample standard deviation ............, sample size n = .....
Confidence level: ........................... Hence ↵ = .............. and z = ....................
A 99% confidence interval of µ is
s s
x z <µ<x+z
n n
We are 99% confident that the true mean SAT mathematics score of seniors is
between 488.1 and 513.9.
Example 8.12 Suppose a PC manufacturer wants to evaluate the performance of its hard
disk memory system. One measure of performance is the average time between failures of
the disk drive. To estimate this value, a quality control engineer recorded the time between
failures for a random sample of 45 disk-drive failures. The following sample statistics were
computed: x = 1762 hours and s = 215 hours. Estimate the true mean time between
failures with a 90% confidence interval.
Solution. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................
Case 2: Assume that the population from which the sample is selected has an approximate
normal distribution. The population standard deviation is unknown and the sample size
n is small (n < 30). Then the random variable
X µ
T =
S/ n
has a Student t-distribution with n 1 degrees of freedom. Here S is the sample standard
deviation.
If x and s are the sample mean and the sample standard deviation of a random
sample of size n from a normal population, then
s s
x t <µ<x+t
n n
is a (1 ↵)100% CI for the population mean µ.
Definition 8.13 t is a critical value related to a t-distribution with m degrees of freedom.
If T has a t-distribution with m degrees of freedom, then P (T t ) = ↵.
Example 8.14 The following is a random data set from a normal population:
7.2 5.7 4.9 6.2 8.5 2.8
Construct a 95% CI for the population mean µ.
Solution.
Mean and standard deviation of the sample: x = 5.883 and s = 1.959
Population follows .................. distribution, is ................., sample size n = ........,
we we use t-distributrion with ....... degrees of freedom.
Confidence level 95% means that ↵ = 1 0.95 = 0.05, we have from the t-table
that t = 2.571.
Hence, a 95% CI for µ is:
s s
x t ,x + t
n n
or
(3.827; 7.939)
We are 95% confident that the true mean µ will be between 3.827 and 7.939.
Example 8.15 A random sample of size 26 is drawn from a population having a normal
distribution. The sample mean and the sample standard deviation from the data are given,
respectively, as x = 2.22 and s = 1.67. Construct a 98% CI for the population mean µ
and interpret.
Solution. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................
Example 8.16 The following data, expressed as an air pollution index, give the air quality
of a city for 10 randomly selected days:
57.3 ; 58.1 ; 58.7 ; 66.7 ; 58.6 ; 61.9 ; 59.0 ; 64.4 ; 62.6 ; 64.9
Assuming that the data may be looked upon as a random sample from a normal population,
construct a 95% CI for the actual average air pollution index for this city and interpret.
Solution. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................
Example 8.19 A survey conducted of 1404 respondents found that 323 students paid for
their education by student loans. Find the 90% confidence interval of the true proportion
of students who paid for their education by student loans.
Solution. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................
8.5 Determining sample size
We show in this section that the appropriate sample size for making an inference
about a population mean or proportion depends on the desired reliability.
The di↵erence between x and µ is the error of estimation resulting from the
sampling process. Let E = x µ = the error of estimation. Note that
x µ E
z= =
/ n / n
A formula is used to determine sample size
z
n=
E
The population variance is known or can be determined from past studies.
If population variance is unknown, then we can estimate
1
= range
4
↵/2 ↵/2
1 ↵
µ z µ µ+z
range
Example 8.20 The estimation of a new operating system’s mean response time to an
editing command should have an error bound of 5 milliseconds with 95% confidence. Ex-
perience with other operating systems suggests that = 25 is a reasonable approximation
to the population standard deviation. What sample size n should be used?
Solution.
For confidence level 1 ↵ = 0.95, we have z = 1.96.
Error of estimation: E = 5
Standard deviation: = 25
The sample size
z (1.96).(25)
n= = 96.04
E 5
We round up n = 97.
Example 8.21 The manufacturer of official NFL footballs uses a machine to inflate its
new balls to a pressure of 13.5 pounds. When the machine is properly calibrated, the
mean inflation pressure is 13.5 pounds, but uncontrollable factors cause the pressures of
individual footballs to vary randomly from about 13.3 to 13.7 pounds. For quality control
purposes, the manufacturer wishes to estimate the mean inflation pressure to within .025
pound of its true value with a 99% confidence interval. What sample size should be used?
Solution.
For confidence level 1 ↵ = 0.99, we have z = 2.575.
Error of estimation: E = 0.25
1
The range of observations 13.7 13.3 = 0.4, hence we can estimate = 0.4 = 0.1.
4
The sample size
z (2.575).(0.1)
n= = 106.09
E 0.25
We round up n = 107.
Example 8.22 A large manufacturing firm is interested in estimating the average distance
traveled to work by its employees. Past studies of this type indicate that the standard
deviation of these distances should be in the neighborhood of 2 km. How many employees
should be sampled if the estimate is to be within 0.1 km of the true average, with 95%
confidence?
Solution. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................
p̂(1 p̂)
E=z
n
The formula for determining sample size
z p̂(1 p̂)
n=
E
Example 8.23 Suppose that a local TV station in a city wants to conduct a survey to
estimate support for the president’s policies on the economy within 3% error with 95%
confidence. Suppose they have an initial estimate that 70% of the people in the city support
the economic policies of the president. How many people should the station survey?
Solution.
For confidence level 1 ↵ = 0.95, we have z = 1.96.
Error of estimation: E 0.03
Sample proportion p̂ = 0.7
The sample size
z p̂(1 p̂) (1.96) .(0.7)(0.3)
n = 896.37.
E 0.05
Thus, the TV station must survey at least 897 people.
Example 8.24 A researcher wishes to estimate, with 95% confidence, the proportion of
people who did not have a tablet. A previous study shows that 40% of those interviewed did
not have a tablet. The researcher wishes to be accurate within 2% of the true proportion.
Find the minimum sample size necessary.
Solution. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................
Example 8.25 Suppose you want to estimate the average age of all Boeing 737-300 air-
planes now in active domestic U.S. service. You want to be 95% confident, and you want
your estimate to be within one year of the actual figure. The 737-300 was first placed in
service about 24 years ago, but you believe that no active 737-300s in the U.S. domestic
fleet are more than 20 years old. How large of a sample should you take? (97)
Solution. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................
....................................................................................