Chap08 PPT
Chap08 PPT
Chap08 PPT
Economics
Vũ Võ
[email protected]
8-1
Chapter 8
Sampling Distributions and Estimation
Chapter Contents
8-2
Chapter 8
Sampling Distributions and Estimation
(continued)
Chapter Learning Objectives (LOs)
8-3
Chapter 8
Sampling Distributions and Estimation
(continued, 2)
Chapter Learning Objectives (LOs) continued
8-4
Chapter 8
8.1 Sampling and Estimation
LO8-1: Define sampling error, parameter, and estimator.
8-5
Chapter 8
LO8-1: Define sampling error, parameter, and
estimator (continued).
8-6
Chapter 8
LO8-1: Define sampling error, parameter, and
estimator (continued, 2).
8-7
Chapter 8
LO8-1: Define sampling error, parameter, and
estimator (continued, 3).
8-8
Chapter 8
LO8-1: Define sampling error, parameter, and
estimator (continued, 4).
• The dot plots show that the sample means have much
less variation than the individual sample items.
Note: This chapter describes
the behavior of the sample
mean and other statistical
estimators of population
parameters, and explains
how to
make inferences about a
population.
8-9
Chapter 8
LO8-1: Define sampling error, parameter, and
estimator (continued, 5).
Estimators
• An estimator is a statistic derived from a sample to infer the value
of a population parameter.
• An estimate is the value of the estimator in a particular sample.
• Population parameters are usually represented by
Greek letters and the corresponding statistic
by Roman letters.
8-10
Chapter 8
LO8-1: Define sampling error, parameter, and
estimator (continued, 6).
Examples of Estimators
8-11
Chapter 8
LO8-1: Define sampling error, parameter, and
estimator (continued, 7).
Sampling Error
Properties of Estimators
8-13
Chapter 8
LO8-2: Explain the desirable properties of estimators
(continued).
Note: Also, a desirable property for an estimator is for it
to be unbiased.
8-14
Chapter 8
LO8-2: Explain the desirable properties of estimators
(continued, 2).
Efficiency
• Efficiency refers to the variance of the estimator’s sampling distribution.
• A more efficient estimator has smaller variance.
• Among all unbiased estimators, we prefer the minimum variance
estimator, referred to as MVUE (minimum variance unbiased estimator).
• Figure 8.5 (next slide) shows two unbiased estimators. Both patterns are
centered on the bull’s-eye, but the estimator on the left has less variation.
• A more efficient estimator is closer on average to the true value of the
parameter.
• You cannot assess efficiency from one sample, but it can be studied either
mathematically or by simulation.
• For a normal distribution, ̅ and s2 are minimum variance estimators
of µ and σ2, respectively. Similarly, the sample proportion p is an MVUE of
the population proportion π. That is one reason these statistics are widely
used.
8-15
Chapter 8
LO8-2: Explain the desirable properties of estimators
(continued, 3).
Efficiency (continued)
8-16
Chapter 8
LO8-2: Explain the desirable properties of estimators
(continued, 4).
Consistency
• A consistent estimator converges toward the parameter being
estimated as the sample size increases.
• That is, the sampling distribution collapses on the true parameter, as
illustrated in Figure 8.6.
8-17
Chapter 8
8.2 Central Limit Theorem
LO8-3: State and apply the Central Limit Theorem
for a mean.
• The sampling distribution of an estimator is the probability
distribution of all possible values the statistic may assume when a
random sample of size n is taken.
• An estimator has a probability distribution with a mean and variance.
• Consider the sample mean ̅ used to estimate the population
mean µ.
• Our objective is to use the sampling distribution of ̅ to say
something about the population that we are studying.
• To describe the sampling distribution, we need to know the
mean, variance, and shape of the distribution.
• Recall that the sample mean is an unbiased estimator for µ;
therefore, ̅ = (expected value of the mean).
8-18
Chapter 8
LO8-3: State and apply the Central Limit Theorem for a
mean (continued).
8-19
Chapter 8
LO8-3: State and apply the Central Limit Theorem for a
mean (continued, 2).
8-20
Chapter 8
LO8-3: State and apply the Central Limit Theorem for a
mean (continued, 3).
The following are three important facts about the sample mean.
8-21
Chapter 8
LO8-3: State and apply the Central Limit Theorem for a
mean (continued, 4).
1. If the population is exactly normal, then the sample mean
follows a normal distribution.
8-22
Chapter 8
LO8-3: State and apply the Central Limit Theorem for a
mean (continued, 5).
2. As the sample size n increases, the distribution of sample means
converges to the population mean µ (i.e., the standard error of the
mean gets smaller).
8-23
Chapter 8
LO8-3: State and apply the Central Limit Theorem for a
mean (continued, 6).
3. Even if your population is not normal, by the Central Limit
Theorem, if the sample size is large enough, the sample
means will have approximately a normal distribution.
8-24
Chapter 8
LO8-3: State and apply the Central Limit Theorem for a
mean (continued, 7).
8-25
Chapter 8
LO8-3: State and apply the Central Limit Theorem for a
mean (continued, 8).
Range of Sample Means
The Central Limit Theorem permits us to define an interval within
which the sample means are expected to fall. As long as the
sample size n is large enough, we can use the normal distribution
regardless of the population shape (or any n if the population is
normal to begin with).
8-26
Chapter 8
LO8-3: State and apply the Central Limit Theorem for a
mean (continued, 9).
90% Interval:
95% Interval:
99% Interval:
8-27
Chapter 8
8.3 Sample Size and Standard Error
LO8-4: Explain how sample size affects the standard error.
8-28
Chapter 8
LO8-4: Explain how sample size affects the standard error
(continued).
Sample Size and Standard Error (continued)
8-29
Chapter 8
LO8-4: Explain how sample size affects the standard error
(continued, 2).
Illustration: All Possible Samples from a Uniform Population
8-30
Chapter 8
LO8-4: Explain how sample size affects the standard error
(continued, 3).
Illustration: All Possible Samples from a Uniform Population
(continued)
8-31
Chapter 8
LO8-4: Explain how sample size affects the standard error
(continued, 4).
Illustration: All Possible Samples from a Uniform Population
(continued)
8-32
Chapter 8
8.4 Confidence Interval for a Mean
(µ) with known σ
LO8-5: Construct a confidence interval for a population
mean using z.
What Is a Confidence Interval?
• A sample mean ̅ calculated from a random sample x1, x2, …, xn is a point
estimate of the unknown population mean µ.
• Now, because samples vary, we need to indicate our uncertainty about the
true value of µ.
• Based on our knowledge of the sampling distribution of , we can create
an interval estimate for µ.
• We construct a confidence interval for the unknown mean µ by adding and
subtracting a margin of error from ̅ , the mean of our random sample.
• The confidence level for this interval is expressed as a percentage such as 90,
95, or 99 percent.
8-33
Chapter 8
LO8-5: Construct a confidence interval for a population
mean using z (continued).
8-34
Chapter 8
LO8-5: Construct a confidence interval for a population
mean using z (continued, 2).
If samples are drawn from a normal
population (or if the sample is large enough
that is approximately normal by the
Central Limit Theorem) and σ is known,
then the margin of error is calculated using
the standard normal distribution. The
value zα/2 is determined by the desired
level of confidence, which we call 1 − α.
Because the sampling distribution is
symmetric, α/2 is the area in each tail of
the normal distribution.
8-35
Chapter 8
LO8-5: Construct a confidence interval for a population
mean using z (continued, 3).
Choosing a Confidence Level
8-36
Chapter 8
LO8-5: Construct a confidence interval for a population
mean using z (continued, 4).
Interpretation
8-37
Chapter 8
LO8-5: Construct a confidence interval for a population
mean using z (continued, 5).
8-38
Chapter 8
8.5 Confidence Interval for a Mean
(µ) with Unknown σ
LO8-6: Know when and how to use Student’s t instead of z
to estimate a mean.
Student’s t Distribution
• In situations where the population is normal but its
standard deviation σ is unknown, the Student’s
t distribution should be used instead of the
normal z distribution.
• This is particularly important when the sample size is small.
• When σ is unknown, the formula for a confidence interval
resembles the formula for known σ except
that t replaces z and s replaces σ.
8-39
Chapter 8
LO8-6: Know when and how to use Student’s t instead of z
to estimate a mean (continued).
8-40
Chapter 8
LO8-6: Know when and how to use Student’s t instead of z
to estimate a mean (continued, 2).
8-41
Chapter 8
LO8-6: Know when and how to use Student’s t instead of z
to estimate a mean (continued, 3).
8-42
Chapter 8
LO8-6: Know when and how to use Student’s t instead of z
to estimate a mean (continued, 4).
Student’s t Distribution
• t distributions are symmetric and shaped like the standard normal
distribution.
• However, they are somewhat less peaked and have thicker tails.
• The t distribution is dependent on the size n of the sample.
8-43
Chapter 8
LO8-6: Know when and how to use Student’s t instead of z
to estimate a mean (continued, 5).
Degrees of Freedom
• Degrees of Freedom (d.f.) is a parameter based on the
sample size that is used to determine the value of the t
statistic.
• Degrees of freedom tell how many observations are used to
calculate σ, less the number of intermediate estimates used
in the calculation. The d.f. for the t distribution in this case,
is given by d.f. = n -1.
• As n increases, the t distribution approaches the shape of
the normal distribution.
• For a given confidence level, t is always larger than z, so a
confidence interval based on t is always wider than if z were
used.
8-44
Chapter 8
LO8-6: Know when and how to use Student’s t instead of z
to estimate a mean (continued, 6).
Comparison of z and t
8-45
Chapter 8
LO8-6: Know when and how to use Student’s t instead of z
to estimate a mean (continued, 7).
Example: GMAT Scores
8-47
Chapter 8
LO8-6: Know when and how to use Student’s t instead of z
to estimate a mean (continued, 9).
Example: GMAT Scores (continued)
For a 90% confidence interval, use Appendix D to find t0.05 = 1.729 with d.f. = 19.
8-48
Chapter 8
LO8-6: Know when and how to use Student’s t instead of z
to estimate a mean (continued, 10).
Note: One can use Excel, Minitab, etc. to obtain these values as well as to
construct confidence Intervals.
8-49
Chapter 8
LO8-6: Know when and how to use Student’s t instead of z
to estimate a mean (continued, 11).
8-50
Chapter 8
LO8-6: Know when and how to use Student’s t instead of z
to estimate a mean (continued, 12).
Using Appendix D
• Beyond d.f. = 50, Appendix D shows d.f. in steps of 5
or 10.
• If the table does not give the exact degrees of
freedom, use the t-value for the next lower degrees of
freedom.
• This is a conservative procedure since it causes the
interval to be slightly wider.
• A conservative statistician may use the t distribution for
confidence intervals when σ is unknown because
using z would underestimate the margin of error.
8-51
Chapter 8
8.6 Confidence Interval for a
Proportion (π)
LO8-7: Construct a confidence interval for a population
proportion.
8-52
Chapter 8
LO8-7: Construct a confidence interval for a population
proportion (continued).
8-53
Chapter 8
LO8-7: Construct a confidence interval for a population
proportion (continued, 2).
8-54
Chapter 8
LO8-7: Construct a confidence interval for a population
proportion (continued, 3).
Applying the CLT (continued)
8-55
Chapter 8
LO8-7: Construct a confidence interval for a population
proportion (continued, 4).
When Is It Safe to Assume Normality of p?
Rule of Thumb
The sample proportion p = x/n may be assumed normal when
the sample has at least 10 “successes” and at least 10 “failures,”
i.e., when x ≥ 10 and n − x ≥ 10.
Table 8.9 8-56
Chapter 8
LO8-7: Construct a confidence interval for a population
proportion (continued, 5).
When Is It Safe to Assume Normality of p? (continued)
Table 8.8 shows the minimum sample size needed to
assume normality for p = x/n.
where
8-58
Chapter 8
LO8-7: Construct a confidence interval for a population
proportion (continued, 7).
Example
A sample of 75 retail in-store purchases showed that 24 were paid in cash.
We will construct a 95 percent confidence interval for the proportion of all
retail in-store purchases that are paid in cash.
8-59
Chapter 8
LO8-7: Construct a confidence interval for a population
proportion (continued, 8).
Example (continued)
We cannot know with certainty whether or not the true proportion lies within
the interval [.214, .426]. Either it does, or it does not. And it is true that
different samples could yield different intervals. But we can say that, on
average, 95 percent of the intervals constructed in this way would contain the
true population proportion π. Therefore, we are 95 percent confident that the
true proportion π is between .214 and .426.
8-60
Chapter 8
LO8-7: Construct a confidence interval for a population
proportion (continued, 9).
Rule of Three
8-61
Chapter 8
8.7 Estimating from Finite
Populations
LO8-8: Know how to modify confidence intervals when the
population is finite.
• In Chapter 2 we discussed infinite and finite populations and the implication of a finite
population when sampling without replacement.
• If the sample size n is less than 5 percent of the population, and we are sampling
without replacement, then we consider the size of the population to be effectively
infinite.
• However, on occasion we will take samples without replacement where n is greater than
5 percent of the population.
• When this happens, our margin of error on the interval estimate is actually less than
when the sample size is “small” relative to the population size.
• As we sample more of the population, we get more precise estimates. We need to
account for the fact that we are sampling a larger percentage of the population.
• The finite population correction factor (FPCF) reduces the margin of error and provides
a more precise interval estimate.
8-62
Chapter 8
LO8-8: Know how to modify confidence intervals when the
population is finite (continued).
where
N = the number of items in the population
n = number of items in the sample
8-63
Chapter 8
LO8-8: Know how to modify confidence intervals when the
population is finite (continued, 2).
The FPCF can be omitted when the population is infinite (e.g., when we are
sampling from an ongoing production process) or effectively infinite (when
the population at least 20 times as large as the sample). When n/N < .05,
the FPCF is almost equal to 1 and will have a negligible effect on the
confidence interval.
Confidence Intervals for Finite Population
estimating π
8-64
Chapter 8
8.8 Sample Size Determination for
a Mean
LO8-9: Calculate sample size to estimate a mean.
Thus, the formula for the sample size can be written as:
How to Estimate σ?
• Method 1: Take a Preliminary Sample
Take a small preliminary sample and use the sample s in place of
σ in the sample size formula.
• Method 2: Assume Uniform Population
Estimate rough upper and lower limits a and b and set
σ = [(b-a)/12]½.
• Method 3: Assume Normal Population
Estimate rough upper and lower limits a and b and set σ = (b-a)/6.
This assumes normality with most of the data with μ ± 3σ so the
range is 6σ.
• Method 4: Poisson Arrivals
In the special case when λ is a Poisson arrival rate, then = l.
8-66
Chapter 8
8.9 Sample Size Determination for
a Proportion
LO8-10: Calculate sample size to estimate a
proportion.
8-67
Chapter 8
LO8-10: Calculate sample size to estimate a
proportion (continued).
Solving gives
8-68
Chapter 8
LO8-10: Calculate sample size to estimate a
proportion (continued, 2).
How to Estimate π?
Chi-Square Distribution
8-70
Chapter 8
LO8-11: Construct a confidence interval for a variance
(optional) (continued).
Confidence Interval
8-71
Chapter 8
LO8-11: Construct a confidence interval for a variance
(optional) (continued, 2).
Example
8-72
Chapter 8
LO8-11: Construct a confidence interval for a variance
(optional) (continued, 3).
Example (continued)
8-73
Chapter 8
LO8-11: Construct a confidence interval for a variance
(optional) (continued, 4).
Example (continued)
You can use Appendix E to find critical chi-square values.
8-74
Chapter 8
LO8-11: Construct a confidence interval for a variance
(optional) (continued, 5).
Example (continued)
8-75
Chapter 8
LO8-11: Construct a confidence interval for a variance
(optional) (continued, 6).
Example (continued)
The 95 percent confidence interval for the population
variance σ2 is
8-77