CH 9 and 10
CH 9 and 10
Sampling Distributions
1
Sampling Distributions…
A sampling distribution is created by, as the name suggests,
sampling.
The method we will employ on the rules of probability and
the laws of expected value and variance to derive the
sampling distribution.
© 2015 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a
license distributed with a certain product or service or otherwise on a password-protected website for classroom use. 9.3
Sampling Distribution of Two Dice
A sampling distribution is created by looking at
all samples of size n=2 (i.e. two dice) and their means…
1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0 5.5 6.0
© 2015 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a
license distributed with a certain product or service or otherwise on a password-protected website for classroom use. 9.5
Compare…
Compare the distribution of X…
1 2 3 4 5 6 1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0 5.5 6.0
6
Generalize…
We can generalize the mean and variance of the sampling of
two dice:
…to n-dice:
The standard deviation of the
sampling distribution is
called the standard error:
7
Central Limit Theorem…
The sampling distribution of the mean of a random sample
drawn from any population is approximately normal for a
sufficiently large sample size.
The larger the sample size, the more closely the sampling
distribution of X will resemble a normal distribution.
8
Central Limit Theorem…
If the population is normal, then X is normally distributed
for all values of n.
9
Sampling Distribution of the Sample Mean
1.
2.
10
Sampling Distribution of the Sample Mean
We can express the sampling distribution of the mean simple
as
X−µ
Z=
σ/ n
11
Sampling Distribution of the Sample Mean
The summaries above assume that the population is infinitely
large. However if the population is finite the standard error is
σ N−n
σx =
n N −1
N−n
N −1
12
Sampling Distribution of the Sample Mean
If the population size is large relative to the sample size the
finite population correction factor is close to 1 and can be
ignored.
14
Example 9.1(a)…
We want to find P(X > 32), where X is normally distributed
and µ = 32.2 and σ =.3
X − µ 32 − 32.2
P(X > 32) = P > = P( Z > − .67) = 1 − .2514 = .7486
σ .3
15
Example 9.1(b)…
The foreman of a bottling plant has observed that the amount
of soda in each “32-ounce” bottle is actually a normally
distributed random variable, with a mean of 32.2 ounces and
a standard deviation of .3 ounce.
16
Example 9.1(b)…
We want to find P(X > 32), where X is normally distributed
With µ = 32.2 and σ =.3
Things we know:
1) X is normally distributed, therefore so will X.
2) = 32.2 oz.
3)
17
Example 9.1(b)…
If a customer buys a carton of four bottles, what is the
probability that the mean amount of the four bottles will be
greater than 32 ounces?
18
Graphically Speaking…
mean=32.2
what is the probability that one bottle will what is the probability that the mean of
contain more than 32 ounces? four bottles will exceed 32 oz?
19
Chapter-Opening Example
Salaries of a Business School’s Graduates
In the advertisements for a large university, the dean of
the School of Business claims that the average salary
of the school’s graduates one year after graduation is
$800 per week with a standard deviation of $100.
20
Chapter-Opening Example
Salaries of a Business School’s Graduates
He does a survey of 25 people who graduated one year ago
and determines their weekly salary.
P( X < 750)
The distribution of X, the weekly income, is likely to be
positively skewed, but not sufficiently so to make the
distribution of X nonnormal. As a result, we may assume that X
is normal with mean
µ x = µ = 800
σ x = σ / n = 100 / 25 = 20
22
Chapter-Opening Example
Thus,
P( X < 750)
X − µx 750 − 800
= P <
σ x 20
= P( Z < − 2.5)
= .5 − .4938
= .0062
The probability of observing a sample mean as low as $750 when
the population mean is $800 is extremely small. Because this event
is quite unlikely, we would have to conclude that the dean's claim is
not justified.
23
Using the Sampling Distribution for Inference
Here’s another way of expressing the probability calculated from a
sampling distribution.
P(-1.96 < Z < 1.96) = .95
Substituting the formula for the sampling distribution
X−µ
P(−1.96 < < 1.96) = .95
σ/ n
σ σ
P(µ − 1.96 < X < µ + 1.96 ) = .95
n n
24
Using the Sampling Distribution for Inference
Returning to the chapter-opening example where µ = 800, σ = 100,
and n = 25, we compute
100 100
P(800 − 1.96 < X < 800 + 1.96 ) = .95
25 25
or
This tells us that there is a 95% probability that a sample mean will
fall between 760.8 and 839.2. Because the sample mean was
computed to be $750, we would have to conclude that the dean's
claim is not supported by the statistic.
25
Using the Sampling Distribution for Inference
Changing the probability from .95 to .90 changes the probability
statement to
σ σ
P(µ − 1.645 < X < µ + 1.645 ) = .90
n n
26
Using the Sampling Distribution for Inference
We can also produce a general form of this statement
σ σ
P(µ − z α / 2 < X < µ + zα / 2 ) =1− α
n n
In this formula α (Greek letter alpha) is the probability that
does not fall into the interval.
27
Using the Sampling Distribution for Inference
For example, with µ = 800, σ = 100, n = 25 and α= .01, we
produce
σ σ
P(µ − z .005 < X < µ + z .005 ) = 1 − .01
n n
100 100
P(800 − 2.575 < X < 800 + 2.575 ) = .99
25 25
28
Sampling Distribution of a Proportion…
The estimator of a population proportion of successes is the
sample proportion. That is, we count the number of
successes in a sample and compute:
29
Sampling Distribution of a Sample Proportion…
30
Example 9.2
In the last election a state representative received 52% of the
votes cast.
31
Example 9.2
The number of respondents who would vote for the representative
is a binomial random variable with n = 300 and p = .52.
32
Example 9.2
Thus, we calculate
34
Sampling Distribution: Difference of two means
The expected value and variance of the sampling
distribution of are given by:
mean:
standard deviation:
mean of
36
Example 9.3…
Starting salaries for MBA grads at two universities are
normally distributed with the following means and standard
deviations. Samples from each school are taken…
University 1 University 2
Mean 62,000 $/yr 60,000 $/yr
Std. Dev. 14,500 $/yr 18,300 $/yr
sample size n 50 60
37
Example 9.3…
“What is the probability that the sample mean starting salary
of University #1 graduates will exceed that of the #2 grads?”
Z
“there is about a 74% chance
that the sample mean starting salary of U. #1 grads will
exceed that of U. #2”
38
From Here to Inference
In Chapters 7 and 8 we introduced probability distributions,
which allowed us to make probability statements about
values of the random variable.
39
From Here to Inference
In Example 7.9, we needed to know that the probability that
Pat Statsdud guesses the correct answer is 20% (p = .2) and
that the number of correct answers (successes) in 10
questions (trials) is a binomial random variable.
40
From Here to Inference
In Example 8.2, we needed to know that the return on
investment is normally distributed with a mean of 10% and a
standard deviation of 5%.
41
From Here to Inference
The figure below symbolically represents the use of
probability distributions.
42
From Here to Inference
In this chapter we developed the sampling distribution,
wherein knowledge of the parameter(s) and some
information about the distribution allow us to make
probability statements about a sample statistic.
43
From Here to Inference
Statistical works by reversing the direction of the flow of
knowledge in the previous figure. The next figure displays
the character of statistical inference.
44
From Here to Inference
Sampling distribution
Statistic ------ Parameter
45
Chapter 10
Introduction to Estimation
© 2015 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a
license distributed with a certain product or service or otherwise on a password-protected website for classroom use. 10.46
Where we have been…
Chapters 7 and 8:
Binomial, Uniform, and Normal distributions allow us to
make probability statements about X (a member of the
population).
© 2015 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a
license distributed with a certain product or service or otherwise on a password-protected website for classroom use. 10.50
Statistical Inference…
Statistical inference is the process by which we acquire
information and draw conclusions about populations from
samples.
Statistics
Data Information
Population
Sample
Inference
Statistic
Parameter
© 2015 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a
license distributed with a certain product or service or otherwise on a password-protected website for classroom use. 10.51
Estimation…
There are two types of inference: estimation and hypothesis
testing; estimation is introduced first.
© 2015 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a
license distributed with a certain product or service or otherwise on a password-protected website for classroom use. 10.52
Types of Statistics
We make inference about three types of statistics:
1.The Mean
2.The Variance
3.A Proportion of the data that meets some qualitative requirement
© 2015 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a
license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
Estimation…
The objective of estimation is to determine the approximate
value of a population parameter on the basis of a sample
statistic.
Point Estimator
Interval Estimator
© 2015 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a
license distributed with a certain product or service or otherwise on a password-protected website for classroom use. 10.54
Point Estimator…
A point estimator draws inferences about a population by
estimating the value of an unknown parameter using a single
value or point.
© 2015 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a
license distributed with a certain product or service or otherwise on a password-protected website for classroom use. 10.56
Point & Interval Estimation…
For example, suppose we want to estimate the mean summer
income of a class of business students. For n = 25 students,
is calculated to be 400 $/week.
© 2015 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a
license distributed with a certain product or service or otherwise on a password-protected website for classroom use. 10.57
Qualities of Estimators…
Qualities desirable in estimators include unbiasedness,
consistency, and relative efficiency:
© 2015 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a
license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
Unbiased Estimators…
An unbiased estimator of a population parameter is an
estimator whose expected value is equal to that parameter.
E(X) = µ
© 2015 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a
license distributed with a certain product or service or otherwise on a password-protected website for classroom use. 10.60
Unbiased Estimators…
An unbiased estimator of a population parameter is an
estimator whose expected value is equal to that parameter.
E(Sample median) = µ
© 2015 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a
license distributed with a certain product or service or otherwise on a password-protected website for classroom use. 10.61
Consistency…
An unbiased estimator is said to be consistent if the
difference between the estimator and the parameter grows
smaller as the sample size grows larger.
V(X) is σ2/n
© 2015 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a
license distributed with a certain product or service or otherwise on a password-protected website for classroom use. 10.62
Consistency…
An unbiased estimator is said to be consistent if the
difference between the estimator and the parameter grows
smaller as the sample size grows larger.
© 2015 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a
license distributed with a certain product or service or otherwise on a password-protected website for classroom use. 10.63
Relative Efficiency…
If there are two unbiased estimators of a parameter, the one
whose variance is smaller is said to be relatively efficient.
E.g. both the the sample median and sample mean are
unbiased estimators of the population mean, however, the
sample median has a greater variance than the sample mean,
so we choose since it is relatively efficient when
compared to the sample median.
© 2015 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a
license distributed with a certain product or service or otherwise on a password-protected website for classroom use. 10.64
Interval Estimate of the Mean
© 2015 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a
license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
Estimating μ when σ is known…
Known, i.e.
Known, i.e. sample mean
standard normal
Unknown, i.e. we
distribution
want to estimate
the population
mean
66
© 2015 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a
license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
Estimating when is known…
In Chapter 8 we produced the following general probability
statement about X
P( − Z α / 2 < Z < Z α / 2 ) = 1 − α
Thus
X −µ
Z=
σ/ n
© 2015 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a
license distributed with a certain product or service or otherwise on a password-protected website for classroom use. 10.67
Estimating when is known…
Thus, substituting Z we produce
x −µ
P(− z α / 2 < < zα / 2 ) = 1 − α
σ/ n
In Chapter 9 (with a little bit of algebra) we expressed the following
σ σ
P µ − z α / 2 < x < µ + zα / 2 =1− α
n n
© 2015 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a
license distributed with a certain product or service or otherwise on a password-protected website for classroom use. 10.68
Estimating µ when σ is known…
This
σ σ
P x − z α / 2 < µ < x + zα / 2 =1− α
n n
© 2015 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a
license distributed with a certain product or service or otherwise on a password-protected website for classroom use. 10.69
Estimating when is known…
The interval can also be expressed as
σ
Lower confidence limit = x − zα / 2
n
σ
Upper confidence limit = x + z α / 2
n
© 2015 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a
license distributed with a certain product or service or otherwise on a password-protected website for classroom use. 10.70
Confidence Interval Estimator for μ :
The probability 1– is called the confidence level.
Usually represented upper confidence
with a “plus/minus” ( limit (UCL)
± ) sign
width
© 2015 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a
license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
Interpreting Confidence Intervals
• The Confidence Level (1-α) is a probability
• (1-α) is the probability that the interval contains the true
mean
• Therefore a confidence interval is a probability statement
•the probability of the CI not containing the true mean is α
• Standard Confidence Intervals are on the next slide
© 2015 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a
license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
Confidence Intervals
© 2015 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a
license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
Example 10.1…
The Doll Computer Company makes its own computers and
delivers them directly to customers who order them via the
Internet.
© 2015 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a
license distributed with a certain product or service or otherwise on a password-protected website for classroom use. 10.74
Example 10.1…
To lower these costs the operations manager wants to use an
inventory model. He notes demand during lead time is
normally distributed and he needs to know the mean to
compute the optimum inventory level.
© 2015 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a
license distributed with a certain product or service or otherwise on a password-protected website for classroom use. 10.75
Example 10.1…
235 374 309 499 253
421 361 514 462 369
394 439 348 344 330
261 374 302 466 535
386 316 296 332 334
© 2015 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a
license distributed with a certain product or service or otherwise on a password-protected website for classroom use. 10.76
Example 10.1…
“We want to estimate the mean demand over lead time with
95% confidence in order to set inventory levels…”
IDENTIFY
© 2015 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a
license distributed with a certain product or service or otherwise on a password-protected website for classroom use. 10.77
Example 10.1… COMPUTE
1.96
75
Given
n 25
therefore:
The lower and upper confidence limits are 340.76 and 399.56.
© 2015 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a
license distributed with a certain product or service or otherwise on a password-protected website for classroom use. 10.78
Example 10.1 Using Excel COMPUTE
© 2015 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a
license distributed with a certain product or service or otherwise on a password-protected website for classroom use. 10.79
Example 10.1
© 2015 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a
license distributed with a certain product or service or otherwise on a password-protected website for classroom use. 10.80
Example 10.1… INTERPRET
The estimation for the mean demand during lead time lies
between 340.76 and 399.56 — we can use this as input in
developing an inventory policy.
That is, we estimated that the mean demand during lead time
falls between 340.76 and 399.56, and this type of estimator
is correct 95% of the time. That also means that 5% of the
time the estimator will be incorrect.
© 2015 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a
license distributed with a certain product or service or otherwise on a password-protected website for classroom use. 10.81
Interval Width…
A wide interval provides little information.
For example, suppose we estimate with 95% confidence that
an accountant’s average starting salary is between $15,000
and $100,000.
© 2015 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a
license distributed with a certain product or service or otherwise on a password-protected website for classroom use. 10.82
Interval Width…
The width of the confidence interval estimate is a function of
the confidence level, the population standard deviation, and
the sample size…
© 2015 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a
license distributed with a certain product or service or otherwise on a password-protected website for classroom use. 10.83
Interval Width…
The width of the confidence interval estimate is a function of
the confidence level, the population standard deviation, and
the sample size…
© 2015 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a
license distributed with a certain product or service or otherwise on a password-protected website for classroom use. 10.84
Interval Width…
The width of the confidence interval estimate is a function of
the confidence level, the population standard deviation, and
the sample size…
Larger values of σ
produce w i d e r
confidence intervals.
© 2015 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a
license distributed with a certain product or service or otherwise on a password-protected website for classroom use. 10.85
Interval Width…
The width of the confidence interval estimate is a function of
the confidence level, the population standard deviation, and
the sample size…
© 2015 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a
license distributed with a certain product or service or otherwise on a password-protected website for classroom use. 10.87
Selecting the Sample Size…
x−µ
P (− zα / 2 < < zα / 2 ) = 1 − α
σ/ n
2
z α / 2σ
n =
B
© 2015 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a
license distributed with a certain product or service or otherwise on a password-protected website for classroom use. 10.88
Selecting the Sample Size…
To illustrate suppose that in Example 10.1 before gathering
the data the manager had decided that he needed to estimate
the mean demand during lead time to with 16 units, which is
the bound on the error of estimation.
2 2
zα / 2σ (1.96)(75)
n = = = 84.41
B 16
© 2015 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a
license distributed with a certain product or service or otherwise on a password-protected website for classroom use. 10.89
Selecting the Sample Size…
Because n must be an integer and because we want the
bound on the error of estimation to be no more than 16 any
non-integer value must be rounded up.
© 2015 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a
license distributed with a certain product or service or otherwise on a password-protected website for classroom use. 10.90