Sampling and Sampling Distributions
Sampling and Sampling Distributions
Sampling Distributions
Tools of Business Statistics
Descriptive statistics
Collecting, presenting, and describing data
Inferential statistics
Drawing conclusions and/or making decisions
concerning a population based only on
sample data
Populations and Samples
Population Sample
a b cd b c
ef gh i jk l m n gi n
o p q rs t u v w o r u
x y z y
Why Sample?
Sample
Population
Inferential Statistics
Drawing conclusions and/or making decisions
concerning a population based on sample results.
Estimation
e.g., Estimate the population mean
weight using the sample mean
weight
Hypothesis Testing
e.g., Use sample evidence to test
the claim that the population mean
weight is 120 kg
Sampling Distributions
Sampling
Distributions
Random variable, X,
is age of individuals
Values of X:
18, 20, 22, 24 (years)
Developing a
Sampling Distribution
(continued)
μ
X i P(x)
N
.25
18 20 22 24
21
4
0
σ
(X i μ) 2
2.236
18 20 22 24 x
N A B C D
Uniform Distribution
Developing a
Sampling Distribution
(continued)
Now consider all possible samples of size n = 2
1st 2nd Observation
Obs 16 Sample
18 20 22 24
Means
18 18,18 18,20 18,22 18,24
1st 2nd Observation
20 20,18 20,20 20,22 20,24 Obs 18 20 22 24
22 22,18 22,20 22,22 22,24 18 18 19 20 21
24 24,18 24,20 24,22 24,24 20 19 20 21 22
16 possible samples 22 20 21 22 23
(sampling with
replacement)
24 21 22 23 24
Developing a
Sampling Distribution
(continued)
E(X)
X i
18 19 21 24
21 μ
N 16
σX
( X i μ) 2
N
(18 - 21)2 (19 - 21)2 (24 - 21)2
1.58
16
Comparing the Population with its
Sampling Distribution
Population Sample Means Distribution
N=4 n=2
μ 21 σ 2.236 μX 21 σ X 1.58
_
P(X) P(X)
.3 .3
.2 .2
.1 .1
0 X 0 18 19 20 21 22 23 24
_
18 20 22 24 X
A B C D
Expected Value of Sample Mean
σ
σX
n
Note that the standard error of the mean decreases as
the sample size increases
If the Population is Normal
σ
μX μ and σX
n
Z-value for Sampling Distribution
of the Mean
Z-value for the sampling distribution of X :
( X μ) ( X μ)
Z
σX σ
n
σ2 N n or σ σ Nn
Var( X)
n N 1
X
n N 1
Finite Population Correction
If the sample size n is not small compared to the
population size N , then use
( X μ)
Z
σ Nn
n N 1
Sampling Distribution Properties
Normal Population
μx μ Distribution
μ x
(i.e. x is unbiased ) Normal Sampling
Distribution
(has the same mean)
μx
x
Sampling Distribution Properties
(continued)
Smaller
sample size
μ x
If the Population is not Normal
We can apply the Central Limit Theorem:
Even if the population is not normal,
…sample means from the population will be
approximately normal as long as the sample size is
large enough.
σ
μx μ and σx
n
Central Limit Theorem
the sampling
As the n↑
distribution
sample
becomes
size gets
almost normal
large
regardless of
enough…
shape of
population
x
If the Population is not Normal
(continued)
Population Distribution
Sampling distribution
properties:
Central Tendency
μx μ
μ x
Variation Sampling Distribution
σ (becomes normal as n increases)
σx Larger
n Smaller
sample size
sample
size
μx x
How Large is Large Enough?
Solution:
Even if the population is not normally
distributed, the central limit theorem can be
used (n > 25)
… so the sampling distribution of x is
approximately normal
… with mean μx = 8
…and standard deviation σ x σ 3 0.5
n 36
Example
(continued)
Solution (continued):
7.8 - 8 μX -μ 8.2 - 8
P(7.8 μ X 8.2) P
3 σ 3
36 n 36
P(-0.5 Z 0.5) 0.3830
Population Sampling Standard Normal
Distribution Distribution Distribution .1915
??? +.1915
? ??
? ? Sample Standardize
? ? ?
?
7.8 8.2 -0.5 0.5
μ8 X μX 8 x μz 0 Z
Acceptance Intervals
Determine a range within which sample means are likely
to occur, given a population mean and variance.
By the Central Limit Theorem, we know that the distribution of X
is approximately normal if n is large enough, with mean μ and
standard deviation σ X
Let zα/2 be the z-value that leaves area α/2 in the upper tail of the
normal distribution (i.e., the interval - zα/2 to zα/2 encloses
probability 1 – α)
Then
μ z/2σ X
Sampling
Distributions
Properties:
X P(1 P)
E(P̂) P σ Var
and
2
Pˆ
n n
(where P = population proportion)
Z-Value for Proportions
Pˆ P Pˆ P
Z
σ Pˆ P(1 P)
n
Example
Standardized
Sampling Distribution Normal Distribution
.4251
Standardize
Sampling
Distributions
0 4 8 12 16 20 24 28 2 0 4 8 12 16 20 24 28 2 0 4 8 12 16 20 24 28 2