Lecture 9
Lecture 9
Chapter 9
Sampling distributions
9.3 Introduction to sampling distribution
9.4 Sampling distribution of the sample
mean 𝐗
9.5 Sampling distribution of the sample
proportion 𝐩
1
Recall what statistics is
Statistics
Information
Data
Subset
Statistic
Parameter
• Populations have parameters: , 2, , p (probability)
• Samples have statistics: 𝑥, s2, s, 𝑝 (relative frequency)
• Based on values of statistics of the sample,
inferences can be made about parameters of the
population.
2
9.4 Sampling Distribution
of the Sample Mean
Example, page 346: A fair die is thrown infinitely many
times, with the random variable X = Number of spots
showing on any throw. The probability distribution of X is:
x 1 2 3 4 5 6
P(x) 1/6 1/6 1/6 1/6 1/6 1/6
and the mean and variance (of the population of
numbers of spots observed) are:
3
Sampling Distribution of Two Dice
The sampling distribution of X is shown below:
P( )
1.0 1/36
1.5 2/36
2.0 3/36
2.5 4/36
3.0 5/36
3.5 6/36
4.0 5/36
4.5 4/36
5.0 3/36
5.5 2/36
6.0 1/36
Compare…
Compare the distribution of X
1 2 3 4 5 6 1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0 5.5 6.0
𝝁
The sampling
with the sampling distribution of 𝐗 distribution of the
(approximately normal distribution). statistic is the
tool that tells us
Note that: how close the
statistic is to the
parameter 𝝁
8
4
Generalize…
We can generalize the mean and variance of the
sampling of 2 dice:
The standard
to sampling of n dice deviation of the
(roll n dice and calculate sampling
the sample mean 𝑿 ): distribution of the
sample mean is
called the
standard error.
n 5 (roll 5 dice)
x 3 .5
n 10 (roll 10 dice)
x2
x2 .5833 ( ) x 3.5 n 25 (roll 25 dice)
5
2 x 3 .5
x2 .2917 ( x
)
10 x2
x2 .1167 ( )
25
1 3.5 6
5
Central Limit Theorem
• If a random sample is drawn from a normal
population, then the sampling distribution of the
sample mean 𝐗 is normally distributed for all
values of n (sample size).
• If a random sample is drawn from any population,
the sampling distribution of the sample mean
𝑿 is approximately normal for a sufficiently
large sample size. The larger the sample size,
the more closely the sampling distribution of 𝐗 will
resemble a normal distribution.
• A sample size of 30 may be sufficiently large to
allow us to use the normal distribution as an
approximation for the sampling distribution of 𝐗 .
11
12
6
Sampling Distribution of the Sample Mean
The summaries above assume that the population is
infinitely large. However, if the population is finite the
standard error is
Nn
x
n N 1
Nn
where N is the population size and N 1
is the finite population correction factor.
13
7
Example, pages 342 and 358
Salaries of a business school’s graduates:
The average weekly income of graduates one year
after graduation is $1000. Suppose the distribution
of weekly income has a standard deviation of $100.
What is the probability that 25 randomly-selected
graduates have an average weekly income of less
than $950?
Solution
Let X be the weekly income of graduates one year
after graduation. We consider sampling distribution
of sample mean 𝐗 for sample size n = 25 and have
X 950 1000
P( X 950) P( ) P( Z 2.5) 0.0062
x 100 25
15
8
Using Sampling Distributions for
Inference
– To make inferences about / related to population
parameters we use sampling distributions.
– The symmetry of the normal distribution along with the
sample distribution of the mean lead to:
X
P (1.96 Z 1.96) 0.95, or P (1.96 1.96) 0.95
n
- z.025 z.025
This can be written as P (1.96 X 1.96 ) 0.95
n n
which becomes P ( 1.96 X 1.96 ) 0.95
n n
In general, P ( z 2 X z 2 ) 1
n n
17
0.025 0.025
Normal distribution of 𝑿
–1.96 0 1.96
0.025 0.025
1.96 1.96
n n
18
9
Example, pages 342 and 358 (contd.)
19
20
10
Normal Approximation
to Binomial Distribution
Binomial distribution with n=20 and p=0.5 can be
approximated by the normal distribution with the
same mean and standard deviation.
(µ= np = 10 and = 𝑛𝑝 1 − 𝑝 = 2.24)
21
Normal Approximation
to Binomial Distribution
• Normal approximation to the binomial distribution
works best when
– the number of experiments (sample size) is
large, and
– the probability of success p is close to 0.5.
22
11
Normal Approximation
to Binomial Distribution
To calculate
P(X=10) using
the normal
distribution, we
can find the area
under the
normal curve
between 9.5 and
10.5
Explanation
= np = 20(0.5) = 10; 2 = np(1 – p) = 20(0.5)(1 – 0.5) = 5; = 2.24
9.5 10 10.5
9.5 10 10.5 10
P(XBinomial = 10) ~= P(9.5<Y<10.5) P( Z ) .1742
2.24 2.24
24
12
More Normal Approximation Exercises
8
P(X 14) P(13.5 < Y < n + 0.5) 8.5
P(Y > 13.5)
13
Example 9.2 revised, page 363
The Laurier company’s brand has a market share of
45.6%. In a survey, 300 consumers were asked which
brand they prefer. What is the probability that more than
50% of the respondents say they prefer the Laurier
brand?
Solution: The number of respondents who prefer Laurier
is binomial with n = 300 and p = 0.456. Also, np =
300(0.456) = 136.8 > 5, and n(1 – p) = 300(1 – 0.456)
= 163.2 > 5. Therefore, 𝑝 is normal with mean p =
0.456 and standard error
p (1 p ) 0.456(1 0.4561)
pˆ 0.0288
n 300
Hence
pˆ p 0.50 0.456
P ( pˆ 0.50) P P ( Z 1.53) 0.0618
p (1 p ) n 0.0288
27
Home assignment:
28
14