Sp Sampling Lect 12
Sp Sampling Lect 12
Lecture 12
Stratified Random Sampling
Shalabh
Department of Mathematics and Statistics
Indian Institute of Technology Kanpur
Population (N units)
𝑌 𝑌 𝑌
… … … N Ni
n1 units n2 units nk units i 1
𝑦 𝑦 𝑦
2
Stratified Random Sampling: Example
• Find the average height of the students in a school of class 1 to 12.
• All the drawn samples combined together will constitute the final
stratified sample.
4
Advantages of Stratified Sampling :
1.Data of known precision may be required for certain parts of the
population. This can be accomplished with a more careful
investigation to few strata.
5
Advantages of Stratified Sampling :
2. Sampling problems may differ in different parts of the
population.
6
Advantages of Stratified Sampling :
3. Administrative convenience can be exercised in stratified
sampling.
7
Advantages of Stratified Sampling :
4. Full cross‐section of population can be obtained through
stratified sampling. It may be possible in SRS that some large part
of the population may remain unrepresented. Stratified sampling
enables one to draw a sample representing different segments of
the population to any desired extent. The desired degree of
representation of some specified parts of population is also
possible.
8
Advantages of Stratified Sampling :
6. In case of skewed population, use of stratification is of
importance since larger weight may have to be given for the few
extremely large units which in turn reduces the sampling
variability.
7. When estimates are required not only for the population but also
for the subpopulations, then stratified sampling is helpful.
10
Stratified Random Sampling:
We use the following symbols and notations:
N : Population size
k : Number of strata
Ni : Number of sampling units in ith strata
k
N N i Total population size
i 1
11
Stratified Random Sampling:
Let
Y : characteristic under study,
yij : value of jth unit in ith stratum j = 1,2,…,ni, i = 1,2,...,k,
1 Ni
Yi
Ni
y
j 1
ij : population mean of i th
stratum, j 1, 2,, ni , i 1, 2,..., k ,
1 ni
yi
ni
ij
y
j 1
: sample mean of units from i th
stratum
1 k k
Ni
Y
N
i 1
N iYi wiYi : population mean where wi .
i 1 N
12
Stratified Random Sampling:
There are k independent samples drawn through SRS from each of
the stratum.
13
Estimation of Population Mean:
In case of stratified sampling, the population mean is defined as
the weighted arithmetic mean of stratum means where the
weights are provided in terms of strata sizes.
1 k
Y NiYi ,
N i 1
Find the sample mean of the units drawn from each startum.
E ( yi ) Yi ,
1 k
yst Ni yi .
N i 1
1 k
E ( yst ) Ni E ( yi )
N i 1
1 k
Ni Y i
N i 1
Y.
15
Biased Estimator of Population Mean:
Since the sample in each stratum is drawn by SRS, so
E ( yi ) Yi ,
1 k
y ni yi
n i 1
1 k
E ( y ) ni E ( yi )
n i 1
1 k
ni Yi
n i 1
1 k
ni Y i
n i 1
Y
16
Variance of Stratum Mean:
k k ni
Var ( yst ) wi2 Var ( yi ) w w Cov( y , y )
i j i j
i 1 i ( j ) 1 j 1
Since all the samples have been drawn independently from each
of the strata by SRSWOR, so
Cov( yi , y j ) 0, i j
Ni ni 2
Var ( yi ) Si
Ni ni
where
1 Ni
Si2
Ni 1 j 1
(Yij Y i .
) 2
17
Variance of Stratum Mean:
k k ni
Var ( yst ) wi2 Var ( yi ) w w Cov( y , y )
i j i j
i 1 i ( j ) 1 j 1
Thus
k
Ni ni 2
Var ( yst ) w 2
i Si
i 1 Ni ni
k
n S 2
wi2 1 i i .
i 1 Ni ni
18
How to Construct Strata:
k
n S 2
Var ( yst ) wi2 1 i i .
i 1 Ni ni
If 𝑺𝟐𝒊 is small for all i = 1,2,...,k, then variance will also be small.
That is why the strata are constructed such that they are within