0% found this document useful (0 votes)

2 views

Stratified Sampling Notes

Statistics

Uploaded by

nkhasifm

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

Stratified Sampling Notes

Statistics

Uploaded by

nkhasifm

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

ST3424: Survey Methods and Applications

Stratified Sampling
In stratified sampling the population consisting of N units is first divided into k subpopulations
or strata of N1 , N2 , . . . , Nk units respectively, such that sampling units within each stratum are as
homogeneous as possible and strata should be as heterogeneous from each other as possible.
These strata are not overlapping and together they comprise the total population such that
k
X
N1 + N2 + . . . + Nk = Ni = N
i=1
Sampling is done independently from each stratum and estimators from each of the strata are
pooled with suitable weights to obtain an estimator of the population mean ȲN . If simple random
sampling (SRS) is used to select a sample from each stratum, the method is referred to as stratified
random sampling.
Sizes of samples that are selected from the strata are denoted by n1 , n2 , . . . , nk respectively. The
Xk
total sample size n = ni .
i=1

Principal Reasons Why Stratification is Considered Common in Sampling

1. If data of known precision are required for certain subdivisions of the population, it is advis-
able to treat each subdivision as a population in its own right.

2. Stratification creates convenience for organization of field work requiring supervision since it
minimizes travel costs if it is done according to administrative places, e.g. districts in the
case of Lesotho.

3. Different sampling methods may be adopted for selecting a sample from different strata. e.g
people living in institutions such as hotels, hospitals, prisons and cattle posts are often placed
in a different stratum from people living in ordinary homes. Hence different sampling methods
can be appropriate for the two situations.

4. Stratification ensures adequate representation to every segment of the population when it is

compared with simple random sampling.

5. Stratification may produce a gain in precision in the estimates of characteristics of the whole
population.

1
• If each stratum is homogeneous, in the sense that measurements vary little from one
unit to another, a precise estimate of any stratum mean can be obtained from a small
sample in that stratum.

• The estimates from strata can be combined into a precise estimate for the whole popu-
lation.

6. In case of extreme values in the population, such values may be segregated to form a separate
stratum.

Limitations of Stratified Sampling

1. Strata sizes Ni s must be known.

2. The frame of sampling units in such strata should be available.

Notation
Let Yij be the value of the jth unit in the ith stratum,
N be the population size
Ni be the size of the ith stratum
YNi be the total of the ith stratum
ȲN or Ȳ be the population mean
Ni
wi = N be unknown weights
Suppose there are k strata, then units in the population can be presented as follows:

Table 1: Units in the Population by Stratum

1 ... h ... k
Y11 ... Yh1 ... Yk1
.. .. ..
. . .
Y1i ... Yhi ... Yki
.. . ..
. ..
.
Y1N ... YhN ... YkN

Hence the population total can presented as

Xk XNi Xk
Y= Yij = YNi
i=1 j=1 i=1

2
Table 2: Notation for the ith Stratum

1 ... h ... k
Stratum Size N1 ... Nh ... Nk
Stratum Total YN1 ... YNh ... YNk
Stratum Mean ȲN1 ... ȲNh ... ȲNk

The mean of the ith stratum ȲNi is the mean of all units in the ith stratum presented as
Ni
1 X
ȲNi = Yij
Ni
j=1
The papulation mean is given as
1 XX
ȲN = Yij ,
N
i j
which can be presented as
k k
X Ni X
ȲN = ȲNi = wi ȲNi
N
i=1 i=1

If the sample of size ni is drawn from the ith stratum of size Ni by SRS, the sample estima-
tor for the ith stratum is given as
ni
1 X
ȳni = yij ,
ni
j=1
which is an unbiased estimator of the ith stratum mean ȲNi , i.e. E(yni ) = ȲNi

Weighted Mean of Sample Estimators ȳni

k
X
ȳst = wi ȳni
i=1
Xk ni
h1 X i
= wi yij
ni
i=1 i=1

If in every stratum the sample estimator ȳni is unbiased estimator of the ith stratum mean ȲNi ,
then ȳst is an unbiased estimator of the population mean ȲN .

3
Proof
k
X
E(ȳst ) = wi E(ȳni )
i=1
k
X
= wi ȲNi
i=1
k
X Ni
= ȲNi
N
i=1
= ȲN
Since sample estimators ȳni are unbiased in the individual strata. i.e. E(ȳni ) = ȲNi , the population
mean may be written as
X Ni
k X
Yij
i=1 j=1
ȲN =
N
k
X Ni
= ȲNi
N
i=1
k
X
= wi ȲNi
i=1
k
hX i
Var(ȳst ) = Var wi ȳni
i=1
k
X
= wi2 Var(ȳni )
i=1
when samples are selected from strata by SRS
k 1
X 1 2
Var(ȳst ) = wi2 − S
ni Ni i
i=1
Ni
X
where Si2 = 1
Ni −1 (Yij − ȲNi )2 and E(s2i ) = Si2
i=1

The estimate of the variance is

k 1
ˆ
X 1 2
Var(ȳst ) = wi2 − s
ni N i i
i=1
Ni
X
where s2i = 1
ni −1 (yij − ȳni )2
i=1

In stratified sampling the values of the sample sizes ni in the respective strata are determined
by the sampler.
They may be selected to minimize the variance of the estimator (Var(ȳst )) for a specified cost of
studying the sample or to minimize the cost for a specified value of Var(ȳst ).

4
Allocation of Total Sample to the ith Stratum
Given the total sample size n we may choose how to allocate it among the k strata and there
are two types of allocation that we can use, namely proportional allocation and optimum/Neymsn
allocation.
Proportional Allocation
Under proportional allocation the size of the sample that is selected from the ith stratum ni is in
the same proportion to the total sample size n as the size of the ith stratum Ni is to the population
size N
ni Ni
i.e n = N .

For instance if a stratum contains 15% of the population elements, the sample from this stra-
tum will consists of 15% of the total sample size n.
Ni Ni
In such cases ni = N n = wi n, where wi = N and N wi = Ni

Variance of the Estimator y¯st Under Proportional Allocation

k
2 1 1 2
X
Var(ȳst )p = wi − S
ni N i i
i=1
k 1
X 1 2
= wi2 − S
nwi N wi i
i=1
1 k
1 X
= − wi Si2
n N
i=1
k
N −nX
= wi Si2
Nn
i=1

If the population size N is large as compared to the sample size

1 k
1 X
Var(ȳst )p = − wi Si2
n N
i=1
1 Pk
becomes Var(ȳst )p = n i=1 wi Si2

Optimum/Neyman Allocation
In optimum allocation a given total sample size n should be allocated among the k strata such that
the stratified sampling estimator ȳst will have the smallest possible variance.
The problem is to determine n1 , n2 , . . . , nk such that we minimize

5
k 1
X 1 2
Var(ȳst ) = wi2
− S
ni N i i
i=1
subject to the constraint that the total sample size n is calculated as
k
X
n1 + n2 + . . . + nk = ni = n
i=1
Optimum allocation estimates the population mean or population total with the lowest variance
for a fixed total sample size n. In this case
nw S
ni = Pk i i
i=1 wi Si

Variance of the Estimator y¯st under Optimum Allocation

Using the fact that
k 1
X 1 2
Var(ȳst ) = wi2 − S
ni N i i
i=1
k
X 1 1 2 2
= − w S
ni N i i i
i=1
nw S
and substituting ni with Pk i i derive the variance of y¯st under optimum allocation and show
i=1 wi Si

that it is given by

k k
1X 2 1 X
Var(ȳst )opt = wi Si − wi Si2
n N
i=1 i=1

For large population size N

k
1X 2
Var(ȳst )opt = wi Si
n
i=1

Comparison of Stratified Sampling and Simple Random Sampling (SRS)

In particular comparison of proportional stratified sampling and SRS
Recall that for proportional allocation
1 k
1 X
Var(ȳst )p = − wi Si2
n N
i=1

and for SRSWOR

1 1 2
Var(ȳn )SRS = − S
n N
i k N
2 1 XX
S = (Yij − ȲN )2
N −1
i=1 j=1

6
ik N
1 XX
= (Yij − ȲNi + ȲNi − ȲN )2
N −1
i=1 j=1
1 XX 1 X
= (Yij − ȲNi )2 + Ni (ȲNi − ȲN )2
N −1 N −1
i=1 j=1 i=1
We have
XX
(N − 1)S 2 = (Yij − ȲN )2
X i=1
Xj=1 X
= (Yij − ȲNi )2 + Ni (ȲNi − ȲN )2
i j i
which can be written as
X X
(N − 1)S 2 = (Ni − 1)Si2 + Ni (ȲNi − ȲN )2
i i
Dividing by N we get
(N − 1) 2 X (Ni − 1) 2 X Ni
S = Si + (ȲNi − ȲN )2
N N N
i i
If Ni s are large (as it is likely practice) then Ni − 1 ≈ Ni
X X
S2 = wi Si2 + wi (ȲNi − ȲN )2
i i
multiplying both sides by n1 − N1
we get
1 1 2 1 1 X 1 1 X
− S = − wi Si2 + − wi (ȲNi − ȲN )2
n N n N n N
i 1 1 X
i

Var(ȳn )SRS = Var(ȳst )p + − wi (ȲNi − ȲN )2

n N
i
1 1 X
Var(ȳn )SRS − Var(ȳst )p = − wi (ȲNi − ȲN )2
n N
i

The variance of SRS estimator ȳn is greater than that of the stratified estimator based on pro-
portional allocation, of the same sample size n. The difference between the two variances may be
small, but it is always nonnegative for large Ni s.
Proportional stratified sampling is better than SRS or it is more efficient than SRS, hence it should
be preferred to SRS in all cases where it is feasible (in all cases where the strata sizes Ni s are
known).

Comprehensive Statistics MBA
No ratings yet
Comprehensive Statistics MBA
2 pages
Chapter4 Stratified Sampling
No ratings yet
Chapter4 Stratified Sampling
27 pages
Design Analys Sample Survey
No ratings yet
Design Analys Sample Survey
10 pages
04 Stratified Sampling
No ratings yet
04 Stratified Sampling
19 pages
STAT 366 - Sample Survey Theory and Methods II - Lecture 2
No ratings yet
STAT 366 - Sample Survey Theory and Methods II - Lecture 2
82 pages
Stratified
No ratings yet
Stratified
17 pages
Chapter 4 - Stratified Random Sampling-1
No ratings yet
Chapter 4 - Stratified Random Sampling-1
12 pages
8 - M2 - Stratified Sampling
No ratings yet
8 - M2 - Stratified Sampling
33 pages
51.3 Stratified Random Sampling
No ratings yet
51.3 Stratified Random Sampling
15 pages
Stratified Sampling 2012
No ratings yet
Stratified Sampling 2012
17 pages
Chapter4 Sampling Stratified Sampling
No ratings yet
Chapter4 Sampling Stratified Sampling
26 pages
14447
No ratings yet
14447
10 pages
Unit-3 by EasePDF
No ratings yet
Unit-3 by EasePDF
29 pages
Stratified Randon Sampling
No ratings yet
Stratified Randon Sampling
32 pages
Sp Sampling Lect 12
No ratings yet
Sp Sampling Lect 12
19 pages
Unit 3
No ratings yet
Unit 3
16 pages
Chapter 4 - 2010
No ratings yet
Chapter 4 - 2010
13 pages
Random Sampling (Stratified) Example
No ratings yet
Random Sampling (Stratified) Example
4 pages
STA 405 FIRST MATERIAL
No ratings yet
STA 405 FIRST MATERIAL
17 pages
Sampling CH-4
No ratings yet
Sampling CH-4
16 pages
Stratified Sampling
No ratings yet
Stratified Sampling
11 pages
Lecture 4
No ratings yet
Lecture 4
55 pages
Stat 475 Notes 8: y B X y B X y BX N SEB NNX N X Is Unknown, Then We Substitute The Sample Mean X For It
No ratings yet
Stat 475 Notes 8: y B X y B X y BX N SEB NNX N X Is Unknown, Then We Substitute The Sample Mean X For It
13 pages
Chapter4 Sampling Stratified Sampling 1
No ratings yet
Chapter4 Sampling Stratified Sampling 1
27 pages
SYSTEMATIC Sampling Assignment PDF
No ratings yet
SYSTEMATIC Sampling Assignment PDF
7 pages
STAT8101_L3_25
No ratings yet
STAT8101_L3_25
43 pages
Chapter4 Sampling Stratified Sampling
No ratings yet
Chapter4 Sampling Stratified Sampling
43 pages
Chap016 - Sao Chép
No ratings yet
Chap016 - Sao Chép
30 pages
Principles of Sampling
No ratings yet
Principles of Sampling
20 pages
Chapter Four
No ratings yet
Chapter Four
77 pages
Bab 4
No ratings yet
Bab 4
38 pages
Sqqs2083 Sampling Techniques Chapter 4: Stratified Sampling
No ratings yet
Sqqs2083 Sampling Techniques Chapter 4: Stratified Sampling
38 pages
Sampling Theory and Method-301-500
No ratings yet
Sampling Theory and Method-301-500
200 pages
Sampling Methods: - Attaullah Shah
No ratings yet
Sampling Methods: - Attaullah Shah
16 pages
Design Summary - Survey Sampling
No ratings yet
Design Summary - Survey Sampling
4 pages
Design and Analysis of Surveys: Summer 2021
No ratings yet
Design and Analysis of Surveys: Summer 2021
27 pages
slidesc53_2
No ratings yet
slidesc53_2
41 pages
Statistics c.1
No ratings yet
Statistics c.1
125 pages
Advantages : Simple Random Sampling Systematic Sampling
No ratings yet
Advantages : Simple Random Sampling Systematic Sampling
2 pages
Sampling
No ratings yet
Sampling
7 pages
Study Population and Sampling
No ratings yet
Study Population and Sampling
22 pages
Sampling Summary Sampling Methods Population Sample: Methods Process Advantages Disadvantages Random
No ratings yet
Sampling Summary Sampling Methods Population Sample: Methods Process Advantages Disadvantages Random
3 pages
3
No ratings yet
3
55 pages
Sampling
No ratings yet
Sampling
22 pages
Research Methods (Sampling)
No ratings yet
Research Methods (Sampling)
21 pages
Chapter One
No ratings yet
Chapter One
48 pages
5 sampling
No ratings yet
5 sampling
62 pages
003. Stratified Sampling - Wikipedia
No ratings yet
003. Stratified Sampling - Wikipedia
4 pages
Stat 410 Tutorial Week 7
No ratings yet
Stat 410 Tutorial Week 7
3 pages
Introduction To Sampling: Situo Liu Spry, Inc. 10/25/2013
No ratings yet
Introduction To Sampling: Situo Liu Spry, Inc. 10/25/2013
22 pages
Stratified Sampling
No ratings yet
Stratified Sampling
17 pages
7Sampling Technique (1)
No ratings yet
7Sampling Technique (1)
60 pages
RUMUS TEKSAM (Simple Random Sampling Dan Stratified)
No ratings yet
RUMUS TEKSAM (Simple Random Sampling Dan Stratified)
3 pages
7.1 Basic Concepts
No ratings yet
7.1 Basic Concepts
28 pages
Stratified Sampling
No ratings yet
Stratified Sampling
4 pages
Sample Notes
No ratings yet
Sample Notes
9 pages
StratifiedBalanced
No ratings yet
StratifiedBalanced
7 pages
Sample and Sampling Methods-22 (1)
No ratings yet
Sample and Sampling Methods-22 (1)
7 pages
Sampling Techniques
No ratings yet
Sampling Techniques
54 pages
Biostat Lecture Six
No ratings yet
Biostat Lecture Six
50 pages
Application of Derivatives Tangents and Normals (Calculus) Mathematics E-Book For Public Exams
From Everand
Application of Derivatives Tangents and Normals (Calculus) Mathematics E-Book For Public Exams
Mohmmad Khaja Shareef
5/5 (1)
2ND Quarterly Assessment in Practical Research 2
No ratings yet
2ND Quarterly Assessment in Practical Research 2
3 pages
III-W1-6
No ratings yet
III-W1-6
10 pages
ARTIGO SWLS - Satisfaction With Life Scale Among Adolescents Portuguese
100% (1)
ARTIGO SWLS - Satisfaction With Life Scale Among Adolescents Portuguese
10 pages
Correlation: By: Nathaniel S. Antero
No ratings yet
Correlation: By: Nathaniel S. Antero
13 pages
School Logo Place Here: (This Portion To Be Filled Out by The OJT Coordinator)
No ratings yet
School Logo Place Here: (This Portion To Be Filled Out by The OJT Coordinator)
1 page
Bài literature review mẫu
No ratings yet
Bài literature review mẫu
17 pages
Exploring how authoritarian leadership affects commitment_ the mediating roles of trust in the school principal and silence
No ratings yet
Exploring how authoritarian leadership affects commitment_ the mediating roles of trust in the school principal and silence
21 pages
Brown Et Al 2015
No ratings yet
Brown Et Al 2015
19 pages
Business Research Method Group Report
No ratings yet
Business Research Method Group Report
14 pages
698 16
No ratings yet
698 16
8 pages
Wash Institute (Ops)
No ratings yet
Wash Institute (Ops)
6 pages
GIT: Living in The IT Era: Lect Ure 04
No ratings yet
GIT: Living in The IT Era: Lect Ure 04
37 pages
Survey Lab 1
100% (1)
Survey Lab 1
8 pages
Nadcap Scope 189936
No ratings yet
Nadcap Scope 189936
2 pages
Focus Group Question Guide Phase3 JET
No ratings yet
Focus Group Question Guide Phase3 JET
7 pages
Performance Appraisal Gayatri
No ratings yet
Performance Appraisal Gayatri
54 pages
Staistics Exam G8
No ratings yet
Staistics Exam G8
8 pages
Coffee and Milk Tea
60% (5)
Coffee and Milk Tea
5 pages
Social Psychology - Article Review 4 - Gender, Love Schemas, and Reactions To Romantic Break-Ups
No ratings yet
Social Psychology - Article Review 4 - Gender, Love Schemas, and Reactions To Romantic Break-Ups
7 pages
Canadian French Translation and Preliminary Validation of The Conformity To Masculine Norms Inventory: A Pilot Study
No ratings yet
Canadian French Translation and Preliminary Validation of The Conformity To Masculine Norms Inventory: A Pilot Study
10 pages
PT Original
No ratings yet
PT Original
6 pages
Creative Thinking Anthony Acevedo Online 230819
No ratings yet
Creative Thinking Anthony Acevedo Online 230819
46 pages
Competency Mapping
No ratings yet
Competency Mapping
73 pages
Inclinometer Reliability For Shoulder Ranges of Motion in Individuals With Subacromial Impingement Syndrome
No ratings yet
Inclinometer Reliability For Shoulder Ranges of Motion in Individuals With Subacromial Impingement Syndrome
8 pages
Project Planning
No ratings yet
Project Planning
3 pages
GC Bus660 Week 7 Benchmark Data Analysis Case Study Assignment Latest 2016 July
No ratings yet
GC Bus660 Week 7 Benchmark Data Analysis Case Study Assignment Latest 2016 July
2 pages
FINAL Exam - Stat and prob 11
No ratings yet
FINAL Exam - Stat and prob 11
4 pages
Pr2 Midterm Exam
No ratings yet
Pr2 Midterm Exam
3 pages
A Study On HR Policies and Practices in Textile Industry With Reference To Coimbatore Dr. S. Usha & M. Purushothaman
No ratings yet
A Study On HR Policies and Practices in Textile Industry With Reference To Coimbatore Dr. S. Usha & M. Purushothaman
5 pages

Stratified Sampling Notes

Uploaded by

Stratified Sampling Notes

Uploaded by

ST3424: Survey Methods and Applications

Principal Reasons Why Stratification is Considered Common in Sampling

4. Stratification ensures adequate representation to every segment of the population when it is

Limitations of Stratified Sampling

1. Strata sizes Ni s must be known.

2. The frame of sampling units in such strata should be available.

Table 1: Units in the Population by Stratum

Hence the population total can presented as

Weighted Mean of Sample Estimators ȳni

The estimate of the variance is

Variance of the Estimator y¯st Under Proportional Allocation

If the population size N is large as compared to the sample size

Variance of the Estimator y¯st under Optimum Allocation

For large population size N

Comparison of Stratified Sampling and Simple Random Sampling (SRS)

and for SRSWOR

Var(ȳn )SRS = Var(ȳst )p + − wi (ȲNi − ȲN )2

You might also like