Sampling Formula
Sampling Formula
(4.1)
(4.2)
Where
(4.4)
Estimated variance of
(4.5)
(4.6)
(4.11)
where D=
1
Sample size required to estimate with a bound on the error of estimation B:
(4.13)
where D=
(4.14)
Estimated variance of :
(4.15)
(4.16)
(4.18)
2
(5.1)
Estimated variance of
(5.2)
(5.3)
= (5.4)
Approximate sample size required to estimate or with a bound B on the error of estimation.
(5.6)
where ai = ni/n
and D= B2 /4 for estimating mean
D=B2 /4N2 for estimating total
Sample allocation
1. Equal allocation
2. Proportional allocation
3
ai= Ni/N
3. Neyman allocation
(5.9)
4. Optimum allocation
Approximate allocation that minimizes cost for a fixed value of V ( ) or minimum
V( ) for a fixed cost:
(5.7)
(5.13)
Estimated variance of
(5.14)
Approximate sample size required to estimate p with a bound B on the error of estimation:
(5.15)
(5.16)
Systematic Sampling
4
Estimator of the population mean
(7.1)
Estimated variance of
(7.2)
(7.5)
Estimated variance of
(7.6)
(7.7)
Estimated variance of
(7.8)
(7.10)
where
5
(7.11)
Cluster Sampling
(8.1)
Estimated variance of
(8.2)
where
(8.3)
(8.4)
Estimated variance of
(8.5)
6
(8.6)
(8.7)
Estimated variance of N
(8.8)
where
(8.9)
Approximate sample size required to estimate , with a bound B on the error of estimation:
(8.12)
Approximate size required to estimate , using M , with a bound B on the error of estimation.
(8.13)
Approximate sample size required to estimate , using , with a bound B on the error of
estimation:
(8.15)
7
(8.16)
Estimated variance of
(8.17)
where
(8.18)
(8.19)
Estimated variance of
(8.20)
(8.21)
Estimated variance of
(8 .22)
8
(9.1)
Estimated variance of
(9.2)
(9.3)
( 9.5)
Estimated variance of
(9.6)
(9.7)
Estimated variance of
(9.8)
where
9
(9.9)
and
n=1,2,.....n (9.10)
(9.11)
Estimated variance of p:
(9.12)
where
(9.13)
(9.14)
(9.15)
10
When N is large
(9.16)
(9.17)
(9.19)
where c1 is associated with cost of sampling each cluster and c 2 is associated with cost of
sampling each element within a cluster, and c is the total cost.
The value of m that minimizes for fixed cost, or minimizes c for fixed variance is
(9.20)
(9.22)
Estimated variance of
(9.23)
(9.24)
Estimated variance of
11
(9.25)
f=
b= number of cases that we want to select, and N i is the cluster size for the i th cluster, f is the
sampling fraction and F is the inverse of f.
Design effects
_ _
Var (yst) / Var (y srs) for stratified random sample, and its value is less than 1 if the design is more
efficient than the srs
_ _
Var (ycl) / Var (ysrs) for cluster sampling
If one has the estimate of the design effect, and the variance from srs, one can estimate the
variance from complex sample, simply by
_ _
Var (ycl) = deft2 [Var (ysrs)]
_
Deft2 = 1+ (n-1)
_
= [deft2 -1]/ [n-1]
This implies that if the sample size if srs is reduced by half, the varians will be double, given
_
Var (y) = s2 / n
_ _
Thus, if s2 is constant, Var (y) =constant/100 will be that of Var (y) = constant/200.
12