Systematic Sampling2 (Edited)
Systematic Sampling2 (Edited)
=
( )
2
1
s
n
f
y e s
sys
=
7
Technology Oriented Business Driven Sustainable Development Environmental Friendly
2. Total Y or T
Estimate with
3. Proportion, P
Estimate with
( ) ( )
sys
sys sys
Y Ny
se Ny Nse y
=
=
1
n
i
i
sys
y
P
n
=
=
( ) ( )
1
1
1
sys sys
f
se p p p
n
8
Technology Oriented Business Driven Sustainable Development Environmental Friendly
Example
An investigator wishes to determine the quality of maple syrup contained in the
sap of trees on a Vermont farm. The total number of trees, N, is unknown; hence
it is impossible to conduct an SRS of trees. As an alternative procedure, the
investigator decides to use 1 in 7 systematic sample. The data from this survey
are listed below. Entries are the percentage of sugar content (in the sap) for the
trees sampled. Use these data to estimate , the average content of maple
trees on the farm.
Calculate the standard error of estimator.
9
Technology Oriented Business Driven Sustainable Development Environmental Friendly
Tree sampled Sugar content of the sap,y
1
2
3
.
.
.
210
211
212
82
76
83
.
.
.
84
80
79
6724
5776
6889
.
.
.
7056
6400
6241
1,486,800
2
y
212
1
17, 066
i
i
y
=
=
10
Technology Oriented Business Driven Sustainable Development Environmental Friendly
An estimate of is given by
95% CI for : 80.5 1.96 (1.47) 80.5 2.88
( )
( )
( )
1
2
2
2
2
17, 066
80.5
212
1
17, 066
1, 486,800
212
212
535.483
212
1 535.483
1484
212
2.16 1.47
n
i
i
sys
i
i
sys
y
y
n
y
y
n
s
n
se y
=
= = =
=
= =
11
Technology Oriented Business Driven Sustainable Development Environmental Friendly
Example
Let us take a systematic sample of one in six physicians. We first take a random number
between 1 and 6; let say number 5. The physicians selected in the sample, then, are 5,11,17
and 23
i. The estimated mean number of household visits per physician is
ii. The estimated total number of visits made by all physicians in the POP, is
iii. The estimated proportion of physicians making one or more household visits is
Physician Number Number of visitors, y
5
11
17
23
7
0
7
0
5 . 3
4
14
= =
sys
y
( )
25(3.5) 87.5
Y or T Ny =
= =
50 . 0
4
2
= = p
12
Technology Oriented Business Driven Sustainable Development Environmental Friendly
Sampling Distribution of Estimates
Suppose that a sample of one in five physicians is desired.
N = 25, k = 5
n = 5
No. of possible samples = 5
Random no.
chosen
Physicians in
sample
(one or more
visits)
1 1,6,11,16,21 2.6 64 0.4
2 2,7,12,17,22 4.8 120 0.6
3 3,8,13,18,23 1.4 35 0.4
4 4,9,14,19,24 9.2 230 0.8
5 5,10,15,20,25 7.4 185 0.6
y
Y
13
Technology Oriented Business Driven Sustainable Development Environmental Friendly
( ) ( )
( )
1
2.6 4.8 1.4 9.2 7.4
5
5.08
E y
or Y
= + + + +
= =
y is an unbiased estimate of
Similarly,
( )
( )
1
17
Technology Oriented Business Driven Sustainable Development Environmental Friendly
Probability of each sample being chosen = 1/6
Thus we see that in this instance systematic sampling does not lead to unbiased
estimates the impact made on the estimates is not the same for each element.
Illustrative example
Suppose that a list of appointments for a nurse practitioner is available to us and
that we will take a sample of 1 in 4 of the patients seen by this nurse on a given day
for purposes of estimating the average time spent per patient. Suppose that on the
day in which the sample was to be taken, the nurse saw a total of 12 patients in the
order shown in the table.
( ) ( )
( )
( )
( ) ( )
1
12 ...... 1.5 4.79
6
1
o
= =
=
= average correlation of all pairwise units in the same systematic sample
Random POP
Sampling units (su) are heterogeneous, ~ 0
As N
Periodic POP
su are homogenous,
As N
Ordered POP
su are heterogeneous, As N
2
n | |
|
\ .
( ) ( )
,
srs
sys
V y V y
o s
( ) ( )
,
sys srs
V y V y s
24
o >
( )
,
sys
srs
V y V y
| |
|
\ .
>
Environmental Friendly Sustainable Development Business Driven Technology Oriented
Repeated Systematic Sampling
Systematic sampling (sys):
i. Frame is random and either
i) (+ve integer)
ii) N is large
(sys) method (SRS) method
ii. Frame is not always random
i) monotonic relationship
ii) periodic relationship
Remedies
1. Rearrange randomly the units in the frame
2. Paired selections
3. Choose several random starts in the course of collecting the 1 in k sample.
N
k
n
=
...
i
j k j
i
+ k j
i
2 +
i i
j n k +
( )
2 1
1 j n k + +
25
Environmental Friendly Sustainable Development Business Driven Technology Oriented
4. m repeated systematic samples
a) original zone size
b) mk = new zone size
c) sample size
sample size for each 1 in mk systematic sample
d) obtain m independent systematic 1 in mk systematic samples as follows:
i) obtain m random starts
ii) obtain
N
k
n
=
N
N
m n
mk
n
| |
= =
|
\ .
1
,...
1
m
i
j j where
j mk i s s
1
... y y
m
26
Environmental Friendly Sustainable Development Business Driven Technology Oriented
e) Statistical inference
i) estimate
ii)
iii) 100 (1- )% CI for is:-
f) Statistical Inference (Y)
i) Estimate Y with
ii)
( )
Y
1
m
i
i
y
Y with y
m
=
=
( )
( )
1
2
1
;
1
m
i
i
y y
f n
se y f
m m N
=
= =
Y
( )
2
y Z se y
o
y N
( ) ( )
se Ny Nse y =
27
Environmental Friendly Sustainable Development Business Driven Technology Oriented
Example
Let us suppose that we wish to take a systematic sample of 18 workers from the list of 162
workers for purposes of estimating the mean number of work days lost per worker from acute
illnesses
N = 162 n = 18
k = 9
Notice that 18 = 6 x 3, therefore we can take a sample of 18 workers by taking six systematic
samples each containing 3 workers.
m = 6, k = 9
Repeated systematic samples 1in 54
Choose 6 random nos:-
Say 2, 31, 46, 13, 34, 53
54 mk =
1 54; 1,...,
i
j i m s s =
28
Environmental Friendly Sustainable Development Business Driven Technology Oriented
The six samples are :-
Random
number
Elements of
sample
Days Lost mean
2 2,56,110 6,2,7 5.00
13 13,67,121 4,4,5 4.33
31 31,85,139 6,4,2 4.00
34 34,88,142 5,3,2 3.33
46 46,100,154 6,12,3 7.00
53 53,107,161 7,3,0 3.33
29
Environmental Friendly Sustainable Development Business Driven Technology Oriented
( )
( )
( )
1
6
1
2
5.00 ... 3.33
6 6
4.5
1
1
18
1
162 1.9033
6
0.28197
0.53
m
i
i
i
i
y
y
y y
f
se y
m m
=
=
+ +
= =
=
=
=
=
30
Technology Oriented Business Driven Sustainable Development Environmental Friendly
31