Part I (25%)
Part I (25%)
Answer ALL questions in Part I and Part II. Please use separate answer books
for Part I and Part II. Marks are shown in square brackets.
Part I [25%]
1. Someone requires certain items of statistics and asks you for advice on whether a
sample survey should be conducted to obtain the statistics. What matters will you
raise for discussion with that person before you give your advice?
[Total: 4 marks)
[Total: 9 marks)
S&AS: STAT1304 Design & Analysis of Sample Surveys 2
4. Two series of surveys need to be conducted in order to provide data for the
Statistical Authority to compile the Consumer Price Index (CPI).
a) What are the two series of surveys? What are their respective aims?
b) With Base Year 2005, the CPI of a certain Territory were:
Year Index
2006 103
2007 105
2010 114
A re-basing was done in 201 0. The index for the year 2011 under the new base
was 103.
i) What was the CPI for 2011 under the OLD base?
ii) What was the CPI for 2007 under the NEW base?
[Total: 8 marks]
S&AS: STAT1304 Design & Analysis of Sample Surveys 3
Part II [75%]
Let
X;= weight of battery i
Y; =lifetime of battery i
Table I: Battery weights and lifetimes in month A
Battery Weight (pounds) Lifetime (hours) Yi -TXi
b) Estimate the average battery lifetime :Yr using ratio estimation. Place a bound
on the error of estimation with 95% confidence level. [5 marks]
c) Estimate the average battery lifetime :Yrr using regression estimation. Place a
bound on the error of estimation with 95% confidence level. [5 marks]
d) Comment on the precision of the three methods. Show that the regression
estimator is at least as efficient as the ordinary estimator. [5 marks]
[Total: 20 marks]
S&AS: STAT1304 Design & Analysis of Sample Surveys 4
Let
xi =weight of battery i
Yi = lifetime of battery i
Table 2:
Battery weights and lifetimes in month A Battery weights and lifetimes in month B
weight lifetime weight lifetime
Battery (pounds) (hours) Yt -TXt Battery (pounds) (hours) Yi- rxi
1 61.5 1180 -21.9942 1 62.2 1180 -19.6053
2 63.5 1250 8.9165 2 63.8 1240 9.5367
3 63.5 1245 3.9165 3 63.5 1243 18.3226
4 64.0 1245 -5.8558 4 66.5 1280 -2.5362
5 63.8 1248 1.0531 5 68.5 1310 -11.1087
6 65.8 1300 13.9639 6 69.2 1340 5.3909
mean 63.6833 1244.67 0.00 mean 65.6167 1265.50 0.00
SD 1.3732 38.1663 12.7199 SD 2.8771 56.9342 13.9279
sum 382.1 7468 sum 393.7 7593.0
a) Estimate the stratified average battery lifetime Yst using ordinary estimation.
Place a bound on the error of estimation with 95% confidence level.
[5 marks]
b) Estimate the average battery lifetime Yst_r using combined ratio estimation.
Place a bound on the error of estimation with 95% confidence level.
[5 marks]
d) Comment on the precision of the three methods. Under what condition is the
combined ratio estimator more efficient than the separate ratio estimator?
[5 marks]
[Total: 20 marks]
S&AS: STAT1304 Design & Analysis of Sample Surveys 5
3. a) The following table shows the number of births (in thousands) and the birth
rate (in births per thousand of population) in the United States for a systematic
sample of years between 1950 and 1990.
i) Estimate the total number of births during this 41-year period. Find an
appropriate estimate of the variance. [5 marks)
ii) Estimate the mean birth rate during this period and fmd an appropriate
estimator of the variance. Referring to the trends of the data, is the mean
birth rate a good predictor of the birth rate for 1995? Explain your answer.
[5 marks)
b) An auditor is confronted with a long list of accounts receivable for a firm. She
must verifY the amounts on 10% of these accounts and estimate the average
difference between the audited and book values. Conunent on her choice of
using simple random sampling, stratified random sampling, systematic
sampling or cluster sampling for the following situations.
iii) The accounts are grouped by department and then listed chronologically
within departments.
[10 marks]
[Total: 20 marksI
S&AS: STAT1304 Design & Analysis of Sample Surveys 6
b) Estimate the proportion of voters favoring the candidate and place a bound on
the error of estimation with 95% confidence level. [5 marks]
c) The newspaper wants to conduct a similar survey during the next election. How
large a sample will be needed to estimate the proportion of voters favoring a
similar candidate with a bound of 0.05 on the error of estimation?
[5 marks]
[Total: 15 marks]
************END OF PAPER************
S&AS: STAT1304 Design & Analysis of Sample Surveys 7
=s~- Zrpsxsy + r 2 s;
var(fi) = L W?(1- ~ C - ~a
Proportion P 11
fi = IwlPl
l
l nl-
Mean using 2
y = L wlrlxl var(yr) = L W?(1- nl
N )sl-
separate ratio 1 n1
estimate
Y;
sl2 = 1
n 1 -1
I (Yu - TtXu) 2
Rt ==
xl 1
=- - Cl:
nr-1
yfi - Zr1 L XuYr; + Tt2 L x/;)
S&AS: STAT1304 Design & Analysis of Sample SuiVeys 8
Mean using
combined ratio
y=rX - - =I
var(yr) W 1z(1- nl
N )sl-
1 n1
2
estimate
y = I
sl2 1
nl- 1
-
(Yli - Yl
R--
-x - r(xli- ii))z
Mean using
separate
Ylr =I Wj[Yj var(y1r) =I 2
Wj [s;j- 2/JjSxyj
regression
estimate
- /Jj(xj
-xj)l
+ /3252·]1- [j
J XJ n·
}
Min{var(Ylr,s)}
=I Wj2(1
1
-p · S ·-[j)
2)2( --
J YJ n·}
where q·J = W·
2
SZ._J
1-f· = Min{var(Ylr,s)}
J XJ nj
+I qj(fJj- /3)2
Systematic Sampling
Parameter Point Estimate Estimated Variance
Y n s2
Mean
Ysy =~IYi =
var(Ysy,srs) (1 - N) ~
Population total r Tsy = N)isy
-c-) =
var Tsy
2
N 2(1- N)-;
nsy
Proportion P fi = Psy =
var(fi) (1- n) Psy(1- Psy)
N n-1
Repeated systematic
sample mean y Ysy,rep =: I s
Yl =
var(Ysy,rep)
n s?o
(1- N):
s
Half-sample Ysy,half -c- )= -
var Ysy,half
(1 n)(y1-yz)2
N
= +
1
2CY1 Yz)
4
2
Difference
=~IYi
Ysy,diff
_ _
var(Ysy,diff) = n 5 diff
(1 - N) ~
S&AS: STAT1304 Design & Analysis of Sample Surveys 9
Cluster Sampling
Parameter Point Estimate Estimated Variance
Ratio Ln Lm'Yij 1 n s2
Ycl.r = Ln mi var(:Yc~,r) = fJZ (1 - N) ~
2=
I: mi2 cYi- Ycl,r
- )2
Sr
n-1
Equal sizes cluster Ln Yi
- 1 n s2
sample mean Y Ycl.eq = -n- var(yc1,eq ) = -m2( 1 - -)---"-
N n
I: m 2 cYi- Yc!,r
- )2
s2-
e-
n-1
Probabilities
Proportional to Size Ypps = ~LYi var(ypps) = n(n ~ 1) I C:Yi
~
-ypps )2
Unbiased Ln Yi n s2
Yt=-- var(:Yt) = (1 --)...!..
n N n
Ln (Yi - Yt) 2
s2-
t -
n-1
=n(t~)
Stratum size I N1S1 \
n 1 = n ( N1S1 ) n1
nl
n1 = n Fz L NiSi
I: NiSi
\ Fz;
Sample size n n n
n I:( N1S1jfz) I;(N1Sd jfz) > [I:CNtsaF Nl:NtSf
> 2 > 2 2
- N 2d 2 +I: N1S/ - N d 2 +I: N1S/ - N d +I; N1S/