Mstat PSB 2022
Mstat PSB 2022
GROUP B
x2 + 2U x + V 2 = 0.
1
5. Suppose r ≥ 1 distinct books are distributed at random among
n ≥ 3 children.
Y = 22 − 3X
X = 5.84 − 0.12Y
2
7. Suppose N students arriving at a college are all equally likely
to have a particular disease with an unknown probability p.
The disease status (affected / not affected) of all students are
independent. Blood samples are collected from all N students.
In order to estimate p, two strategies are proposed.
Strategy 1 Test all samples separately to obtain the status for all N
students.
(b) If a group tests positive, then all students within that group
are further tested individually. Suppose that each test (for
individual sample or pooled sample) has equal cost. Then,
which of the two strategies would you prefer to identify
all students affected with the disease when the underlying
p = 0.5, N = 200, and m = 20?
3
8. Based on historical data, a positive random variable X arising
from an unknown distribution is believed to have the chi-square
distribution with 1 degree of freedom. A new theory suggests
√
that it may be better to model X as exponentially distributed
with mean λ, where λ is such that E(X) for this model is the
same as that for the earlier model.
(a) Compute λ.
(b) Suppose X1 , X2 , . . . , Xn are independent observations from
this unknown distribution. For α ∈ (0, 1), consider the
most powerful level α test for the null hypothesis that the
earlier model is correct against the alternative that the new
model is correct. Show that the rejection region of this test
is
� �
� √ �√
(x1 , x2 , . . . , xn ) : xi − 2 2 xi > c
i i
4
9. The following ANOVA table for three factors A, B, and C was
obtained (under a suitable model) from some data, but several
values were illegible and marked ‘∗’. It is known that A had two
levels and there were 24 observations in all.