Isi Mstat Pyq Book (2024-2016)
Isi Mstat Pyq Book (2024-2016)
SYLLABUS
Mathematics
Elementary set theory. Functions and relations. Elementary combinatorics: Permutations and
combinations, Binomial and multinomial theorem.
Theory of equations.
Vector spaces. Determinant, rank, trace and inverse of a matrix. System of linear equations.
Eigenvalues and eigenvectors of matrices.
Limit and continuity of functions of one variable. Differentiation and integration. Applications of
differential calculus, maxima and minima.
Notions of sample space and probability. Combinatorial probability. Conditional probability and
independence. Bayes Theorem. Random variables and expectations. Moments and moment
generating functions. Standard univariate discrete and continuous distributions. Distribution of
functions of a random variable. Distribution of order statistics. Joint probability distributions.
Marginal and conditional probability distributions. Multinomial distribution. Bivariate normal
and multivariate normal distributions.
Sampling distributions of statistics. Statement and applications of Weak law of large numbers and
Central limit theorem.
Basic experimental designs such as CRD, RBD, LSD and their analyses. ANOVA. Elements of
factorial designs. Conventional sampling techniques (SRSWR/SRSWOR) including stratification.
PSA-2024
Notations and Abbreviations
Then
1
2. Let a, b, c be three distinct non-zero real numbers. The expres-
sion
(x − b)(x − c) (x − c)(x − a) (x − a)(x − b)
a +b +c −x
(a − b)(a − c) (b − c)(b − a) (c − a)(c − b)
(A) 60◦ , 60◦ and 60◦ (B) 90◦ , 45◦ and 45◦
(C) 120◦ , 30◦ and 30◦ (D) 50◦ , 60◦ and 70◦
equals
2
6. Let S be the set of all 3 × 3 matrices A such that A2 = 0 and
the entries of A belong to {0, 1}. Then the number of elements
in the set S is
3
10. Let f : [0, 1] → [0, 1] be a function such that
(i) There exists at least one point a ∈ [0, 1] such that f (a) = a.
(ii) There exists at most one point a ∈ [0, 1] such that f (a) = a.
4
13. Let X(1) ≤ · · · ≤ X(n) denote the order statistics from a random
sample of size n, drawn from an exponential distribution with
mean equal to 1. For z > y > 0, the conditional density of X(n)
given X(1) = y is
14. Let F and G be two cdfs and let α, β > 0. Which of the following
is not a cdf?
(A) (αF (αx) + βG(βx)) /(α + β) (B) F (αx)G(βx)
5
16. Suppose E1 , . . . , Em are disjoint events satisfying 0 < P(Ej ) < 1,
m
∪
for each j = 1, . . . , m. Let E = Ej . Assume 0 < P(E) < 1.
j=1
Let A be any other event and define p = P(A | E). Which of
the following statements is true?
∑
m
(A) p = P(A | Ej ) P(E)
j=1
∑
m
(B) p = P(A | Ej ) P (Ej | E)
j=1
∑
m
(C) p = P(A | Ej )
j=1
∑
m
(D) p = P(A | Ej ) P(Ej )
j=1
17. Suppose A is the set of all 6-digit numbers formed using each
of the digits 1, 2, 3, . . . , 6 exactly once. If a number X is chosen
uniformly at random from A, then what is the probability that
X is divisible by 6 but not by 9 or 11?
6
19. A coin with success probability 1/4 is independently tossed
N times and the outcomes are recorded. Given that the total
number of successes is m, what is the probability that the first
k tosses are successes, where k ≤ m?
k
(A) (1/4)k (3/4)N −k (B)
m )
(
m(m − 1) · · · (m − k + 1) m
(C) (D) (1/4)k (3/4)m−k
N (N − 1) · · · (N − k + 1) k
7
22. Suppose {Xi : i ≥ 1} are iid Poisson random variables with
standard deviation 2. Then as n → ∞
n
1∑
Xi (Xi − 2)
n i=1
converges in probability to
23. For x1 , x2 , . . . , x2n ∈ [−1, 1], where the xi ’s are not necessarily
distinct, define
2n 2n
1 ∑ 2 1 ∑
m= xi and s = (xi − m)2 .
2n i=1 2n i=1
Let s2max denote the maximum possible value that s2 can take
over all such x1 , x2 , . . . , x2n .
Based on the above information, choose the FALSE statement
from the options given below.
8
24. Consider a random variable with pmf given by
p if x = 1,
3p if x = 2,
P (X = x) =
2p if x = 3,
1 − 6p if x = 4,
9
26. Suppose that we have observations (x1 , y1 ), . . . , (xn , yn ). We fit
a simple linear regression of y on x, with intercept, using these
observations via the least-squares method. Let ei , i = 1, . . . , n,
be the residuals of the fitted regression model. Consider the
following statements.
1
∑
n
(i) n
ei = 0.
i=1
1
∑n
1
∑
n
1
∑
n
(ii) n
(ei − ē)(xi − x̄) = 0 where x̄ = n
xi and ē = n
ei .
i=1 i=1 i=1
1
∑n
(iii) n
ei xi = 0.
i=1
10
28. A random sample is obtained from a N (µ, 1) distribution, where
µ ∈ R is an unknown parameter. Using this sample, a most
powerful (MP) test ϕ of size α = 1/2 is constructed for testing
H0 : µ = 0 against H1 : µ = 1. Let β denote the power of this
test. Which of the following statements is correct?
11
30. Let N ≥ 3 be an integer. A box contains N balls numbered
{1, 2, . . . , N }. A simple random sample s of size n (1 < n < N )
is selected with replacement from the box. For i = 1, . . . , N ,
define
1 if the i-th ball is selected in s,
Wi =
0 otherwise.
12
PSB-2024
Notations and Abbreviations
The following are used throughout the question paper.
1
3. A box contains 7 indistinguishable green balls, 5 indistinguish-
able white balls and 6 indistinguishable black balls. Suppose
balls are drawn using simple random sampling with replacement
until balls of all colours are obtained. Let N denote the mini-
mum number of draws needed to achieve this.
2
7. Suppose that X1 , X2 are i.i.d. U (0, θ) random variables. We
want to test H0 : θ = 1 against H1 : θ = θ1 where θ1 > 1 is fixed.
For this, we adopt two testing strategies.
(a) Find κ such that both the tests have the same size.
(b) Find the powers of the two tests. For Test 2, use the value
of κ obtained in part (a).
(c) Which of the two tests would you prefer and why?
3
10. A city has N households numbered {1, . . . , N }. Let yi denote the
size of the i-th household. A simple random sample s1 of size
m is drawn without replacement from households {1, . . . , M }.
Another independent simple random sample s2 , also of size m, is
drawn without replacement from households {N −M +1, . . . , N }.
Assume m < M and M > N/2.
4
PSA-2023
1. A sequence of real numbers {an }n≥1 has a peak at n if an ≥ ak
for all k ≥ n. Consider the following statements.
Then
2. Let C denote the set of complex numbers and let Im(z) denote
the imaginary part of z ∈ C. Consider the set
Then,
1
3. For a set S, let S c denote the complement of S. Also, for two
sets P and Q, let P \ Q = P ∩ Qc . Let A, B1 , B2 and B3 be four
sets. Which of the following statements is NOT true?
has
2
6. Let A be a finite set of real numbers having m (≥ 2) elements.
Define a function f : R → R, given by
Then,
2x + y − 1 = 0, ax + 3y − 3 = 0, 3x + 2y − 2 = 0
are concurrent is
3
9. For a non-constant geometric progression for which the second
term is 2 and the common ratio is an integer, the 10th, 20th and
30th terms are in arithmetic progression. Then, the fourth term
is
√ √
x + 8 − 8x + 1
10. lim √ √ equals
x→1 5 − x − 7x − 3
2 1 7
(A) does not exist (B) (C) (D)
3 2 12
4
!
a 1
12. If the matrix A = has 1 as an eigenvalue, then the
2 3
determinant of A is
and
fn+1 (x) = f (fn (x)) for n > 1.
1 1
(A) (B) 32 (C) (D) 8
8 32
5
15. Let n = aaaaaaaaabcd be a 12-digited number divisible by 45
where the digits a, b, c, d are not necessarily distinct and a ̸= 0.
How many such numbers are there?
16. Let a, b and c be the sides of a triangle such that c2 = a2 +b2 −ab.
Then which of the following is always true?
(A) a ≤ c and b ≤ c
(B) a ≤ c ≤ b or b ≤ c ≤ a
(C) c ≤ a and c ≤ b
(D) None of the above
6
18. Suppose that the sample mean and sample standard deviation
for a set of n observations x1 , x2 , . . . , xn are m and s (> 0),
respectively. These values are updated to m1 and s1 after one
more observation xn+1 is added to the data set.
Based on the above information, choose the correct statement
from the options given below.
7
20. Let X1 , X2 , . . . be a sequence of random variables such that
1
E(Xi ) = 1, Var(Xi ) = 1 for all i and Cov(Xi , Xj ) = for
n
2
1X
all i ̸= j. Let Zn = Xi . Then, lim Var(Zn ) equals
n i=1 n→∞
1 1
(A) (B) (C) 0 (D) 1
2 4
21. Suppose that P(A|B) = 0.4 and P(Ac |B c ) = 0.6. Then, the two
equations are sufficient to find
8
!
X −Y
23. Consider a matrix M = where X and Y are indepen-
Y X
dent standard normal random variables. Then the probability
that M is a non-singular matrix is
1 1
(A) 0 (B) 1 (C) √ (D)
2 2
24. Let (U, V ) be a point chosen uniformly at random from the unit
circle {(u, v) ∈ R2 : u2 + v 2 = 1}. Then Var(U ) is
1 1 1
(A) (B) (C) 1 (D)
3 2 4
36 46 56 66
(A) (B) (C) (D)
190 190 190 190
9
26. Let X be a random variable with probability density function
displayed in the following graph.
0.20
0.15
Density f(x)
0.10
0.05
0.00
−10 −5 0 5 10
(a) (b)
0.4
0.3
0.2
0.1
Density f(x)
0.0
(c) (d)
0.4
0.3
0.2
0.1
0.0
−10 −5 0 5 10 −10 −5 0 5 10
10
27. Suppose that we want to fit the regression model
y = β1 x + β2 x2 + ϵ
11
29. Assume X1 , . . . , Xn are independent and identically distributed
N (µ, 1) random variables with µ ∈ R. We want to test H0 :
µ = 0 versus H1 : µ ̸= 0. Consider the following two one-sided
testing problems
Let ϕA,η (x) and ϕB,η (x) denote the most powerful tests of size
η ∈ (0, 1) for H0,A and H0,B , respectively. Then, for testing H0
versus H1 ,
12
30. Let ϕ denote the probability density function of the standard
normal distribution. Let fθ , for θ ∈ {0, 1}, be defined as
ϕ(x) if θ = 0,
fθ (x) =
1 ϕ x−1 if θ = 1.
2 2
13
PSB-2023
GROUP A
GROUP B
1
5. Suppose X1 , . . . , Xn (n ≥ 2) are independent and identically
distributed observations from a distribution having probability
density function
e−(x−θ) if x ≥ θ,
fθ (x) =
0 if x < θ,
where θ ∈ R. Let
Z ∞
ψ(θ) = fθ (x)dx.
1
(a) Show that Bn (θ) > 0 for every θ < 1 and Bn (θ) = 0 for
every θ ≥ 1.
(b) Show that lim Bn (θ) = 0 for every θ < 1.
n→∞
(a) X1 = 7 and X2 = 9.
(b) X1 = 4 and X2 = 9.
(c) X1 = 5 and X2 = 9.
2
7. Mr. X wants to compare five different varieties of wheat. He has
access to a piece of land which has been divided into 25 smaller
plots as shown in the figure below.
3
8. Let X1 , . . . , Xn be independent and identically distributed
Bernoulli(θ) random variables where θ ∈ (0, 1). The maximum
likelihood estimator of θ is the sample mean X n . However, a
statistician feels that the sample size n is too small, and decides
to increase the sample size. In order to do so, he records the
observed values of the data points, {x1 , . . . , xn }, and then se-
lects a random sample of size m = kn with replacement from
{x1 , . . . , xn }, where k is a positive integer. The values in the
observed sample are recorded as {Y1 , . . . , Ym }. The statistician
proposes the new estimator T = 12 (Y m + X n ), where Y m is the
sample mean of Y1 , . . . , Ym .
4
9. To test whether the heights of siblings are correlated, a re-
searcher devised the following plan: She identified a random
sample of n families with at least two adult male children.
For the ith family, suppose that Xi and Yi are the heights
of the first and second male child, respectively. Assume that
(X1 , Y1 ), . . . , (Xn , Yn ) are independent bivariate normal random
vectors with parameters (µ, µ, σ 2 , σ 2 , ρ), where µ and σ 2 are
known from previous studies. She is interested in testing the
null hypothesis H0 : ρ = 0 against the alternative H1 : ρ = 0.5.
Unfortunately, due to a mistake in the questionnaire, she was
only able to observe (Ui , Vi ) for each i, where Ui = max(Xi , Yi )
and Vi = min(Xi , Yi ).
(a) Based on the observed sample, obtain the test statistic cor-
responding to the most powerful test of H0 against H1 .
(b) Find a critical value so that the size of the test converges
to 0.05 as n → ∞.
5
PSA-2022
1. Let {xj }j≥1 be a sequence of positive numbers in geometric
progression. Let Pn denote the product of the first n terms
of {xj }. Which of the following statements is true for all n ≥ 1?
3
(A) P3n = P2n /Pn3
2
(B) P3n = P2n /Pn
1/2
(C) P3n = P2n Pn2
(D) P3n = Pn P2n
S = {x ∈ (a, b) : f � (x) = 0} .
1
nx
3. Let x > 1. Then lim x−n n(−1)
n→∞
(A) equals ex .
(B) does not exist.
(C) equals +∞.
(D) equals 0.
(I) f + g is monotonic on [ a, b ].
(II) f g is monotonic on [ a, b ].
(III) The maxima of f + g is attained at either a or b.
(IV) The maxima of f g is attained at either a or b.
2
5. What is the range of the function
� �
2 1 −x2
f (x) = x + e , x ∈ (−∞, ∞) ?
2
�1 � � � � � � �
(A) 2
, e−1/2 (B) 0, 12 (C) 0, e−1/2 (D) 0, e−1/2
3
8. For θ ∈ R with θ �= 0, what is the value of
� ∞
x 2 2
√ eθx−θ x /2 dx ?
−∞ 2π
√ √
1 e e 1
(A) (B) (C) (D)
θ|θ| θ2 θ|θ| θ2
9. Let S� be the� set of all real numbers a such that the matrix
1 −2
M= has no real eigenvalue. Then,
8 a
4
11. Let M (x) denote the matrix
x 0 1
0 1 0 ,
x 0 −x
5
13. A six digit number is chosen at random. What is the probability
that at least two consecutive digits in the chosen number are
equal?
36 76
(A) 37
(B) 77
(C) 0.95 (D) 0.8
6
16. Suppose that X has a uniform distribution on {1, 2, . . . , n}.
Further suppose that conditioned on the event {X = x}, Y
has a uniform distribution on {1, 2, ..., x}. Let
py (x) = P (X = x | Y = y).
ρ √
(A) ρ (B) 1−ρ
(C) 0 (D) nρ
7
19. Let X be a random variable with the Student’s t-distribution
with 1 degree of freedom. Let M (t) be its moment generating
function. Which of the following statements is correct?
1
(A) Beta distribution on (0, 1) with parameters 2
and 12 .
(B) Gamma distribution with scale parameter 1 and shape
parameter 12 .
(C) Gamma distribution with scale parameter 1 and shape
parameter 32 .
(D) Beta distribution on (0, 1) with parameters 1 and 1.
8
21. Suppose X1 , X2 , . . . , X100 are independent and identically
100
�
distributed as Bernoulli(p), where p ∈ (0, 1). If Y = Xi and
i=1
Z = min{X1 , X2 , . . . , X100 }, then
(A) Y and Z do not have the same mean but have the same
variance.
(B) Y and Z have the same mean but not the same variance.
(C) Y and Z neither have the same mean nor the same variance.
(D) Y and Z have the same distribution.
1
Then Y
2n n
converges in probability to
9
23. Let X be a random variable. Define the function
P(X < x) + P(X ≤ x)
g(x) = , for all x ∈ R.
2
Which of the following statements is always true?
10
26. Suppose X1 , X2 , . . . , Xn is a random sample from the uniform
distribution on (0, θ), and let Yi = log Xi for i = 1, 2, . . . , n.
Which of the following statistics is an unbiased estimator of
log θ?
(A) Y − 1
(B) max(Y1 , Y2 , . . . , Yn )
(C) Y + 1
(D) log(2X)
(A) 3 e−2 (B) 5 e−2 (C) 1.4 e−0.4 (D) 1.44 e−0.4
� � √ √
(A) 1 − (1 − α)5 (B) (1 − α)5 (C) 1 − α5 (D) α5
11
29. Suppose Y1 , Y2 , . . . , Y5 is a random sample from the exponential
distribution with mean µ. Which of the following intervals
contains µ with 100(1−α)% confidence? Here xk,p is the number
�
5
such that P (χ2k > xk,p ) = p, and T = Yi .
i=1
� �
T
(A) 0,
x5,1−α
� �
2x10,α
(B) 0,
T
� �
5T
(C) 0,
− log(1 − α)
� �
2T
(D) 0,
x10,1−α
� 0 : θ = −1 against H1 : θ = 2, φ is a test of
(A) For testing H
size α.
� 0 : θ = −1 against H1 : θ = 2, φ is a MP test
(B) For testing H
of size α� , where α� < α.
� 1 : θ = −2, φ is a MP test
(C) For testing H0 : θ = 0 against H
of size α.
� 0 : θ = 1/2 against H1 : θ = 2, φ is a test of
(D) For testing H
size α� , where α� < α.
12
PSB-2022
GROUP A
GROUP B
x2 + 2U x + V 2 = 0.
1
5. Suppose r ≥ 1 distinct books are distributed at random among
n ≥ 3 children.
Y = 22 − 3X
X = 5.84 − 0.12Y
2
7. Suppose N students arriving at a college are all equally likely
to have a particular disease with an unknown probability p.
The disease status (affected / not affected) of all students are
independent. Blood samples are collected from all N students.
In order to estimate p, two strategies are proposed.
Strategy 1 Test all samples separately to obtain the status for all N
students.
(b) If a group tests positive, then all students within that group
are further tested individually. Suppose that each test (for
individual sample or pooled sample) has equal cost. Then,
which of the two strategies would you prefer to identify
all students affected with the disease when the underlying
p = 0.5, N = 200, and m = 20?
3
8. Based on historical data, a positive random variable X arising
from an unknown distribution is believed to have the chi-square
distribution with 1 degree of freedom. A new theory suggests
√
that it may be better to model X as exponentially distributed
with mean λ, where λ is such that E(X) for this model is the
same as that for the earlier model.
(a) Compute λ.
(b) Suppose X1 , X2 , . . . , Xn are independent observations from
this unknown distribution. For α ∈ (0, 1), consider the
most powerful level α test for the null hypothesis that the
earlier model is correct against the alternative that the new
model is correct. Show that the rejection region of this test
is
� �
� √ �√
(x1 , x2 , . . . , xn ) : xi − 2 2 xi > c
i i
4
9. The following ANOVA table for three factors A, B, and C was
obtained (under a suitable model) from some data, but several
values were illegible and marked ‘∗’. It is known that A had two
levels and there were 24 observations in all.
5
PSA-2021
1. Let f be a di↵erentiable function on R such that the number of ele-
ments in the set {x : f (x) = 0} is 50. Then the minimum number of
elements in the set {x : f 0 (x) = 0} is
2. Let P3 be the vector space of all polynomials with real coefficients and
of degree less than or equal to 3 over the field of real numbers. The
dimension for the subspace of polynomials p(x) such that
p(1) = p(2) = 0 is
Then
1
sin x4
4. lim is
x!0 (sin x)3
4
(A) 1 (B) 1 (C) 0 (D) 3
(A) p = q
(B) p and q MUST be rational numbers
(C) p + q MUST be a rational number
p
(D) MUST be a rational number
q
6. Let
[x3 ] [x]3
↵ = lim and = lim ,
x!1 x3 x!1 x3
where [x] denotes the greatest integer less than or equal to x. Then
(A) ↵ = 1, =0
(B) ↵ = 1, =1
(C) ↵ = 1, does not exist
(D) Neither ↵ nor exists
2
7. For every natural number n, let
Z n
In = ({x}n + { x}n ) dx
0
where {x} = x [x] where [x] denotes the greatest integer less than
or equal to x. Then In equals
2n 2nn+1
(A) (B) 0 (C) (D) n
n+1 n+1
3
10. A matrix is chosen at random from the set of all 2 ⇥ 2 matrices with
elements 0 or 1 only. The probability that the determinant of the
chosen matrix is non-zero is
3 3 1 1
(A) 16
(B) 8
(C) 8
(D) 4
11. If ( n C0 + n C1 )( n C1 + n C2 ) . . . ( n Cn 1 + n Cn ) = k n C0 n C1 . . . n Cn 1 ,
then k is equal to
nn
(A) n!
(n+1)n
(B) n!
(n+1)n
(C) nn!
(n+1)n+1
(D) n!
Then
4
" # " # " #
1 3 2 0 4 0
13. Let A = ,B= and C = . Then the sum
4 2 0 3 0 1
BAC C 2 AB 2 B 2n 1 AC 2n 1
C 2n AB 2n
trA + tr + tr + · · · + tr + tr +···
9 92 92n 1 92n
(A) is equal to 14
(B) is equal to 9
(C) is 1
(D) is equal to 12
14. Let F be the set of all bijections f from the set S = {1, 2, 3, 4, 5, 6}
onto itself such that f (f (i)) 6= i for any i 2 S. The number of elements
in F are
15. Let f : [0, 1] ! R be a monotonic function such that the number of its
R1
discontinuity points is finite. For t 0 let mt = 0 xt f (x)dx. Then
which of the following statements is true?
5
16. Let F (x) and G(x) be cumulative distribution functions of random
variables X and Y , respectively. Then which of the following would
NOT be a cumulative distribution function?
(A) 1/25
(B) 4/5
5 4
(C) e /e
4 5 4
(D) e e / 1 e
6
19. Let X be a random variable. Let F (t) denote its cumulative dis-
tribution function and M (t) denote its moment generating function.
Consider the following two statements.
V (Yn )
lim =
n!1 n
(A) 2 2 (2 2
1)
2
(B) 2
4
(C) 20
4
(D)
7
21. The mean and the variance of a set of observations are 10 and 25
respectively. When two new observations are included in the set, both
the mean and the variance remain the same. Which of the following
is a correct statement?
8
23. Suppose that the probability of a diagnostic test detecting the presence
of an infection is 0.9 for a diabetic individual and 0.8 for a non-diabetic
individual. An individual is randomly selected from the population.
What should be the ratio of the proportions of diabetic to non-diabetic
individuals in the population so that the probability of the selected
individual being diabetic given that the above diagnostic test produces
a negative result is the same as that of the individual being non-
diabetic given that the test produces a positive result?
(A) the two sample t-test statistic yields more power than the
ANOVA statistic for any non-zero value of µ1 µ2 and for any
m, n.
(B) the two sample t-test statistic and the ANOVA statistic yield the
same power for any non-zero value of µ1 µ2 and for any m, n.
(C) the two sample t-test statistic yields less power than the ANOVA
statistic for any non-zero value of µ1 µ2 and for any m, n.
(D) the two sample t-test statistic and the ANOVA statistic will yield
the same power for any non-zero value of µ1 µ2 if m = n but
will not necessarily yield the same power if m 6= n.
9
25. In order to compare between t( 2) treatments, one carries out t
replications of an experiment using a randomized block design with
t blocks. The error degrees of freedom for testing the equality of
treatment e↵ects in such a design is
(A) t + 1
(B) 3(t 1)
(C) (t 1)(t2 + t 1)
(D) t(t 1)(t + 1)
10
27. Suppose X1 , X2 , X3 are independent normally distributed random
2
variables with mean µ and variance . However, instead of
X1 , X2 , X3 , we only observe Y1 = X2 X1 and Y2 = X3 X2 . Which
2
of the following statistics is sufficient for ?
28. Consider data on body mass index (BMI) and fasting blood sugar
levels (fgl) for two groups of individuals. Suppose that the correlation
coefficient between BMI and fgl is equal to 0.5 in each of the two
groups. If the two groups are combined, the correlation coefficient
between BMI and fgl:
11
29. A random variable Z is obtained as follows. Let X ⇠ U (0, 1), and
Y given X = x be Bernoulli with probability of success x. If Y = 1,
Z is defined to be X. Otherwise, the experiment is repeated until a
pair (X, Y ) with Y = 1 is obtained. Then, the probability density
function of Z on (0, 1) is
(A) 2(1 z)
(B) 2z
(C) 12z 2 (1 z)
(D) 6z(1 z)
12
PSB-2021
Group A
1. Define f : R → R as
cos(2x) if x is rational,
f (x) =
sin2 (x) if x is irrational.
(a) f is continuous,
(b) f is differentiable.
8 5 3×2
3. Consider the set of all five digit integers formed by permuting the
digits 1, 2, 4, 6 and 7. Let X denote a randomly chosen integer from
this set and let Y denote the position, from the right, of the digit 4
in the randomly chosen integer. For example, if the integer chosen is
12647, then Y is 2 and for 41276, Y is 5.
1
Group B
(i) When the player is in state 1 or 6: If the roll of the die results
in k then the player moves to state k, for k = 1, . . . , 6.
(ii) When the player is in state 2 or 3: If the roll of the die results in
1, 2 or 3 then the player moves to state 4. Otherwise the player
moves to state 5.
(iii) When the player is in state 4 or 5: If the roll of the die results in
4, 5 or 6 then the player moves to state 2. Otherwise the player
moves to state 3.
The player wins when s/he visits 2 more states, besides the starting
one.
(a) Calculate the probability that the player will eventually move
out of states 1 and 6.
(b) Calculate the expected time the player will remain within states
1 and 6.
(c) Calculate the expected time for a player to win, i.e., to visit 2
more states, besides the starting one.
2
6. Let X1 , . . . , Xn be i.i.d. random variables which are uniformly dis-
tributed on (θ, 2θ), θ > 0.
X(n)
(a) Show that 2
is the maximum likelihood estimator (MLE) of θ
where X(n) = max(X1 , . . . , Xn ).
(b) Find an unbiased estimator for θ based on the MLE.
(c) Given any > 0, show that
X(n)
lim P − θ > = 0.
n→∞ 2
3
9. Consider a linear regression model:
yi = α + βxi + ei , i = 1, 2, . . . , n
where xi ’s are fixed and ei ’s are i.i.d. random errors with mean 0 and
variance σ 2 .
Define two estimators of β as follows
Pn Pn
i=1 y i xi y i
βb1 = Pn and βb2 = Pi=1
n 2
.
i=1 xi i=1 xi
4
PSA-2020
1. How many distinct straight lines can one form that are given by an
equation ax + by = 0, where a and b are numbers from the set
{0, 1, 2, 3, 4, 5, 6, 7}?
(A) 63.
(B) 57.
(C) 37.
(D) 49.
(A) an = n2 − 3n, n ≥ 1.
(B) an = 3n2 − n, n ≥ 1.
(C) an = log( 34 )n , n ≥ 1.
(D) an = 7n − n2 , n ≥ 1.
1
ROUGH WORK
4. Let f be the function on the set of real numbers defined by
ex sin x cos x
f (x) = e2x sin 2x cos 2x .
e3x sin 3x cos 3x
Then f 0 (0) is
6. Let √ ! !
1 3 1 2 −1
X= √ and A = .
2 −1 3 3 −2
Let X t denote the transpose of the matrix X. If B = XAX t , then
X t B 105 X equals
2
ROUGH WORK
7. Suppose
n
X
aj (n) = kj , j = 0, 1, 2, 3, n = 1, 2, . . . .
k=1
a1 (n)a2 (n)
Then limn→∞ a0 (n)a3 (n) equals
8. Let S be the set of all 3 × 3 real matrices A = ((aij )) such that the matrix
((a3ij )) has rank one. Let R be the set
R = {rank(A) : A ∈ S}.
Then R is equal to
(A) {1}. (B) {1, 2}. (C) {1, 3}. (D) {1, 2, 3}.
S1 : x2 + y 2 + 4x + 2y − 4 = 0,
S2 : x2 + y 2 − 4x − 4y + 4 = 0.
3
ROUGH WORK
10. Let A be the 3 × 3 matrix
1 2 4
A = 0 0 3
0 0 −1
Am,n = x ∈ R : n2 x3 + 2020x2 + mx = 0 .
Then the number of pairs (m, n) for which Am,n has exactly two points
is
12. The set of all real solutions of the inequality 2|x| > |x − 1| is
4
ROUGH WORK
13. Let S and T be two non-empty sets and f : S → T be a function such
that for all subsets A and B of S, we have f (A ∩ B) = f (A) ∩ f (B).
Then
14. Let S be the set of all 3 × 3 matrices A such that among the 9 entries
of A, there are exactly three 0’s, exactly three 1’s and exactly three 2’s.
The number of matrices in S that have trace divisible by 3 is
where for 1 ≤ k ≤ n
1 if xk ≤ t,
fk (t) =
0 otherwise.
Then A contains
5
ROUGH WORK
16. Let X, Y be independent and identically distributed random variables
with
P (X = k) = (0.75)(0.25)k , k = 0, 1, 2, . . . .
6
ROUGH WORK
19. Let X and Y be random variables with V (X) = 9 and V (Y ) = 4. Then
which of the following is true for non negative reals a and b?
21. A multiple choice test uses the following scoring procedure to discourage
students from guessing. For each correct response the score is 5, for each
incorrect response the score is 0 and for no response the score is 2. If
there are 4 choices for each question, what is the minimum number of in-
correct choices that the student must eliminate before it is advantageous
to guess among the rest than leave the question unanswered?
7
ROUGH WORK
22. Each of 100 laboratory rats has available plain water and a mixture of
water and caffeine in their cages. After 24 hours, two measurements
were recorded for each rat: the amount of caffeine consumed (X) and
the blood pressure (Y ). Based on the data, the correlation coefficient
between X and Y was found to be 0.428. Which of the following con-
clusions is justified on the basis of the study?
19 17 13 11
(A) 3 . (B) 3 . (C) 3 . (D) 3 .
24. Suppose X and Y are independent observations from N (0, 1). For t > 0,
define g(t) = P (|X − Y | > t | (X + Y ) > t). Then
8
ROUGH WORK
25. Let X1 , X2 , . . . , Xn be independent and identically distributed N (1, σ 2 )
random variables. Let σ c2 denote the maximum likelihood estimator of
σ 2 . Then
1 Pn
c2 =
(A) σ n i=1 (Xi − X̄)2 and it is biased.
1 Pn
c2 =
(B) σ n−1 i=1 (Xi − X̄)2 and it is unbiased.
1 Pn
c2 =
(C) σ n−1 i=1 (Xi − 1)2 and it is biased.
1 Pn
c2 =
(D) σ n i=1 (Xi − 1)2 and it is unbiased.
(A) Sample median. (B) (log 2)X̄. (C) log 2/X̄. (D) X̄/ log 2.
27. Suppose X is a discrete random variable taking values −1, 0, 1, each with
probability 31 . Let Y = |X|. Which one of the following statements is
correct?
9
ROUGH WORK
28. Let X1 , X2 , . . . , Xn be independent and identically distributed random
variables with probability density function
1 −| x−1 |
f (x) = e λ , −∞ < x < ∞,
2λ
where λ > 0 is an unknown parameter. Which one of the following is
a form of the most powerful test of its size for testing H0 : λ = 2 vs
H1 : λ = 1?
Pn
(A) Reject H0 if i=1 (Xi − 1)2 > C.
Pn
(B) Reject H0 if i=1 |Xi − 1| > C.
Pn
(C) Reject H0 if i=1 (Xi − 1)2 < C.
Pn
(D) Reject H0 if i=1 |Xi − 1| < C.
30. Suppose T1 and T2 are two different real valued statistics each of which
is sufficient for an unknown parameter θ. Then, which of the following
is always true?
10
ROUGH WORK
ROUGH WORK
ROUGH WORK
PSB-2020
GROUP A
(a) Compute P
i∈S Xi
E Pn .
i=1 Xi
GROUP B
1
5. Let U1 , U2 , . . . , Un be independent and identically distributed random
variables each having a uniform distribution on (0, 1). Let
X = min{U1 , U2 , . . . , Un }, Y = max{U1 , U2 , . . . , Un }.
yi = βxi + εi xi , i = 1, 2, . . . , n,
2
8. Assume that X1 , . . . , Xn is a random sample from N (µ, 1), with µ ∈ R.
We want to test H0 : µ = 0 against H1 : µ = 1. For a fixed integer
m ∈ {1, . . . , n}, the following statistics are defined:
T1 = (X1 + . . . + Xm )/m,
T2 = (X2 + . . . + Xm+1 )/m,
.. ..
.= .
Tn−m+1 = (Xn−m+1 + . . . + Xn )/m.
3
PSA
PSA-2019 2019
1. Let A = ((aij )) be an m×n matrix with all non-zero real entries. Let B
be obtained from A by replacing a11 by 0 and keeping all other entries
unchanged. If r is the rank of A, then what is the set of possible values
for the rank of B?
3. If 3(cos 100◦ + i sin 100◦ )(cos 110◦ + i sin 110◦ ) = x + iy, where x and y
are real numbers, then
√
(A) x = − 3 2 3 , y = − 32 .
√
3 3
(B) x = 2 , y = 32 .
√
3 3
(C) x = 2 , y = − 32 .
√
(D) x = −323, y = 32 .
1
ROUGH WORK
4. What is the number of 6 digit positive integers in which the sum of
the digits is at least 52?
be . . . zyx in the decimal system, i.e., x is the unit’s digit, y the ten’s
digit, and so on. What is z?
6. How many times does the digit ‘2’ appear in the set of integers
{1, 2, ..., 1000}?
2
ROUGH WORK
0 1 t
7. Let t be a real number. Then the rank of 2 t −1 equals
2 2 0
�200�
8. The number 100 /4100 lies in
3
ROUGH WORK
10. Two friends, one from Kolkata and one from Delhi, start driving to-
wards each other at the same time. It is given that the distance between
Kolkata and Delhi is 1455 km. One of them drives at a constant speed
of 80 kmph (km per hour), while the other drives at a speed of 50 kmph
during the first hour, 55 kmph during the second hour, 60 kmph dur-
ing the third hour, and so on (i.e., his speeds over successive hours are
in an arithmetic progression). How long will it take for them to meet
each other?
12. What is the set of numbers x in (0, 2π) such that log log(sin x + cos x)
is well-defined?
(A) [ π8 , 3π
8 ] (B) (0, π2 ) (C) (0, π4 ] (D) (0, π) ∪ ( 3π
2 , 2π)
4
ROUGH WORK
13. Let C1 and C2 be concentric circles with centre at O and radii r1 and
r2 respectively. The line OA2 intersects C1 at A1 . The line A1 D is
tangent to C1 at A1 . What is the length of the line segment A2 D?
� � �
(A) 2r1 (r2 − r1 ) (B) r22 − r12 (C) r2 (D) 2r2 (r2 − r1 )
C2 D
C1
O A1 A2
f
14. The reflection of the point (1, 2) with respect to the line x + 2y = 15 is
(A) (3, 6). (B) (6, 3). (C) (10, 5). (D) (5, 10).
15. How many solutions does the equation cos2 x + 3 sin x cos x + 1 = 0
have for x ∈ [0, 2π)?
5
ROUGH WORK
16. The functions f, g : [0, 1] → [0, 1] are given by f (x) = 12 x(x + 1) and
g(x) = 12 x2 (x + 1). What is the area enclosed between the graphs of
f −1 and g −1 ?
18. Draw one observation N at random from the set {1, 2, . . . , 100}. What
is the probability that the last digit of N 2 is 1?
6
ROUGH WORK
19. Let X be the number of tosses of a fair coin required to get the first
head. If Y | X = n is distributed as Binomial(n, 12 ), then what is
P(Y = 1)?
7
ROUGH WORK
22. A shopkeeper has 12 bulbs of which 3 are defective. She sells the bulbs
by selecting them at random one at a time. What is the probability
that the seventh bulb sold is the last defective one?
23. The chances of Smitha getting admitted to colleges A and B are 60%
and 40% respectively. Assume that colleges admit students indepen-
dently of each other. If Smitha is told that she has been admitted to
at least one college, what is the probability that she got admitted to
college A?
8
ROUGH WORK
25. Suppose Y, X1 , X2 , . . . , Xn are i.i.d. N (µ, 1) random variables, and
In = (X̄n − an , X̄n + an ) is a 95% confidence interval for µ. Then
P(Y ∈ In )
(A) converges to 1 as n → ∞.
(B) is greater than 0.95 for all n ≥ 1.
(C) is less than 0.95 for all n ≥ 1.
(D) equals 0.95 for all n ≥ 1.
9
ROUGH WORK
28. Let X1 , X2 , . . . , Xn be a random sample from a distribution with prob-
ability density function
(λ + 1) xλ 0 ≤ x ≤ 1
fλ (x) =
0 otherwise,
1 �
n
(C) −1 − n log Xi
i=1
�n
(D) 1 − n/( log Xi )
i=1
√ √ √
(A) Φ(2/ 7) (B) Φ(2/ 3) (C) Φ(2/ 5) (D) Φ(1)
What is E(X)?
10
ROUGH WORK
PSB
PSB-2019 2019
GROUP A
GROUP B
1
5. Suppose X1 , X2 , . . . , Xn are independent random variables such that
� of N .
(a) Find the maximum likelihood estimator N
�.
(b) Find the probability mass function of N
n+1 �
(c) Show that n N − 1 is an unbiased estimator of N .
2
8. Suppose {(xi , yi , zi ) : i = 1, 2, . . . , n} is a set of trivariate observations
on three variables: X, Y , and Z, where zi = 0 for i = 1, 2, . . . , n − 1
and zn = 1. Suppose the least squares linear regression equation of Y
on X based on the first n − 1 observations is
�0 + α
y=α �1 x
�1 = β�1 .
Show that α
(a) Find the constant c such that the test that “rejects when X > c”
has size 0.05 for the null hypothesis H0 : µ = 0.
(b) Find the power of this test against the alternative hypothesis
H1 : µ = 2.
3
PSA-2018
1. Let A be a 2 × 2 nonzero real matrix. Which of the following is true?
1
ROUGH WORK
4. Let A and B be n × n matrices. Assuming all the inverses exist,
(A−1 − B −1 )−1
equals
(A) (I − AB −1 )−1 B.
(B) A(B − A)−1 B.
(C) B(B − A)−1 A.
(D) B(A − B)−1 A.
Then f is differentiable at
2
ROUGH WORK
7. Let f be a function defined from (0, ∞) to R such that
Then f is
1/x
8. The value of lim log x
x→∞
x7 + 5x5 + x3 − 3x2 + 3x − 7 = 0
is
3
ROUGH WORK
10. Let x be a real number. Then
lim lim cos2n (m!πx)
m→∞ n→∞
(A) {a2n }n≥1 is always convergent but {a2n+1 }n≥1 need not be con-
vergent.
(B) both {a2n }n≥1 and {a2n+1 }n≥1 are always convergent and have
the same limit.
(C) {a3n }n≥1 is not necessarily convergent.
(D) both {a2n }n≥1 and {a2n+1 }n≥1 are always convergent but may
have different limits.
12. Let {an }n≥1 be a sequence of positive numbers such that an+1 ≤ an
for all n, and lim an = a. Let pn (x) be the polynomial
n→∞
pn (x) = x2 + an x + 1,
and suppose pn (x) has no real roots for every n. Let α and β be the
roots of the polynomial p(x) = x2 + ax + 1. Then
4
ROUGH WORK
13. Consider the set of all functions from {1, 2, . . . , m} to {1, 2, . . . , n},
where n > m. If a function is chosen from this set at random, what is
the probability that it will be strictly increasing?
n n m+n−1 m+n−1
/mn . /nm . /mn . /nm .
(A) m (B) m (C) m (D) m−1
14. A flag is to be designed with 5 vertical stripes using some or all of the
four colours: green, maroon, red and yellow. In how many ways can
this be done so that no two adjacent stripes have the same colour?
5
ROUGH WORK
16. Suppose X is a random variable with P (X > x) = 1/x2 , for all x > 1.
The variance of Y = 1/X 2 is
6
ROUGH WORK
19. Let X and Y be random variables with mean λ. Define
min(X, Y ) with probability 1 ,
2
Z=
max(X, Y ) with probability 1 .
2
What is E(Z)?
√
(A) λ. (B) 4λ/3. (C) λ2 . (D) 3λ/2.
2 3 1 2 3
(A) N. (B) N. (C) (N −2) + N. (D) (N −2) .
(A) an → 14 , bn → 12 , cn → 14 .
(B) an → 13 , bn → 13 , cn → 13 .
(C) an → 0, bn → 0, cn → 1.
(D) an → p, bn → p, cn → 1 − 2p.
7
ROUGH WORK
22. Suppose X1 , X2 and X3 are i.i.d. positive valued random variables.
Xi
Define Yi = X1 +X2 +X3 , i = 1, 2, 3. The correlation between Y1 and Y3
is
yi = βxi + i , for i = 1, . . . , n,
Then,
8
ROUGH WORK
25. Suppose X is a random variable with finite variance. Define X1 = X,
X2 = αX1 , X3 = αX2 , . . . , Xn = αXn−1 , for 0 < α < 1. Then
Corr(X1 , Xn ) is
13 −1/2
(A) 41 . (B) 1 − 8 e . (C) 1 − 23 e−1/2 . (D) 78 .
9
ROUGH WORK
28. Let X be a random variable with
(1 − θ) 1 θ
Pθ (X = −1) = , Pθ (X = 0) = , and Pθ (X = 1) =
2 2 2
for 0 < θ < 1. In a random sample of size 20, the observed frequencies
of −1, 0 and 1 are 6, 4 and 10, respectively. The maximum likelihood
estimate of θ is
29. Two judges evaluate n individuals, with (Ri , Si ) the ranks assigned to
the i-th individual by the two judges. Suppose there are no ties and
Si = Ri + 1, for i = 1, . . . , (n − 1), and Si = 1 if Ri = n. If the
Spearman’s rank correlation between the two evaluations is 0, what is
the value of n?
equals
√ √
(A) Φ(x 2). (B) Φ x/ 2 . (C) Φ(x). (D) Φ(2x).
10
ROUGH WORK
PSB-2018
GROUP A
x2 − 3x3 − x1 λ = 0,
x1 − 3x3 − x2 λ = 0,
x1 + x2 + x3 λ = 0.
GROUP B
4. An urn contains r > 0 red balls and b > 0 black balls. A ball is drawn
at random from the urn, its colour noted, and returned to the urn.
Further, c > 0 additional balls of the same colour are added to the
urn. This process of drawing a ball and adding c balls of the same
colour is continued. Define Xi = 1 if at the i-th draw the colour of the
ball drawn is red, and 0 otherwise. Compute E( ni=1 Xi ).
P
1
7. Suppose {(X1 , Y1 ), . . . , (Xn , Yn )} is a random sample from a bivariate
normal distribution with E(Xi ) = E(Yi ) = 0, Var(Xi ) = Var(Yi ) = 1
and unknown Corr(Xi , Yi ) = ρ ∈ (−1, 1), for all i = 1, . . . , n. Define
Wn = n1 ni=1 Xi Yi .
P
Based on the above sample, obtain the most powerful test for testing
H0 : θ = 0 against H1 : θ = 1, at level α, with 0 < α < 1. Find the
critical region in terms of the quantiles of a standard distribution.
yi = α + βxi + i , for i = 1, . . . , n,
(a) Let α
e denote the least squares estimate of α obtained assuming
β = 5. Find the mean squared error (MSE) of α
e in terms of the
model parameters.
(b) Obtain the maximum likelihood estimator of this MSE.
2
PSA-2017
1. Let f : R → R be a nonconstant function satisfying f (x + 2) = f (x)
for every real x. Then lim f (x)
x→∞
1
ROUGH WORK
4. Let A be a nonzero 2 × 2 matrix such that A3 = 0. Then which of the
following statements is always false?
(A) trace(A − A2 ) = 0
(B) trace(A + A2 ) = 0
(C) det(I + A) = 0
(D) A2 = 0
(A) [0, 1]. (B) (0, 1]. (C) R. (D) [−3, −2] ∪ [2, 3].
2
ROUGH WORK
7. The coordinates of the point at which the tangent of the curve
y = x2 − 1 makes an angle of 45◦ with the positive X-axis are
√ √
5−1 1− 5
(A) ( 2 , 2 ). (B) ( 23 , − 59 ). (C) ( 34 , − 16
7
). (D) ( 12 , − 43 ).
8. Let f : R → R be defined by
x2 sin 1
x if x 6= 0 ,
f (x) =
0 if x = 0 .
Then f is
γ α β
3
ROUGH WORK
10. Suppose the rank of
1 1 2 2
1 1 1 3
a b b 1
is 2 for some real numbers a and b. What is the value of b ?
1 1
(A) 3 (B) 3 (C) 1 (D) 2
11. What is the infinite sum x + 2x2 + 3x3 + · · · for |x| < 1 ?
x x 1 x
(A) 1−x2
(B) (1+x)2
(C) (1−x)2
(D) (1−x)2
12. A function f : R → R, such that f (x) = f (−x) for all x, has left
derivative 5 at x = 0. Then, the right derivative of f at x = 0
4
ROUGH WORK
13. Let f : R → R be defined by
b|x|c
X 1
f (x) = ,
2k
k=0
where byc denotes the greatest integer less than or equal to y. Then,
Then, what is the set of all possible values that f 0 (0) can take?
15. Suppose that the events A, B, and C are pairwise independent such
that each of them occurs with probability p. Assume that all three of
them cannot occur simultaneously. What is P (A ∪ B ∪ C) ?
5
ROUGH WORK
16. Let X1 , X2 , . . . be a sequence of independent random variables dis-
tributed exponentially with mean 1. Suppose that N is a random
variable, independent of the Xi -s, that has a Poisson distribution with
mean λ > 0. What is the expected value of X1 + X2 + · · · + XN 2 ?
Var(X − Y ) ≥ |σX − σY |k
hold for all choices of random variables X and Y with finite variances
2 and σ 2 , respectively?
σX Y
18. Consider a diagnostic test for a disease that gives the correct result with
probability 0.9, independently of whether the subject being diagnosed
has the disease or not. Suppose that 20% of the population has the
disease. For a particular subject, given that the test indicates presence
of the disease, what is the conditional probability that the subject has
the disease?
9 9 1 1
(A) 10 (B) 13 (C) 2 (D) 5
6
ROUGH WORK
19. Let X1 , X2 , X3 , X4 be independent standard normal random variables.
Then what is the distribution of
(X1 + X2 + X3 + X4 )2
?
(X1 − X2 + X3 − X4 )2
21. Assume that the events A and B are such that A ⊂ B and P (B) > 0.
For every event E, define 1E to be the random variable which takes
value 1 or 0 depending on whether E occurs or not, respectively. Then,
Cov(1A , 1B ) equals
(A) P (A|B)P (B c ).
(B) P (A)P (B c ).
(C) P (A)P (B).
(D) P (B)P (Ac ).
7
ROUGH WORK
22. Suppose that X is chosen uniformly from {1, 2, . . . , 100}, and given
X = x, Y is chosen uniformly from {1, 2, . . . , x}. What is P (Y = 30) ?
1
(A) 100
70 1
(B) 100 × 30
1 1 1 1
(C) 100 × 1+ 2 + + ··· +
3 30
1 1 1
(D) 100 × 30 + · · · + 100
23. Three numbers are chosen at random from {1, 2, ..., 10} without re-
placement. What is the probability that the minimum of the chosen
numbers is 3 or their maximum is 7 ?
13 19 3 11
(A) 40 (B) 60 (C) 10 (D) 40
8
ROUGH WORK
25. Suppose that X1 , . . . , Xn are independent and identically distributed
normal random variables with mean θ and variance θ, where θ > 0.
Then,
n
X
(A) Xi is sufficient for θ.
i=1
Xn
(B) Xi2 is sufficient for θ.
i=1
n
X
Xi + Xi2 is sufficient for θ.
(C)
i=1
n
X n
X
(D) neither Xi nor Xi2 is sufficient for θ.
i=1 i=1
9
ROUGH WORK
28. For n ≥ 2, let X1 , . . . , Xn be independent and identically distributed
normal random variables with mean 0 and variance σ 2 . Consider the
most powerful test for the null hypothesis H0 : σ = 2 against the
alternative H1 : σ = 1, at significance level α for some 0 < α < 1.
What is the power of this test?
In what follows, Fm denotes the cumulative distribution function of a
χ2 random variable with m degrees of freedom, and χ2m,β satisfies the
equation Fm (χ2m,β ) = 1 − β, for every m ≥ 1 and β ∈ (0, 1).
10
PSB-2017
GROUP A
where f (k) denotes the k-th derivative of f for k ≥ 1. Prove that there
exists x ∈ (0, 1) such that f (n+1) (x) = 0.
GROUP B
1
6. Suppose you have a 4-digit combination lock, but you have forgotten
the correct combination. Consider the following three strategies to find
the correct one:
yi = β1 x1i + β2 x2i + i , i = 1, . . . , n ,
where x1i and x2i are scalar covariates, β1 and β2 are unknown scalar
coefficients, and i are uncorrelated errors with mean 0 and variance
σ 2 > 0. Instead of using the correct model, we obtain an estimate β̂1
of β1 by minimizing
n
X
(yi − β1 x1i )2 .
i=1
2
TEST CODE: PSA (Objective type) and PSB (Short answer type) 2016
SYLLABUS
Mathematics
Arithmetic, geometric and harmonic progressions. Trigonometry. Two dimensional
coordinate geometry: Straight lines, circles, parabolas, ellipses and hyperbolas.
Elementary set theory. Functions and relations. Elementary combinatorics: Per-
mutations and combinations, Binomial and multinomial theorem.
Theory of equations.
Complex numbers and De Moivre’s theorem.
Vectors and vector spaces. Algebra of matrices. Determinant, rank, trace and
inverse of a matrix. Solutions of linear equations. Eigenvalues and eigenvectors of
matrices.
Limits and continuity of functions of one variable. Differentiation. Leibnitz for-
mula. Applications of differential calculus, maxima and minima. Taylor’s theorem.
Indefinite integral. Fundamental theorem of calculus. Riemann integration and prop-
erties. Improper integrals.
1
PSA-2016
Basic experimental designs such as CRD, RBD, LSD and their analyses. Ele-
ments of factorial designs. Conventional sampling techniques (SRSWR/SRSWOR)
including stratification. Ratio and regression methods of estimation.
1. The number of functions f : {1, 2, . . . , 10} → {1, 2, . . . , 10} such that f (x) 6= x
for all x is
2. The set of all ordered pairs of real numbers (x, y) satisfying satisfying y 2 − 2y −
x2 + 4x = 3 is a
(A) circle (B) point (C) hyperbola (D) pair of straight lines.
3. Let
log(2 + x) − x2n sin x
f (x) = lim for x > 0.
n→∞ 1 + x2n
Then
(A) f is continuous at x = 1
(B) lim f (x) 6= lim f (x)
x→1+ x→1−
(C) lim f (x) = sin 1
x→1+
(D) lim f (x) does not exist.
x→1−
5. Let
1 1
f (x) = x2 + 2
+x+ , x>0
x x
and let m = min{f (x)}. Then
2
6. Let f be a convex function, i.e.,
7. Suppose A is a 100 × 100 real symmetric matrix whose diagonal entries are all
positive. Then which of the following is necessarily true?
(A) All eigenvalues of A are greater than 0
(B) no eigenvalue of A is greater than 0
(C) at least one eigenvalue of A is greater than 0
(D) none of these.
8. The integral
Z 1
sin x
dx
0 xα
10. How many 5 × 5 matrices are there such that each entry is 0 or 1 and each row
sum and each column sum is 4?
3
π k
11. For n ≥ 1, let Gn be the geometric mean of {sin( · ) : 1 ≤ k ≤ n}. Then
2 n
lim Gn is
n→∞
1
(A) 1/4 (B) log 2 (C) 2 log 2 (D) 1/2.
12. Suppose a, b, x, y are real numbers such that a2 + b2 = 81, x2 + y 2 = 121 and
ax + by = 99. Then the set of all possible values of ay − bx is
9 9 9
(A) {0} (B) 0, (C) 0, (D) ,∞ .
11 11 11
14. Three distinct squares are selected at random from a 8 × 8 chess board. Then
the probability that they form an L-shaped pattern (looked at from one fixed
side only) as drawn below is
196 49 36
(A) 64
(B) 64
(C) 64
(D) greater than 1/2.
3 3 3
eλ − 1 1 1 − e−λ 1 − e−λ
(A) (B) (C) (D) .
λ λ+1 λ λ+1
4
17. There are 10 boxes each containing 6 white and 7 red balls. Two different boxes
are chosen at random, one ball is drawn simultaneously at random from each
and transferred to the other box. Now a box is again chosen from the 10 boxes
and a ball is chosen from it. Then the probability that this ball is white is
18. The number of cars (X) arriving at a service station per day is a random variable
with mean 4. The service station can provide service to a maximum of 4 cars
per day. Then the expected number of cars per day that do not get serviced
equals
∞
X ∞
X
(A) 4 (B) 0 (C) iP (X = i + 4) (D) iP (X = i − 4).
i=0 i=4
20. Assume that (X, Y ) is bivariate Normal with E(X) = E(Y ) = 0, Var(X) = σ12 ,
Var(Y ) = σ22 and Cor(X, Y ) = ρ for some ρ ∈ (−1, 1). The probability that X
is larger than Y is
σ1 −σ2 σ12 +σ22
(A) 1/2 (B) Φ √ (C) Φ √ (D) none of the above.
1−ρ2 1−ρ2
21. Suppose that the bivariate data (x1 , y1 ), . . . , (xn , yn ) lie on the straight line
y = a + bx for some a, b ∈ R. Assume further that neither all the xi ’s are same,
nor are all the yi ’s. Which of the following values is not a possibility for the
correlation coefficient calculated from the above data?
22. In the randomised block design for ANOVA where k is the number of treatments
and b is the number of blocks, the degrees of freedom for error is given by
5
23. Suppose X1 , X2 , . . . , Xn is a random sample from a population with mean µ
and finite variance. Consider the following two estimators of µ2 :
1 XX
E1 = Xi Xj
n(n − 1)
1≤i6=j≤n
1 X 2
E2 = X̄ 2 − (Xi − X̄) .
n(n − 1)
1≤i≤n
Then,
24. It is known that the proportion of smokers (p) in a population lies in the interval
[1/3, 2/3]. In a random sample of N individuals selected from the population, it
was found that M were smokers. The maximum likelihood estimate of p based
on the above data is
(A) is √1
bd
p
(B) is d/b
p
(C) is b/d
(D) cannot be determined from the above information.
26. Let bxc denote the largest integer not larger than x. If X is distributed as
N (0, 1), then E(bXc)
6
27. X and Y are i.i.d. random variables with finite variances. Then
(A) E(α̂) = α
α
(B) E(α̂) = 1
1− np
α
(C) E(α̂) = 1−np
30. Let X1 , . . . , Xn be i.i.d. from N (0, σ 2 ). What is the form of the most powerful
test for testing the null hypothesis
H0 : σ = 1 ,
7
PSB-2016
SAMPLE QUESTIONS: PSB
1. In the diagram below, L(x) is a straight line that intersects the graph of a
polynomial P (x) of degree 2 at the points A = (−1, 0) and B = (5, 12). The
area of the shaded region is 36 square units.
Obtain the expression for P (x).
L(x)
A
P (x)
T r(A2 ) = T r(A) = n.
Tc x1 , x2 , x3 , x4 := (1 + c)x1 , x2 + cx3 , x3 + cx2 , (1 + c)x4 .
5. A box contains 50 red balls, 30 green balls and 20 blue balls. Suppose balls are
drawn successively at random with replacement from the box. Let N denote
the minimum number of draws required to obtain balls of all three colours.
Compute P (N > n) for all positive integers n.
8
6. Let X1 , . . . , Xn be i.i.d. random variables from a continuous distribution whose
density is symmetric around 0. Suppose E(|X1 |) = 2. Define
n
X n
X
Y = Xi and Z= 1(Xi > 0) .
i=1 i=1
If the regression is now performed using all (n1 + n2 ) observations, will the
regression equation remain the same? Justify your answer.
10. A cake weighing one kilogram is cut into two pieces, and each piece is weighed
separately. Denote the measured weights of the two pieces by X and Y . Assume
that the errors in obtaining X and Y are independent and normally distributed
with mean zero and the same (unknown) variance. Devise a test for the hy-
pothesis that the true weights of the two pieces are equal.
9
TEST CODE: PSA (Objective type) and PSB (Short answer type) 2016
SYLLABUS
Mathematics
Arithmetic, geometric and harmonic progressions. Trigonometry. Two dimensional
coordinate geometry: Straight lines, circles, parabolas, ellipses and hyperbolas.
Elementary set theory. Functions and relations. Elementary combinatorics: Per-
mutations and combinations, Binomial and multinomial theorem.
Theory of equations.
Complex numbers and De Moivre’s theorem.
Vectors and vector spaces. Algebra of matrices. Determinant, rank, trace and
inverse of a matrix. Solutions of linear equations. Eigenvalues and eigenvectors of
matrices.
Limits and continuity of functions of one variable. Differentiation. Leibnitz for-
mula. Applications of differential calculus, maxima and minima. Taylor’s theorem.
Indefinite integral. Fundamental theorem of calculus. Riemann integration and prop-
erties. Improper integrals.
1
PSA-2016
Basic experimental designs such as CRD, RBD, LSD and their analyses. Ele-
ments of factorial designs. Conventional sampling techniques (SRSWR/SRSWOR)
including stratification. Ratio and regression methods of estimation.
1. The number of functions f : {1, 2, . . . , 10} → {1, 2, . . . , 10} such that f (x) 6= x
for all x is
2. The set of all ordered pairs of real numbers (x, y) satisfying satisfying y 2 − 2y −
x2 + 4x = 3 is a
(A) circle (B) point (C) hyperbola (D) pair of straight lines.
3. Let
log(2 + x) − x2n sin x
f (x) = lim for x > 0.
n→∞ 1 + x2n
Then
(A) f is continuous at x = 1
(B) lim f (x) 6= lim f (x)
x→1+ x→1−
(C) lim f (x) = sin 1
x→1+
(D) lim f (x) does not exist.
x→1−
5. Let
1 1
f (x) = x2 + 2
+x+ , x>0
x x
and let m = min{f (x)}. Then
2
6. Let f be a convex function, i.e.,
7. Suppose A is a 100 × 100 real symmetric matrix whose diagonal entries are all
positive. Then which of the following is necessarily true?
(A) All eigenvalues of A are greater than 0
(B) no eigenvalue of A is greater than 0
(C) at least one eigenvalue of A is greater than 0
(D) none of these.
8. The integral
Z 1
sin x
dx
0 xα
10. How many 5 × 5 matrices are there such that each entry is 0 or 1 and each row
sum and each column sum is 4?
3
π k
11. For n ≥ 1, let Gn be the geometric mean of {sin( · ) : 1 ≤ k ≤ n}. Then
2 n
lim Gn is
n→∞
1
(A) 1/4 (B) log 2 (C) 2 log 2 (D) 1/2.
12. Suppose a, b, x, y are real numbers such that a2 + b2 = 81, x2 + y 2 = 121 and
ax + by = 99. Then the set of all possible values of ay − bx is
9 9 9
(A) {0} (B) 0, (C) 0, (D) ,∞ .
11 11 11
14. Three distinct squares are selected at random from a 8 × 8 chess board. Then
the probability that they form an L-shaped pattern (looked at from one fixed
side only) as drawn below is
196 49 36
(A) 64
(B) 64
(C) 64
(D) greater than 1/2.
3 3 3
eλ − 1 1 1 − e−λ 1 − e−λ
(A) (B) (C) (D) .
λ λ+1 λ λ+1
4
17. There are 10 boxes each containing 6 white and 7 red balls. Two different boxes
are chosen at random, one ball is drawn simultaneously at random from each
and transferred to the other box. Now a box is again chosen from the 10 boxes
and a ball is chosen from it. Then the probability that this ball is white is
18. The number of cars (X) arriving at a service station per day is a random variable
with mean 4. The service station can provide service to a maximum of 4 cars
per day. Then the expected number of cars per day that do not get serviced
equals
∞
X ∞
X
(A) 4 (B) 0 (C) iP (X = i + 4) (D) iP (X = i − 4).
i=0 i=4
20. Assume that (X, Y ) is bivariate Normal with E(X) = E(Y ) = 0, Var(X) = σ12 ,
Var(Y ) = σ22 and Cor(X, Y ) = ρ for some ρ ∈ (−1, 1). The probability that X
is larger than Y is
σ1 −σ2 σ12 +σ22
(A) 1/2 (B) Φ √ (C) Φ √ (D) none of the above.
1−ρ2 1−ρ2
21. Suppose that the bivariate data (x1 , y1 ), . . . , (xn , yn ) lie on the straight line
y = a + bx for some a, b ∈ R. Assume further that neither all the xi ’s are same,
nor are all the yi ’s. Which of the following values is not a possibility for the
correlation coefficient calculated from the above data?
22. In the randomised block design for ANOVA where k is the number of treatments
and b is the number of blocks, the degrees of freedom for error is given by
5
23. Suppose X1 , X2 , . . . , Xn is a random sample from a population with mean µ
and finite variance. Consider the following two estimators of µ2 :
1 XX
E1 = Xi Xj
n(n − 1)
1≤i6=j≤n
1 X 2
E2 = X̄ 2 − (Xi − X̄) .
n(n − 1)
1≤i≤n
Then,
24. It is known that the proportion of smokers (p) in a population lies in the interval
[1/3, 2/3]. In a random sample of N individuals selected from the population, it
was found that M were smokers. The maximum likelihood estimate of p based
on the above data is
(A) is √1
bd
p
(B) is d/b
p
(C) is b/d
(D) cannot be determined from the above information.
26. Let bxc denote the largest integer not larger than x. If X is distributed as
N (0, 1), then E(bXc)
6
27. X and Y are i.i.d. random variables with finite variances. Then
(A) E(α̂) = α
α
(B) E(α̂) = 1
1− np
α
(C) E(α̂) = 1−np
30. Let X1 , . . . , Xn be i.i.d. from N (0, σ 2 ). What is the form of the most powerful
test for testing the null hypothesis
H0 : σ = 1 ,
7
PSB-2016
SAMPLE QUESTIONS: PSB
1. In the diagram below, L(x) is a straight line that intersects the graph of a
polynomial P (x) of degree 2 at the points A = (−1, 0) and B = (5, 12). The
area of the shaded region is 36 square units.
Obtain the expression for P (x).
L(x)
A
P (x)
T r(A2 ) = T r(A) = n.
Tc x1 , x2 , x3 , x4 := (1 + c)x1 , x2 + cx3 , x3 + cx2 , (1 + c)x4 .
5. A box contains 50 red balls, 30 green balls and 20 blue balls. Suppose balls are
drawn successively at random with replacement from the box. Let N denote
the minimum number of draws required to obtain balls of all three colours.
Compute P (N > n) for all positive integers n.
8
6. Let X1 , . . . , Xn be i.i.d. random variables from a continuous distribution whose
density is symmetric around 0. Suppose E(|X1 |) = 2. Define
n
X n
X
Y = Xi and Z= 1(Xi > 0) .
i=1 i=1
If the regression is now performed using all (n1 + n2 ) observations, will the
regression equation remain the same? Justify your answer.
10. A cake weighing one kilogram is cut into two pieces, and each piece is weighed
separately. Denote the measured weights of the two pieces by X and Y . Assume
that the errors in obtaining X and Y are independent and normally distributed
with mean zero and the same (unknown) variance. Devise a test for the hy-
pothesis that the true weights of the two pieces are equal.