GTU Probability and Statistics Winter 2021 Pyq With Solution
GTU Probability and Statistics Winter 2021 Pyq With Solution
(b) A card is drawn at random from a pack of 52 cards. What is the probability
that the card is a spade or a king?
In a standard deck of 52 cards:
There are 13 spades.
There are 4 kings.
To find the probability, we use the formula for the probability of either of two events (A or B):
P (AorB) = P (A) + P (B) − P (AandB)
Here, A is the event of drawing a spade, and B is the event of drawing a king.
P (A) = T otalnumberofspades/T otalnumberofcards = 13/52
(c) State Baye’s theorem. There are three bags; first containing 1 white, 2 red
and 3green balls; second 2 white, 3 red and 1 green balls and third 3 white, 1
red and 2 green balls. Two balls are drawn from a beg chosen at random. These
are found to be 1 white and 1 red. Find the probability that the balls so drawn
came from the second bag.
Bayes' theorem describes the probability of an event based on prior knowledge of conditions that might be related to the
event.
P (A ∣ B) = (P (B ∣ A) ⋅ P (A))/P (B)
B = 1 white + 1 red
P(A2/B) = P(A2).P(B/A2) / P(A1).P(B/A1) + P(A2).P(B/A2) + P(A3).P(B/A3)
After doing the calculation, the result wil be 0.545
Q.2
(a) Two judges in a beauty contest rank the 12 contestants as follows:
x 1 2 3 4 5 6 7 8 9 10 11 12
y 12 9 6 10 3 5 4 7 6 2 11 1
(b) A book contains 100 misprints distributed randomly throughout its 100
pages. What is the probability that a page observed at random contains at
least 2 misprints.
Given that there are 100 misprints distributed randomly over 100 pages, we can assume that the average rate (λ) of misprints
per page is 1. The Poisson probability mass function is given by:
P(X=x)= e^(−λ) * (λ)ˣ / x!
n=100
λ=1
P(X≥2)=1-P(X<2)
= 1-[P(X=0)+P(X=1)]
= 1-[0.3679+0.3679]
= 0.2642
(c) A die is thrown six times. If getting an odd number is a success, find the
probability of (i) 5 success (ii) at least five success and (ii ) at most five success.
n=6. Because the value of n is less, we wil use Binomial distribution
P = 1/2 = 0.5
i) P(x=5) = nCxP x Qn−x
6C5(0.5)5 (0.5)1 = 0.0938
i ) P(X≥5) = P(X=5)+P(X=6)
= 0.0938 + 6C6(0.5)6 (0.5)0 = 0.938 + 0.0156 = 0.1094
i i) P(X≤5)
1 − P (x > 5) = 1 − P (x = 6) = 1 − 0.0156 = 0.9844
OR
(c) If a random variable x is Gamma distribution with parameter 𝜆 = 3, compute
the value of (i) P(x≤1) and (ii) P(1≤x≤2)
f(x) = Γr𝜆 xr−1 e−𝜆x , x > 0
r
=0
Let r = 1 (suppose)
𝜆 = 3 (given)
f(x) = 3x0 e−3x
= 3e−3x , x > 0
CV = x6ˉ ∗ 100%
= 13.9284
21
∗ 100 % = 66.3257%
x̄ = ∑∑ffixii = 420
20 = 21
Therefore,
√ 3880
20 = 13.9284
Class Fi CF
0-30 8 8
30-60 13 21
60-90 22 48
90-120 (L) 27 70
120-150 18 88
150-180 7 95
∑=95
n
M=l+ −m
n
2
∗c
f
∑f
n
2
= 2
= 75/2 = 47.5
= 95
(c) Compute the correlation coefficients between X and Y using following data:
X 2 4 5 6 8 11
Y 18 12 10 8 7 5
x̄ = ∑nxi = 6
Ȳ = ∑nyi = 10
r = √ ∑∑(x−(x−xˉ)xˉ√)(Y∑−(YYˉ )−Yˉ ) 2 2
−67
√50√106
= −0.9203
OR
Q.3 (a) Obtain correlation coefficient between x and y if two regression lines are
4x5y+33=0 and 20x-9y-107=0.
Ans. r = √byxbxy
5y=4x+33:
y=4/5x + 33/5
byx = 4/3 < 1
20x=9y+107: x = 9/20y + 107/20 bxy = 9/20 < 1
= 3/5 = 0.6
(b) Calculate the mode for the following data:
Class Interval 0-10 10-20 20-30 30-40 40-50
Frequency 10 14 19 7 13
Mode z = l + 2f 1−f
f 1−f 0
0−f 2 ∗ c
Class fi
0-10 10
10-20 14 f0
l=20-30 19 (highest freq. or f1)
30-40 7 f2
40-50 13
Ans.
x y x-x̄ Y-Ȳ (x-x̄)(Y-Ȳ) (x-x̄)^2
100 85 8 4 32 64
98 90 6 9 54 36
78 70 -14 -11 154 196
85 72 -7 -9 63 49
110 95 18 14 252 324
93 81 1 0 0 1
80 74 -12 -7 84 144
639 814
Reg y on x,
Y-Ȳ=byx(x-x̄)
byx = ∑(x− xˉ)(Y − Yˉ )
∑ (x− xˉ) 2
x̄ = ∑nxi = 92
Ȳ= ∑nyi = 81
= 639/814
= 0.7850
Q.4
(a) Explain the term related to testing of hypothesis:
(i) Null hypothesis
Null Hypothesis (H0): It means that there is no significant difference or effect observed, essential y stating that any
observed variation in data is due to chance or random occurrences.
(ii)Alternate hypothesis
Alternate Hypothesis (H1 or Ha): It proposes an effect or difference, suggesting that observed data variations are not
just due to random chance but due to a specific cause or factor.
(iii) Errors while accepting or rejecting a hypothesis
Errors in Hypothesis Testing:
Type I Error: Occurs when the null hypothesis is true, but is incorrectly rejected. It's also known as a "false positive."
Type II Error: Occurs when the null hypothesis is false, but is incorrectly accepted. It's known as a "false negative."
5. Z0.05 = 1.96
6. MOD(x) < MOD(Z0.05)
Therefore, the hypothesis is accepted.
(c) Fit a binomial distribution for the following data showing the survey of 800
families with 4 children and test the goodness of fit
No. of boys 0 1 2 3 4
No. of girls 4 3 2 1 0
No. of families 32 178 290 238 64
Ans.
No. of boys f0 fe (f0-fe)^2/fe
0 32 50 6.48
1 178 200 2.42
2 290 300 0.333
3 238 200 7.22
4 64 50 3.92
∑=20.3733
5. v=n-1 = 5-1 = 4
x2 0.05(v = 4) = 9.488
6. x2 > x2 0.05
Therefore, it is rejected
OR
Q.4 (a) A random sample of size 15 from bivariate normal distribution gave a
correlation coefficient r=0.5. Is this indicate the existence of correlation in the
population?
Ans. n=15, r=0.5
Testing the hypothesis:
1. H0: There is no co-relation 3=0
2. H1 = 3≠0
3. α=0.05
4. t = r√n−2
1−r2
= 2.0817
5. v=n-2=15-2=13
t0.05(v=13) = 2.16
6. t < t0.05
Therefore, it is accepted
(b) A tire company is suspicious to claim that the average lifetime of certain
tires is at least 28000 km. To check the claim, the company takes the sample
of 40 tires and gets a mean life time of 27463 km with standard deviation of
1348 km. Test the hypothesis at 1% level of significance
It has large sample & single mean.
μ=28000
n=40
x=27463
s=1348
Testing the hypothesis:
1. H0: μ=28000 km
2. H1: μ≠28000 km
3. α=0.01
4. z = Sxˉ/√n
−μ
= −2.5795
MOD(z) = 2.5795
5. Z0.01 = -2.333
MOD(Z0.01) = 2.333
6. MOD(z)>MOD(z0.01)
Therefore, it is rejected
(c) Fit a Poisson distribution for the following data and test the goodness of fit.
x 0 1 2 3 4
f 112 73 30 4 1
Ans.
x f0 fixi fe (f0-fe)^2/fe
0 112 0 42
1 73 73 76
2 30 60 2.5
3 4 12 6
4 1 4 1
x f0 fixi fe (f0-fe)^2/fe
=220
∑ ∑=149 =0.3997
∑
Q.5
(a) In 𝑦 = 𝑎 + 𝑏𝑥 if ∑ 𝑥 = 50, ∑ 𝑦 = 80, ∑ 𝑥𝑦 = 1030, ∑ 𝑥^2 = 750 and 𝑛 =
10, then find 𝑎 and 𝑏
y= a + bx
∑ y = na + b∑x
∑ xy = a∑x + b∑x^2
a = 1.7
b = 1.26
y = 1.7 + 1.26x
lny = lna + bx
Say, lny = Y, lna = A and bx = bx
n= 4
∑ Y = nA + b∑x
∑ xY = A∑x + b∑x^2
x y Y=lny xY x^2
1 7 1.9459 1.9459 1
2 11 2.3979 4.7958 4
3 17 2.8332 8.4996 9
4 27 3.2958 13.1832 6
10 10.47 28.42 30
A = 1.4969 b = 0.4485
a = eA = 4.4678
y = ae^bx = (4.4678)e^0.44485x
(c) State properties of the normal distribution. Suppose the marks of 800
students are normally distributed with mean 66 and standard deviation 5. Find
number of students getting marks (i) between 65 and 70 (ii) greater than or
equal to 72 (Given that P(0≤z≤0.20)=0.0793, that P(0≤z≤0.80)=0.2881 and that
P(0≤z≤1.2)=0.3849)
Properties of normal distribution:
1. It is symmetric
2. Its mean/median/mode are equal
n=800, μ=66, σ = 5
(i) P(65<x<70)
z1 = x1−μ
σ = −0.20
z2 = x2−μ
σ = 0.80
OR
Q.5 (a) A random variable x has the following probability distribution:
xi 0 1 2 3
pi 1/6 3/8 3/8 1/8
Q.5 (b) With usual notations, find the value of p for a binomial random variable
x when n=6 and 9P(x=4)=P(x=2).
q p(x=4) = p(x=2)
P(X=x) = nCxP x qn−x
q[6C4P 4 q2 ] = 6C2P 2 q4 q(15p4 q2 ) = 15p2 q4 qp2 = q2 qp2 = (1 − p)2 qp2 = 1 − 2p + p2 8p2 + 2p − 1 = 0
(2P+1)(4P-1)=0
P=− 12 but that’s not possible so
P= 1/4
Ans.
x y x^2 x^3 x^4 xy x^2y
.. .. .. .. .. .. ..
∑ =2 ∑ =5 ∑ =6 ∑ =8 ∑ =18 ∑ =12 ∑ =16
a = -0.25
b= 2.15
c=0.55
y = ax2 + bx + c
∑y = a∑x^2 + b∑x + nc
∑xy = a∑x^3 + b∑x^2 + c∑x
∑ x2 y = a ∑ x4 + b ∑ x3 + c ∑ x2