Data Analytics 8th TA Session
Data Analytics 8th TA Session
()( )
0 3−0
0 1 1
P(X=0)=C 3∗ ∗ 1− =0.5787…
6 6
(b) 1 five?
()( )
1 3 −1
1 1 1
P(X=1)=C 3∗ ∗ 1− =0.34722..
6 6
(c) 3 fives?
()( )
2 3 −2
1 2 1
P(X=2)=C ∗ ∗ 1−
3 =0.0046…
6 6
Q2. A very large shipment of parts contains 10% defectives. Two parts are chosen at random
from the shipment and checked. Let the random variable X denote the number of defectives found.
Find the probability distribution function of this random variable
N=2
π=0.1
X = number of defective parts found in the chosen 2 parts
X=k P(X=k)
0 0 2−0
0 C 2∗( 0.10 ) ∗( 1−0.10 ) =0.81
1 1 2−1
1 C 2∗( 0.10 ) ∗( 1−0.10 ) =0.18
2 2 2−2
2 C 2∗( 0.10 ) ∗( 1−0.10 ) =0.0 1
Q3. A (blindfolded) marksman finds that on the average he hits the target 4 times out of 5. If he
fires 4 shots, what is the probability of
1
(a) More than 2 hits?
(b) At least 3 miss?
(c) Expectation of X
(d) Variance of X
Q4. The following table shows the probabilities of music CD sales per minute of Tower Records
in a given month (back when there was a Tower Records, and back when people actually bought
music CDs) associated with the random distribution of the number of concerts scheduled in that
month. It appears that the two random variables may be correlated. (Note that the zeros at certain
points in the table are by themselves suggestive.) We note that the values in the table are
retrospective – we have simply tabulated observed results over a long history.
CDs sold Per Minute(X)
0 1 2 3 4 Total
0 0.02 0.05 0.04 0.01 0 0.12
Concerts
1 0.02 0.04 0.06 0.03 0 0.15
per month
2 0.01 0.03 0.3 0.04 0.02 0.4
(Y)
3 0 0.03 0.1 0.12 0.08 0.33
Total 0.05 0.15 0.5 0.2 0.1 1
2
Q6. Suppose X is a normal variable with mean 20.0 and standard deviation 10.0. Now find
the X value, , so that only 2.5% of all values are below (Tip: use the fact that a normal
distribution is symmetric).
.025
=?
Q7. Let a and b be two constant numbers. Construct a new random variable W as
W=aX+bY
Suppose that μX=0.4, μY=0.1, σ2X=0.5, σ2Y=0.2, Corr(X,Y)=-0.2
Now you invest 40% of your money in stock A, (i.e., a=0.4).
Compute E(W) and Var(W), and SD(W).
Q9. There is concern about the speed of automobiles travelling over a particular stretch of highway. For
a random sample of 30 automobiles, radar indicated the following speeds, in miles per hour:
82
88
64
78
90
57
74
70
81
60
75
3
78
85
77
78
65
73
79
73
66
71
70
61
77
66
69
72
67
64
74
Let the mean speed of all automobiles travelling over this stretch of highway be mph.
(a). Find the sample mean and variance, x , s 2 (you can use excel)
(b). Find the 95% confidence interval for the mean speed of all automibles travelling over this stretch of
highway.
(c). Test the hypothesis that people are speeding, if the legal speed on this highway is 65 mph. That is, test
H0: =65
H1: > 65
More exercise questions for Estimation and Hypothesis Testing will be given at the extra TA
sessions.