Data Analytics 2nd TA Session
Data Analytics 2nd TA Session
1. An insurance company estimated that 30% of all automobile accidents were partly caused by
weather conditions and that 20% of all automobile accidents involved bodily injury. Further, of
those accidents that involved bodily injury, 40% were partly caused by weather conditions.
a. What is the probability that a randomly chosen accident both was partly caused by
weather conditions and involved bodily injury?
A: accidents were partly caused by weather conditions
P(A)=0.3
B: accidents involved bodily injury
P(B)=0.2
P(A|B)= 0.4
P(A&B)=P(A|B)*P(B)=0.4*0.2=0.08
b. Are the events ″Partly caused by weather conditions” and ″Involved bodily injury″
independent?
P(A)*P(B)=0.3*0.2=0.06 different of P(A&B).
So A and B are not independent.
c. If a randomly chosen accident was partly caused by weather conditions, what is the
probability that it involved bodily injury?
P(B|A)= P(A&B)/ P(A) =0.08/0.3=0.26666…
d. What is the probability that a randomly chosen accident both was not partly caused by
weather conditions and did not involve bodily injury
P(A&B)= 1-P(A+B)= 1-P(A)-P(B)+ P(A&B)=1-0.3-0.2+0.08=0.58
2. A survey carried out for a supermarket classified customers according to whether their visits to
the store are frequent or infrequent and whether they often, sometimes, or never purchase generic
products. The accompanying table gives the proportions of people surveyed in each of the six
joint classifications.
Purchase of Generic Products
Frequency of Visit
Often (O) Sometimes (S) Never (N)
a. What is the probability that a customer both is a frequent shopper and often purchases
generic products? P(F&O)= 0.12
b. What is the probability that a customer who never buys generic products visits the store
frequently? P(F|N)=P(F&N)/P(N)=0.19/(0.19+0.08)=0.703…
1
c. Are the events “Never buys generic products” and “Visits the store frequently”
independent? P(N&F)=0.19 different from P(N)*P(F)=0.27*0.79 =0.2133
So the events are not independent.
d. What is the probability that a customer who infrequently visits the store often buys
generic products? P(O|I)=P(O&I)/P(I)=0.07/(0.07+0.06+0.08)=0.33…
e. Are the events “Often buys generic products” and “Visits the store infrequently”
independent? P(O&I)=0.07 different from P(O)*P(I)=0.19*0.21 =0.0399
So the events are not independent.
f. What is probability that a customer frequently visits the store?P(F)=0.12+0.48+0.19=0.79
g. What is the probability that a customer never buys generic products?
P(N)=0.19+0.08=0.27
h. What is the probability that a customer either frequently visits the store or never buys
generic products? P(F+N)=P(F)+P(N)-P(F&N)=0.79+0.27-0.19=0.87
3. The bar graph given below shows the percentage distribution of the total expenditures of a
company under various expense heads during 2003.
A. The total amount of expenditures of the company is how many times of expenditure on
research and development?
A. 27 B. 20
C. 18 D. 8
B. What is the ratio of the total expenditure on infrastructure and transport to the total
expenditure on taxes and interest on loans?
A. 5:4 B. 8:7
C. 9:7 D. 13:11
2
4. The following table shows the grouped data, in classes, for the heights of 50 people.
5. If the mean of numbers 28, x, 42, 78 and 104 is 62, then what is the mean of 128, 255,
511, 1023 and x?
62*5=28+x+42+78+104
X=310-252=58
M= (128+255+511+1023+58)/5=1975/5=395
6. The monthly salaries of 20 people are listed in Excel file: “Data-Analytics-2nd TA session-
Q6Data.xlsx”.
a). Calculate the mean and median of the salaries of the 20 people.
b). Draw a histogram to analyze the distribution of the salaries.
3
4