Assignment 2 - Set+1 - Descriptive+Statistics+Probability+ (2) A
Assignment 2 - Set+1 - Descriptive+Statistics+Probability+ (2) A
1. Look at the data given below. Plot the data, find the outliers and find out μ , σ , σ 2
import pandas as pd
import seaborn as sns
import matplotlib.pyplot as plt
data=pd.Series([24.23,25.53,25.41,24.14,29.62,28.25,25.81,24.39,40.26,32.95,91.36,25.99,39.42,26.71,
35.00])
names=["Allied Signal","Bankers Trust","General Mills","ITT Industries","J.P.Morgan & Co.","Lehman
Brothers","Marriott","MCI","Merrill Lynch","Microsoft","Morgan Stanley","Sun
Microsystems","Travelers","US Airways","Warner-Lambert"]
%matplotlib inline
fig=plt.figure(figsize=(8,8))
plt.pie(data,labels=names,autopct='%1.2f%%')
plt.show()
Questions referred to from Aczel A., Sounderpandian J., Complete Business Statistics (7ed.)
sns.boxplot(data)
Questions referred to from Aczel A., Sounderpandian J., Complete Business Statistics (7ed.)
round(data.mean(),4)
33.2713
round(data.std(),4)
16.9454
round(data.var(),4)
287.1466
2.
Questions referred to from Aczel A., Sounderpandian J., Complete Business Statistics (7ed.)
Answer the following three questions based on the box-plot above.
(i) What is inter-quartile range of this dataset? (please approximate the numbers) In one
line, explain what this value implies.
(iii) If it was found that the data point with the value 25 is actually 2.5, how would the new
box-plot be affected?
In that case there would have been no outliers, and it might have affected in the values
of mean and median slightly. The boxplot might have moved towards right slightly.
3.
Questions referred to from Aczel A., Sounderpandian J., Complete Business Statistics (7ed.)
Answer the following three questions based on the histogram above.
(i) Where would the mode of this dataset lie?
Between 5 – 8 (Most frequent data)
(iii) Suppose that the above histogram and the box-plot in question 2 are plotted for the
same dataset. Explain how these graphs complement each other in providing
information about any dataset.
By comparing both of them it is very clear that the data would be positively skewed.
Also, would help us finding mean, mode value.
4. AT&T was running commercials in 1990 aimed at luring back customers who had switched to
one of the other long-distance phone service providers. One such commercial shows a
businessman trying to reach Phoenix and mistakenly getting Fiji, where a half-naked native on a
beach responds incomprehensibly in Polynesian. When asked about this advertisement, AT&T
admitted that the portrayed incident did not actually take place but added that this was an
enactment of something that “could happen.” Suppose that one in 200 long-distance telephone
calls is misdirected. What is the probability that at least one in five attempted telephone calls
reaches the wrong number? (Assume independence of attempts.)
Questions referred to from Aczel A., Sounderpandian J., Complete Business Statistics (7ed.)
Hence probability of call not getting misdirected = 1-(1/200) = 199/200
Number of phone calls attempted = 5
Therefore, probability that at least one in 5 attempted call reaches the wrong number is:
=1-(199/200) ^5
= 0.025
5. Returns on a certain business venture, to the nearest $1,000, are known to follow the following
probability distribution
x P(x)
-2,000 0.1
-1,000 0.1
0 0.2
1000 0.2
2000 0.3
3000 0.1
(i) What is the most likely monetary outcome of the business venture?
(iii) What is the long-term average earning of business ventures of this kind? Explain
x P(x) Income (x*P(X)
-2,000 0.1 -200
-1,000 0.1 -100
0 0.2 0
1000 0.2 200
2000 0.3 600
3000 0.1 300
Total 800
800
(iv) What is the good measure of the risk involved in a venture of this kind? Compute this
measure
Questions referred to from Aczel A., Sounderpandian J., Complete Business Statistics (7ed.)
x P(x) x*P(X)
-2,000 0.1 -200
-1,000 0.1 -100
0 0.2 0
1000 0.2 200
2000 0.3 600
3000 0.1 300
var 86666.67
std 294.392
Questions referred to from Aczel A., Sounderpandian J., Complete Business Statistics (7ed.)