Assignment-3 Key-Graphical Representations
Assignment-3 Key-Graphical Representations
SP and Weight(WT)
Ans)
> kurtosis(ex3_csv$WT)
[1] 0.5979244
> kurtosis(ex3_csv$WT)
[1] 3.819284
Ans: The above boxplot suggests that the distribution has lots of outliers towards upper extreme
Q11) Suppose we want to estimate the average weight of an adult male in Mexico. We draw a random
sample of 2,000 men from a population of 3,000,000 men and weigh them. We find that the average
person in our sample weighs 200 pounds, and the standard deviation of the sample is 30 pounds.
Calculate 94%, 98%, 96% confidence interval?
Ans: n=2000
X = 200
s= 30
s 30
Confidence Interval Estimate= X ± Z => 200 ± Z
√n √2000
94% Confidence: qnorm(0.97)
[1] 1.880794=Z
30
200 ± 1.88* =198.74 – 201.26
√2000
98% Confidence: > qnorm(0.99)
[1] 2.326348=Z
30
200 ± 2.33* =198.44-201.56
√2000
96% Confidence: > qnorm(0.98)
[1] 2.053749
30
200 ± 2.05* = 198.62-201.38
√2000
34,36,36,38,38,39,39,40,40,41,41,41,41,42,42,45,49,56
Q13) what is the nature of skewness when mean, median of data are equal?
Q18) Answer the below questions using the below boxplot visualization.
What can we say about the distribution of the data?
Draw an Inference from the distribution of data for Boxplot 1 with respect Boxplot 2.
Ans) 1) The median of the two boxplots are same approximately 260.
1.
Answer the following three questions based on the box-plot above.
(i) What is inter-quartile range of this dataset? (please approximate the numbers) In one
line, explain what this value implies.
Ans) Inter-quartile range is the range between upper quartile (Q3) and lower quartile (Q1).
IQR= Q3-Q1= 12-5 = 7
50% of the data lies between IQR.
2.
Answer the following three questions based on the histogram above.
(i) Where would the mode of this dataset lie?
Ans) We need to have actual data to get the exact value of the mode. The mode can lie between 4
and 10 because there are many values in this range but this is just an assumption. The 2 bars of the
same height doesn’t indicate mode every time.
(ii) Comment on the skewness of the dataset.
Ans) It is right skewed or +ve skewed.
(iii) Suppose that the above histogram and the box-plot in question 2 are plotted for the
same dataset. Explain how these graphs complement each other in providing
information about any dataset.
Ans) from the above histogram and barplot we can confirm an outlier at 25 in Y value. Both the plots
indicate the +ve skewness of the dataset.