0% found this document useful (0 votes)
25 views

13.exploratory Data Analysis

This document discusses exploratory data analysis techniques. It provides questions and hints for drawing inferences from boxplots and histograms. The questions examine interpreting inter-quartile ranges, skewness, outliers, and how boxplots and histograms complement each other in visualizing a dataset.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
25 views

13.exploratory Data Analysis

This document discusses exploratory data analysis techniques. It provides questions and hints for drawing inferences from boxplots and histograms. The questions examine interpreting inter-quartile ranges, skewness, outliers, and how boxplots and histograms complement each other in visualizing a dataset.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 5

Exploratory Data Analysis

Q2) Draw inferences about the following boxplot & histogram.


Hint: [Insights drawn from the plots about the data such as whether data is normally
distributed/not, outliers, measures like mean, median, mode, variance, std. deviation]

© 2013 - 2020 360DigiTMG. All Rights Reserved.


Q11) Comment on the below Boxplot visualizations?

Draw an Inference from the distribution of data for Boxplot 1 with respect Boxplot 2.
Hint: [On comparing both the plots, and check if the data is normally distributed/not, outliers
present, skewness etc.]

© 2013 - 2020 360DigiTMG. All Rights Reserved.


Q12)

Answer the following three questions based on the boxplot above.


(i) What is inter-quartile range of this dataset? [Hint: IQR = Q3 – Q1]
In one line, explain what this value implies. (Hint: Based on IQR definition)
(ii) What can we say about the skewness of this dataset?
(iii) If it were found that the data point with the value 25 is 2.5, how would the new
boxplot be affected?
(Hint: On changing the data point from 25 to 2.5 in the data, how is it different from the
current one.)

Q13)

© 2013 - 2020 360DigiTMG. All Rights Reserved.


Answer the following three questions based on the histogram above.
(i) Where would the mode of this dataset lie? Hint: [In terms of values On Y-axis]
On 20 (In terms of values On Y-axis)
(ii) Comment on the skewness of the dataset
Positive skewness
(iii) Suppose that the above histogram and the boxplot in question 2 are plotted for
the same dataset. Explain how these graphs complement each other in providing
information about any dataset. Hint: [Visualizing both the plots, draw the
insights]

Hints:
For each assignment, the solution should be submitted in the below format

© 2013 - 2020 360DigiTMG. All Rights Reserved.


1. Research and Perform all possible steps for obtaining solution
2.
3. For Statistics calculations, explanation of the solutions should be documented in black and
white along with the codes.
Must follow these guidelines:
3.1. Be thorough with the concepts of Probability, Central Limit Theorem and Perform the
calculation stepwise
3.2. For True/False Questions, or short answer type questions explanation is must

3.3. R & Python code for Univariate Analysis (histogram, box plot, bar plots etc.) the data
distribution to be attached
4. All the codes (executable programs) should execute without errors
5. Code modularization should be followed
6. Each line of code should have comments explaining the logic and why you are using that

© 2013 - 2020 360DigiTMG. All Rights Reserved.

You might also like