13.exploratory Data Analysis
13.exploratory Data Analysis
Instructions:
Please share your answers filled inline in the word document. Submit Python code and R code
files wherever applicable.
Name: _________________________
Problem Statements:
Q1) Calculate Skewness, Kurtosis using R/Python code & draw inferences on the following data.
Hint: [Insights drawn from the data such as data is normally distributed/not, outliers, measures
like mean, median, mode, variance, std. deviation]
a. Cars speed and distance
Q5) What is the nature of skewness when mean, median of data is equal?
Ans: Symmetrical
Q6) What is the nature of skewness when mean > median?
Ans: Right skewed
Q7) What is the nature of skewness when median > mean?
Ans: Left skewed
Q8) What does positive kurtosis value indicates for a data?
Ans: The data is normally distributed and kurtosis value is 0
Q9) What does negative kurtosis value indicates for a data?
Ans: The distribution of the data has lighter tails and a flatter peaks than the normal
distribution
Q10) Answer the below questions using the below boxplot visualization.
Draw an Inference from the distribution of data for Boxplot 1 with respect Boxplot 2.
Hint: [On comparing both the plots, and check if the data is normally distributed/not, outliers
present, skewness etc.]
Ans: By observing both the plots whisker’s level is high in boxplot 2, mean and median are
equal hence distribution is symmetrical.
Q13)
Hints:
For each assignment, the solution should be submitted in the below format
1. Research and Perform all possible steps for obtaining solution
2.
3.3. R & Python code for Univariate Analysis (histogram, box plot, bar plots etc.) the data
distribution to be attached
4. All the codes (executable programs) should execute without errors
5. Code modularization should be followed
6. Each line of code should have comments explaining the logic and why you are using that