13.exploratory Data Analysis
13.exploratory Data Analysis
Instructions:
Please share your answers filled inline in the word document. Submit Python code and R code
files wherever applicable.
Problem Statements:
Q1) Calculate Skewness, Kurtosis using R/Python code & draw inferences on the following data.
Hint: [Insights drawn from the data such as data is normally distributed/not, outliers, measures
like mean, median, mode, variance, std. deviation]
a. Cars speed and distance
boxplot(dist) #boxplot
Sol:
It is not normally distributed .
It is positively skewed because it has long tail to words right and it is a uni-model
> median(Mks.)
[1] 40.5
> skewness(Mks.)
[1] 1.542885
> kurtosis(Mks.)
[1] 5.621631
> var(Mks.) # variance
[1] 25.52941
> sd(Mks.) #standard deviation
[1] 5.052664
> hist(Mks.) #histogram
Q5) What is the nature of skewness when mean, median of data is equal?
Ans-Normally distributed
Draw an Inference from the distribution of data for Boxplot 1 with respect Boxplot 2.
Hint: [On comparing both the plots, and check if the data is normally distributed/not, outliers
present, skewness etc.]
Q12)
Q13)
3.3. R & Python code for Univariate Analysis (histogram, box plot, bar plots etc.) the data
distribution to be attached
4. All the codes (executable programs) should execute without errors
5. Code modularization should be followed
6. Each line of code should have comments explaining the logic and why you are using that