0% found this document useful (0 votes)

17 views4 pages

Exploratory Data Analysis

Yes

Uploaded by

mahipalsinghrathore9993

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views4 pages

Exploratory Data Analysis

Yes

Uploaded by

mahipalsinghrathore9993

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

Exploratory Data Analysis

Lab Exercise 1: Summary Statistics and Data Visualization

Problem Statement:
Use the mtcars dataset available in R. Calculate summary statistics (mean, median, standard deviation)
for the mpg (miles per gallon) column. Then, create a histogram and a boxplot for the same column.

Lab Exercise 2: Correlation Analysis

Problem Statement:
Use the iris dataset. Calculate the correlation matrix for the numerical variables in the dataset. Create a
pairs plot to visualize the relationships between these variables.

Lab Exercise 3: Data Cleaning and Handling Missing Values

Problem Statement:
Create a sample dataset with some missing values. Handle the missing values by imputing the mean for
numerical columns and the mode for categorical columns.

Lab Exercise 4: Outlier Detection

Problem Statement:
Using the mtcars dataset, detect outliers in the hp (horsepower) column using the IQR method. Display
the rows that contain outliers.

Lab Exercise 5: Data Transformation and Visualization

Problem Statement:
Use the iris dataset. Normalize the Sepal.Length column and create a density plot for the normalized
values. Also, create a scatter plot between the normalized Sepal.Length and Sepal.Width.
Answers

Lab Exercise 1:
# Load the dataset
data(mtcars)

# Calculate summary statistics

mean_mpg <- mean(mtcars$mpg)
median_mpg <- median(mtcars$mpg)
sd_mpg <- sd(mtcars$mpg)

# Display the summary statistics

mean_mpg
median_mpg
sd_mpg

# Create a histogram
hist(mtcars$mpg, main="Histogram of MPG", xlab="Miles Per Gallon", col="blue")

# Create a boxplot
boxplot(mtcars$mpg, main="Boxplot of MPG", ylab="Miles Per Gallon", col="green")

Lab Exercise 2
# Load the dataset
data(iris)

# Calculate the correlation matrix

cor_matrix <- cor(iris[, 1:4])

# Display the correlation matrix

cor_matrix

# Create a pairs plot

pairs(iris[, 1:4], main="Pairs Plot of Iris Dataset", col=iris$Species)

Lab Exercise 3
# Create a sample dataset with missing values
sample_data <- data.frame(
Age = c(25, 30, NA, 22, 40, NA, 35),
Gender = c("Male", "Female", "Female", NA, "Male", "Male", NA)
)
# Define a function to impute the mean for numerical columns
impute_mean <- function(x) {
x[is.na(x)] <- mean(x, na.rm = TRUE)
return(x)
}

# Define a function to impute the mode for categorical columns

impute_mode <- function(x) {
x[is.na(x)] <- names(sort(table(x), decreasing = TRUE))[1]
return(x)
}

# Impute missing values

sample_data$Age <- impute_mean(sample_data$Age)
sample_data$Gender <- impute_mode(sample_data$Gender)

# Display the cleaned dataset

sample_data

Lab Exercise 4
# Load the dataset
data(mtcars)

# Calculate the IQR for the hp column

Q1 <- quantile(mtcars$hp, 0.25)
Q3 <- quantile(mtcars$hp, 0.75)
IQR_hp <- IQR(mtcars$hp)

# Define the outlier boundaries

lower_bound <- Q1 - 1.5 * IQR_hp
upper_bound <- Q3 + 1.5 * IQR_hp

# Detect outliers
outliers <- mtcars[mtcars$hp < lower_bound | mtcars$hp > upper_bound, ]

# Display the rows containing outliers

outliers

Lab Exercise 5
# Load the dataset
data(iris)
# Normalize the Sepal.Length column
normalize <- function(x) {
return((x - min(x)) / (max(x) - min(x)))
}
iris$Sepal.Length.Normalized <- normalize(iris$Sepal.Length)

# Create a density plot for the normalized values

plot(density(iris$Sepal.Length.Normalized), main="Density Plot of Normalized Sepal Length",
xlab="Normalized Sepal Length")

# Create a scatter plot between the normalized Sepal.Length and Sepal.Width

plot(iris$Sepal.Length.Normalized, iris$Sepal.Width, main="Scatter Plot of Normalized Sepal Length vs
Sepal Width", xlab="Normalized Sepal Length", ylab="Sepal Width", col=iris$Species)

Shamsundar M2 Project2
50% (2)
Shamsundar M2 Project2
14 pages
Inequalities and Modulus PDF
100% (2)
Inequalities and Modulus PDF
71 pages
Machine Learning Assignment Report - Cars
100% (4)
Machine Learning Assignment Report - Cars
42 pages
Lab 5
0% (1)
Lab 5
5 pages
Detailed Lesson Plan in English Grade
75% (4)
Detailed Lesson Plan in English Grade
4 pages
Project 4 - Cars-Datasets PDF
100% (2)
Project 4 - Cars-Datasets PDF
44 pages
Exercises For R
No ratings yet
Exercises For R
40 pages
R Intro 2011
No ratings yet
R Intro 2011
115 pages
R For Data Exploration
No ratings yet
R For Data Exploration
52 pages
Linear and Generalized Linear Models: Nicholas Christian BIOST 2094 Spring 2011
No ratings yet
Linear and Generalized Linear Models: Nicholas Christian BIOST 2094 Spring 2011
22 pages
Advanced R Data Analysis Training PDF
No ratings yet
Advanced R Data Analysis Training PDF
72 pages
Aditya Garg DMDW
No ratings yet
Aditya Garg DMDW
40 pages
Business Analytics-1: STR (Crew - Data)
No ratings yet
Business Analytics-1: STR (Crew - Data)
16 pages
Analysis Using Statistical: Introduction & Data Exploration
No ratings yet
Analysis Using Statistical: Introduction & Data Exploration
23 pages
Mtcars: Choosing The Most Related Variable (S) To The Response
No ratings yet
Mtcars: Choosing The Most Related Variable (S) To The Response
13 pages
Final Cost Practical
No ratings yet
Final Cost Practical
29 pages
All Values in The First Column
No ratings yet
All Values in The First Column
7 pages
Data Science Using R
No ratings yet
Data Science Using R
11 pages
Descriptive and Inferential Statistics With R
No ratings yet
Descriptive and Inferential Statistics With R
6 pages
MTH 4407 - Group 2 (Dr. Farid Zamani) - Lecture 6
No ratings yet
MTH 4407 - Group 2 (Dr. Farid Zamani) - Lecture 6
22 pages
Quiz 4 - Exploratory Data Analysis - Courserav2
No ratings yet
Quiz 4 - Exploratory Data Analysis - Courserav2
1 page
Using R For Basic Statistical Analysis
No ratings yet
Using R For Basic Statistical Analysis
11 pages
Workshop Activity: X Seq y Length
No ratings yet
Workshop Activity: X Seq y Length
3 pages
2023 Tutorial 12
No ratings yet
2023 Tutorial 12
6 pages
Concepts of EDA, Outliers-Detection and Treatment
No ratings yet
Concepts of EDA, Outliers-Detection and Treatment
99 pages
Da Lab It
No ratings yet
Da Lab It
20 pages
Some Exercises
No ratings yet
Some Exercises
9 pages
BAB 5-2 MTK Graph in R PT 2 Materi Line Plot
No ratings yet
BAB 5-2 MTK Graph in R PT 2 Materi Line Plot
9 pages
Functions and Packages
No ratings yet
Functions and Packages
7 pages
CTS Architectural Draughtsman - CTS - NSQF-5
No ratings yet
CTS Architectural Draughtsman - CTS - NSQF-5
54 pages
Exercise 3
No ratings yet
Exercise 3
4 pages
HB 2.2-2003 Australian Standards For Civil Engineering Students Structural Engineering
No ratings yet
HB 2.2-2003 Australian Standards For Civil Engineering Students Structural Engineering
8 pages
Experiment 1
No ratings yet
Experiment 1
4 pages
Experiment 1
No ratings yet
Experiment 1
4 pages
Hamsterxproposal
No ratings yet
Hamsterxproposal
21 pages
Theories of Reading
No ratings yet
Theories of Reading
15 pages
R Practicals
No ratings yet
R Practicals
32 pages
Sommers Revision
No ratings yet
Sommers Revision
12 pages
MDPN460 Lecture05
No ratings yet
MDPN460 Lecture05
32 pages
ITTC Surface Treatment
No ratings yet
ITTC Surface Treatment
63 pages
Procedures Step Test
No ratings yet
Procedures Step Test
1 page
UNIT02
No ratings yet
UNIT02
41 pages
All Templates For Pte (Updated)
No ratings yet
All Templates For Pte (Updated)
3 pages
Lab File AD PDF
No ratings yet
Lab File AD PDF
25 pages
EDA - Final
No ratings yet
EDA - Final
7 pages
R-Programming Lab Mannual
No ratings yet
R-Programming Lab Mannual
33 pages
Dsi237 Group 2
No ratings yet
Dsi237 Group 2
27 pages
Dav Pracs
No ratings yet
Dav Pracs
9 pages
sample - end - test - cuối kì 1
No ratings yet
sample - end - test - cuối kì 1
4 pages
Normal Probability Distribution
No ratings yet
Normal Probability Distribution
32 pages
R Module 11 - Statistics
No ratings yet
R Module 11 - Statistics
35 pages
Statistics and Data Science With R Part - 4
No ratings yet
Statistics and Data Science With R Part - 4
23 pages
DP Prog
No ratings yet
DP Prog
10 pages
02 Horizontal Boundaries - Av
No ratings yet
02 Horizontal Boundaries - Av
19 pages
ProbList2 24 SLN
No ratings yet
ProbList2 24 SLN
20 pages
Lab Manual
No ratings yet
Lab Manual
7 pages
An Introducion of PLECS
No ratings yet
An Introducion of PLECS
30 pages
Begla 136
No ratings yet
Begla 136
4 pages
Affidavit James
No ratings yet
Affidavit James
26 pages
Missing Values and Outliers in R-Software
No ratings yet
Missing Values and Outliers in R-Software
17 pages
F24 Lab-01
No ratings yet
F24 Lab-01
4 pages
Smart Credentials - Guia de Instalacion
No ratings yet
Smart Credentials - Guia de Instalacion
13 pages
My Fair Lady Essay
No ratings yet
My Fair Lady Essay
5 pages
How To Add An Estimate Field To A Sub-Task
No ratings yet
How To Add An Estimate Field To A Sub-Task
12 pages
Importance of Planning in Management Developing Organization by L. Jeseviciute-Ufartiene
No ratings yet
Importance of Planning in Management Developing Organization by L. Jeseviciute-Ufartiene
6 pages
Nokia Support Guide PC Suite 5.8
No ratings yet
Nokia Support Guide PC Suite 5.8
14 pages
583 - Reading Comprehension Passage 13 MCQ Test With Answers The Honeymoon Couple
No ratings yet
583 - Reading Comprehension Passage 13 MCQ Test With Answers The Honeymoon Couple
3 pages
Synopsis On Hunan Rights
No ratings yet
Synopsis On Hunan Rights
3 pages
Conversion Program For KNA1 Table
No ratings yet
Conversion Program For KNA1 Table
4 pages
Worksheet - Eapp - Week 2
No ratings yet
Worksheet - Eapp - Week 2
2 pages
Autolab Application Note AUT01
No ratings yet
Autolab Application Note AUT01
2 pages
Understanding Modern Electronics
100% (1)
Understanding Modern Electronics
3 pages
FRQ
No ratings yet
FRQ
2 pages
R Practical
No ratings yet
R Practical
9 pages
Machinelearninglabmanual
No ratings yet
Machinelearninglabmanual
47 pages
Model Lab
No ratings yet
Model Lab
6 pages
in Italiano Chiuchiu 2 PDF Free
No ratings yet
in Italiano Chiuchiu 2 PDF Free
8 pages
FDS Lab Manual
No ratings yet
FDS Lab Manual
32 pages
S3 SetB Mid-Year GE TP7 TE
No ratings yet
S3 SetB Mid-Year GE TP7 TE
12 pages
Da Lab File 2
No ratings yet
Da Lab File 2
13 pages
ML Lab Manual Bcsl602
No ratings yet
ML Lab Manual Bcsl602
108 pages
DEV Lab Manual
No ratings yet
DEV Lab Manual
27 pages
Lab Manual ML
No ratings yet
Lab Manual ML
26 pages
Lab Manual - DSR
No ratings yet
Lab Manual - DSR
32 pages
Practice Questions On Central Tendency On Mtcars
No ratings yet
Practice Questions On Central Tendency On Mtcars
3 pages
CS605 Labcf
No ratings yet
CS605 Labcf
30 pages
Experiment 8
No ratings yet
Experiment 8
4 pages
DSBDA Practicals
No ratings yet
DSBDA Practicals
16 pages