0% found this document useful (0 votes)
40 views11 pages

What Is EDA in Data Science - Everything About Exploratory Data - by Aman Kharwal - Medium

What is EDA in Data Science_. Everything about Exploratory Data… _ by Aman Kharwal _ Medium

Uploaded by

sarsij mishra
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
40 views11 pages

What Is EDA in Data Science - Everything About Exploratory Data - by Aman Kharwal - Medium

What is EDA in Data Science_. Everything about Exploratory Data… _ by Aman Kharwal _ Medium

Uploaded by

sarsij mishra
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 11

12/27/23, 8:58 PM What is EDA in Data Science?.

Everything about Exploratory Data… | by Aman Kharwal | Medium

Open in app Sign up Sign in

Search

What is EDA in Data Science?


Everything about Exploratory Data Analysis for Data Science!

Aman Kharwal · Follow


3 min read · Jun 1

Listen Share

Exploratory data analysis (EDA) is a Data Science concept where we analyze a


dataset to discover patterns, trends, and relationships within the data. If you want to
understand Exploratory Data Analysis, this article is for you. In this article, I will
take you through everything about Exploratory Data Analysis (EDA) you should
know as a Data Science professional.

What is EDA & How Does it Help?


Exploratory data analysis (EDA) is a Data Science concept where we analyze a
dataset to discover patterns, trends, and relationships within the data. It helps us

https://amankharwal.medium.com/what-is-eda-in-data-science-abb5a646f21c 1/11
12/27/23, 8:58 PM What is EDA in Data Science?. Everything about Exploratory Data… | by Aman Kharwal | Medium

better understand the information contained in the dataset and guides us in making
informed decisions and formulating strategies to solve real business problems.

For example, suppose a retail business is facing a drop in sales.

By performing an EDA on their sales data, we can explore various factors that may
influence declining sales, such as changes in customer preferences, changes in
market trends, or the impact of promotional campaigns.

EDA can help identify these factors, allowing us to design targeted marketing
strategies and make data-driven decisions to increase sales.

Below are some resources you can follow to learn about the practical
implementation of Exploratory Data Analysis:

1. EDA using Python

2. EDA using SQL

3. EDA using R

Questions You Need to Ask from Data While Performing EDA


When we perform Exploratory Data Analysis, we ask questions from data using Data
Science tools like Python, R, or SQL. So it would help if you always started by asking
the right questions from your data while performing EDA. Below are some questions
that you should always ask from your data while performing EDA:

1. How many variables/features are present?

2. What are the range, minimum, maximum, mean, and median values?

3. Are the variables normally distributed or skewed?

4. Are there any extreme values or outliers that need to be addressed?

5. Are there any strong positive or negative correlations between variables?

6. Which variables have the most significant impact on the target variable?

7. Are there any seasonality or periodic patterns?

8. Are there any increasing or decreasing trends over time?

https://amankharwal.medium.com/what-is-eda-in-data-science-abb5a646f21c 2/11
12/27/23, 8:58 PM What is EDA in Data Science?. Everything about Exploratory Data… | by Aman Kharwal | Medium

9. Are there any clusters or groups within the data?

10. Are there any anomalies or unusual observations?

11. How does the data vary across different categories or groups?

Process of EDA
It doesn’t matter which language or tool you use for EDA. Below is the process you
should follow while performing Exploratory Data Analysis:

1. Data Collection: Gather relevant data from various sources, ensuring its
accuracy and completeness.

2. Data cleaning: Perform data cleaning operations to fix missing values, manage
outliers, and eliminate inconsistencies.

3. Data Visualization: Create visual representations of data using graphs,


histograms, scatterplots, or heatmaps. Visualization helps identify patterns,
trends, and anomalies in the data set.

4. Descriptive Statistics: Calculate and analyze key descriptive statistics, such as


mean, median, mode, standard deviation, and quartiles. These statistics provide
insight into the data’s central tendencies, distributions, and variabilities.

5. Correlation Analysis: Explore relationships between variables by calculating


correlation coefficients. This analysis helps identify dependencies and
understand how changes in one variable affect others.

Summary
Exploratory data analysis (EDA) is a Data Science concept where we analyze a
dataset to discover patterns, trends, and relationships within the data. It helps us
better understand the information contained in the dataset and guides us in making
informed decisions and formulating strategies to solve real business problems. I
hope you liked this article on what is EDA in Data Science. Feel free to ask valuable
questions in the comments section below.

Data Science Data Data Analysis Artificial Intelligence Machine Learning

https://amankharwal.medium.com/what-is-eda-in-data-science-abb5a646f21c 3/11
12/27/23, 8:58 PM What is EDA in Data Science?. Everything about Exploratory Data… | by Aman Kharwal | Medium

Follow

Written by Aman Kharwal


20K Followers

I write stories behind the data📈 | instagram.com/amankharwal.official/

More from Aman Kharwal

Aman Kharwal in Coders Camp

60 Python Projects with Source Code


60 Python Projects with Source code solved and explained for free

2 min read · Jan 14, 2021

1.5K 12

https://amankharwal.medium.com/what-is-eda-in-data-science-abb5a646f21c 4/11
12/27/23, 8:58 PM What is EDA in Data Science?. Everything about Exploratory Data… | by Aman Kharwal | Medium

Aman Kharwal

190 Python Projects with Source Code


190 Python Projects with Source Code: solved and explained for free

4 min read · Jun 11, 2021

1.2K 10

Aman Kharwal in Coders Camp

290+ Machine Learning Projects with Python

https://amankharwal.medium.com/what-is-eda-in-data-science-abb5a646f21c 5/11
12/27/23, 8:58 PM What is EDA in Data Science?. Everything about Exploratory Data… | by Aman Kharwal | Medium

290+ Machine Learning Projects Solved & Explained using Python programming language.

6 min read · Jun 4, 2021

2K 16

Aman Kharwal

60+ Data Analysis Projects with Python


60+ Amazing Data Analysis Projects with Python: solved and explained.

2 min read · May 14, 2021

728 7

See all from Aman Kharwal

Recommended from Medium

https://amankharwal.medium.com/what-is-eda-in-data-science-abb5a646f21c 6/11
12/27/23, 8:58 PM What is EDA in Data Science?. Everything about Exploratory Data… | by Aman Kharwal | Medium

Adegboyega (Vividus)

How to Break Into Data Science & Analytics: A Roadmap


Last Updated: 23rd of Dec, 2023

3 min read · 3 days ago

Tony Sebastian

Data Science Road Map 2023–2024

https://amankharwal.medium.com/what-is-eda-in-data-science-abb5a646f21c 7/11
12/27/23, 8:58 PM What is EDA in Data Science?. Everything about Exploratory Data… | by Aman Kharwal | Medium

Data science involves utilizing domain knowledge, programming skills, mathematics, and
statistics to derive meaningful insights from data…

6 min read · Jul 9

268 4

Lists

Predictive Modeling w/ Python


20 stories · 726 saves

Practical Guides to Machine Learning


10 stories · 834 saves

Natural Language Processing


1035 stories · 508 saves

ChatGPT prompts
34 stories · 863 saves

K S Ashish

Call Center Data Analysis Using Microsoft Excel


How to use Call Center Data to Improve Customer Satisfaction

6 min read · Aug 13

https://amankharwal.medium.com/what-is-eda-in-data-science-abb5a646f21c 8/11
12/27/23, 8:58 PM What is EDA in Data Science?. Everything about Exploratory Data… | by Aman Kharwal | Medium

36 1

Nishi Paul

Dataframe and Series in Pandas: Complete Detail


Since you are here to learn about Pandas, you already know the essence of it. This blog is about
the in-depth details on Series and…

18 min read · Dec 14

56

https://amankharwal.medium.com/what-is-eda-in-data-science-abb5a646f21c 9/11
12/27/23, 8:58 PM What is EDA in Data Science?. Everything about Exploratory Data… | by Aman Kharwal | Medium

Afreen

The 2023 Roadmap to Become a Data Analyst: Free Resources to


Kickstart Your Career
The world of data analytics is expanding rapidly, and becoming a data analyst in 2023 is an
exciting and rewarding career choice. As…

· 3 min read · Jul 25

11

Sneha Bajaj

https://amankharwal.medium.com/what-is-eda-in-data-science-abb5a646f21c 10/11
12/27/23, 8:58 PM What is EDA in Data Science?. Everything about Exploratory Data… | by Aman Kharwal | Medium

Exploratory Data Analysis: Netflix


Analyse data using visual methods to gain business insights & actionable recommendations.

7 min read · Jun 30

122

See more recommendations

https://amankharwal.medium.com/what-is-eda-in-data-science-abb5a646f21c 11/11

You might also like