SlideShare a Scribd company logo
2
Most read
3
Most read
6
Most read
EXPLOLATORY DATA ANALYSIS
Davis David
Data Scientist at ParrotAI
CONTENT:
1. Introduction to EDA
2. Importance of EDA
3. Data Types
4. Python Packages for EDA
5. List of Graphs
6. Practical EDA
1.INTRODUCTION TO EDA
 Exploratory Data Analysis refers to the critical process of performing
initial investigations on data so as to discover patterns, to spot
anomalies, to test hypothesis and to check assumptions with the help
of summary statistics and graphical representations.
It is a good practice to understand the data first and try to gather as
many insights from it.
2. IMPORTANCE OF EDA
Identifying the most important variables/features in your dataset.
Testing a hypothesis or checking assumptions related to the dataset.
To check the quality of data for further processing and cleaning.
Deliver data-driven insights to business stakeholders.
Verify expected relationships actually exist in the data.
To find unexpected structure or insights in the data.
Exploratory data analysis with Python
Exploratory data analysis with Python
Exploratory data analysis with Python
Two Categories of Data
 Structured Data types
Example: csv file, excel file, database file
 Unstructured Data types
Examples: Images, videos, audio,
Data Types
Structured Data Types
Categorical - This is any data that isn’t a number.
 Ordinal - have a set of order e.g. rating happiness on a scale of 1-10.
 Binary - have only two values .e.g. Male or Female
 Nominal - no set of order e.g. Countries
Numerical – Data inform of numbers
 Continuous - numbers that don’t have a logical end to them e.g heights
 Discrete - have a logical end to them e.g. days in the month
Python Packages for EDA
1.Bar Chart
2. Pie Chart
3.Histogram
4.Scatter Plot
5. Heatmap
6. Box Plot
7. Line Plot
8. Violin Plot
9.Bubble Plot
10. 3D Scatter Plot
Exploratory data analysis with Python

More Related Content

PPTX
Exploratory Data Analysis
Umair Shafique
 
PDF
Exploratory data analysis data visualization
Dr. Hamdan Al-Sabri
 
PPTX
Exploratory data analysis
Gramener
 
PPTX
Exploratory data analysis
Peter Reimann
 
PDF
Data Visualization in Exploratory Data Analysis
Eva Durall
 
PDF
Exploratory Data Analysis - Satyajit.pdf
AmmarAhmedSiddiqui2
 
PPT
EXPLORATORY DATA ANALYSIS
BabasID2
 
PPTX
Statistics and data science
Mohammad Azharuddin
 
Exploratory Data Analysis
Umair Shafique
 
Exploratory data analysis data visualization
Dr. Hamdan Al-Sabri
 
Exploratory data analysis
Gramener
 
Exploratory data analysis
Peter Reimann
 
Data Visualization in Exploratory Data Analysis
Eva Durall
 
Exploratory Data Analysis - Satyajit.pdf
AmmarAhmedSiddiqui2
 
EXPLORATORY DATA ANALYSIS
BabasID2
 
Statistics and data science
Mohammad Azharuddin
 

What's hot (20)

PPTX
Classification in data mining
Sulman Ahmed
 
PPTX
Big Data Analytics
Ghulam Imaduddin
 
PPTX
Introduction to Data Science.pptx
Vrishit Saraswat
 
PPTX
Naive bayes
Ashraf Uddin
 
PPTX
Data preprocessing in Machine learning
pyingkodi maran
 
PPTX
Predictive analytics
SAS Singapore Institute Pte Ltd
 
PPT
01 Data Mining: Concepts and Techniques, 2nd ed.
Institute of Technology Telkom
 
PPTX
Kdd process
Rajesh Chandra
 
PPT
1.2 steps and functionalities
Krish_ver2
 
PPTX
Clustering in Data Mining
Archana Swaminathan
 
PPTX
Machine Learning - Splitting Datasets
Andrew Ferlitsch
 
PPTX
Association rule mining.pptx
maha797959
 
PPTX
Data mining: Classification and prediction
DataminingTools Inc
 
PDF
Dimensionality Reduction
Saad Elbeleidy
 
PDF
Data Visualization in Data Science
Maloy Manna, PMP®
 
PPTX
Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...
Md. Main Uddin Rony
 
PPTX
Introduction to Data Mining
DataminingTools Inc
 
PPSX
Frequent itemset mining methods
Prof.Nilesh Magar
 
PDF
Data preprocessing using Machine Learning
Gopal Sakarkar
 
PPTX
All data models in dbms
Naresh Kumar
 
Classification in data mining
Sulman Ahmed
 
Big Data Analytics
Ghulam Imaduddin
 
Introduction to Data Science.pptx
Vrishit Saraswat
 
Naive bayes
Ashraf Uddin
 
Data preprocessing in Machine learning
pyingkodi maran
 
Predictive analytics
SAS Singapore Institute Pte Ltd
 
01 Data Mining: Concepts and Techniques, 2nd ed.
Institute of Technology Telkom
 
Kdd process
Rajesh Chandra
 
1.2 steps and functionalities
Krish_ver2
 
Clustering in Data Mining
Archana Swaminathan
 
Machine Learning - Splitting Datasets
Andrew Ferlitsch
 
Association rule mining.pptx
maha797959
 
Data mining: Classification and prediction
DataminingTools Inc
 
Dimensionality Reduction
Saad Elbeleidy
 
Data Visualization in Data Science
Maloy Manna, PMP®
 
Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...
Md. Main Uddin Rony
 
Introduction to Data Mining
DataminingTools Inc
 
Frequent itemset mining methods
Prof.Nilesh Magar
 
Data preprocessing using Machine Learning
Gopal Sakarkar
 
All data models in dbms
Naresh Kumar
 
Ad

Similar to Exploratory data analysis with Python (20)

PDF
Exploratory Data Analysis - A Comprehensive Guide to EDA.pdf
JamieDornan2
 
PPTX
Introduction of data science
TanujaSomvanshi1
 
PDF
Exploratory Data Analysis - A Comprehensive Guide to EDA.pdf
StephenAmell4
 
PDF
Exploratory Data Analysis - A Comprehensive Guide to EDA.pdf
JamieDornan2
 
PPTX
Data Exploration in Python.pptx
SIDDHARTH435426
 
PDF
Master Data Analyst Course in Bangalore with ProITBridge's Expert Course.pdf
proitbridgePvtLtd
 
PPTX
EDA_Unit1_Charts_Code for your reference.pptx
MrsKavithaG
 
PDF
ugc carelist journals ugc carelist journals
mounikadopenventio
 
PPTX
Exploratory Data Analysis (EDA) .pptx
ZahidRiazHaans
 
PDF
EDA-Unit 1.pdf
Nirmalavenkatachalam
 
PDF
Exploratory Data Analysis - NIST eHandbook of Statistical Methods-out.pdf
lsharkey602
 
PPTX
EXPLORATORY DATA ANALYSIS IN STATISTICAL MODeLING.pptx
rakeshreghu98
 
PPTX
UNIT - 5 : 20ACS04 – PROBLEM SOLVING AND PROGRAMMING USING PYTHON
Nandakumar P
 
PDF
data science course in hyderabad
maneesha2312
 
PPTX
Unit2.pptx Statistical Interference and Exploratory Data Analysis
Priyanka Jadhav
 
PDF
Lecture 3 - Exploratory Data Analytics (EDA), a lecture in subject module Sta...
Maninda Edirisooriya
 
PPTX
R.SOWMIYA (30323U09086).pptx data science with python
ksaravanakumar450
 
PPTX
Understanding the Primary Goal of Exploratory Data Analysis.pptx
MindCypress .
 
PPTX
Exploratory Data Analysis.pptx for Data Analytics
harshrnotaria
 
PDF
Exploratory data analysis handbook (from www.nist.gov, Engineering Statistic...
Stella Tsank
 
Exploratory Data Analysis - A Comprehensive Guide to EDA.pdf
JamieDornan2
 
Introduction of data science
TanujaSomvanshi1
 
Exploratory Data Analysis - A Comprehensive Guide to EDA.pdf
StephenAmell4
 
Exploratory Data Analysis - A Comprehensive Guide to EDA.pdf
JamieDornan2
 
Data Exploration in Python.pptx
SIDDHARTH435426
 
Master Data Analyst Course in Bangalore with ProITBridge's Expert Course.pdf
proitbridgePvtLtd
 
EDA_Unit1_Charts_Code for your reference.pptx
MrsKavithaG
 
ugc carelist journals ugc carelist journals
mounikadopenventio
 
Exploratory Data Analysis (EDA) .pptx
ZahidRiazHaans
 
EDA-Unit 1.pdf
Nirmalavenkatachalam
 
Exploratory Data Analysis - NIST eHandbook of Statistical Methods-out.pdf
lsharkey602
 
EXPLORATORY DATA ANALYSIS IN STATISTICAL MODeLING.pptx
rakeshreghu98
 
UNIT - 5 : 20ACS04 – PROBLEM SOLVING AND PROGRAMMING USING PYTHON
Nandakumar P
 
data science course in hyderabad
maneesha2312
 
Unit2.pptx Statistical Interference and Exploratory Data Analysis
Priyanka Jadhav
 
Lecture 3 - Exploratory Data Analytics (EDA), a lecture in subject module Sta...
Maninda Edirisooriya
 
R.SOWMIYA (30323U09086).pptx data science with python
ksaravanakumar450
 
Understanding the Primary Goal of Exploratory Data Analysis.pptx
MindCypress .
 
Exploratory Data Analysis.pptx for Data Analytics
harshrnotaria
 
Exploratory data analysis handbook (from www.nist.gov, Engineering Statistic...
Stella Tsank
 
Ad

Recently uploaded (20)

PDF
Google I/O Extended 2025 Baku - all ppts
HusseinMalikMammadli
 
PDF
Presentation about Hardware and Software in Computer
snehamodhawadiya
 
PDF
Chapter 1 Introduction to CV and IP Lecture Note.pdf
Getnet Tigabie Askale -(GM)
 
PPT
L2 Rules of Netiquette in Empowerment technology
Archibal2
 
PDF
AI Unleashed - Shaping the Future -Starting Today - AIOUG Yatra 2025 - For Co...
Sandesh Rao
 
PDF
Unlocking the Future- AI Agents Meet Oracle Database 23ai - AIOUG Yatra 2025.pdf
Sandesh Rao
 
PPTX
What-is-the-World-Wide-Web -- Introduction
tonifi9488
 
PPTX
AI and Robotics for Human Well-being.pptx
JAYMIN SUTHAR
 
PDF
Building High-Performance Oracle Teams: Strategic Staffing for Database Manag...
SMACT Works
 
PDF
How-Cloud-Computing-Impacts-Businesses-in-2025-and-Beyond.pdf
Artjoker Software Development Company
 
PPTX
The-Ethical-Hackers-Imperative-Safeguarding-the-Digital-Frontier.pptx
sujalchauhan1305
 
PDF
DevOps & Developer Experience Summer BBQ
AUGNYC
 
PPTX
Smart Infrastructure and Automation through IoT Sensors
Rejig Digital
 
PPTX
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
PDF
Doc9.....................................
SofiaCollazos
 
PPTX
ChatGPT's Deck on The Enduring Legacy of Fax Machines
Greg Swan
 
PDF
REPORT: Heating appliances market in Poland 2024
SPIUG
 
PDF
Orbitly Pitch Deck|A Mission-Driven Platform for Side Project Collaboration (...
zz41354899
 
PDF
Best ERP System for Manufacturing in India | Elite Mindz
Elite Mindz
 
PDF
This slide provides an overview Technology
mineshkharadi333
 
Google I/O Extended 2025 Baku - all ppts
HusseinMalikMammadli
 
Presentation about Hardware and Software in Computer
snehamodhawadiya
 
Chapter 1 Introduction to CV and IP Lecture Note.pdf
Getnet Tigabie Askale -(GM)
 
L2 Rules of Netiquette in Empowerment technology
Archibal2
 
AI Unleashed - Shaping the Future -Starting Today - AIOUG Yatra 2025 - For Co...
Sandesh Rao
 
Unlocking the Future- AI Agents Meet Oracle Database 23ai - AIOUG Yatra 2025.pdf
Sandesh Rao
 
What-is-the-World-Wide-Web -- Introduction
tonifi9488
 
AI and Robotics for Human Well-being.pptx
JAYMIN SUTHAR
 
Building High-Performance Oracle Teams: Strategic Staffing for Database Manag...
SMACT Works
 
How-Cloud-Computing-Impacts-Businesses-in-2025-and-Beyond.pdf
Artjoker Software Development Company
 
The-Ethical-Hackers-Imperative-Safeguarding-the-Digital-Frontier.pptx
sujalchauhan1305
 
DevOps & Developer Experience Summer BBQ
AUGNYC
 
Smart Infrastructure and Automation through IoT Sensors
Rejig Digital
 
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
Doc9.....................................
SofiaCollazos
 
ChatGPT's Deck on The Enduring Legacy of Fax Machines
Greg Swan
 
REPORT: Heating appliances market in Poland 2024
SPIUG
 
Orbitly Pitch Deck|A Mission-Driven Platform for Side Project Collaboration (...
zz41354899
 
Best ERP System for Manufacturing in India | Elite Mindz
Elite Mindz
 
This slide provides an overview Technology
mineshkharadi333
 

Exploratory data analysis with Python

  • 1. EXPLOLATORY DATA ANALYSIS Davis David Data Scientist at ParrotAI
  • 2. CONTENT: 1. Introduction to EDA 2. Importance of EDA 3. Data Types 4. Python Packages for EDA 5. List of Graphs 6. Practical EDA
  • 3. 1.INTRODUCTION TO EDA  Exploratory Data Analysis refers to the critical process of performing initial investigations on data so as to discover patterns, to spot anomalies, to test hypothesis and to check assumptions with the help of summary statistics and graphical representations. It is a good practice to understand the data first and try to gather as many insights from it.
  • 4. 2. IMPORTANCE OF EDA Identifying the most important variables/features in your dataset. Testing a hypothesis or checking assumptions related to the dataset. To check the quality of data for further processing and cleaning. Deliver data-driven insights to business stakeholders. Verify expected relationships actually exist in the data. To find unexpected structure or insights in the data.
  • 8. Two Categories of Data  Structured Data types Example: csv file, excel file, database file  Unstructured Data types Examples: Images, videos, audio,
  • 10. Structured Data Types Categorical - This is any data that isn’t a number.  Ordinal - have a set of order e.g. rating happiness on a scale of 1-10.  Binary - have only two values .e.g. Male or Female  Nominal - no set of order e.g. Countries Numerical – Data inform of numbers  Continuous - numbers that don’t have a logical end to them e.g heights  Discrete - have a logical end to them e.g. days in the month