0% found this document useful (0 votes)
6 views4 pages

Technical Content-EDA For TE

The document outlines a 5-day course on Exploratory Data Analysis (EDA) covering topics such as data preparation, exploration, advanced techniques, real-world applications, and insights reporting. Each day consists of structured hours dedicated to specific topics, including data types, descriptive statistics, feature engineering, and hands-on projects. The course aims to provide participants with practical skills and knowledge to perform EDA effectively using various tools and techniques.

Uploaded by

Mishita Ingawale
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as XLSX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
6 views4 pages

Technical Content-EDA For TE

The document outlines a 5-day course on Exploratory Data Analysis (EDA) covering topics such as data preparation, exploration, advanced techniques, real-world applications, and insights reporting. Each day consists of structured hours dedicated to specific topics, including data types, descriptive statistics, feature engineering, and hands-on projects. The course aims to provide participants with practical skills and knowledge to perform EDA effectively using various tools and techniques.

Uploaded by

Mishita Ingawale
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as XLSX, PDF, TXT or read online on Scribd
You are on page 1/ 4

Day Total Hours Hours Topic

1. Introduction to EDA and Data Preparation


30mins 1. Course Introduction (30 mins)
30mins 2. Python Setup for EDA (30 mins)
1.5 hrs 3. Data Types and Structures (1.5 Hours)
1 6 hours
1 hr 4. Data Loading and Saving (1 Hour)
5. Data Cleaning Basics (2 Hours)
2 hrs
30mins Revision

2. Data Exploration and Descriptive Statistics

1 hr 1. Understanding Data

2 6 hours 1.5 hrs 2. Descriptive Statistics

1.5 hrs 3. Data Visualization Basics


2 hrs 4. Exploring Relationships in Data

Day 3: Advanced EDA Techniques

2 hrs 1. Feature Engineering


3 6 hours
2 hrs 2. Advanced Data Visualization

2 hrs 3. Data Transformation (2 Hours)

Day 4: EDA on Real-World Datasets

3 hrs 1. Working with Case Studies

4 6 hours
4 6 hours

1.5 hrs 2. Time Series Analysis Basics

1.5 hrs 3. Automation in EDA

Day 5: Insights and Reporting

2 hrs 1. Interpreting and Communicating Results

5 6 hours
3 hrs 2. Hands-On Project

1 hr 3. Presentation of Findings
Sub-topic
Objectives of EDA
Overview of the data analysis workflow
Importance
Installing ofimporting
EDA in data science
Numeric, andcategorical, andlibraries
datetime types
Overview
Series andof NumPy, Pandas,
DataFrames Matplotlib, and Seaborn
in Pandas
Importing data from CSV
Indexing
Working and slicing data
Handling with missing and null data
duplicates
Exporting data
Replacing, dropping, and filling missing values
Renaming columns and indices

Shape, size, and basic information


Summary statistics with `describe()`
Measures of central tendency: Mean, median, mode
Measures of dispersion: Range, variance, standard deviation
Skewness and kurtosis

Line plots, scatter plots, bar charts


Customizing Matplotlib plots (titles, labels, legends)
Introduction to Seaborn

Correlation analysis

Creating new features


Encoding categorical variables (label encoding, one-hot encoding)
Normalization and standardization
Violin plots, swarm plots, and joint plots (Seaborn)
Interactive visualizations with Plotly
Handling outliers (IQR, Z-score methods)
Scaling and binning

any one project with one dataset is possible


Dataset 1: Analyzing Titanic data (survival rates, gender-based analysis)
Dataset 2: Customer segmentation (e.g., mall customers dataset)
Parsing datetime data
Time-based indexing and slicing
Plotting time series trends
Automated EDA tools (`pandas_profiling’)
Creating custom reusable functions

Deriving actionable insights


Pitfalls to avoid during EDA
Choose a real-world dataset (e.g., COVID-19, e-commerce, or finance) any
one only
Perform end-to-end EDA: Cleaning, visualization
Summarizing findings in a report
Best practices for visuals

You might also like