Pandas
Pandas
Pandas
Introduction
• It contains data structures and data manipulation
tools designed to make data cleaning and analysis fast
and easy in Python.
• Pandas is designed for working with tabular or
heterogeneous data.
• NumPy, by contrast, is best suited for working with
homogeneous numerical array data.
● Why Data Handling
● We need to manage Multi -dimensional data
for our AI projects
● Organized and well managed data is a
requirment for All Steps in AI (Training,
Validation, Testing, and Predicting)
Example: International Students
Different Types of data….
● Textual data (Reviews, comments, News, posts)
● Numerical Data (Labour Force, litracy rate, economics
indicators
● Pictures
● Videos
● Time Series (Daily temperatures for 5 years)
● Other Patterns
Common Action over data
Filling Removing
Collecting
Missing Uncessary Data
Large Data
handling
For starters
● Series.
● Dataframe.
● Data loading, storage and manipulation.
Reading data in pandas