0% found this document useful (0 votes)
8 views10 pages

Pandas

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
8 views10 pages

Pandas

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 10

Data Analysis and Handling with

Pandas
Introduction
• It contains data structures and data manipulation
tools designed to make data cleaning and analysis fast
and easy in Python.
• Pandas is designed for working with tabular or
heterogeneous data.
• NumPy, by contrast, is best suited for working with
homogeneous numerical array data.
● Why Data Handling
● We need to manage Multi -dimensional data
for our AI projects
● Organized and well managed data is a
requirment for All Steps in AI (Training,
Validation, Testing, and Predicting)
Example: International Students
Different Types of data….
● Textual data (Reviews, comments, News, posts)
● Numerical Data (Labour Force, litracy rate, economics
indicators
● Pictures
● Videos
● Time Series (Daily temperatures for 5 years)
● Other Patterns
Common Action over data

● Collecting and Storing,


● Filtering Sampling,
● Filling and removing Duplicates
● Merging and Splitting
Data processing pipeline: a typical example

Filling Removing
Collecting
Missing Uncessary Data

Adding new data Data Spliting Feeding to ML

Large Data
handling
For starters
● Series.
● Dataframe.
● Data loading, storage and manipulation.
Reading data in pandas

You might also like