100% found this document useful (1 vote)
896 views

Pandas

Pandas is a Python library used for working with datasets. It has functions for analyzing, cleaning, exploring, and manipulating data. Pandas allows analyzing big data and making conclusions based on statistical theories. It can clean messy datasets and make them readable. Pandas provides Series, DataFrames, and Panels as data structures.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
100% found this document useful (1 vote)
896 views

Pandas

Pandas is a Python library used for working with datasets. It has functions for analyzing, cleaning, exploring, and manipulating data. Pandas allows analyzing big data and making conclusions based on statistical theories. It can clean messy datasets and make them readable. Pandas provides Series, DataFrames, and Panels as data structures.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 24

PANDAS

PYTHON
Python is a high-level,
interpreted,
general-purpose programmin
g language
.

It supports multiple


programming paradigms.

Python is meant to be an
easily readable language.
WHY PYTHON IS PREFFERED
FOR AI AND MACHINE
LEARNING?
 A great library ecosystem. For example: Pandas and Matplotlib.
A low entry barrier
Flexibility
Platform independence
 Readability
Good visualization options
Community support
Growing popularity
What is Pandas?

 Pandas is a Python library used for working with data sets.

 It has functions for analyzing, cleaning, exploring, and


manipulating data.

 The name "Pandas" has a reference to both "Panel Data", and


"Python Data Analysis" and was created by Wes McKinney in
2008.

 The source code for Pandas is located at this github


repository https://github.com/pandas-dev/pandas
Why Use Pandas?
Pandas allows us to analyze big data and make conclusions based
on statistical theories.

Pandas can clean messy data sets, and make them readable and
relevant.

Relevant data is very important in data science.

Pandas are also able to delete rows that are not relevant, or
contains wrong values, like empty or NULL values. This is
called cleaning the data.
.
INSTALLATION OF PANDAS
If you are using PIP, then run the below command to install pandas module.

pip install pandas

To import Pandas in your python script


import pandas as pd
DATA STRUCTURES IN PANDAS
There are 3 data structures provided by the Pandas module, which are as
follows:

• Series: It is a 1-D size-immutable array-like structure having homogeneous


data.

• Data Frames: It is a 2-D size-mutable tabular structure with heterogeneously


typed columns.

• Panel: It is a 3-D, size-mutable array.


PANDAS SERIES
CREATING A SERIES
PANDAS DATAFRAME
Creating a data frame
PANDAS DATAFRAME
Importing data from CSV to DataFrame
PANDAS DATAFRAME
Data Exploration
PANDAS DATAFRAME
-
PANDAS DATAFRAME

• DATA SLICING :
PANDAS DATAFRAME
PANDAS DATAFRAME
PANDAS DATAFRAME
PANDAS DATAFRAME
• DATA GROUPING :
PANDAS DATAFRAME
DATA FILTERING :
PANDAS DATAFRAME
DATA SORTING
THANK YOU!

BY -
SHUBHAM SONI
KIMREET KAUR
KRITIKA MANCHANDA
RACHIT KAPOOR

You might also like