0% found this document useful (0 votes)
13 views11 pages

4a Introduction To Pandas - PPTX - Lyst5943

Uploaded by

srajgiri40
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
13 views11 pages

4a Introduction To Pandas - PPTX - Lyst5943

Uploaded by

srajgiri40
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 11

Python Libraries : PANDAS

Agenda

• Introduction of Pandas
• Pandas Dataframes
• Pandas Series
• Converting Dataframe & Series into arrays
• Functions to work view data with files
What is Pandas
• Pandas is an open-source Python Library providing
high-performance data manipulation and analysis tool

• The name Pandas is derived from word ‘Panel Data’,


and was developed by Wes McKinney for high
performance, flexible tool for data analysis.

• Using Pandas, we can accomplish five typical steps in


the processing and analysis of data, load, prepare,
manipulate, model, and analyze.

• Python with Pandas is used in a wide range of fields


including academic and commercial domains including
finance, economics, Statistics, analytics, etc.
Key Features
• Fast and efficient Dataframe objects with
customized indexing.
• Tools for loading data into in-memory from
different file formats.
• Data alignment and integrated handling of missing
data.
• Reshaping and pivoting of date sets.
• Label-based slicing, indexing and sub setting of
large data sets.
• Group by data for aggregation and
transformations.
• High performance merging and joining of data.
Time Series functionality.
Environment Setup

Standard Python distribution doesn't come


bundled with Pandas module. A lightweight
alternative is to install NumPy using popular
Python package installer, pip.

pip install pandas


Introduction to Data Structures

Pandas deals with the following three data structures −

1 Series
2 DataFrame
3 Panel

These data structures are built on top of Numpy array,


which means they are fast.
Series
• Series is a one-dimensional array like structure with
homogeneous data. For example, the following series is a
collection of integers 10, 23, 56, …
• Key Points:
• Homogeneous data
• Size Immutable
• Values of Data Mutable
DataFrame
DataFrame is a two-dimensional array with
heterogeneous data.
Data Type

The data types of the four columns are as follows

Name String
Score Float
Attempt Integer
Qualify String

Key Points:-
• Heterogeneous data
• Size Mutable
• Data Mutable
Panel

Panel is a three-dimensional data structure with


heterogeneous data.
A panel can be illustrated as a container of DataFrame.

Key Points:-
• Heterogeneous data
• Size Mutable
• Data Mutable
• Define the Pandas/Python pandas?

• Mention the different types of Data Structures in Pandas?

• Define DataFrame with example in Pandas?

• Define Series with example in Pandas?

• Define panel with example in Pandas?

You might also like