0% found this document useful (0 votes)

29 views45 pages

Python Pandas Series

Pandas is a software library for data manipulation and analysis in Python. It allows importing and storing data in Series and DataFrame objects, and performing operations on these structures. Key features include loading data from various sources, handling missing data, reshaping and pivoting datasets, and merging and joining datasets.

Uploaded by

chandram654321

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

29 views45 pages

Python Pandas Series

Uploaded by

chandram654321

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 45

Python

Pandas
Series
The advantages of Pandas over Excel are
 Scalability - Pandas is only limited by hardware and can manipulate larger
quantities of data.
 Speed - Pandas is much faster than Excel, which is especially noticeable when
working with larger quantities of data.
 Automation - A lot of the tasks that can be achieved with Pandas are extremely
easy to automate, reducing the amount of tedious and repetitive tasks that need
to be performed daily.
 Interpretability - It is very easy to interpret what happens when each task is
run, and it is relatively easy to find and fix errors.
 Advanced Functions - Performing advanced statistical analysis and creating
complex visualizations is very straightforward.
Module: Module is a file which contains python functions. It is .py
file which has python executable code or statements.
Package: Package is namespace which contains multiple
packages or modules. It is a directory which contains a special
file __init__.py.
A namespace is a system that has a unique name for each and
every object in Python. An object might be a variable or a
method.
Library: It is collection of various packages. There is no difference
between package and python library conceptually.

Framework: It is a collection of various libraries which architects th

code flow.
 Data in a structure

 It will store in specific manner

 It is a collection of data values and operations that

can be applied to that data.

 It will enables efficient storage, retrieval and

modification to the data
Pandas:
Pandas is a software library for the Python programming language
written by Wes McKinney for data manipulation and analysis. The name
Pandas is derived from the term “Panel Data”. It is an open source and
free to use
We can analyze the data in pandas in two ways-
 Series

 Dataframe
Installation of pandas:

pip(preferred installation program) install

pandas
KEY FEATURES OF PANDAS
 Fast and efficient DataFrame object with default and customized
indexing.
 Tools for loading data from different file formats.
 Data alignment and integrated handling of missing data.
 Reshaping and pivoting of date sets.
 Label-based slicing and indexing of large data sets.
 Deletion/Insertion of columns from/to a data structure.
 Group by data for aggregation and transformations.
 High performance merging and joining of data.
SERIES

 Series is a one-dimensional array like structure with

homogeneous data. For example, the following series is a
collection of integers 10, 23, 56, …
SERIES (CONTD.)
 Pandas series is a one-dimensional labeled array capable of
holding data of any type (integer, string, float, python objects,
etc.).
 The axis labels are collectively called index.
 Pandas series is nothing but a column in an excel sheet.
 The object supports both integer and label-based indexing and
provides a host of methods for performing operations
involving the index.
Characteristics of series

series is a one-dimensional labeled array capable of holding

homogenous data of any type (integer, string, float etc.).

The data labels in series are numeric starting from 0 by default.

The data labels are called as indexes.

The data in series is mutable i.E. It can be changed but the size of
series is immutable i.E. Size of the series cannot be changed.
CREATING A SERIES
Pandas series can be created from the lists, dictionary, and from a
scalar value etc.
Syntax
Pandas.Series( data, index, name)
Where
Data: takes various forms like ndarray, list, constants/scalar
values, dictionary, mathematical expression
Index: are unique and hashable with same length as data.
Default is np.Arange(n) if no index is passed.
Name: allows you to give a name to a series object
Series() with arguments

SYNTAX:

<SERIES OBJECT> = PANDAS.SERIES(DATA, INDEX = IDX, [DTYPE =

<DATA TYPE>])

THE DATA SUPPLIED TO SERIES() CAN BE EITHER:

 A SEQUENCE (LIST)
 AN NDARRAY
 A SCALAR VALUE
 A PYTHON DICTIONARY
 A MATHEMATICAL EXPRESSION/FUNCTION
Here, keys of the dictionary become the indexes of the
series.
Creating a series with index of string type
String can be used as an index to the elements of a series.
Creating a series using two different lists
The two lists are passed as arguments to Series() method, out of which
the first list will be index and the other one will be the value.
Creating a series using missing values (nan)
In certain situations, we need to create a series object for which size is
defined but some elements or data are missing. This is handled by defining
NaN (Not a number) value(s), which is an attribute of Numpy library and
this can be achieved by defining a missing value using np.NaN.
Creating a series using a range()
 To create a series using range() method.

CODE:

 We can change the index in place also by

ser.index = [ ‘first’, ‘second’, ‘third’, ‘fourth’, ‘fifth’]

Creating a series with range() & for loop
Creating a series from scalar or constant values
A series can be created using a scalar or constant value as shown below. Here,
data is a scalar value for which it is a must to provide an index and the constant
value shall be repeated to match the length of the index.
CREATING A SERIES USING MATHEMATICAL
EXPRESSION/FUNCTION
A series object can be created by defining a function or a mathematical
expression that determines the values for data sequence using the syntax as
follows:
<Series Object> = pd.Series (index = None, data = <expression [function]>)
CREATING A SERIES USING A MATHEMATICAL
FUNCTION
A series using a mathematical exponentiation function.
SERIES OBJECT ATTRIBUTES
Some common attributes related to series object are described below
and are accessed using the syntax:
<series object>.<Attributename>
Attribute Description
Series.index Returns index of the series
Series.values Returns ndarray
Series.dtype Returns dtype object of the underlying data
Series.shape Returns tuple of the shape of underlying data
Series.nbytes Returns number of bytes of underlying data
Series.ndim Returns the number of dimension
Series.size Returns the number of elements

Series.hasnans Returns true if there are any NaN

Series.empty Returns true if series object is empty
INDEXING AND SELECTING DATA IN SERIES
ACCESSING DATA FROM A SERIES WITH
POSITION
We can access data from a series by passing the position
value and even through slicing.
Accessing data through iloc & loc
● indexing and accessing can also be done using iloc and loc.
● iloc :- iloc is used for indexing and selecting based on position,
i.e. by row no. and column no. it refers to position-based
indexing.

syntax: iloc = [<row no. range>, <column no. range>]

● loc :- loc is used for indexing and selecting based on name, i.e.
by row name and column name. it refers to name-based
indexing.

syntax: loc = [<list of row names>, <list of column names>]

Accessing data using iloc & loc
Retrieving values from a series using head()
And tail() functions
The Series.head() function displays first ‘n’ from a pandas object. By
default, it gives us the top 5 rows of data in the series.

The Series.tail() function displays the last 5 elements by default.

Retrieving values from a series using head()
And tail() functions
The Series.head() function displays first ‘n’ from a pandas object. By
default, it gives us the top 5 rows of data in the series.

The Series.tail() function displays the last 5 elements by default.

Mathematical operations on a series
Mathematical processing can be performed on series using scalar
values and functions. All the arithmetic operators such as +, -, *, /,
etc. can be successfully performed on series.
Example:

Note:
Arithmetic
operation is
possible on objects
of same index;
otherwise will
result as NaN.
Vector operations on a series
Vector operations mean that if you apply a function or expression
then it is individually applied on each item of the object. Since Series
objects are built upon Numpy arrays (ndarrays), they also support
vectorized operations, just like ndarrays.

All these are

vector operations.
Retrieving values using conditions
We can also give conditions to retrieve values from a series that
satisfies the given condition.
Example:

Here, it is performing the

filter operation and returns
filtered result containing only
those values that return True
for the given Boolean
expression.
DELETING ELEMENTS FROM A SERIES
We can delete an element from a series using drop( ) method by
passing the index of the element to be deleted as the argument
to it.
Example:

Series will Series after

all the deleting
elements item at
intact. index 3.
Sorting on the Values and Index
To sort a Series object on the basis of values and index,
you may use sort_values() and and sort_index().
Sorting on the Values and Index
To sort a Series object on the basis of values and index,
you may use sort_values() and and sort_index().

pandas series.sort_values() function is used to sort values on series

object. it sorts the series in ascending order or descending order, by
default it does in ascending order. you can specify your preference
using the ascending parameter which is true by default.

you may use sort_values() and and sort_index().

# syntax of series.sort_values()
● series.sort_values(axis=0, ascending=True)
sort pandas series in an ascending order:
sortedseries = myseries.sort_values()
sortedseries = myseries.sort_values(ascending=true)

# sort series contains numeric values.

import pandas as pd
myseries = pd.series([25000,30000,23000,15000,80000])
# sort pandas series in a descending order.
sortedseries = myseries.sort_values(ascending=False)

# sort inplace
myseries.sort_values(ascending=false, inplace=True)

A. Python Programming and SQL Bible. 7-In-1 Mastery.. (Lenichenko a.)
No ratings yet
A. Python Programming and SQL Bible. 7-In-1 Mastery.. (Lenichenko a.)
306 pages
Module 2 Pandas 1 (1)
No ratings yet
Module 2 Pandas 1 (1)
79 pages
Essential Guide To Data Science For Petroleum Engineers
No ratings yet
Essential Guide To Data Science For Petroleum Engineers
150 pages
Sr Ip Pandas i Full Notes
No ratings yet
Sr Ip Pandas i Full Notes
30 pages
Unit 1 FUNDAMENTALS OF DATA SCIENCE-1
No ratings yet
Unit 1 FUNDAMENTALS OF DATA SCIENCE-1
27 pages
235487-23es1201 – Python Programming
No ratings yet
235487-23es1201 – Python Programming
2 pages
Pandas - Series - Introduction
No ratings yet
Pandas - Series - Introduction
19 pages
Panda
No ratings yet
Panda
46 pages
DATA HANDLING WITH PANDAS - 1 NOTES XII IP
No ratings yet
DATA HANDLING WITH PANDAS - 1 NOTES XII IP
28 pages
DV
No ratings yet
DV
53 pages
Titanic eda
No ratings yet
Titanic eda
17 pages
Panda Ncert 1
No ratings yet
Panda Ncert 1
36 pages
pandas notes
No ratings yet
pandas notes
19 pages
Syllabus_5_Sem_B.Tech_CyberSecurity
No ratings yet
Syllabus_5_Sem_B.Tech_CyberSecurity
14 pages
Pandas
No ratings yet
Pandas
20 pages
4.
No ratings yet
4.
7 pages
Pandas Notes 1
No ratings yet
Pandas Notes 1
6 pages
ARIMA
No ratings yet
ARIMA
11 pages
Python Pandas - Series Notes
No ratings yet
Python Pandas - Series Notes
13 pages
Zomoto Data analysis using python
No ratings yet
Zomoto Data analysis using python
10 pages
Harshil Project Proposal BTECH
No ratings yet
Harshil Project Proposal BTECH
3 pages
Chapter 2 Data Handling using pandas - I(Series)
No ratings yet
Chapter 2 Data Handling using pandas - I(Series)
13 pages
Unit-1 Python Pandas (1)
No ratings yet
Unit-1 Python Pandas (1)
56 pages
PANDAS - PPT 32q
No ratings yet
PANDAS - PPT 32q
24 pages
Pandas
No ratings yet
Pandas
163 pages
Unit I: Data Handling Using Pandas and Data Visualization: Marks:25
No ratings yet
Unit I: Data Handling Using Pandas and Data Visualization: Marks:25
135 pages
Python_Learning_Planner
No ratings yet
Python_Learning_Planner
6 pages
XII - I.P. -SPLITUP-SYL.
No ratings yet
XII - I.P. -SPLITUP-SYL.
1 page
HW210-1
No ratings yet
HW210-1
14 pages
Unit 5 PythonPackages (Numpy,Pandas,Tkinter)
No ratings yet
Unit 5 PythonPackages (Numpy,Pandas,Tkinter)
68 pages
Himadri Mishra res without photo
No ratings yet
Himadri Mishra res without photo
1 page
XII-IP-QuickRevision
No ratings yet
XII-IP-QuickRevision
26 pages
Isha Iitd Pm
No ratings yet
Isha Iitd Pm
1 page
JETIR publication
No ratings yet
JETIR publication
7 pages
DV FINAL QB
No ratings yet
DV FINAL QB
60 pages
Reading Material For Data Handling Using Pandas-I
No ratings yet
Reading Material For Data Handling Using Pandas-I
51 pages
Syllabus 4th SEM B.tech(CSE) Autonomous
No ratings yet
Syllabus 4th SEM B.tech(CSE) Autonomous
21 pages
Ncert Pandas
No ratings yet
Ncert Pandas
36 pages
Class 12 IP Ch-1, 2 3
No ratings yet
Class 12 IP Ch-1, 2 3
28 pages
Python Pandas Series
No ratings yet
Python Pandas Series
30 pages
Data Analyst Interview Questions
No ratings yet
Data Analyst Interview Questions
9 pages
Cranes Resume Format
No ratings yet
Cranes Resume Format
2 pages
Mid Defence MRS
No ratings yet
Mid Defence MRS
44 pages
Resume of Data Analyst
No ratings yet
Resume of Data Analyst
2 pages
About
No ratings yet
About
3 pages
Data Handling using Pandas-1
No ratings yet
Data Handling using Pandas-1
23 pages
Python Pandas
No ratings yet
Python Pandas
22 pages
Python Pandas
100% (1)
Python Pandas
35 pages
Pandas
No ratings yet
Pandas
14 pages
Data Handlinng Using Pandas
No ratings yet
Data Handlinng Using Pandas
46 pages
Data Handlinng Using Pandas-I
No ratings yet
Data Handlinng Using Pandas-I
46 pages
XII_ip_Panda_I_Part_I_2023 (1) 1 1
No ratings yet
XII_ip_Panda_I_Part_I_2023 (1) 1 1
25 pages
12_IP_PA2_2024-25
No ratings yet
12_IP_PA2_2024-25
7 pages
XII IP Ch 1 Python Pandas - I Series
No ratings yet
XII IP Ch 1 Python Pandas - I Series
45 pages
Walmart Data Analyst Interview Experience
No ratings yet
Walmart Data Analyst Interview Experience
10 pages
Pandas basics
No ratings yet
Pandas basics
21 pages
Exp 25_26
No ratings yet
Exp 25_26
17 pages
Unit_III_part_2_1725700061785
No ratings yet
Unit_III_part_2_1725700061785
85 pages
Python Code
No ratings yet
Python Code
44 pages
1 Data Handlinng Using Pandas-I
No ratings yet
1 Data Handlinng Using Pandas-I
46 pages
Pandas
No ratings yet
Pandas
11 pages
Python Pandas
No ratings yet
Python Pandas
230 pages
Sample Question Paper Set-1
No ratings yet
Sample Question Paper Set-1
7 pages
Pandas_1_Series
No ratings yet
Pandas_1_Series
14 pages
Working With Pandas Notes
No ratings yet
Working With Pandas Notes
27 pages
Pandas Class 12 Ncertttt
No ratings yet
Pandas Class 12 Ncertttt
48 pages
LAST MINUTES REVISION Pandas Series
No ratings yet
LAST MINUTES REVISION Pandas Series
6 pages
Python Pandas
No ratings yet
Python Pandas
96 pages
Ln. 1 - Data handling using Pandas - Series & Dataframe
No ratings yet
Ln. 1 - Data handling using Pandas - Series & Dataframe
14 pages
IP TERM-1 Study Material (Session 2021-22)
No ratings yet
IP TERM-1 Study Material (Session 2021-22)
84 pages
1 IP 12 NOTES PythonPandas 2022 PDF
100% (3)
1 IP 12 NOTES PythonPandas 2022 PDF
66 pages
Study Material IP 2022
No ratings yet
Study Material IP 2022
55 pages
Class12 Pandas Notes
No ratings yet
Class12 Pandas Notes
23 pages
Unit I: Data Handling Using Pandas and Data Visualization: Marks:30
No ratings yet
Unit I: Data Handling Using Pandas and Data Visualization: Marks:30
75 pages
Data Handling Using Pandas - 1-2-1
No ratings yet
Data Handling Using Pandas - 1-2-1
10 pages
Informatics Practices Class 12 Study Material
No ratings yet
Informatics Practices Class 12 Study Material
128 pages
ML Lab8
No ratings yet
ML Lab8
28 pages
Pandas-Creating Series & Dataframes (DR V Gowri, Srmist)
No ratings yet
Pandas-Creating Series & Dataframes (DR V Gowri, Srmist)
47 pages
Data Handling Using Pandas-1
No ratings yet
Data Handling Using Pandas-1
25 pages
Pandas Notoes For XII PDF
No ratings yet
Pandas Notoes For XII PDF
12 pages
Final Copy of Class Xii Ip 2022-23-Worksheets-Amd
75% (8)
Final Copy of Class Xii Ip 2022-23-Worksheets-Amd
114 pages
XII-IP-QuickRevision 2 in 1
No ratings yet
XII-IP-QuickRevision 2 in 1
13 pages
Data Handling Using Pandas I - Series
No ratings yet
Data Handling Using Pandas I - Series
11 pages
Data Analytics Pandas
No ratings yet
Data Analytics Pandas
33 pages
python-notes-BCC-302 (Unit - 05)
No ratings yet
python-notes-BCC-302 (Unit - 05)
25 pages
Pandas - Series - Short - Notes
No ratings yet
Pandas - Series - Short - Notes
7 pages
Ip Practice Questions Class 12
No ratings yet
Ip Practice Questions Class 12
5 pages
Class XII Data Handlinng Using PandasI
No ratings yet
Class XII Data Handlinng Using PandasI
46 pages
Ian Talks Python A-Z
From Everand
Ian Talks Python A-Z
Ian Eress
No ratings yet
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet

Python Pandas Series

Uploaded by

Python Pandas Series

Uploaded by

Python

Framework: It is a collection of various libraries which architects th

 It will store in specific manner

 It is a collection of data values and operations that

 It will enables efficient storage, retrieval and

pip(preferred installation program) install

 Series is a one-dimensional array like structure with

series is a one-dimensional labeled array capable of holding

The data labels in series are numeric starting from 0 by default.

<SERIES OBJECT> = PANDAS.SERIES(DATA, INDEX = IDX, [DTYPE =

THE DATA SUPPLIED TO SERIES() CAN BE EITHER:

 We can change the index in place also by

ser.index = [ ‘first’, ‘second’, ‘third’, ‘fourth’, ‘fifth’]

Series.hasnans Returns true if there are any NaN

syntax: iloc = [<row no. range>, <column no. range>]

syntax: loc = [<list of row names>, <list of column names>]

The Series.tail() function displays the last 5 elements by default.

The Series.tail() function displays the last 5 elements by default.

All these are

Here, it is performing the

Series will Series after

pandas series.sort_values() function is used to sort values on series

you may use sort_values() and and sort_index().

# sort series contains numeric values.

You might also like