0% found this document useful (0 votes)

8 views13 pages

Python Pandas - Series Notes

Pandas is an open-source Python library created in 2008 for high-performance data manipulation and analysis, featuring data structures like Series and DataFrame. Series is a one-dimensional labeled array, while DataFrame is a two-dimensional structure that can hold heterogeneous data types. The library provides various functions for data operations, including indexing, slicing, and mathematical operations.

Uploaded by

Daniel Mathew

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views13 pages

Python Pandas - Series Notes

Uploaded by

Daniel Mathew

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 13

Python Pandas

The term "Pandas" refers to an open-source library for manipulating high-performance data in Python. It
was created in 2008 by Wes McKinney and is used for data analysis in Python. Pandas is an open-source
library that provides high-performance data manipulation in Python. Before Pandas, Python was able for
information planning, however it just offered restricted help for information investigation. As a result,
Pandas entered the picture and enhanced data analysis capabilities.
DataFrame and Series are the two data structures that Pandas provides for processing data.
The best way to think of these data structures is that the higher dimensional data structure is a container
of its lower dimensional data structure. For example, DataFrame is a container of Series, Panel is a
container of DataFrame. These data structures are discussed below
Python Pandas Series
A one-dimensional array capable of storing a variety of data types is how it is defined. The term "index"
refers to the row labels of a series. We can without much of a stretch believer the rundown, tuple, and
word reference into series utilizing "series' technique. Multiple columns cannot be included in a Series.
Only one parameter exists:
Data: It can be any list, dictionary, or scalar value.
Key Points
• Homogeneous data
• Size Immutable
• Values of Data Mutable

Python Pandas DataFrame

It is a generally utilized information design of pandas and works with a two-layered exhibit with named
tomahawks (lines and segments). As a standard method for storing data, DataFrame has two distinct
indexes-row index and column index. It has the following characteristics:
The sections can be heterogeneous sorts like int, bool, etc.

• Heterogeneous data
• Size Mutable
• Data Mutable
Series: Creation of series from NDArray, Dictionary, Scaler
values
Series
A Pandas Series is a one-dimensional labeled ndarray structure. A Pandas Series can be thought of as a
column in a spreadsheet. It consists of two main components: the labels and the data.

For example
0 'Nirmal'
1 20
2 5.3
3 False
dtype: object

Here, the series has two columns, labels (0, 1, 2 and 3) and data ('nirmal', 20, 5.3, False).

The labels are the index values assigned to each data point, while the data represents the actual values
stored in the Series.

Note: Pandas Series can store homogeneous data elements. It uses a concept called dtype (data type) to
manage and represent the underlying data in a Series.

Creating a Pandas Series

To create Series any of the following methods can be used. Make sure to import pandas library.
Creating an empty Series: Series() function of Pandas is used to create a series. A basic series, which
can be created, is an Empty Series.

# import pandas as pd
import pandas as pd
# Creating empty series
ser = pd.Series()
print(ser)
Output:
Series([], dtype: float64)

By default, the data type of Series is float.

Creating a series from array: In order to create a series from NumPy array, we have to import numpy
module and have to use array() function.
# import pandas as pd
import pandas as pd
# import numpy as np
import numpy as np
# simple array
data = np.array(['g', 'e', 'e', 'k', 's'])
ser = pd.Series(data)
print(ser)
Mathematical Operation on Series object
We can do arithmetic operations ( +, -, *, /) on more than one series objects.
The arithmetic operation is performed only on matching index. For non-matching
index it produces NaN values.

If data items of matching indexes are not compatible for the operation, it produces
NaN values as a result.
Program-1
import pandas as pd
S1 = pd.Series([12,23,34])
S2 = pd.Series([10,20,10])
print(“Addition of Series with matching indexes”)
print(S1 + S2)

Output –
Addition of Series with matching indexes
0 22
1 43
2 44
dtype: int64
Program-2
import pandas as pd
S1 = pd.Series([12,23,34,56])
S2 = pd.Series([10,20,10])
print(“Addition of Series of Different sizes”)
print(S1 + S2)

Output –
Addition of Series of Different sizes
0 22
1 43
2 44
3 NaN
dtype: int64

Program-3
import pandas as pd
S1 = pd.Series([12,23,34])
S2 = pd.Series([10,20,10],index=[‘a’,’b’,’c’])
print(“Addition of Series With Non Matching Index”)
print(S1 + S2)

Output –
Addition of Series with Non Matching Index
0 NaN
1 NaN
2 NaN
a NaN
b NaN
c NaN
dtype: float64

Program-4
What will be the output produced by the following programming statements-1 & 2?
import pandas as pd
S1=pd.Series (data=[31,41,51])
print(S1>40) -->Statement1
print(S1[S1>40]) -->Statement2
Output –
Statement-1
0 False
1 True
2 True
Statement-2
1 41
2 51

Summary
 Pandas Series is a one dimensional array like labeled structure.
 Series labels need not be unique but must be a hashable type.
 Homogenous – Series elements must be of the same data type.
 Size-immutable – Once created, the size of a Series object cannot be
changed.
 The series object supports both integer and label-based indexing and
provides various methods for performing operations involving the index.
 Series can be created using List, array, dictionary and scalar value.
Head function
The head function in Python displays the first five rows of the dataframe by default.
It takes in a single parameter: the number of rows. We can use this parameter to
display the number of rows of our choice.

Syntax of head function is defined as follows:

dataframe.head(N)

N refers to the number of rows. If no parameter is passed, the first five rows are
returned.

import pandas as pd
# Creating a dataframe
df = pd.DataFrame({'Sports': ['Football', 'Cricket', 'Baseball', 'Basketball',
'Tennis', 'Table-tennis', 'Archery', 'Swimming', 'Boxing']})
print(df.head()) # By default
print('\n')
print(df.head(3)) # Printing first 3 rows
print('\n')
print(df.head(-2)) # Printing all except the last 2 rows

Sports
0 Football
1 Cricket
2 Baseball
3 Basketball
4 Tennis

Sports
0 Football
1 Cricket
2 Baseball

Sports
0 Football
1 Cricket
2 Baseball
3 Basketball
4 Tennis
5 Table-tennis
6 Archery
Tail function
The tail function in Python displays the last five rows of the dataframe by default. It
takes in a single parameter: the number of rows. We can use this parameter to
display the number of rows of our choice.

Syntax
The tail function is defined as follows:
dataframe.tail(N)

N refers to the number of rows. If no parameter is passed, the last five rows are
returned.
The tail function also supports negative values of N. In that case, all rows except
the first N rows are returned.
# Creating a dataframe
df = pd.DataFrame({'Sports': ['Football', 'Cricket', 'Baseball', 'Basketball',
'Tennis', 'Table-tennis', 'Archery', 'Swimming', 'Boxing']})
print(df.tail()) # By default
print('\n')
print(df.(3)) tail# Printing last 3 rows
print('\n')
print(df.tail(-2)) # Printing all except the first 2 rows

Sports
4 Tennis
5 Table-tennis
6 Archery
7 Swimming
8 Boxing

Sports
6 Archery
7 Swimming
8 Boxing

Sports
2 Baseball
3 Basketball
4 Tennis
5 Table-tennis
6 Archery
7 Swimming
8 Boxing

Indexing/Slices from Series Object

A slice object is created from Series object using a syntax of <object>[Start : end :
step] but the start and stop signify the positions of elements not the indexes. The
slice object of a series object is also a panda Series type object.

Slicing takes place position wise and not the index wise in a series object

The index [] operator can be used to perform indexing and slicing operations on a
Series object. The index[] operator can accept either-Index/labels
Integer index positions
Using the index operator with labels-
The index operator can be used in the following ways-
Using a single label inside the square brackets- Using a single label/index inside
the square brackets will return only the corresponding element referred to by that
label/index.
Using multiple labels- We can pass multiple labels in any order that is present in
the Series object. The multiple labels must be passed as a list i.e. the multiple
labels must be separated by commas and enclosed in double square brackets.
Passing a label is passed that is not present in the Series object, should be avoided
as it right now gives NaN as the value but in future will be considered as an error
by Python.

# indexing a Series object multiple labels

import pandas as pd
d={'a':101, 'b':102, 'c':103, 'd':104, 'e':105, 'f':106}
s=pd.Series(d)
u=s[['b', 'a', 'f']]
print(u)

o/p:

b 102
a 101
f 106
dtype: int64

Using slice notation start label : end label-

Inside the index operator we can pass start label : end label. Here contrary to the
slice concept all the items from start label values till the end label values including
the end label values is returned back.
# indexing a Series object using startlabel : endlabel

import pandas as pd
d={'a':101, 'b':102, 'c':103, 'd':104, 'e':105, 'f':106}
s=pd.Series(d)
u=s['b':'e’]
print(u)
Output

b 102
c 103
d 104
e 105
dtype: int64
Slicing a Series object using Integer Index positions-
The concept of slicing a Series object is similar to that of slicing python lists, strings
etc. Even though the data type of the labels can be anything each element of the
Series object is associated with two integer numbers:

In forward indexing method the elements are numbered from 0,1,2,3, … with 0
being assigned to thefirst element, 1 being assigned to the second element and so
on.

In backward indexing method the elements are numbered from -1,-2, -3,
… with -1 being assigned tothe last element, -2 being assigned to the second last
element and so on.
d={'a':101, 'b':102, 'c':103, 'd':104, 'e':105, 'f':106}
s=pd.Series(d)
The Series object is having the following integer index positions-

Slice concept-
The basic concept of slicing using integer index positions is common to Python
object such as strings, list, tuples, Series, Dataframe etc. Slice creates a new object
using elements of an existing object. It is created as: ExistingObjectName[start :
stop : step] where start, stop , step are integers

# Slicing a Series object

import pandas as pd
d={'a':101, 'b':111, 'c':121, 'd':131, 'e':141, 'f':151}
s=pd.Series(d)
x=s[1: :2]
print('x=\n', x)
y=s[-1: :-1]
print('y=\n', y)
z=s[1: -2: 2]
print('z=\n', z)

Output
x=
b 111
d 131
f 151
dtype: int64
y=
f 151
e 141
d 131
c 121
b 111
a 101
dtype: int64
z=
b 111
d 131
dtype: int64
Modifying elements of Series object-
The elements of a Series object can be modified using any of the following
methods-
Using index [ ] operator to modify single/multiple values
# Modifying a Series object index [ ] method
import pandas as pd
d={'a':101, 'b':111, 'c':121, 'd':131, 'e':141, 'f':151}
a 777
b 111
c 555
d 131
e 141
f 666
dtype: int64 s
s=
a 777
0
1
2
e 141
f 666
dtype: int64

string at/iat property to modify a single value

# Modifying a Series object at/iat property
import pandas as pd
d={'a':101, 'b':111, 'c':121, 'd':131, 'e':141, 'f':151}
s=pd.Series(d)
s['c'] = 555
s[['f','a']] = [666,777]
print('s=\n', s)
s['b':'d']=[0,1,2]
print('s=\n', s)

Output s=
a 101
b 111
c 121
d 999
e 141
f 777
dtype : int64

Using loc, iloc property to modify single /multiple values

#Modifying a Series object loc/iloc property
import pandas as pd
d={'a':101, 'b':111, 'c':121, 'd':131, 'e':141, 'f':151}
s=pd.Series(d)
s.loc['b'] = 9
s.loc['e':'f'] = [8,7]
print('s=\n', s)
s.iloc[1: :2] = [33,44,55]
print('s=\n', s)

Output s=
a 101
b9
c 121
d 131
e8
f7
dtype: int64

s=
a 101
b 33
c 121
d 44
e8
f 55

e) Using slice method to modify multiple values

# Modifying a Series object slice method

import pandas as pd
d={'a':101, 'b':111, 'c':121, 'd':131, 'e':141, 'f':151}
s=pd.Series(d)
s[1: :2] = [1,2,3]
print('s=\n', s)

Output s=
a 101
b1
c 121
d2
e 141
f3
dtype : int64

Changing indexes of Series object-

The index property can be used to change the indexes of a Series object import
pandas as pd

# Changing indexes of Series object

import pandas as pd
d={'a':101, 'b':111, 'c':121, 'd':131}
s=pd.Series(d)
s.index = ['have','a','nice', 'day']
print('s=\n', s)

Output
s=
have 101
A 111
Nice 121
Day 131
dtype: int64

Working With Pandas Notes
No ratings yet
Working With Pandas Notes
27 pages
1 IP 12 NOTES PythonPandas 2022 PDF
100% (3)
1 IP 12 NOTES PythonPandas 2022 PDF
66 pages
Class XII IP IMP Notes & Sample Papers
No ratings yet
Class XII IP IMP Notes & Sample Papers
125 pages
Python Pandas
100% (1)
Python Pandas
35 pages
Class12 Pandas Notes
No ratings yet
Class12 Pandas Notes
23 pages
Study Material IP 2022
No ratings yet
Study Material IP 2022
55 pages
Pandas-Creating Series & Dataframes (DR V Gowri, Srmist)
No ratings yet
Pandas-Creating Series & Dataframes (DR V Gowri, Srmist)
47 pages
Data Analytics Pandas
No ratings yet
Data Analytics Pandas
33 pages
Python Pandas
No ratings yet
Python Pandas
22 pages
Data Handling Using Pandas I - Series
No ratings yet
Data Handling Using Pandas I - Series
11 pages
Data Handling Using Pandas-1
No ratings yet
Data Handling Using Pandas-1
25 pages
12ip 22 23
No ratings yet
12ip 22 23
188 pages
Unit 1 Pandas - Series and DataFrame
No ratings yet
Unit 1 Pandas - Series and DataFrame
19 pages
On Data Handling Using Pandas-I
100% (2)
On Data Handling Using Pandas-I
64 pages
ML Lab8
No ratings yet
ML Lab8
28 pages
Unit I: Data Handling Using Pandas and Data Visualization: Marks:30
No ratings yet
Unit I: Data Handling Using Pandas and Data Visualization: Marks:30
75 pages
Class XII Data Handlinng Using PandasI
No ratings yet
Class XII Data Handlinng Using PandasI
46 pages
Python Pandas Series
No ratings yet
Python Pandas Series
45 pages
CH 02 - Data Handling Using Pandas Leip102 EDITED Smaller 01 Codes Only
No ratings yet
CH 02 - Data Handling Using Pandas Leip102 EDITED Smaller 01 Codes Only
15 pages
Exp8 SBLC
No ratings yet
Exp8 SBLC
9 pages
Chapter 1 and 2 Series and Data Frame
No ratings yet
Chapter 1 and 2 Series and Data Frame
45 pages
Reading Material For Data Handling Using Pandas-I
No ratings yet
Reading Material For Data Handling Using Pandas-I
51 pages
Data Handling Using Pandas - 1-2-1
No ratings yet
Data Handling Using Pandas - 1-2-1
10 pages
Class 12 IP Ch-1, 2 3
No ratings yet
Class 12 IP Ch-1, 2 3
28 pages
Data Manipulation With Pandas
No ratings yet
Data Manipulation With Pandas
38 pages
Python Data Processing
No ratings yet
Python Data Processing
36 pages
Unit I: Data Handling Using Pandas and Data Visualization: Marks:25
No ratings yet
Unit I: Data Handling Using Pandas and Data Visualization: Marks:25
135 pages
1 Data Handlinng Using Pandas-I
No ratings yet
1 Data Handlinng Using Pandas-I
46 pages
Unit III Part 2 1725700061785
No ratings yet
Unit III Part 2 1725700061785
85 pages
12 IP Questions
No ratings yet
12 IP Questions
181 pages
Pandas 1 Series
No ratings yet
Pandas 1 Series
14 pages
Pandas Class 12 Ncertttt
No ratings yet
Pandas Class 12 Ncertttt
48 pages
Java Mod1 Part1
No ratings yet
Java Mod1 Part1
80 pages
Bca Syllabus Bbmku 2021-25
No ratings yet
Bca Syllabus Bbmku 2021-25
60 pages
Pandas
No ratings yet
Pandas
20 pages
Chapter 5 Javascriptdocument
No ratings yet
Chapter 5 Javascriptdocument
122 pages
The Studio 3T Field Guide To MongoDB Aggregation
No ratings yet
The Studio 3T Field Guide To MongoDB Aggregation
148 pages
WBP Epa
100% (1)
WBP Epa
56 pages
Ln. 1 - Data Handling Using Pandas - Series & Dataframe
No ratings yet
Ln. 1 - Data Handling Using Pandas - Series & Dataframe
14 pages
XII - Ip - Panda - I - Part - I - 2023 (1) 1 1
No ratings yet
XII - Ip - Panda - I - Part - I - 2023 (1) 1 1
25 pages
CSE488 Lab5 Pandas
No ratings yet
CSE488 Lab5 Pandas
27 pages
Mongo DB
No ratings yet
Mongo DB
77 pages
XII IP CH 1 Python Pandas - I Series
No ratings yet
XII IP CH 1 Python Pandas - I Series
45 pages
Data Handlinng Using Pandas-I
No ratings yet
Data Handlinng Using Pandas-I
46 pages
Viva Manual-1
No ratings yet
Viva Manual-1
18 pages
Unit-1 Python Pandas
No ratings yet
Unit-1 Python Pandas
56 pages
4 SC
No ratings yet
4 SC
7 pages
CDL in CDS
No ratings yet
CDL in CDS
14 pages
cc102 Module
No ratings yet
cc102 Module
80 pages
BigQuery Remote Function User Guide
No ratings yet
BigQuery Remote Function User Guide
7 pages
PF Theory Course Outline
No ratings yet
PF Theory Course Outline
8 pages
Collection Framework
No ratings yet
Collection Framework
51 pages
Chapter 6 PHP
No ratings yet
Chapter 6 PHP
57 pages
Chapter 2 Data Handling Using Pandas - I (Series)
No ratings yet
Chapter 2 Data Handling Using Pandas - I (Series)
13 pages
Object Oriented Programming: Lab 01 Introduction To C++ (Revision of Control Structure, Arrays, Functions)
No ratings yet
Object Oriented Programming: Lab 01 Introduction To C++ (Revision of Control Structure, Arrays, Functions)
9 pages
01 Data Handlinng Using Pandas-I-1-9
No ratings yet
01 Data Handlinng Using Pandas-I-1-9
9 pages
D2XX Programmers Guide
No ratings yet
D2XX Programmers Guide
109 pages
PYTHON UNIT-5 Part-C
No ratings yet
PYTHON UNIT-5 Part-C
4 pages
Cs 508 Mines Mcqs Midterm
No ratings yet
Cs 508 Mines Mcqs Midterm
14 pages
Exp 25 - 26
No ratings yet
Exp 25 - 26
17 pages
The Go Programming Language Specification - The Go Programming Language
No ratings yet
The Go Programming Language Specification - The Go Programming Language
110 pages
Data Analysis and Visualization Using Python Libraries and Streamlit - RTF Pre Read Materials
No ratings yet
Data Analysis and Visualization Using Python Libraries and Streamlit - RTF Pre Read Materials
29 pages
Arrays (1D, 2D, 3D)
No ratings yet
Arrays (1D, 2D, 3D)
20 pages
Object-Oriented Approach To Programming Logic and Design 4th Edition Joyce Farrell Test Bank 1
100% (75)
Object-Oriented Approach To Programming Logic and Design 4th Edition Joyce Farrell Test Bank 1
7 pages
Data Handling Using Pandas-1 - Series Object Notes PDF
No ratings yet
Data Handling Using Pandas-1 - Series Object Notes PDF
25 pages
Computer Sample Paper
No ratings yet
Computer Sample Paper
66 pages
CSC 204 Session 1
No ratings yet
CSC 204 Session 1
16 pages
Data Handlinng Using Pandas
No ratings yet
Data Handlinng Using Pandas
46 pages
Python UnitIV
No ratings yet
Python UnitIV
20 pages
OOP Using Java Unit 1 Notes
No ratings yet
OOP Using Java Unit 1 Notes
47 pages
Vectors Lists Sequences Skip Lists
No ratings yet
Vectors Lists Sequences Skip Lists
56 pages
Python Pandas Series
No ratings yet
Python Pandas Series
30 pages
Module 3 Notes Arrays
No ratings yet
Module 3 Notes Arrays
20 pages
Ip Notes
No ratings yet
Ip Notes
20 pages
9618 Specimen Paper Answers Paper 2 (For Examination From 2021)
100% (1)
9618 Specimen Paper Answers Paper 2 (For Examination From 2021)
28 pages
Pandas Notes
No ratings yet
Pandas Notes
19 pages
Java A Detailed Approach To Practical Coding (Step-By-Step Java Book 2)
No ratings yet
Java A Detailed Approach To Practical Coding (Step-By-Step Java Book 2)
129 pages
Pra 3
No ratings yet
Pra 3
7 pages
Unit 9 - V1
No ratings yet
Unit 9 - V1
16 pages
Introducing Python Pandas
No ratings yet
Introducing Python Pandas
54 pages
SR Ip Pandas I Full Notes
No ratings yet
SR Ip Pandas I Full Notes
30 pages
Data Handling With Pandas - 1 Notes Xii Ip
No ratings yet
Data Handling With Pandas - 1 Notes Xii Ip
28 pages
Panda
No ratings yet
Panda
46 pages
Pandas
No ratings yet
Pandas
57 pages
Student Support Material 25-26 Subject Ip065
No ratings yet
Student Support Material 25-26 Subject Ip065
104 pages
Pandas
No ratings yet
Pandas
163 pages
Introduction To Pandas
No ratings yet
Introduction To Pandas
9 pages
MLL Ip Xii
No ratings yet
MLL Ip Xii
22 pages

Python Pandas - Series Notes

Uploaded by

Python Pandas - Series Notes

Uploaded by

Python Pandas

Python Pandas DataFrame

Creating a Pandas Series

By default, the data type of Series is float.

Syntax of head function is defined as follows:

Indexing/Slices from Series Object

# indexing a Series object multiple labels

Using slice notation start label : end label-

# Slicing a Series object

string at/iat property to modify a single value

Using loc, iloc property to modify single /multiple values

e) Using slice method to modify multiple values

Changing indexes of Series object-

# Changing indexes of Series object

You might also like