0% found this document useful (0 votes)

18 views7 pages

Pandas Practice

The document outlines a practice lab for using the Pandas library in Python, focusing on creating DataFrames and Series, as well as selecting and slicing data. It includes exercises on using the loc() and iloc() functions for data selection, along with practical coding examples. The lab aims to enhance understanding of data manipulation using Pandas within a 30-minute timeframe.

Uploaded by

mktpvh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views7 pages

Pandas Practice

Uploaded by

mktpvh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Pandas_Practice

March 6, 2025

1 Practice Lab: Selecting data in a Dataframe

Estimated time needed: 30 minutes

1.1 Objectives
After completing this lab you will be able to:
• Use Pandas Library to create DataFrame and Series
• Locate data in the DataFrame using loc() and iloc() functions
• Use slicing

1.1.1 Exercise 1: Pandas: DataFrame and Series

Pandas is a popular library for data analysis built on top of the Python programming language.
Pandas generally provide two data structures for manipulating data, They are:
• DataFrame
• Series
A DataFrame is a two-dimensional data structure, i.e., data is aligned in a tabular fashion in rows
and columns.
• A Pandas DataFrame will be created by loading the datasets from existing storage.
• Storage can be SQL Database, CSV file, Excel file, etc.
• It can also be created from the lists, dictionaries, and from a list of dictionaries.
Series represents a one-dimensional array of indexed data. It has two main components : 1. An
array of actual data. 2. An associated array of indexes or data labels.
The index is used to access individual data values. You can also get a column of a dataframe as a
Series. You can think of a Pandas series as a 1-D dataframe.

[1]: !pip install pandas

Collecting pandas
Downloading
pandas-2.2.3-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata
(89 kB)
Collecting numpy>=1.26.0 (from pandas)
Downloading
numpy-2.2.3-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata

1
(62 kB)
Requirement already satisfied: python-dateutil>=2.8.2 in
/opt/conda/lib/python3.12/site-packages (from pandas) (2.9.0.post0)
Requirement already satisfied: pytz>=2020.1 in /opt/conda/lib/python3.12/site-
packages (from pandas) (2024.2)
Collecting tzdata>=2022.7 (from pandas)
Downloading tzdata-2025.1-py2.py3-none-any.whl.metadata (1.4 kB)
Requirement already satisfied: six>=1.5 in /opt/conda/lib/python3.12/site-
packages (from python-dateutil>=2.8.2->pandas) (1.17.0)
Downloading
pandas-2.2.3-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (12.7
MB)
�� 12.7/12.7 MB
118.7 MB/s eta 0:00:00
Downloading
numpy-2.2.3-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (16.1 MB)
�� 16.1/16.1 MB
150.7 MB/s eta 0:00:00
Downloading tzdata-2025.1-py2.py3-none-any.whl (346 kB)
Installing collected packages: tzdata, numpy, pandas
Successfully installed numpy-2.2.3 pandas-2.2.3 tzdata-2025.1

[2]: # let us import the Pandas Library

import pandas as pd

Once you’ve imported pandas, you can then use the functions built in it to create and analyze data.
In this practice lab, we will learn how to create a DataFrame out of a dictionary.
Let us consider a dictionary ‘x’ with keys and values as shown below.
We then create a dataframe from the dictionary using the function pd.DataFrame(dict)

[3]: #Define a dictionary 'x'

x = {'Name': ['Rose','John', 'Jane', 'Mary'], 'ID': [1, 2, 3, 4], 'Department':␣

↪['Architect Group', 'Software Group', 'Design Team', 'Infrastructure'],

'Salary':[100000, 80000, 50000, 60000]}

#casting the dictionary to a DataFrame

df = pd.DataFrame(x)

#display the result df

[3]: Name ID Department Salary

0 Rose 1 Architect Group 100000
1 John 2 Software Group 80000
2 Jane 3 Design Team 50000

2
3 Mary 4 Infrastructure 60000

We can see the direct correspondence between the table. The keys correspond to the column labels
and the values or lists correspond to the rows.

Column Selection: To select a column in Pandas DataFrame, we can either access the columns
by calling them by their columns name.
Let’s Retrieve the data present in the ID column.

[4]: #Retrieving the "ID" column and assigning it to a variable x

x = df[['ID']]
x

[4]: ID
0 1
1 2
2 3
3 4

Let’s use the type() function and check the type of the variable.

[5]: #check the type of x

type(x)

[5]: pandas.core.frame.DataFrame

The output shows us that the type of the variable is a DataFrame object.

Access to multiple columns Let us retrieve the data for Department, Salary and ID columns

[6]: #Retrieving the Department, Salary and ID columns and assigning it to a␣

↪variable z

z = df[['Department','Salary','ID']]
z

[6]: Department Salary ID

0 Architect Group 100000 1
1 Software Group 80000 2
2 Design Team 50000 3
3 Infrastructure 60000 4

1.1.2 Try it yourself

Problem 1: Create a dataframe to display the result as below:
[7]: #write your code here

Click here for the solution

3
a = {'Student':['David', 'Samuel', 'Terry', 'Evan'],
'Age':['27', '24', '22', '32'],
'Country':['UK', 'Canada', 'China', 'USA'],
'Course':['Python','Data Structures','Machine Learning','Web Development'],
'Marks':['85','72','89','76']}
df1 = pd.DataFrame(a)
df1

Problem 2: Retrieve the Marks column and assign it to a variable b

[8]: #write your code here

Click here for the solution

b = df1[['Marks']]
b

Problem 3: Retrieve the Country and Course columns and assign it to a variable c
[9]: #write your code here

Click here for the solution

c = df1[['Country','Course']]
c

To view the column as a series, just use one bracket:

[10]: # Get the Student column as a series Object

x = df1['Student']
x

---------------------------------------------------------------------------
NameError Traceback (most recent call last)
Cell In[10], line 3
1 # Get the Student column as a series Object
----> 3 x = df1['Student']
4 x

NameError: name 'df1' is not defined

[ ]: #check the type of x

type(x)

The output shows us that the type of the variable is a Series object.

4
1.1.3 Exercise 2: loc() and iloc() functions
loc() is a label-based data selecting method which means that we have to pass the name of the row
or column that we want to select. This method includes the last element of the range passed in it.
Simple syntax for your understanding:
• loc[row_label, column_label]
iloc() is an indexed-based selecting method which means that we have to pass an integer index in
the method to select a specific row/column. This method does not include the last element of the
range passed in it.
Simple syntax for your understanding:
• iloc[row_index, column_index]
Let us see some examples on the same.

[ ]: # Access the value on the first row and the first column

df.iloc[0, 0]

[ ]: # Access the value on the first row and the third column

df.iloc[0,2]

[ ]: # Access the column using the name

df.loc[0, 'Salary']

Let us create a new dataframe called ‘df2’ and assign ‘df’ to it. Now, let us set the “Name” column
as an index column using the method set_index().

[ ]: df2=df
df2=df2.set_index("Name")

[ ]: #To display the first 5 rows of new dataframe

df2.head()

[ ]: #Now, let us access the column using the name

df2.loc['Jane', 'Salary']

1.1.4 Try it yourself

Use the loc() function,to get the Department of Jane in the newly created dataframe df2.

[ ]: #write your code here

Click here for the solution

df2.loc['Jane', 'Department']

5
Use the iloc() function to get the Salary of Mary in the newly created dataframe df2.

[ ]: #write your code here

Click here for the solution

df2.iloc[3,2]

1.1.5 Exercise 3: Slicing

Slicing uses the [] operator to select a set of rows and/or columns from a DataFrame.
To slice out a set of rows, you use this syntax: data[start:stop],
here the start represents the index from where to consider, and stop represents the index one step
BEYOND the row you want to select. You can perform slicing using both the index and the name
of the column.
NOTE: When slicing in pandas, the start bound is included in the output.
So if you want to select rows 0, 1, and 2 your code would look like this: df.iloc[0:3].
It means you are telling Python to start at index 0 and select rows 0, 1, 2 up to but not including
3.
NOTE: Labels must be found in the DataFrame or you will get a KeyError.
Indexing by labels(i.e. using loc()) differs from indexing by integers (i.e. using iloc()). With loc(),
both the start bound and the stop bound are inclusive. When using loc(), integers can be used,
but the integers refer to the index label and not the position.
For example, using loc() and select 1:4 will get a different result than using iloc() to select rows
1:4.
We can also select a specific data value using a row and column location within the DataFrame
and iloc indexing.

[ ]: # let us do the slicing using old dataframe df

df.iloc[0:2, 0:3]

[ ]: #let us do the slicing using loc() function on old dataframe df where index␣
↪column is having labels as 0,1,2

df.loc[0:2,'ID':'Department']

[ ]: #let us do the slicing using loc() function on new dataframe df2 where index␣
↪column is Name having labels: Rose, John and Jane

df2.loc['Rose':'Jane', 'ID':'Department']

Try it yourself
using loc() function, do slicing on old dataframe df to retrieve the Name, ID and department of
index column having labels as 2,3

6
[ ]: # Write your code below and press Shift+Enter to execute

Click here for the solution

df.loc[2:3,'Name':'Department']

Congratulations, you have completed this lesson and the practice lab on Pandas

Date
(YYYY-MM-DD) Version Changed By Change Description
2022-03-31 0.1 Appalabhaktula Created initial version
Hema

–!>

[ ]:

Mx3ipg2a PDF
No ratings yet
Mx3ipg2a PDF
2 pages
1700 Animated 3
100% (1)
1700 Animated 3
143 pages
For Assignment-3 (Final - Pandas - Lab)
No ratings yet
For Assignment-3 (Final - Pandas - Lab)
40 pages
Pandas Dataframe
No ratings yet
Pandas Dataframe
48 pages
Lab3 - Python - Pandas DataFrame - GeeksforGeeks
No ratings yet
Lab3 - Python - Pandas DataFrame - GeeksforGeeks
20 pages
Pandas 1
No ratings yet
Pandas 1
49 pages
Portable Bluetooth Speaker: Service Manual
No ratings yet
Portable Bluetooth Speaker: Service Manual
45 pages
Data Domain Fundamentals Student Guide
100% (1)
Data Domain Fundamentals Student Guide
70 pages
Data Handing Using Pandas-I
100% (2)
Data Handing Using Pandas-I
46 pages
Python Pandas ch-2
No ratings yet
Python Pandas ch-2
56 pages
Pandas, Numpy, Matplotlib
No ratings yet
Pandas, Numpy, Matplotlib
11 pages
Hyderabad
No ratings yet
Hyderabad
43 pages
Python Data Science 101
100% (1)
Python Data Science 101
41 pages
Introduction To Pandas - Ipynb - Colaboratory
No ratings yet
Introduction To Pandas - Ipynb - Colaboratory
7 pages
Taktis Multi Protocol FCAP MAN 1431KE M
No ratings yet
Taktis Multi Protocol FCAP MAN 1431KE M
187 pages
SBLC 1
No ratings yet
SBLC 1
23 pages
Seleccione Pandas Dataframes Columnas y Filas Usando Loc y Iloc
No ratings yet
Seleccione Pandas Dataframes Columnas y Filas Usando Loc y Iloc
7 pages
Eda Unit 2
No ratings yet
Eda Unit 2
65 pages
Pandas (PPT 5)
No ratings yet
Pandas (PPT 5)
16 pages
Iloc and Loc Uses PDF
No ratings yet
Iloc and Loc Uses PDF
16 pages
Dataframes-I (Create & Selection)
No ratings yet
Dataframes-I (Create & Selection)
10 pages
Lecture 2 - Data Wrangling - Update
No ratings yet
Lecture 2 - Data Wrangling - Update
114 pages
Pandas 3
No ratings yet
Pandas 3
33 pages
Chapter 1 Python Pandas - I
No ratings yet
Chapter 1 Python Pandas - I
35 pages
Lec 02 - DS100 Fa23 - Pandas 1
No ratings yet
Lec 02 - DS100 Fa23 - Pandas 1
61 pages
Cheat Python
No ratings yet
Cheat Python
8 pages
Pandas in Python
No ratings yet
Pandas in Python
59 pages
Data Frames
No ratings yet
Data Frames
60 pages
Python For Data Science 1662157639
No ratings yet
Python For Data Science 1662157639
6 pages
Pandas-Creating Series & Dataframes (DR V Gowri, Srmist)
No ratings yet
Pandas-Creating Series & Dataframes (DR V Gowri, Srmist)
47 pages
Pandas Questions
No ratings yet
Pandas Questions
11 pages
Data Handling Using Pandas-I-ORG
No ratings yet
Data Handling Using Pandas-I-ORG
44 pages
Python For ML
No ratings yet
Python For ML
41 pages
Prepared by G.V.Shivakkumar Sap-Mm/Wm
No ratings yet
Prepared by G.V.Shivakkumar Sap-Mm/Wm
11 pages
Python Pandas-Data Frames
No ratings yet
Python Pandas-Data Frames
41 pages
Ip Lab File Python
No ratings yet
Ip Lab File Python
9 pages
Unit-4Introduction To Pandas
No ratings yet
Unit-4Introduction To Pandas
44 pages
Pandas Data Structures: Sections
No ratings yet
Pandas Data Structures: Sections
13 pages
Day64 - Pandas Interview Questions
No ratings yet
Day64 - Pandas Interview Questions
5 pages
IP Slybuss
No ratings yet
IP Slybuss
21 pages
Dataframe
No ratings yet
Dataframe
2 pages
Lab-3 Pandas Library
No ratings yet
Lab-3 Pandas Library
14 pages
Python Pandas New Sylabus
No ratings yet
Python Pandas New Sylabus
53 pages
Data Handlinng Using Pandas
No ratings yet
Data Handlinng Using Pandas
46 pages
Practice 2
No ratings yet
Practice 2
15 pages
Murali Internship
No ratings yet
Murali Internship
34 pages
Pandas DataFrame
No ratings yet
Pandas DataFrame
70 pages
Unit - 4 - Part 2
No ratings yet
Unit - 4 - Part 2
36 pages
ICT2103 Full Book-Part-3
No ratings yet
ICT2103 Full Book-Part-3
14 pages
Unit 4
No ratings yet
Unit 4
36 pages
Slicing Pandas Dataframe - GeeksforGeeks
No ratings yet
Slicing Pandas Dataframe - GeeksforGeeks
4 pages
Data Handling Using Pandas-1
No ratings yet
Data Handling Using Pandas-1
60 pages
Line by Line 12 IP
No ratings yet
Line by Line 12 IP
21 pages
Pandas - Digitalocean
No ratings yet
Pandas - Digitalocean
15 pages
PANDAS Python
No ratings yet
PANDAS Python
2 pages
Ip Study
No ratings yet
Ip Study
18 pages
Pandas Dataframe1
No ratings yet
Pandas Dataframe1
43 pages
Sample Cover Letter
No ratings yet
Sample Cover Letter
5 pages
Pandas
No ratings yet
Pandas
27 pages
Pandas (Assignment 3)
No ratings yet
Pandas (Assignment 3)
24 pages
Pandas Functions
No ratings yet
Pandas Functions
3 pages
Lab 9
No ratings yet
Lab 9
9 pages
Python-for-Data-Analysis (Pandas
No ratings yet
Python-for-Data-Analysis (Pandas
31 pages
Pandas
No ratings yet
Pandas
5 pages
Data Science Notes Unit-1 Part - 2
No ratings yet
Data Science Notes Unit-1 Part - 2
22 pages
IP 12th Chapter 3
No ratings yet
IP 12th Chapter 3
9 pages
DataFrame Ac Win Final
No ratings yet
DataFrame Ac Win Final
30 pages
SQL Rev Class 12
No ratings yet
SQL Rev Class 12
6 pages
RocheCobasC111Host Interface Manual - 2.1 - EN - 2 PDF
No ratings yet
RocheCobasC111Host Interface Manual - 2.1 - EN - 2 PDF
93 pages
As Cfe Interop 61850 en PDF
No ratings yet
As Cfe Interop 61850 en PDF
29 pages
XenApp 6.5 Advanced Administratoin - Student Manual
No ratings yet
XenApp 6.5 Advanced Administratoin - Student Manual
310 pages
Unit 3
No ratings yet
Unit 3
8 pages
DeepSkyCamera Manual en
No ratings yet
DeepSkyCamera Manual en
39 pages
Antonio Bratto The Teaching Brain
No ratings yet
Antonio Bratto The Teaching Brain
6 pages
Lom Log
No ratings yet
Lom Log
44 pages
Ensemble Learning: Martin Sewell
No ratings yet
Ensemble Learning: Martin Sewell
16 pages
Cómo Escribir Un Ensayo Romano
100% (1)
Cómo Escribir Un Ensayo Romano
5 pages
Knowledge Management
No ratings yet
Knowledge Management
8 pages
Quezon City Polytechnic University
No ratings yet
Quezon City Polytechnic University
13 pages
Android Controlled Spy Robot With Night Vision Camera
No ratings yet
Android Controlled Spy Robot With Night Vision Camera
16 pages
Kumasi Girls' Senior High School: Personal Records Form
No ratings yet
Kumasi Girls' Senior High School: Personal Records Form
2 pages
Manifestation of Globalization
No ratings yet
Manifestation of Globalization
3 pages
A "Welcome in C Programming Class Welcome Again To C Class !" B A.lower C B.split D Set (C) Print (Len (D) )
No ratings yet
A "Welcome in C Programming Class Welcome Again To C Class !" B A.lower C B.split D Set (C) Print (Len (D) )
5 pages
Daljit PDF
No ratings yet
Daljit PDF
2 pages
Sameera CV-english 2024
No ratings yet
Sameera CV-english 2024
11 pages
Renosem-S130D Plasma Sterilizer System Package Price
No ratings yet
Renosem-S130D Plasma Sterilizer System Package Price
2 pages
Top 5 Open Source Email Security Tools On GitHub
No ratings yet
Top 5 Open Source Email Security Tools On GitHub
4 pages
An Approach For Privacy Preservation Using XML Distance Measure
No ratings yet
An Approach For Privacy Preservation Using XML Distance Measure
5 pages
Ds Security Operations
No ratings yet
Ds Security Operations
3 pages
The Essential R Reference
From Everand
The Essential R Reference
Mark Gardener
No ratings yet

Pandas Practice

Uploaded by

Pandas Practice

Uploaded by

Pandas_Practice

1 Practice Lab: Selecting data in a Dataframe

1.1.1 Exercise 1: Pandas: DataFrame and Series

[1]: !pip install pandas

[2]: # let us import the Pandas Library

[3]: #Define a dictionary 'x'

x = {'Name': ['Rose','John', 'Jane', 'Mary'], 'ID': [1, 2, 3, 4], 'Department':␣

'Salary':[100000, 80000, 50000, 60000]}

#casting the dictionary to a DataFrame

#display the result df

[3]: Name ID Department Salary

[4]: #Retrieving the "ID" column and assigning it to a variable x

[5]: #check the type of x

[6]: #Retrieving the Department, Salary and ID columns and assigning it to a␣

[6]: Department Salary ID

1.1.2 Try it yourself

Click here for the solution

Problem 2: Retrieve the Marks column and assign it to a variable b

Click here for the solution

Click here for the solution

To view the column as a series, just use one bracket:

NameError: name 'df1' is not defined

[ ]: #check the type of x

[ ]: # Access the column using the name

[ ]: #To display the first 5 rows of new dataframe

[ ]: #Now, let us access the column using the name

1.1.4 Try it yourself

[ ]: #write your code here

Click here for the solution

[ ]: #write your code here

Click here for the solution

1.1.5 Exercise 3: Slicing

[ ]: # let us do the slicing using old dataframe df

Click here for the solution

You might also like