0% found this document useful (0 votes)

11 views7 pages

Python Filtering

The document is a Jupyter Notebook that demonstrates various methods of filtering data in a pandas DataFrame using Python. It includes examples of filtering based on conditions, selecting specific columns, and comparing the use of loc and iloc functions. The notebook provides a practical guide for users to manipulate and analyze data effectively.

Uploaded by

paulrajarshi7

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views7 pages

Python Filtering

Uploaded by

paulrajarshi7

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

2/24/2020 3.

2_filtering - Jupyter Notebook

Filtering
In [1]:

import numpy as np
import pandas as pd

In [2]:

# Create a Dictionary
d = {
'Name':['Amarend','Ajay','Preety','Rakesh','Raju','Shyam',
'Kiran','Rishi','Prem','Raj','Ravina','Premjit'],
'Exam':['Semester 1','Semester 1','Semester 1','Semester 1','Semester 1','Semester 1',
'Semester 2','Semester 2','Semester 2','Semester 2','Semester 2','Semester 2'],

'Subject':['Mathematics','Mathematics','Mathematics','Science','Science','Science',
'Mathematics','Mathematics','Mathematics','Science','Science','Science'],
'Score':[62,47,55,74,31,77,85,63,42,67,89,81]}

# Create a dataframe
df = pd.DataFrame(d,columns=['Name','Exam','Subject','Score'])
df

Out[2]:

Name Exam Subject Score

0 Amarend Semester 1 Mathematics 62

1 Ajay Semester 1 Mathematics 47

2 Preety Semester 1 Mathematics 55

3 Rakesh Semester 1 Science 74

4 Raju Semester 1 Science 31

5 Shyam Semester 1 Science 77

6 Kiran Semester 2 Mathematics 85

7 Rishi Semester 2 Mathematics 63

8 Prem Semester 2 Mathematics 42

9 Raj Semester 2 Science 67

10 Ravina Semester 2 Science 89

11 Premjit Semester 2 Science 81

View a column of the dataframe in pandas python:

localhost:8888/notebooks/Machine Learning/Python/3.2_filtering.ipynb 1/7

2/24/2020 3.2_filtering - Jupyter Notebook

In [5]:

df['Name']

Out[5]:

0 Amarend
1 Ajay
2 Preety
3 Rakesh
4 Raju
5 Shyam
6 Kiran
7 Rishi
8 Prem
9 Raj
10 Ravina
11 Premjit
Name: Name, dtype: object

View two or more columns of the dataframe in pandas:

In [18]:

df[['Name', 'Score']]

Out[18]:

Name Score

0 Amarend 62

1 Ajay 47

2 Preety 55

3 Rakesh 74

4 Raju 31

5 Shyam 77

6 Kiran 85

7 Rishi 63

8 Prem 42

9 Raj 67

10 Ravina 89

11 Premjit 81

View first two rows of the dataframe in pandas:

localhost:8888/notebooks/Machine Learning/Python/3.2_filtering.ipynb 2/7

2/24/2020 3.2_filtering - Jupyter Notebook

In [6]:

df[:2]

Out[6]:

Name Exam Subject Score

0 Amarend Semester 1 Mathematics 62

1 Ajay Semester 1 Mathematics 47

In [7]:

df.head(2)

Out[7]:

Name Exam Subject Score

0 Amarend Semester 1 Mathematics 62

1 Ajay Semester 1 Mathematics 47

View last two rows of the dataframe in pandas:

In [20]:

df[-2:]

Out[20]:

Name Exam Subject Score

10 Ravina Semester 2 Science 89

11 Premjit Semester 2 Science 81

Filter pandas dataframe by column value

Method 1 : DataFrame Way

localhost:8888/notebooks/Machine Learning/Python/3.2_filtering.ipynb 3/7

2/24/2020 3.2_filtering - Jupyter Notebook

In [21]:

# based on one condition

df1 = df[df['Score']>60]
df1

Out[21]:

Name Exam Subject Score

0 Amarend Semester 1 Mathematics 62

3 Rakesh Semester 1 Science 74

5 Shyam Semester 1 Science 77

6 Kiran Semester 2 Mathematics 85

7 Rishi Semester 2 Mathematics 63

9 Raj Semester 2 Science 67

10 Ravina Semester 2 Science 89

11 Premjit Semester 2 Science 81

In [22]:

# based on multiple conditions

df1A = df[(df['Score']>60) & (df['Subject']=='Mathematics')]
df1B = df[(df.Score>60) & (df.Subject=='Mathematics')]
#df1A
df1B

Out[22]:

Name Exam Subject Score

0 Amarend Semester 1 Mathematics 62

6 Kiran Semester 2 Mathematics 85

7 Rishi Semester 2 Mathematics 63

In [31]:

# Select only a few columns under some conditions

df1C = df[(df.Score>60) & (df.Subject=='Mathematics')][['Name','Score']]
df1C

Out[31]:

Name Score

0 Amarend 62

6 Kiran 85

7 Rishi 63

Method 2 : Query Function

In pandas package, there are multiple ways to perform filtering. The above code can also be written like the
code shown below. This method is elegant and more readable and you don't need to mention dataframe name
localhost:8888/notebooks/Machine Learning/Python/3.2_filtering.ipynb 4/7
2/24/2020 3.2_filtering - Jupyter Notebook

everytime when you specify columns (variables).

In [33]:

df2 = df.query('Score > 60 & Subject == "Mathematics"')

df2

Out[33]:

Name Exam Subject Score

0 Amarend Semester 1 Mathematics 62

6 Kiran Semester 2 Mathematics 85

7 Rishi Semester 2 Mathematics 63

Method 3 : loc function

loc is an abbreviation of location term. All these 3 methods return same output. It's just a different ways of doing
filtering rows.

In [36]:

df3 = df.loc[(df.Score>60) & (df.Subject=='Mathematics')]

df3

Out[36]:

Name Exam Subject Score

0 Amarend Semester 1 Mathematics 62

6 Kiran Semester 2 Mathematics 85

7 Rishi Semester 2 Mathematics 63

Difference between loc and iloc function

loc considers rows based on index labels. Whereas iloc considers rows based on position in the index so it only
takes integers. Let's create a sample data for illustration

localhost:8888/notebooks/Machine Learning/Python/3.2_filtering.ipynb 5/7

2/24/2020 3.2_filtering - Jupyter Notebook

In [38]:

x = pd.DataFrame({"col1" : np.arange(1,20,2)}, index=[9,8,7,6,0, 1, 2, 3, 4, 5])

Out[38]:

col1

9 1

8 3

7 5

6 7

0 9

1 11

2 13

3 15

4 17

5 19

iloc - Index Position

In [39]:

x.iloc[0:5]

Out[39]:

col1

9 1

8 3

7 5

6 7

0 9

loc - Index Label

localhost:8888/notebooks/Machine Learning/Python/3.2_filtering.ipynb 6/7

2/24/2020 3.2_filtering - Jupyter Notebook

In [40]:

x.loc[0:5]

Out[40]:

col1

0 9

1 11

2 13

3 15

4 17

5 19

Note : x.loc[0:5] returns 6 rows (inclusive of 5 which is 6th element)

It is because loc does not produce output based on index position. It considers labels of index only which can
be alphabet as well and includes both starting and end point. Refer the example below.

In [41]:

# more examples - (offline) Data Analytics - Preprocessing 4

In [3]:

df.head()

Out[3]:

Name Exam Subject Score

0 Amarend Semester 1 Mathematics 62

1 Ajay Semester 1 Mathematics 47

2 Preety Semester 1 Mathematics 55

3 Rakesh Semester 1 Science 74

4 Raju Semester 1 Science 31

In [9]:

#df.sortby('Name')

In [ ]:

localhost:8888/notebooks/Machine Learning/Python/3.2_filtering.ipynb 7/7

Pandas 3
No ratings yet
Pandas 3
33 pages
Python
No ratings yet
Python
16 pages
DataFrame 2
No ratings yet
DataFrame 2
38 pages
Python Pandas-Data Frames
No ratings yet
Python Pandas-Data Frames
41 pages
Pandas
No ratings yet
Pandas
5 pages
Data Frame Demo
No ratings yet
Data Frame Demo
73 pages
PDF&Rendition 1
No ratings yet
PDF&Rendition 1
47 pages
Pandas Practice
No ratings yet
Pandas Practice
7 pages
Dataframes-I(Create _ Selection) (1)
No ratings yet
Dataframes-I(Create _ Selection) (1)
12 pages
Dataframes-I (Create & Selection)
No ratings yet
Dataframes-I (Create & Selection)
10 pages
Pandas Dataframe1
No ratings yet
Pandas Dataframe1
43 pages
Ip Lab File Python
No ratings yet
Ip Lab File Python
9 pages
Pandas Filtering
No ratings yet
Pandas Filtering
19 pages
Practical File ANKIT RAJ CLASS 12-F
No ratings yet
Practical File ANKIT RAJ CLASS 12-F
48 pages
Python Pandas - 2 2020-21
No ratings yet
Python Pandas - 2 2020-21
21 pages
Lab Record IP
No ratings yet
Lab Record IP
13 pages
Label Indexing Wrsht-1 (With Solutions)
No ratings yet
Label Indexing Wrsht-1 (With Solutions)
6 pages
Pandas Dataframe
No ratings yet
Pandas Dataframe
8 pages
Xii Record (Dataframe & CSV)
No ratings yet
Xii Record (Dataframe & CSV)
11 pages
Dataframe in Pandas
No ratings yet
Dataframe in Pandas
23 pages
IP Record-5
No ratings yet
IP Record-5
9 pages
Python-for-Data-Analysis (Pandas
No ratings yet
Python-for-Data-Analysis (Pandas
31 pages
Answers Practical File
No ratings yet
Answers Practical File
19 pages
Pandas Basics Guide
No ratings yet
Pandas Basics Guide
4 pages
Case Base Practice Question
No ratings yet
Case Base Practice Question
7 pages
Python Data Science 101
100% (1)
Python Data Science 101
41 pages
Unit 2 notes-II
No ratings yet
Unit 2 notes-II
47 pages
Pandas & Mysql
No ratings yet
Pandas & Mysql
20 pages
Lab Session 06: Perform Following Operations Using Pandas Lab Session 06: Perform Following Operations Using Pandas
No ratings yet
Lab Session 06: Perform Following Operations Using Pandas Lab Session 06: Perform Following Operations Using Pandas
5 pages
Pandas DataFrame
No ratings yet
Pandas DataFrame
70 pages
MCQ On Dataframe
No ratings yet
MCQ On Dataframe
11 pages
Pandas, Numpy, Matplotlib
No ratings yet
Pandas, Numpy, Matplotlib
11 pages
Ip Study
No ratings yet
Ip Study
18 pages
List of Practical Ip065 Xii Session 2025 CKC Academy
No ratings yet
List of Practical Ip065 Xii Session 2025 CKC Academy
19 pages
IP Imp Notes
No ratings yet
IP Imp Notes
5 pages
ICT2103 Full Book-Part-3
No ratings yet
ICT2103 Full Book-Part-3
14 pages
Revision Point - Dataframe
No ratings yet
Revision Point - Dataframe
11 pages
DataFrame Ac Win Final
No ratings yet
DataFrame Ac Win Final
30 pages
IP Practical File Project
No ratings yet
IP Practical File Project
60 pages
Practical File IP
No ratings yet
Practical File IP
27 pages
Chapter 2 Data Handling Using Pandas - I (DATA FRAME)
No ratings yet
Chapter 2 Data Handling Using Pandas - I (DATA FRAME)
15 pages
Dataframe
No ratings yet
Dataframe
2 pages
Loc Iloc at Dataframe
No ratings yet
Loc Iloc at Dataframe
9 pages
Creation of Series Using List, Dictionary & Ndarray
No ratings yet
Creation of Series Using List, Dictionary & Ndarray
65 pages
PYTHON PROGRAMMING: Data Handling
No ratings yet
PYTHON PROGRAMMING: Data Handling
12 pages
Python For Data Science 1662157639
No ratings yet
Python For Data Science 1662157639
6 pages
Lab Session 06: Perform Following Operations Using Pandas
No ratings yet
Lab Session 06: Perform Following Operations Using Pandas
5 pages
Iteration
No ratings yet
Iteration
40 pages
Pandas 2 Complete Notes Class XII
No ratings yet
Pandas 2 Complete Notes Class XII
18 pages
Data Frame
No ratings yet
Data Frame
17 pages
PANDAS Python
No ratings yet
PANDAS Python
2 pages
Data Frame Notes1
No ratings yet
Data Frame Notes1
7 pages
Numpy Boolean Indexing: Filter
No ratings yet
Numpy Boolean Indexing: Filter
39 pages
Practice Questions (Unsolved)
No ratings yet
Practice Questions (Unsolved)
8 pages
CSC - 310 Advanced Python Programming Continuous Assessment-2 Assignment:Ca2
No ratings yet
CSC - 310 Advanced Python Programming Continuous Assessment-2 Assignment:Ca2
33 pages
Any One of The Following Will Modify The Value
No ratings yet
Any One of The Following Will Modify The Value
2 pages
12 Pandas
100% (1)
12 Pandas
21 pages
Ip Practical File
No ratings yet
Ip Practical File
20 pages
Pandas
No ratings yet
Pandas
26 pages
Neo4j Graph Data Science Certified - Exam Practice Tests
From Everand
Neo4j Graph Data Science Certified - Exam Practice Tests
Cristian Scutaru
No ratings yet
4th ETE
No ratings yet
4th ETE
4 pages
4th ME
No ratings yet
4th ME
4 pages
Mobile Communication Systems: Part II-Part II
No ratings yet
Mobile Communication Systems: Part II-Part II
79 pages
Large Language Models For Propaganda Detection: University of Zurich University of Zurich University of Zurich
No ratings yet
Large Language Models For Propaganda Detection: University of Zurich University of Zurich University of Zurich
7 pages
Pollution and Congestion in Urban Areas
No ratings yet
Pollution and Congestion in Urban Areas
19 pages
Chi-Square Test
No ratings yet
Chi-Square Test
6 pages
Group - by Python Code
No ratings yet
Group - by Python Code
11 pages
Covariance Matrix
No ratings yet
Covariance Matrix
6 pages
Merge Append Python Code
No ratings yet
Merge Append Python Code
5 pages
Hands On Practical Examples On Sequential Feature Selection in Python
No ratings yet
Hands On Practical Examples On Sequential Feature Selection in Python
26 pages
Juniji-Hogo by Zen Master Daichi Sokei Zenji
No ratings yet
Juniji-Hogo by Zen Master Daichi Sokei Zenji
3 pages
Librarian S Guide To Online Searching Cultivating Database Skills For Research and Instruction 4th Edition Suzanne S. Bell
No ratings yet
Librarian S Guide To Online Searching Cultivating Database Skills For Research and Instruction 4th Edition Suzanne S. Bell
47 pages
XXX Ref E-BOT Brochure
No ratings yet
XXX Ref E-BOT Brochure
8 pages
Nirma University: ? !,'' XTLT"
No ratings yet
Nirma University: ? !,'' XTLT"
3 pages
Fidelity Bond Forms
No ratings yet
Fidelity Bond Forms
28 pages
Cetprospectus 2025
No ratings yet
Cetprospectus 2025
56 pages
A Deep Learning Approach To The Geometry Friends Game (Artículo)
No ratings yet
A Deep Learning Approach To The Geometry Friends Game (Artículo)
10 pages
(PDF) The Elusive Definition of Knowledge
0% (1)
(PDF) The Elusive Definition of Knowledge
13 pages
Guidelines For The Post of Assistant Professor 14E2023
No ratings yet
Guidelines For The Post of Assistant Professor 14E2023
2 pages
Arithmatic Circuit
No ratings yet
Arithmatic Circuit
7 pages
Lesson 1&2
No ratings yet
Lesson 1&2
6 pages
MUET Weekly LESSON PLAN SEM 2
100% (2)
MUET Weekly LESSON PLAN SEM 2
7 pages
MAPEH 10 Exam
100% (2)
MAPEH 10 Exam
5 pages
What Is Anthropology 2nd Edition Thomas Hylland Eriksen Download
No ratings yet
What Is Anthropology 2nd Edition Thomas Hylland Eriksen Download
48 pages
Ucsp Answer Sheet
No ratings yet
Ucsp Answer Sheet
16 pages
Alumni: in Their Own Words: Admissions Essays That Worked
No ratings yet
Alumni: in Their Own Words: Admissions Essays That Worked
10 pages
Statement of Marks: Examination Held In: June: 2023 Seat No. Name
No ratings yet
Statement of Marks: Examination Held In: June: 2023 Seat No. Name
1 page
Newsflash December 2012 FINAL
No ratings yet
Newsflash December 2012 FINAL
60 pages
KrupaCon 2018 - Bengaluru
No ratings yet
KrupaCon 2018 - Bengaluru
8 pages
ITIL Practitioner 160317
No ratings yet
ITIL Practitioner 160317
26 pages
R-01-POL-PC Policy On Registration in Professional Categories
No ratings yet
R-01-POL-PC Policy On Registration in Professional Categories
31 pages
Hsgraduation6 28
No ratings yet
Hsgraduation6 28
4 pages
Theories and Models in Social Marketing Social Marketing - Lecture 3
100% (1)
Theories and Models in Social Marketing Social Marketing - Lecture 3
53 pages
Integumentary Physical Therapy
No ratings yet
Integumentary Physical Therapy
3 pages
INVITATION For Speaker
No ratings yet
INVITATION For Speaker
3 pages
The Influence of Using English Song Toward Students' Pronunciation Mastery at The Seventh Grade of SMPN 6 Kota Serang
No ratings yet
The Influence of Using English Song Toward Students' Pronunciation Mastery at The Seventh Grade of SMPN 6 Kota Serang
162 pages
Background in A Research Paper
No ratings yet
Background in A Research Paper
2 pages
New Developments in The Bioarchaeology of Care Further Case Studies and Expanded Theory Complete Digital Book
100% (8)
New Developments in The Bioarchaeology of Care Further Case Studies and Expanded Theory Complete Digital Book
14 pages
MD Outline 2016-2017
No ratings yet
MD Outline 2016-2017
16 pages
The Monogamy Gap Men Love and the Reality of Cheating 1st Edition Eric Anderson Full Chapters Included
No ratings yet
The Monogamy Gap Men Love and the Reality of Cheating 1st Edition Eric Anderson Full Chapters Included
129 pages