0% found this document useful (0 votes)

48 views

Pandas - Basics - Practice: Consider The Following Python Dictionary Data and Python List Labels

The document describes various operations performed on a pandas DataFrame containing bird observation data. The DataFrame is created from a dictionary of data and list of index labels. Summary statistics are displayed and various data selections, filters, and transformations are applied, such as calculating group means, sorting, and replacing values.

Uploaded by

Abhishek Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

48 views

Pandas - Basics - Practice: Consider The Following Python Dictionary Data and Python List Labels

Uploaded by

Abhishek Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

pandas_basics_practice

April 7, 2021

Consider the following Python dictionary data and Python list labels:
data = {‘birds’: [‘Cranes’, ‘Cranes’, ‘plovers’, ‘spoonbills’, ‘spoonbills’, ‘Cranes’, ‘plovers’, ‘Cranes’,
‘spoonbills’, ‘spoonbills’], ‘age’: [3.5, 4, 1.5, np.nan, 6, 3, 5.5, np.nan, 8, 4], ‘visits’: [2, 4, 3, 4, 3, 4,
2, 2, 3, 2], ‘priority’: [‘yes’, ‘yes’, ‘no’, ‘yes’, ‘no’, ‘no’, ‘no’, ‘yes’, ‘no’, ‘no’]}
labels = [‘a’, ‘b’, ‘c’, ‘d’, ‘e’, ‘f’, ‘g’, ‘h’, ‘i’, ‘j’]
1. Create a DataFrame birds from this dictionary data which has the index labels.

[1]: import pandas as pd

import numpy as np

data = {'birds': ['Cranes', 'Cranes', 'plovers', 'spoonbills', 'spoonbills',␣

,→'Cranes', 'plovers', 'Cranes', 'spoonbills', 'spoonbills'],

'age': [3.5, 4, 1.5, np.nan, 6, 3, 5.5, np.nan, 8, 4],

'visits': [2, 4, 3, 4, 3, 4, 2, 2, 3, 2],
'priority': ['yes', 'yes', 'no', 'yes', 'no', 'no', 'no', 'yes', 'no',␣
,→'no']}

labels = ['a', 'b', 'c', 'd', 'e', 'f', 'g', 'h', 'i', 'j']

df=pd.DataFrame(data, index=labels)
df

[1]: birds age visits priority

a Cranes 3.5 2 yes
b Cranes 4.0 4 yes
c plovers 1.5 3 no
d spoonbills NaN 4 yes
e spoonbills 6.0 3 no
f Cranes 3.0 4 no
g plovers 5.5 2 no
h Cranes NaN 2 yes
i spoonbills 8.0 3 no
j spoonbills 4.0 2 no

2. Display a summary of the basic information about birds DataFrame and its data.

[114]: df.columns

1
[114]: Index(['birds', 'age', 'visits', 'priority'], dtype='object')

[115]: df.describe()

[115]: age visits

count 8.000000 10.000000
mean 4.437500 2.900000
std 2.007797 0.875595
min 1.500000 2.000000
25% 3.375000 2.000000
50% 4.000000 3.000000
75% 5.625000 3.750000
max 8.000000 4.000000

3. Print the first 2 rows of the birds dataframe

[43]: df.iloc[[0,1]]

[43]: birds age visits priority

labels
a Cranes 3.5 2 yes
b Cranes 4.0 4 yes

4. Print all the rows with only ‘birds’ and ‘age’ columns from the dataframe

[52]: df[['birds','age']]

[52]: birds age

labels
a Cranes 3.5
b Cranes 4.0
c plovers 1.5
d spoonbills NaN
e spoonbills 6.0
f Cranes 3.0
g plovers 5.5
h Cranes NaN
i spoonbills 8.0
j spoonbills 4.0

5. select [2, 3, 7] rows and in columns [‘birds’, ‘age’, ‘visits’]

[60]: df.loc[['b','c','g'],['birds','age','visits']]

[60]: birds age visits

labels
b Cranes 4.0 4
c plovers 1.5 3
g plovers 5.5 2

2
6. select the rows where the number of visits is less than 4

[59]: filt=df['visits']<4
df[filt]

[59]: birds age visits priority

labels
a Cranes 3.5 2 yes
c plovers 1.5 3 no
e spoonbills 6.0 3 no
g plovers 5.5 2 no
h Cranes NaN 2 yes
i spoonbills 8.0 3 no
j spoonbills 4.0 2 no

7. select the rows with columns [‘birds’, ‘visits’] where the age is missing i.e NaN

[2]: filt= df['age'].isnull()

df[filt][['birds','visits']]

[2]: birds visits

d spoonbills 4
h Cranes 2

8. Select the rows where the birds is a Cranes and the age is less than 4

[68]: filt=(df['birds']=='Cranes') & (df['age']<4)

df[filt]

[68]: birds age visits priority

labels
a Cranes 3.5 2 yes
f Cranes 3.0 4 no

9. Select the rows the age is between 2 and 4(inclusive)

[70]: filt=(df['age']>=2) & (df['age']<=4)

df[filt]

[70]: birds age visits priority

labels
a Cranes 3.5 2 yes
b Cranes 4.0 4 yes
f Cranes 3.0 4 no
j spoonbills 4.0 2 no

10. Find the total number of visits of the bird Cranes

[71]: df[df['birds']=='Cranes']['visits'].sum()

3
[71]: 12

11. Calculate the mean age for each different birds in dataframe.

[76]: birds_grp=df.groupby('birds')
birds_grp['age'].mean()

[76]: birds
Cranes 3.5
plovers 3.5
spoonbills 6.0
Name: age, dtype: float64

12. Append a new row ‘k’ to dataframe with your choice of values for each column.
Then delete that row to return the original DataFrame.

[106]: df.loc['k']=['Sparrow',3,4,'yes']
df

[106]: birds age visits priority

labels
a Cranes 3.5 2 yes
b Cranes 4.0 4 yes
c plovers 1.5 3 no
d spoonbills NaN 4 yes
e spoonbills 6.0 3 no
f Cranes 3.0 4 no
g plovers 5.5 2 no
h Cranes NaN 2 yes
i spoonbills 8.0 3 no
j spoonbills 4.0 2 no
k Sparrow 3.0 4 yes

[111]: df=df.drop('k')
df

[111]: birds age visits priority

4
13. Find the number of each type of birds in dataframe (Counts)

[73]: df['birds'].value_counts()

[73]: spoonbills 4
Cranes 4
plovers 2
Name: birds, dtype: int64

14. Sort dataframe (birds) first by the values in the ‘age’ in decending order, then by
the value in the ‘visits’ column in ascending order.

[116]: df.sort_values(by=['age','visits'],ascending=[False,True])

[116]: birds age visits priority

i spoonbills 8.0 3 no
e spoonbills 6.0 3 no
g plovers 5.5 2 no
j spoonbills 4.0 2 no
b Cranes 4.0 4 yes
a Cranes 3.5 2 yes
f Cranes 3.0 4 no
c plovers 1.5 3 no
h Cranes NaN 2 yes
d spoonbills NaN 4 yes

15. Replace the priority column values with’yes’ should be 1 and ‘no’ should be 0

[101]: def replace_priority(x):

if x=='yes':
return 1
else:
return 0
df['priority'].apply(replace_priority)
df

[101]: birds age visits priority

labels
a trumpeters 3.5 2 1
b trumpeters 4.0 4 1
c plovers 1.5 3 0
d spoonbills NaN 4 1
e spoonbills 6.0 3 0
f trumpeters 3.0 4 0
g plovers 5.5 2 0
h trumpeters NaN 2 1
i spoonbills 8.0 3 0
j spoonbills 4.0 2 0

16. In the ‘birds’ column, change the ‘Cranes’ entries to ‘trumpeters’.

5
[91]: df['birds']=df['birds'].replace({'Cranes':'trumpeters'})
df

[91]: birds age visits priority

labels
a trumpeters 3.5 2 yes
b trumpeters 4.0 4 yes
c plovers 1.5 3 no
d spoonbills NaN 4 yes
e spoonbills 6.0 3 no
f trumpeters 3.0 4 no
g plovers 5.5 2 no
h trumpeters NaN 2 yes
i spoonbills 8.0 3 no
j spoonbills 4.0 2 no

[ ]:

All chapters of Solutions Manual for Systems Analysis and Design, 9/E 9th Edition Kenneth E. Kendall, Julie E. Kendall are available for quick PDF download
100% (7)
All chapters of Solutions Manual for Systems Analysis and Design, 9/E 9th Edition Kenneth E. Kendall, Julie E. Kendall are available for quick PDF download
41 pages
Apache Cassandra Administrator Associate - Exam Practice Tests
From Everand
Apache Cassandra Administrator Associate - Exam Practice Tests
Cristian Scutaru
No ratings yet
Introduction To SQL Test Your Understanding
100% (1)
Introduction To SQL Test Your Understanding
71 pages
Databricks Questions
No ratings yet
Databricks Questions
23 pages
Iti Pdfs
No ratings yet
Iti Pdfs
10 pages
SQL Vs PySpark 1678871778
No ratings yet
SQL Vs PySpark 1678871778
8 pages
ALX SE Guide - November - 2022
No ratings yet
ALX SE Guide - November - 2022
31 pages
Contoso Sample DAX Formulas
100% (1)
Contoso Sample DAX Formulas
19 pages
Introduction To Data Visualization With Seaborn Chapter3
100% (1)
Introduction To Data Visualization With Seaborn Chapter3
32 pages
MySQL Cheatsheet - CodeWithHarry
100% (1)
MySQL Cheatsheet - CodeWithHarry
13 pages
COS4840 Oncology Assignment1
No ratings yet
COS4840 Oncology Assignment1
5 pages
Solutions To Pandas Basic Questions
No ratings yet
Solutions To Pandas Basic Questions
1 page
Cleaning Dirty Data With Pandas & Python - DevelopIntelligence Blog PDF
No ratings yet
Cleaning Dirty Data With Pandas & Python - DevelopIntelligence Blog PDF
8 pages
Mastering SQL Window Functions - 01
No ratings yet
Mastering SQL Window Functions - 01
39 pages
100 SQL Formulas Each Student Should Know
No ratings yet
100 SQL Formulas Each Student Should Know
10 pages
T-GCPBDML-B - M2 - Data Engineering For Streaming Data - ILT Slides
No ratings yet
T-GCPBDML-B - M2 - Data Engineering For Streaming Data - ILT Slides
71 pages
EDA with Pandas
No ratings yet
EDA with Pandas
8 pages
SQL Syntax
No ratings yet
SQL Syntax
321 pages
DAX CheetSheat
No ratings yet
DAX CheetSheat
20 pages
Power BI Themes Samples
No ratings yet
Power BI Themes Samples
11 pages
Rank, Dense Rank
100% (1)
Rank, Dense Rank
3 pages
Pandas: Import
100% (1)
Pandas: Import
13 pages
Ch-2 Panda: #Import The Pandas Library and Aliasing As PD
No ratings yet
Ch-2 Panda: #Import The Pandas Library and Aliasing As PD
5 pages
Pandas in Python 16sept2022
No ratings yet
Pandas in Python 16sept2022
8 pages
Django - Overview: MVC Pattern
No ratings yet
Django - Overview: MVC Pattern
3 pages
Brainalyst's SQL Interview Guide
No ratings yet
Brainalyst's SQL Interview Guide
112 pages
Power BI Cheat Sheet
No ratings yet
Power BI Cheat Sheet
10 pages
Python Pandas Cheatsheety
No ratings yet
Python Pandas Cheatsheety
7 pages
Snowflake Demo
No ratings yet
Snowflake Demo
13 pages
Dax 1
No ratings yet
Dax 1
27 pages
6 Different Ways To Compensate For Missing Values in A Dataset
No ratings yet
6 Different Ways To Compensate For Missing Values in A Dataset
6 pages
Pandas Complete Notes
No ratings yet
Pandas Complete Notes
105 pages
Choice of Charts Power BI
No ratings yet
Choice of Charts Power BI
14 pages
Lecture 7 - Using Subqueries To Solve Queries
No ratings yet
Lecture 7 - Using Subqueries To Solve Queries
19 pages
PYTHON notes by devaraj
100% (1)
PYTHON notes by devaraj
40 pages
4 - Power BI - Query Editor - Text Transformation
100% (1)
4 - Power BI - Query Editor - Text Transformation
88 pages
Select Joins: SQL Cheat Sheet
100% (1)
Select Joins: SQL Cheat Sheet
3 pages
Alteryx Topic
No ratings yet
Alteryx Topic
2 pages
Elite SQL Queries For Practice PDF
0% (1)
Elite SQL Queries For Practice PDF
20 pages
Appendix B DAX Reference
100% (1)
Appendix B DAX Reference
174 pages
Informatica Course Content
No ratings yet
Informatica Course Content
5 pages
Power BI
No ratings yet
Power BI
47 pages
SQL - Basics
No ratings yet
SQL - Basics
25 pages
Power BI Question Bank
No ratings yet
Power BI Question Bank
40 pages
Subqueries
No ratings yet
Subqueries
22 pages
Class XII Data Handlinng Using PandasI
No ratings yet
Class XII Data Handlinng Using PandasI
46 pages
Window Function in Pyspark
100% (1)
Window Function in Pyspark
8 pages
Tableau Tutorial
No ratings yet
Tableau Tutorial
65 pages
SQL Functions
100% (1)
SQL Functions
16 pages
Spark DataFrames Project Exercise - Jupyter Notebook
No ratings yet
Spark DataFrames Project Exercise - Jupyter Notebook
7 pages
PL-300
No ratings yet
PL-300
13 pages
Power BI
No ratings yet
Power BI
62 pages
SQL Queries and PL/SQL
No ratings yet
SQL Queries and PL/SQL
92 pages
SQL Interview Questions and Answers G
No ratings yet
SQL Interview Questions and Answers G
67 pages
SQL Subquery
100% (1)
SQL Subquery
57 pages
Tableau Notes: (Dependent Variables) Role. The Field's Data Type Defines If The Field Is, For Example, A
No ratings yet
Tableau Notes: (Dependent Variables) Role. The Field's Data Type Defines If The Field Is, For Example, A
6 pages
SCD Type-1,2 Implementation in Pyspark
No ratings yet
SCD Type-1,2 Implementation in Pyspark
6 pages
Day65 - Day70 Power BI Interview
No ratings yet
Day65 - Day70 Power BI Interview
31 pages
PL 300
No ratings yet
PL 300
13 pages
Get Data With Power BI Desktop: Angeles University Foundation College of Computer Studies
No ratings yet
Get Data With Power BI Desktop: Angeles University Foundation College of Computer Studies
35 pages
Mongodb Cheat Sheet
No ratings yet
Mongodb Cheat Sheet
10 pages
HBase Administration Cookbook
From Everand
HBase Administration Cookbook
Yifeng Jiang
No ratings yet
Instant Pentaho Data Integration Kitchen
From Everand
Instant Pentaho Data Integration Kitchen
Sergio Ramazzina
No ratings yet
Presentation On Android OS
No ratings yet
Presentation On Android OS
25 pages
PYTHON QBANKf
No ratings yet
PYTHON QBANKf
3 pages
Hytera SmartDispatch-Net Installation Guide V4.0
No ratings yet
Hytera SmartDispatch-Net Installation Guide V4.0
79 pages
MCQ 2
No ratings yet
MCQ 2
15 pages
19.Project-Online Resume Builder
100% (2)
19.Project-Online Resume Builder
14 pages
Test Your Understanding - Constructor (Copy) - Attempt Review
100% (1)
Test Your Understanding - Constructor (Copy) - Attempt Review
7 pages
Quarter 2: Week 1-2 Module 1-2: Common Competencies
100% (1)
Quarter 2: Week 1-2 Module 1-2: Common Competencies
14 pages
3 Assessment For SVJC
No ratings yet
3 Assessment For SVJC
5 pages
Hardhat Beginners To Advanced Guides
No ratings yet
Hardhat Beginners To Advanced Guides
212 pages
Quiz Section4 Java Fundamental
75% (4)
Quiz Section4 Java Fundamental
2 pages
CPE 202 Lecture Notes
No ratings yet
CPE 202 Lecture Notes
41 pages
Oracle9I (9.2.0.4.0) Installation On Redhat Advanced Server 3.0 Linux
No ratings yet
Oracle9I (9.2.0.4.0) Installation On Redhat Advanced Server 3.0 Linux
5 pages
DDD
No ratings yet
DDD
45 pages
Vaish Reliance
No ratings yet
Vaish Reliance
1 page
Jersey Hk2 Layers Basic Example Using JDBC - Part-1: Mysql DB Table
No ratings yet
Jersey Hk2 Layers Basic Example Using JDBC - Part-1: Mysql DB Table
8 pages
Oracle: Primavera P6 EPPM BPM Configuration Guide For On-Premises
No ratings yet
Oracle: Primavera P6 EPPM BPM Configuration Guide For On-Premises
24 pages
2023 - MP02 Reliability Management
100% (1)
2023 - MP02 Reliability Management
88 pages
Multi User Collaboration With Autodesk Revit Worksharing
100% (1)
Multi User Collaboration With Autodesk Revit Worksharing
11 pages
CS8651 QB Internet Programming PDF
No ratings yet
CS8651 QB Internet Programming PDF
10 pages
(Ebook) XML - XSLT and Xpath - A Guide To XML Transformations - Prentice Hall - 2001
No ratings yet
(Ebook) XML - XSLT and Xpath - A Guide To XML Transformations - Prentice Hall - 2001
341 pages
Chapter 11 Solution
No ratings yet
Chapter 11 Solution
6 pages
EXP - 4 - To - 6CD - Lab Manual - ODD - 2024 - Removed
No ratings yet
EXP - 4 - To - 6CD - Lab Manual - ODD - 2024 - Removed
16 pages
WinCC V7.3_ Working with WinCC - Managing WinCC Projects and Objects in the SIMATIC Manager
No ratings yet
WinCC V7.3_ Working with WinCC - Managing WinCC Projects and Objects in the SIMATIC Manager
4 pages
PHP - Sessions: Starting A PHP Session
No ratings yet
PHP - Sessions: Starting A PHP Session
4 pages
Resume As Executive
No ratings yet
Resume As Executive
3 pages
3250817_E_20241219
No ratings yet
3250817_E_20241219
6 pages
Concepts
No ratings yet
Concepts
500 pages

Pandas - Basics - Practice: Consider The Following Python Dictionary Data and Python List Labels

Uploaded by

Pandas - Basics - Practice: Consider The Following Python Dictionary Data and Python List Labels

Uploaded by

pandas_basics_practice

[1]: import pandas as pd

data = {'birds': ['Cranes', 'Cranes', 'plovers', 'spoonbills', 'spoonbills',␣

'age': [3.5, 4, 1.5, np.nan, 6, 3, 5.5, np.nan, 8, 4],

[1]: birds age visits priority

[115]: age visits

3. Print the first 2 rows of the birds dataframe

[43]: birds age visits priority

[52]: birds age

5. select [2, 3, 7] rows and in columns [‘birds’, ‘age’, ‘visits’]

[60]: birds age visits

[59]: birds age visits priority

[2]: filt= df['age'].isnull()

[2]: birds visits

[68]: filt=(df['birds']=='Cranes') & (df['age']<4)

[68]: birds age visits priority

9. Select the rows the age is between 2 and 4(inclusive)

[70]: filt=(df['age']>=2) & (df['age']<=4)

[70]: birds age visits priority

10. Find the total number of visits of the bird Cranes

[106]: birds age visits priority

[111]: birds age visits priority

[116]: birds age visits priority

[101]: def replace_priority(x):

[101]: birds age visits priority

16. In the ‘birds’ column, change the ‘Cranes’ entries to ‘trumpeters’.

[91]: birds age visits priority

You might also like