0% found this document useful (0 votes)

18 views

Pandas - Basics - Practice - Assignment 2 - PDF

The document discusses using pandas to analyze a dictionary of bird data. It includes creating a DataFrame from the data, selecting subsets of rows and columns, calculating summary statistics, and finding total visits by bird type. A series of code cells demonstrate different pandas operations on the bird data.

Uploaded by

Ashutosh Kushwaha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views

Pandas - Basics - Practice - Assignment 2 - PDF

Uploaded by

Ashutosh Kushwaha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

9/14/22, 11:02 PM pandas_basics_practice_assignment 2

Consider the following Python dictionary data and Python list labels:

data = {'birds': ['Cranes', 'Cranes', 'plovers', 'spoonbills', 'spoonbills', 'Cranes', 'plovers', 'Cranes',
'spoonbills', 'spoonbills'], 'age': [3.5, 4, 1.5, np.nan, 6, 3, 5.5, np.nan, 8, 4], 'visits': [2, 4, 3, 4, 3, 4, 2,
2, 3, 2], 'priority': ['yes', 'yes', 'no', 'yes', 'no', 'no', 'no', 'yes', 'no', 'no']}

labels = ['a', 'b', 'c', 'd', 'e', 'f', 'g', 'h', 'i', 'j']

1. Create a DataFrame birds from this dictionary data which has the index labels.

birds age visits priority

a Cranes 3.5 2 yes
b Cranes 4.0 4 yes
c plovers 1.5 3 no
d spoonbills NaN 4 yes
e spoonbills 6.0 3 no
f Cranes 3.0 4 no
g plovers 5.5 2 no
h Cranes NaN 2 yes
i spoonbills 8.0 3 no
j spoonbills 4.0 2 no
2. Display a summary of the basic information about birds DataFrame and its data.

In [ ]:
print(df.info())

<class 'pandas.core.frame.DataFrame'>
Index: 10 entries, a to j
Data columns (total 4 columns):
# Column Non-Null Count Dtype
--- ------ -------------- -----
0 birds 10 non-null object
1 age 8 non-null float64
2 visits 10 non-null int64
3 priority 10 non-null object
dtypes: float64(1), int64(1), object(2)
memory usage: 400.0+ bytes
None
3. Print the first 2 rows of the birds dataframe

In [ ]:
df.iloc[:2]

Out[ ]: birds age visits priority

a Cranes 3.5 2 yes

b Cranes 4.0 4 yes

4. Print all the rows with only 'birds' and 'age' columns from the dataframe

file:///C:/Users/HP/OneDrive/Applied ai/Module 1/assignment/pandas_basics_practice_assignment 2.html 1/7

9/14/22, 11:02 PM pandas_basics_practice_assignment 2

birds age visits priority

a Cranes 3.5 2 yes
b Cranes 4.0 4 yes
c plovers 1.5 3 no
d spoonbills NaN 4 yes
e spoonbills 6.0 3 no
f Cranes 3.0 4 no
g plovers 5.5 2 no
h Cranes NaN 2 yes
i spoonbills 8.0 3 no
j spoonbills 4.0 2 no
5. select [2, 3, 7] rows and in columns ['birds', 'age', 'visits']

In [ ]:
#df.loc[['c','d','h'],['birds','age','visits']]
df.iloc[[2,3,7],[0,1,2]]

Out[ ]: birds age visits

c plovers 1.5 3

d spoonbills NaN 4

h Cranes NaN 2

6. select the rows where the number of visits is less than 4

In [ ]:
df[df['visits']<4]

Out[ ]: birds age visits priority

a Cranes 3.5 2 yes

c plovers 1.5 3 no

e spoonbills 6.0 3 no

g plovers 5.5 2 no

h Cranes NaN 2 yes

i spoonbills 8.0 3 no

j spoonbills 4.0 2 no

7. select the rows with columns ['birds', 'visits'] where the age is missing i.e NaN

file:///C:/Users/HP/OneDrive/Applied ai/Module 1/assignment/pandas_basics_practice_assignment 2.html 2/7

9/14/22, 11:02 PM pandas_basics_practice_assignment 2

labels = ['a', 'b', 'c', 'd', 'e', 'f', 'g', 'h', 'i', 'j']
df = pd.DataFrame(data, index = labels)
print(df)
df[df['age'].isna()].loc[:,['birds', 'visits']]

birds age visits priority

d spoonbills 4

h Cranes 2

8. Select the rows where the birds is a Cranes and the age is less than 4

In [ ]:
#df[df['birds'] == 'Cranes'].loc[df['age']<4]
df.loc[(df['birds']== 'Cranes') & (df['age']<4)]

Out[ ]: birds age visits priority

a Cranes 3.5 2 yes

f Cranes 3.0 4 no

9. Select the rows the age is between 2 and 4(inclusive)

In [ ]:
df.loc[(df['age']>2) & (df['age']<=4)]

Out[ ]: birds age visits priority

a Cranes 3.5 2 yes

b Cranes 4.0 4 yes

f Cranes 3.0 4 no

j spoonbills 4.0 2 no

10. Find the total number of visits of the bird Cranes

In [ ]:
bc = df[df['birds'] == 'Cranes'].loc[:,['visits']]
print(bc)
print(bc.sum(axis=0))

visits
a 2
b 4
f 4
h 2
visits 12
dtype: int64
11. Calculate the mean age for each different birds in dataframe.
file:///C:/Users/HP/OneDrive/Applied ai/Module 1/assignment/pandas_basics_practice_assignment 2.html 3/7
9/14/22, 11:02 PM pandas_basics_practice_assignment 2

In [ ]:
import pandas as pd
import numpy as np
data = {'birds': ['Cranes', 'Cranes', 'plovers', 'spoonbills', 'spoonbills', 'Cranes
'age': [3.5, 4, 1.5, np.nan, 6, 3, 5.5, np.nan, 8, 4],
'visits': [2, 4, 3, 4, 3, 4, 2, 2, 3, 2],
'priority': ['yes', 'yes', 'no', 'yes', 'no', 'no', 'no', 'yes', 'no', 'no']
labels = ['a', 'b', 'c', 'd', 'e', 'f', 'g', 'h', 'i', 'j']
df = pd.DataFrame(data, index = labels)
#print(df)
gb = df.groupby('birds')
#for birds,brids_group in gb:
#print(birds)
#print(brids_group)
print(gb.mean())

age visits
birds
Cranes 3.5 3.0
plovers 3.5 2.5
spoonbills 6.0 3.0
12. Append a new row 'k' to dataframe with your choice of values for each column. Then
delete that row to return the original DataFrame.

In [ ]:
import pandas as pd
import numpy as np
data = {'birds': ['Cranes', 'Cranes', 'plovers', 'spoonbills', 'spoonbills', 'Cranes
'age': [3.5, 4, 1.5, np.nan, 6, 3, 5.5, np.nan, 8, 4],
'visits': [2, 4, 3, 4, 3, 4, 2, 2, 3, 2],
'priority': ['yes', 'yes', 'no', 'yes', 'no', 'no', 'no', 'yes', 'no', 'no']
labels = ['a', 'b', 'c', 'd', 'e', 'f', 'g', 'h', 'i', 'j']
df = pd.DataFrame(data, index = labels)
#print(df)
data1 = {'birds': ['seagul'], 'age' : [5], 'visits' : [5], 'priority' : ['yes']}
labels1 = ['k']
df1 = pd.DataFrame(data1, index= labels1)
#print(df1)
df2 = pd.concat([df,df1])
print(df2)
print("\n***********************\n")
df3 = df2.drop(['k'], axis=0)
print(df3)

birds age visits priority

***********************

birds age visits priority

a Cranes 3.5 2 yes
b Cranes 4.0 4 yes
c plovers 1.5 3 no
d spoonbills NaN 4 yes

file:///C:/Users/HP/OneDrive/Applied ai/Module 1/assignment/pandas_basics_practice_assignment 2.html 4/7

9/14/22, 11:02 PM pandas_basics_practice_assignment 2

e spoonbills 6.0 3 no
f Cranes 3.0 4 no
g plovers 5.5 2 no
h Cranes NaN 2 yes
i spoonbills 8.0 3 no
j spoonbills 4.0 2 no
13. Find the number of each type of birds in dataframe (Counts)

In [ ]:
import pandas as pd
import numpy as np
data = {'birds': ['Cranes', 'Cranes', 'plovers', 'spoonbills', 'spoonbills', 'Cranes
'age': [3.5, 4, 1.5, np.nan, 6, 3, 5.5, np.nan, 8, 4],
'visits': [2, 4, 3, 4, 3, 4, 2, 2, 3, 2],
'priority': ['yes', 'yes', 'no', 'yes', 'no', 'no', 'no', 'yes', 'no', 'no']
labels = ['a', 'b', 'c', 'd', 'e', 'f', 'g', 'h', 'i', 'j']
df = pd.DataFrame(data, index = labels)
#print(df)
gb = df.groupby('birds').size()
#for birds,brids_group in gb:
#print(birds)
#print(brids_group)
#print(gb.count())
print(gb)

birds
Cranes 4
plovers 2
spoonbills 4
dtype: int64
14. Sort dataframe (birds) first by the values in the 'age' in decending order, then by the
value in the 'visits' column in ascending order.

In [ ]:
import pandas as pd
import numpy as np
data = {'birds': ['Cranes', 'Cranes', 'plovers', 'spoonbills', 'spoonbills', 'Cranes
'age': [3.5, 4, 1.5, np.nan, 6, 3, 5.5, np.nan, 8, 4],
'visits': [2, 4, 3, 4, 3, 4, 2, 2, 3, 2],
'priority': ['yes', 'yes', 'no', 'yes', 'no', 'no', 'no', 'yes', 'no', 'no']
labels = ['a', 'b', 'c', 'd', 'e', 'f', 'g', 'h', 'i', 'j']
df = pd.DataFrame(data, index = labels)
df.sort_values(by=['age','visits'], ascending=[False,True],inplace=True)
df

Out[ ]: birds age visits priority

i spoonbills 8.0 3 no

e spoonbills 6.0 3 no

g plovers 5.5 2 no

j spoonbills 4.0 2 no

b Cranes 4.0 4 yes

a Cranes 3.5 2 yes

f Cranes 3.0 4 no

c plovers 1.5 3 no

h Cranes NaN 2 yes

file:///C:/Users/HP/OneDrive/Applied ai/Module 1/assignment/pandas_basics_practice_assignment 2.html 5/7

9/14/22, 11:02 PM pandas_basics_practice_assignment 2

birds age visits priority

d spoonbills NaN 4 yes

15. Replace the priority column values with'yes' should be 1 and 'no' should be 0

Out[ ]: birds age visits priority

a Cranes 3.5 2 1

b Cranes 4.0 4 1

c plovers 1.5 3 0

d spoonbills NaN 4 1

e spoonbills 6.0 3 0

f Cranes 3.0 4 0

g plovers 5.5 2 0

h Cranes NaN 2 1

i spoonbills 8.0 3 0

j spoonbills 4.0 2 0

16. In the 'birds' column, change the 'Cranes' entries to 'trumpeters'.

In [ ]:
df.replace(to_replace=['Cranes'],value=['trumpeters'],inplace=True)
df

Out[ ]: birds age visits priority

a trumpeters 3.5 2 1

b trumpeters 4.0 4 1

c plovers 1.5 3 0

d spoonbills NaN 4 1

e spoonbills 6.0 3 0

f trumpeters 3.0 4 0

g plovers 5.5 2 0

h trumpeters NaN 2 1

i spoonbills 8.0 3 0

j spoonbills 4.0 2 0

file:///C:/Users/HP/OneDrive/Applied ai/Module 1/assignment/pandas_basics_practice_assignment 2.html 6/7

9/14/22, 11:02 PM pandas_basics_practice_assignment 2

file:///C:/Users/HP/OneDrive/Applied ai/Module 1/assignment/pandas_basics_practice_assignment 2.html 7/7

Only Pandas
No ratings yet
Only Pandas
8 pages
Python Cheat Sheet: Pandas - Numpy - Sklearn Matplotlib - Seaborn BS4 - Selenium - Scrapy
100% (4)
Python Cheat Sheet: Pandas - Numpy - Sklearn Matplotlib - Seaborn BS4 - Selenium - Scrapy
11 pages
Solutions To Pandas Basic Questions
No ratings yet
Solutions To Pandas Basic Questions
1 page
Pandas - Basics - Practice: Consider The Following Python Dictionary Data and Python List Labels
No ratings yet
Pandas - Basics - Practice: Consider The Following Python Dictionary Data and Python List Labels
7 pages
Pandas - Basics - Practice: Consider The Following Python Dictionary Data and Python List Labels
No ratings yet
Pandas - Basics - Practice: Consider The Following Python Dictionary Data and Python List Labels
6 pages
2 Pandas Basics Practice 2 PDF
No ratings yet
2 Pandas Basics Practice 2 PDF
1 page
Pandas Library Problems For Parctice
No ratings yet
Pandas Library Problems For Parctice
13 pages
Pandas - Cheat - Sheet FULL
No ratings yet
Pandas - Cheat - Sheet FULL
2 pages
batch1 ds
No ratings yet
batch1 ds
15 pages
Ai Tools and Applications-Lab
No ratings yet
Ai Tools and Applications-Lab
33 pages
Exercise 7 - Pandas
No ratings yet
Exercise 7 - Pandas
2 pages
Ex-13 Data Science
No ratings yet
Ex-13 Data Science
11 pages
Mayank Chaudhary DEV Practicals
No ratings yet
Mayank Chaudhary DEV Practicals
14 pages
pandas.ipynb - Colab
No ratings yet
pandas.ipynb - Colab
22 pages
Pandas - Ipynb - Colaboratory
No ratings yet
Pandas - Ipynb - Colaboratory
36 pages
Creating Dataframes Reshaping Data
100% (1)
Creating Dataframes Reshaping Data
2 pages
Pandas Cheat Sheet
No ratings yet
Pandas Cheat Sheet
2 pages
Week 3 GGG
No ratings yet
Week 3 GGG
17 pages
MLRecord
No ratings yet
MLRecord
24 pages
ML Lab Manual Final
No ratings yet
ML Lab Manual Final
36 pages
Print Print Print Print: Import As
No ratings yet
Print Print Print Print: Import As
6 pages
Unit3_3) Pandas.ipynb - Colab
No ratings yet
Unit3_3) Pandas.ipynb - Colab
11 pages
FDS Slot 1
No ratings yet
FDS Slot 1
19 pages
3rd Semester DDM AI DAA DEV Print Pages For Spiral Record 25-1-24 - Removed
No ratings yet
3rd Semester DDM AI DAA DEV Print Pages For Spiral Record 25-1-24 - Removed
28 pages
DA lab
No ratings yet
DA lab
27 pages
TAMIL
No ratings yet
TAMIL
9 pages
RECORD BOOK PROGRAMS 2024-2025
No ratings yet
RECORD BOOK PROGRAMS 2024-2025
11 pages
EXP-3
No ratings yet
EXP-3
10 pages
DL experiment - 1
No ratings yet
DL experiment - 1
10 pages
10 Minutes To Pandas - Pandas 1.2.4 Documentation
No ratings yet
10 Minutes To Pandas - Pandas 1.2.4 Documentation
18 pages
DMT Function
No ratings yet
DMT Function
10 pages
Lab File
No ratings yet
Lab File
96 pages
10 Minutes to Pandas — Pandas 2.1.1 Documentation
No ratings yet
10 Minutes to Pandas — Pandas 2.1.1 Documentation
24 pages
Python Course Cheat Sheet
No ratings yet
Python Course Cheat Sheet
30 pages
Exp_1_Introduction to Data Analytics and Python fundamentals_sdk_ok
No ratings yet
Exp_1_Introduction to Data Analytics and Python fundamentals_sdk_ok
9 pages
Data Science Practical Book - Ipynb
No ratings yet
Data Science Practical Book - Ipynb
21 pages
Jashan ML
No ratings yet
Jashan ML
20 pages
Act 7.2
No ratings yet
Act 7.2
8 pages
DAVL PR1.2 Mit
No ratings yet
DAVL PR1.2 Mit
10 pages
WEBINTEL GUIDED LAB ACTIVITY Introduction To Pandas
No ratings yet
WEBINTEL GUIDED LAB ACTIVITY Introduction To Pandas
1 page
Data Science Practical Problems
No ratings yet
Data Science Practical Problems
40 pages
10 Minutes To Pandas
No ratings yet
10 Minutes To Pandas
26 pages
Ge Sem II Dav Upc 2344001201 Sl. No. Qp. 2012 July 2023
No ratings yet
Ge Sem II Dav Upc 2344001201 Sl. No. Qp. 2012 July 2023
16 pages
Name: Muhammad Sarfraz Seat: EP1850086 Section: A Course Code: 514 Course Name: Data Warehousing and Data Mining
No ratings yet
Name: Muhammad Sarfraz Seat: EP1850086 Section: A Course Code: 514 Course Name: Data Warehousing and Data Mining
39 pages
Pandas
No ratings yet
Pandas
1 page
12 Pandas
100% (1)
12 Pandas
21 pages
Pandas Cheat Sheet
100% (1)
Pandas Cheat Sheet
2 pages
Python Unit 4&5 Que
No ratings yet
Python Unit 4&5 Que
33 pages
Pandas Cheatsheet DF
No ratings yet
Pandas Cheatsheet DF
1 page
pandas
No ratings yet
pandas
24 pages
DSA_1
No ratings yet
DSA_1
8 pages
data science practicals
No ratings yet
data science practicals
47 pages
C ML2
No ratings yet
C ML2
6 pages
Computing Programming With Python (W10)
No ratings yet
Computing Programming With Python (W10)
30 pages
2023 Data Analysis and Visualization Using Python
100% (1)
2023 Data Analysis and Visualization Using Python
9 pages
Python Practical File 12
No ratings yet
Python Practical File 12
22 pages
BDS306B_Module5
No ratings yet
BDS306B_Module5
5 pages
DAV Practical
No ratings yet
DAV Practical
12 pages
Python Cheat Sheet For Excel Users
100% (2)
Python Cheat Sheet For Excel Users
5 pages

Pandas - Basics - Practice - Assignment 2 - PDF

Uploaded by

Pandas - Basics - Practice - Assignment 2 - PDF

Uploaded by

9/14/22, 11:02 PM pandas_basics_practice_assignment 2

birds age visits priority

Out[ ]: birds age visits priority

a Cranes 3.5 2 yes

b Cranes 4.0 4 yes

file:///C:/Users/HP/OneDrive/Applied ai/Module 1/assignment/pandas_basics_practice_assignment 2.html 1/7

birds age visits priority

Out[ ]: birds age visits

6. select the rows where the number of visits is less than 4

Out[ ]: birds age visits priority

a Cranes 3.5 2 yes

h Cranes NaN 2 yes

file:///C:/Users/HP/OneDrive/Applied ai/Module 1/assignment/pandas_basics_practice_assignment 2.html 2/7

birds age visits priority

Out[ ]: birds age visits priority

a Cranes 3.5 2 yes

9. Select the rows the age is between 2 and 4(inclusive)

Out[ ]: birds age visits priority

a Cranes 3.5 2 yes

b Cranes 4.0 4 yes

10. Find the total number of visits of the bird Cranes

birds age visits priority

birds age visits priority

file:///C:/Users/HP/OneDrive/Applied ai/Module 1/assignment/pandas_basics_practice_assignment 2.html 4/7

Out[ ]: birds age visits priority

b Cranes 4.0 4 yes

a Cranes 3.5 2 yes

h Cranes NaN 2 yes

file:///C:/Users/HP/OneDrive/Applied ai/Module 1/assignment/pandas_basics_practice_assignment 2.html 5/7

birds age visits priority

d spoonbills NaN 4 yes

Out[ ]: birds age visits priority

16. In the 'birds' column, change the 'Cranes' entries to 'trumpeters'.

Out[ ]: birds age visits priority

file:///C:/Users/HP/OneDrive/Applied ai/Module 1/assignment/pandas_basics_practice_assignment 2.html 6/7

file:///C:/Users/HP/OneDrive/Applied ai/Module 1/assignment/pandas_basics_practice_assignment 2.html 7/7

You might also like