0% found this document useful (0 votes)

136 views6 pages

Python Pandas Handson

The document provides an overview of Python Pandas HandsOn exercises covering topics such as: 1. Pandas data structures including Series, DataFrames, and generating random data. 2. Accessing Pandas data including selecting rows and columns from DataFrames. 3. Working with CSV files including reading from and writing DataFrames to CSV. 4. Indexing DataFrames including datetime indexing and multi-indexing. 5. Data cleaning techniques like dropping null values. 6. Data aggregation including filtering, grouping, and computing statistics. 7. Merging DataFrames by appending rows and merging on indexes.

Uploaded by

mohamed yasin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

136 views6 pages

Python Pandas Handson

Uploaded by

mohamed yasin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Join our channel if you haven’t joined yet https://t.

me/fresco_milestone ( @fresco_milestone )

Python Pandas HandsOns

1. Pandas Data Structures

import pandas as pd
import numpy as np

heights_A= pd.Series([176.2, 158.4, 167.6, 156.2, 161.4])

heights_A.index = ['s1', 's2', 's3', 's4','s5']
print(heights_A.shape)

weights_A = pd.Series([85.1, 90.2, 76.8, 80.4 , 78.9])

weights_A.index = ['s1', 's2', 's3', 's4','s5']
print(weights_A.dtypes)

df_A = pd.DataFrame()
df_A['Student_height'] = heights_A
df_A['Student_weight'] = weights_A
print(df_A.shape)

my_mean = 170.0
my_std = 25.0
np.random.seed(100)
heights_B= pd.Series(np.random.normal(loc=my_mean, scale=my_std, size=5))
heights_B.index = ['s1', 's2', 's3', 's4','s5']

my_mean1 = 75.0
my_std1 = 12.0
weights_B =pd.Series(np.random.normal(loc=my_mean, scale=my_std, size=5))
weights_B.index = ['s1', 's2', 's3', 's4','s5']
print(heights_B.mean())

df_B = pd.DataFrame()
df_B['Student_height'] = heights_B
df_B['Student_weight'] = weights_B
print(df_B.columns.values.tolist() )
2. Accessing Pandas Data Structures

#Write your code here

import pandas as pd
import numpy as np

heights_A = pd.Series([176.2, 158.4, 167.6, 156.2, 161.4])

heights_A.index = ['s1', 's2', 's3', 's4','s5']
print(heights_A[1])
print(heights_A[[1,2,3]])

weights_A = pd.Series([85.1, 90.2, 76.8, 80.4 , 78.9])

weights_A.index = ['s1', 's2', 's3', 's4','s5']

df_A = pd.DataFrame()
df_A['Student_height'] = heights_A
Join our channel if you haven’t joined yet https://t.me/fresco_milestone ( @fresco_milestone )

df_A['Student_weight'] = weights_A

height = df_A['Student_height']
print(type(height))

df_s1s2 = df_A[df_A.index.isin(['s1','s2'])]
print(df_s1s2)

df_s2s5s1 = df_A[df_A.index.isin(['s1','s2','s5'])]
df_s2s5s1 = df_s2s5s1.reindex(['s2', 's5', 's1'])
print(df_s2s5s1)

df_s1s4 = df_A[df_A.index.isin(['s1','s4'])]
print(df_s1s4)

3. Working with CSV files

#Write your code here

import pandas as pd
import numpy as np

heights_A = pd.Series([176.2, 158.4, 167.6, 156.2, 161.4])

heights_A.index = ['s1', 's2', 's3', 's4','s5']

weights_A = pd.Series([85.1, 90.2, 76.8, 80.4 , 78.9])

weights_A.index = ['s1', 's2', 's3', 's4','s5']

df_A = pd.DataFrame()
df_A['Student_height'] = heights_A
df_A['Student_weight'] = weights_A

df_A.to_csv('classA.csv')

df_A2 = pd.read_csv('classA.csv')
print(df_A2)

df_A3 = pd.read_csv('classA.csv',index_col='Unnamed: 0')

print(df_A3)

my_mean = 170.0
my_std = 25.0
np.random.seed(100)
heights_B = pd.Series(np.random.normal(loc=my_mean, scale=my_std, size=5))
heights_B.index = ['s1', 's2', 's3', 's4','s5']
Join our channel if you haven’t joined yet https://t.me/fresco_milestone ( @fresco_milestone )

my_mean1 = 75.0
my_std1 = 12.0
np.random.seed(100)
weights_B = pd.Series(np.random.normal(loc=my_mean1, scale=my_std1, size=5))
weights_B.index = ['s1', 's2', 's3', 's4','s5']

df_B = pd.DataFrame()
df_B['Student_height'] = heights_B
df_B['Student_weight'] = weights_B

df_B.to_csv('classB.csv', index=False)

df_B2 = pd.read_csv('classB.csv')
print(df_B2)

df_B3 = pd.read_csv('classB.csv',header=None)
print(df_B3)

df_B4 = pd.read_csv('classB.csv',header=None,skiprows=2)
print(df_B4)

4. Indexing Dataframes

#Write your code here

import pandas as pd
import numpy as np

DatetimeIndex = pd.date_range(start='09/1/2017', end='09/15/2017')

print(DatetimeIndex[2])

datelist = ['14-Sep-2017', '9-Sep-2017']

dates_to_be_searched = pd.to_datetime(datelist)

print(dates_to_be_searched)

print(dates_to_be_searched.isin(DatetimeIndex))

arraylist = [['classA']5 + ['classB']5, ['s1', 's2', 's3','s4', 's5']*2]

mi_index = pd.MultiIndex.from_product(arraylist, names=['First Level','Second Level'])
print(mi_index.levels)

5. Data Cleaning
Join our channel if you haven’t joined yet https://t.me/fresco_milestone ( @fresco_milestone )

#Write your code here

import pandas as pd
import numpy as np

heights_A = pd.Series([176.2, 158.4, 167.6, 156.2, 161.4])

heights_A.index = ['s1', 's2', 's3', 's4','s5']

weights_A = pd.Series([85.1, 90.2, 76.8, 80.4 , 78.9])

weights_A.index = ['s1', 's2', 's3', 's4','s5']

df_A = pd.DataFrame()
df_A['Student_height'] = heights_A
df_A['Student_weight'] = weights_A

df_A.loc['s3'] = np.nan
df_A.loc['s5'][1] = np.nan

df_A2 = df_A.dropna(how ='any')

print(df_A2)

6. Data Aggregation

#Write your code here

import pandas as pd
import numpy as np

heights_A = pd.Series([176.2, 158.4, 167.6, 156.2, 161.4])

heights_A.index = ['s1', 's2', 's3', 's4','s5']

weights_A = pd.Series([85.1, 90.2, 76.8, 80.4 , 78.9])

weights_A.index = ['s1', 's2', 's3', 's4','s5']

df_A = pd.DataFrame()
df_A['Student_height'] = heights_A
df_A['Student_weight'] = weights_A

df_A_filter1 = df_A[(df_A.Student_height > 160.0) & (df_A.Student_weight < 80.0)]

print(df_A_filter1)

df_A_filter2 = df_A[df_A.index.isin(['s5'])]
print(df_A_filter2)
Join our channel if you haven’t joined yet https://t.me/fresco_milestone ( @fresco_milestone )

df_A['Gender'] = ['M', 'F', 'M', 'M', 'F']

df_groups = df_A.groupby('Gender')
print(df_groups.mean())

7. Data Merge 1

#Write your code here

import pandas as pd
import numpy as np

heights_A = pd.Series([176.2, 158.4, 167.6, 156.2, 161.4])

heights_A.index = ['s1', 's2', 's3', 's4','s5']

weights_A = pd.Series([85.1, 90.2, 76.8, 80.4 , 78.9])

weights_A.index = ['s1', 's2', 's3', 's4','s5']

df_A = pd.DataFrame()
df_A['Student_height'] = heights_A
df_A['Student_weight'] = weights_A

df_A['Gender'] = ['M', 'F', 'M', 'M', 'F']

s = pd.Series([165.4, 82.7, 'F'],index=['Student_height', 'Student_weight', 'Gender'],name='s6')

df_AA = df_A.append(s)
print(df_AA)

my_mean = 170.0
my_std = 25.0
np.random.seed(100)
heights_B = pd.Series(np.random.normal(loc=my_mean, scale=my_std, size=5))
heights_B.index = ['s1', 's2', 's3', 's4','s5']

my_mean1 = 75.0
my_std1 = 12.0
np.random.seed(100)
weights_B = pd.Series(np.random.normal(loc=my_mean1, scale=my_std1, size=5))
weights_B.index = ['s1', 's2', 's3', 's4','s5']

df_B = pd.DataFrame()
df_B['Student_height'] = heights_B
df_B['Student_weight'] = weights_B
Join our channel if you haven’t joined yet https://t.me/fresco_milestone ( @fresco_milestone )

df_B.index = [ 's7', 's8', 's9', 's10', 's11']

df_B['Gender'] = ['F', 'M', 'F', 'F', 'M']

df = pd.concat([df_AA,df_B])
print(df)

8. Data Merge – 2

#Write your code here

import pandas as pd
import numpy as np

nameid = pd.Series(range(101, 111))

name = pd.Series(['person' + str(i) for i in range(1, 11)])
master = pd.DataFrame()
master['nameid'] = nameid
master['name'] = name

transaction = pd.DataFrame({'nameid':[108, 108, 108,103], 'product':['iPhone', 'Nokia', 'Micromax', 'Viv

o']})

mdf = pd.merge(master,transaction,on='nameid')
print(mdf)

PANDAS
No ratings yet
PANDAS
74 pages
Practical File 2024
No ratings yet
Practical File 2024
25 pages
CM-1 Worksheets
No ratings yet
CM-1 Worksheets
26 pages
Dmgss
No ratings yet
Dmgss
75 pages
Python Pandas - Hands On 1
No ratings yet
Python Pandas - Hands On 1
1 page
StudentMgmStystme ProjectFinal
100% (1)
StudentMgmStystme ProjectFinal
23 pages
KSTV
No ratings yet
KSTV
19 pages
Citra Log - Txt.old
No ratings yet
Citra Log - Txt.old
33 pages
Data Frame Demo
No ratings yet
Data Frame Demo
73 pages
CPE221 (2023-2024) - Lesson 3 - Pandas
No ratings yet
CPE221 (2023-2024) - Lesson 3 - Pandas
10 pages
Project 4
No ratings yet
Project 4
8 pages
DSP Unit-5 Updated
No ratings yet
DSP Unit-5 Updated
23 pages
NumPy and Pandas Step
No ratings yet
NumPy and Pandas Step
9 pages
Misrate2 10.94.141.55 GG
No ratings yet
Misrate2 10.94.141.55 GG
141 pages
Data Science Programs
No ratings yet
Data Science Programs
11 pages
Programs of Python Pandas
No ratings yet
Programs of Python Pandas
15 pages
Lecture 8 - Data Wrangling Using Pandas
No ratings yet
Lecture 8 - Data Wrangling Using Pandas
31 pages
12 Pandas
100% (1)
12 Pandas
21 pages
Adobe Scan 11 Jan 2025
No ratings yet
Adobe Scan 11 Jan 2025
13 pages
Mayank Chaudhary DEV Practicals
No ratings yet
Mayank Chaudhary DEV Practicals
14 pages
Pandas - Ipynb - Colab
No ratings yet
Pandas - Ipynb - Colab
8 pages
Lecture - 2 - Digital Control System
No ratings yet
Lecture - 2 - Digital Control System
77 pages
Unit 4 DSE
No ratings yet
Unit 4 DSE
9 pages
Machine Learning - Exploring The Model
50% (2)
Machine Learning - Exploring The Model
3 pages
Sensors 20 05603
No ratings yet
Sensors 20 05603
20 pages
10) Merging Dataframes: # Detecting Duplicates
No ratings yet
10) Merging Dataframes: # Detecting Duplicates
7 pages
Week 3 GGG
No ratings yet
Week 3 GGG
17 pages
Pandas 2 Complete Notes Class XII
No ratings yet
Pandas 2 Complete Notes Class XII
18 pages
Exam Cutoff Data
No ratings yet
Exam Cutoff Data
4 pages
IP Practical File - Reference
No ratings yet
IP Practical File - Reference
98 pages
Pandas
No ratings yet
Pandas
44 pages
Data Science Practicals - Ipynb
No ratings yet
Data Science Practicals - Ipynb
54 pages
Code Snippets
No ratings yet
Code Snippets
7 pages
IP Project File
No ratings yet
IP Project File
14 pages
Descriptive Statistics With Pandas: Data Handling Using Pandas - II
100% (1)
Descriptive Statistics With Pandas: Data Handling Using Pandas - II
37 pages
Davp Pyq 2023 Solution
No ratings yet
Davp Pyq 2023 Solution
15 pages
Python
No ratings yet
Python
32 pages
Series and DataFrame
No ratings yet
Series and DataFrame
2 pages
Exp 3
No ratings yet
Exp 3
10 pages
What Is Mobile Switching Center and How MSC Functions?
No ratings yet
What Is Mobile Switching Center and How MSC Functions?
6 pages
Student Management System
No ratings yet
Student Management System
9 pages
SRS Document-1
No ratings yet
SRS Document-1
8 pages
Mahesh Laxman Wagh: Professional Summary
No ratings yet
Mahesh Laxman Wagh: Professional Summary
5 pages
Hrithik Saini Class 12th c1, Roll No 1033
No ratings yet
Hrithik Saini Class 12th c1, Roll No 1033
25 pages
Class 5 - 2D Maxima Sweep-Line Algorithm
No ratings yet
Class 5 - 2D Maxima Sweep-Line Algorithm
28 pages
AD3301 - Data - Transformation - Ipynb - Colaboratory
No ratings yet
AD3301 - Data - Transformation - Ipynb - Colaboratory
27 pages
Assignments IP Class 12
No ratings yet
Assignments IP Class 12
9 pages
Data Science Practical Book - Ipynb
No ratings yet
Data Science Practical Book - Ipynb
21 pages
Data Frame Notes1
No ratings yet
Data Frame Notes1
7 pages
Quick Start Guide: Ethernet Communication With Mitsubishi Q Plcs
No ratings yet
Quick Start Guide: Ethernet Communication With Mitsubishi Q Plcs
34 pages
II B.Tech II-sem Timetables R20, R19, R16
No ratings yet
II B.Tech II-sem Timetables R20, R19, R16
5 pages
Mantra MFS100 RD Service Manual Windows 1.1.0
No ratings yet
Mantra MFS100 RD Service Manual Windows 1.1.0
16 pages
Corelation 22.1
No ratings yet
Corelation 22.1
9 pages
Ap Python
No ratings yet
Ap Python
12 pages
Pandas - Datastructures
No ratings yet
Pandas - Datastructures
19 pages
Automating Tasks Using The Automation 360 Excel Advanced Package
No ratings yet
Automating Tasks Using The Automation 360 Excel Advanced Package
18 pages
3a Data Frame - Jupyter Notebook
No ratings yet
3a Data Frame - Jupyter Notebook
5 pages
Project Report - E-Shopping For Clothes
100% (3)
Project Report - E-Shopping For Clothes
11 pages
Oacon LMI3d Lazer Profil Ve Snapshot Sensor
No ratings yet
Oacon LMI3d Lazer Profil Ve Snapshot Sensor
28 pages
12 Pandas
No ratings yet
12 Pandas
9 pages
Data Structures in Pandas Solution.: Code
No ratings yet
Data Structures in Pandas Solution.: Code
9 pages
Computer Architecture (Bcs504) Unit I
No ratings yet
Computer Architecture (Bcs504) Unit I
51 pages
Pandas Cheat Sheet PDF
67% (3)
Pandas Cheat Sheet PDF
1 page
4 PythonPandas
No ratings yet
4 PythonPandas
8 pages
Ip Project
No ratings yet
Ip Project
27 pages
Editorial
No ratings yet
Editorial
4 pages
SQL Questions For Journal
No ratings yet
SQL Questions For Journal
9 pages
Resume Tata Consultancy Services
No ratings yet
Resume Tata Consultancy Services
3 pages
HNS Level III COC Knowledge Test
88% (16)
HNS Level III COC Knowledge Test
3 pages
Fuzzy Logic Based Algorithm For Context Awareness in Lot For Smart Home Environment
No ratings yet
Fuzzy Logic Based Algorithm For Context Awareness in Lot For Smart Home Environment
4 pages
Data Merging Hands-On (2) Solution: Python Pandas
No ratings yet
Data Merging Hands-On (2) Solution: Python Pandas
1 page
Bower
No ratings yet
Bower
1 page
Python Pandas
No ratings yet
Python Pandas
9 pages
On-Board Control Interface Icu 602
No ratings yet
On-Board Control Interface Icu 602
3 pages
Python Cheat Sheet Code Academy
100% (1)
Python Cheat Sheet Code Academy
1 page
Color Theory
No ratings yet
Color Theory
4 pages
The Essential R Reference
From Everand
The Essential R Reference
Mark Gardener
No ratings yet
Data Science With Python
No ratings yet
Data Science With Python
12 pages
Python Pandas Hands-On CID 55937
No ratings yet
Python Pandas Hands-On CID 55937
10 pages
Pandas Cheat Sheet
100% (2)
Pandas Cheat Sheet
6 pages
Unstructured Data Classification Handson
No ratings yet
Unstructured Data Classification Handson
4 pages
Data Merge - Hands-On 1
No ratings yet
Data Merge - Hands-On 1
2 pages
R-Format Instructions: Op Rs RT RD Shamt Funct
No ratings yet
R-Format Instructions: Op Rs RT RD Shamt Funct
4 pages
Python Pandas
No ratings yet
Python Pandas
6 pages
Mini-Project - Java Fullstack Developer - MySQL - FP (63426)
No ratings yet
Mini-Project - Java Fullstack Developer - MySQL - FP (63426)
1 page
Python Program To Find All Numbers in A Range Which Are Perfect Squares and Sum of All Digits in The Number Is Less Than 10 - Sanfoundry
No ratings yet
Python Program To Find All Numbers in A Range Which Are Perfect Squares and Sum of All Digits in The Number Is Less Than 10 - Sanfoundry
5 pages
Data Science Cheat Sheet: KEY Imports
100% (1)
Data Science Cheat Sheet: KEY Imports
1 page
History of Information Technology: Premechanical
No ratings yet
History of Information Technology: Premechanical
5 pages
Cs It Dept PDF
No ratings yet
Cs It Dept PDF
3 pages
Pandas
No ratings yet
Pandas
4 pages
Pandas Cheat Sheet
100% (1)
Pandas Cheat Sheet
2 pages
Python Cheat Sheet: Pandas - Numpy - Sklearn Matplotlib - Seaborn BS4 - Selenium - Scrapy
100% (3)
Python Cheat Sheet: Pandas - Numpy - Sklearn Matplotlib - Seaborn BS4 - Selenium - Scrapy
9 pages
Internet of Things Prime
No ratings yet
Internet of Things Prime
3 pages
Teacher's Notes - Lab Chapter 1 - Intro To Solaris
No ratings yet
Teacher's Notes - Lab Chapter 1 - Intro To Solaris
3 pages

Python Pandas Handson

Uploaded by

Python Pandas Handson

Uploaded by

Join our channel if you haven’t joined yet https://t.

Python Pandas HandsOns

1. Pandas Data Structures

heights_A= pd.Series([176.2, 158.4, 167.6, 156.2, 161.4])

weights_A = pd.Series([85.1, 90.2, 76.8, 80.4 , 78.9])

#Write your code here

heights_A = pd.Series([176.2, 158.4, 167.6, 156.2, 161.4])

weights_A = pd.Series([85.1, 90.2, 76.8, 80.4 , 78.9])

3. Working with CSV files

#Write your code here

heights_A = pd.Series([176.2, 158.4, 167.6, 156.2, 161.4])

weights_A = pd.Series([85.1, 90.2, 76.8, 80.4 , 78.9])

df_A3 = pd.read_csv('classA.csv',index_col='Unnamed: 0')

#Write your code here

DatetimeIndex = pd.date_range(start='09/1/2017', end='09/15/2017')

datelist = ['14-Sep-2017', '9-Sep-2017']

arraylist = [['classA']*5 + ['classB']*5, ['s1', 's2', 's3','s4', 's5']*2]

#Write your code here

heights_A = pd.Series([176.2, 158.4, 167.6, 156.2, 161.4])

weights_A = pd.Series([85.1, 90.2, 76.8, 80.4 , 78.9])

df_A2 = df_A.dropna(how ='any')

#Write your code here

heights_A = pd.Series([176.2, 158.4, 167.6, 156.2, 161.4])

weights_A = pd.Series([85.1, 90.2, 76.8, 80.4 , 78.9])

df_A_filter1 = df_A[(df_A.Student_height > 160.0) & (df_A.Student_weight < 80.0)]

df_A['Gender'] = ['M', 'F', 'M', 'M', 'F']

#Write your code here

heights_A = pd.Series([176.2, 158.4, 167.6, 156.2, 161.4])

weights_A = pd.Series([85.1, 90.2, 76.8, 80.4 , 78.9])

df_A['Gender'] = ['M', 'F', 'M', 'M', 'F']

s = pd.Series([165.4, 82.7, 'F'],index=['Student_height', 'Student_weight', 'Gender'],name='s6')

df_B.index = [ 's7', 's8', 's9', 's10', 's11']

#Write your code here

nameid = pd.Series(range(101, 111))

transaction = pd.DataFrame({'nameid':[108, 108, 108,103], 'product':['iPhone', 'Nokia', 'Micromax', 'Viv

You might also like

arraylist = [['classA']5 + ['classB']5, ['s1', 's2', 's3','s4', 's5']*2]