0% found this document useful (0 votes)

11 views6 pages

Week 2

Uploaded by

srujjanbelamgi12

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views6 pages

Week 2

Uploaded by

srujjanbelamgi12

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

import pandas as pd

# DataFrame 1
data1 = {'Name': ['Pankaj', 'Meghna', 'Lisa'],
'Country': ['India', 'India', 'USA'],
'Role': ['CEO', 'CTO', 'CTO']}
df1 = pd.DataFrame(data1)
# DataFrame 2
data2 = {'ID': [1, 2, 3],
'Name': ['Pankaj', 'Anupam', 'Amit']}
df2 = pd.DataFrame(data2)
print("DataFrame 1:")
print(df1)
print("\nDataFrame 2:")
print(df2)

DataFrame 1:
Name Country Role
0 Pankaj India CEO
1 Meghna India CTO
2 Lisa USA CTO

DataFrame 2:
ID Name
0 1 Pankaj
1 2 Anupam
2 3 Amit

result_row = pd.merge(df1, df2, on='Name')

print(result_row)

Name Country Role ID

0 Pankaj India CEO 1

# Left Join
result_left = pd.merge(df1, df2, on='Name', how='left')
print("\nResult Left Join:")
print(result_left)
# Right Join
result_right = pd.merge(df1, df2, on='Name', how='right')
print("\nResult Right Join:")
print(result_right)
# Outer Join

result_outer = pd.merge(df1, df2, on='Name', how='outer')

print("\nResult Outer Join:")
print(result_outer)

Result Left Join:

Name Country Role ID
0 Pankaj India CEO 1.0
1 Meghna India CTO NaN
2 Lisa USA CTO NaN

Result Right Join:

Name Country Role ID
0 Pankaj India CEO 1
1 Anupam NaN NaN 2
2 Amit NaN NaN 3

Result Outer Join:

Name Country Role ID
0 Amit NaN NaN 3.0
1 Anupam NaN NaN 2.0
2 Lisa USA CTO NaN
3 Meghna India CTO NaN
4 Pankaj India CEO 1.0

result_outer = pd.merge(df1, df2, on='Name', how='outer')

print("\nResult Outer Join:")
print(result_outer)

Result Left Join:

Name Country Role ID
0 Pankaj India CEO 1.0
1 Meghna India CTO NaN
2 Lisa USA CTO NaN

Result Right Join:

Name Country Role ID
0 Pankaj India CEO 1
1 Anupam NaN NaN 2
2 Amit NaN NaN 3

Result Outer Join:

Name Country Role ID
0 Amit NaN NaN 3.0
1 Anupam NaN NaN 2.0
2 Lisa USA CTO NaN
3 Meghna India CTO NaN
4 Pankaj India CEO 1.0

# Sales Dictionary and Region Dictionary

sales_dict = {'ID': [1, 2, 3, 4],
'Amount': [100, 200, 300, 400]}
region_dict = {'ID': [1, 2, 3, 5],
'Region': ['East', 'West', 'North', 'South']}
# Create DataFrames
sales_df = pd.DataFrame.from_dict(sales_dict)
region_df = pd.DataFrame.from_dict(region_dict)
print("Sales DataFrame:")
print(sales_df)
print("\nRegion DataFrame:")
print(region_df)

Sales DataFrame:
ID Amount
0 1 100
1 2 200
2 3 300
3 4 400

Region DataFrame:
ID Region
0 1 East
1 2 West
2 3 North
3 5 South

# b) Merging with Inner Join

result_inner = pd.merge(sales_df, region_df, on='ID', how='inner')
print("\nInner Join:")
print(result_inner)
# c) Merging with Left Join
result_left = pd.merge(sales_df, region_df, on='ID', how='left')
print("\nLeft Join:")
print(result_left)
# d) Merging with Right Join
result_right = pd.merge(sales_df, region_df, on='ID', how='right')
print("\nRight Join:")
print(result_right)
# e) Merging with Outer Join
result_outer = pd.merge(sales_df, region_df, on='ID', how='outer')
print("\nOuter Join:")
print(result_outer)
Inner Join:
ID Amount Region
0 1 100 East
1 2 200 West
2 3 300 North

Left Join:
ID Amount Region
0 1 100 East
1 2 200 West
2 3 300 North
3 4 400 NaN

Right Join:
ID Amount Region
0 1 100.0 East
1 2 200.0 West
2 3 300.0 North
3 5 NaN South

Outer Join:
ID Amount Region
0 1 100.0 East
1 2 200.0 West
2 3 300.0 North
3 4 400.0 NaN
4 5 NaN South

import numpy as np
import pandas as pd
# Data with Missing Values
data = {'A': [1, np.nan, 3, 4],
'B': [5, 6, np.nan, 8],
'C': [np.nan, np.nan, 9, 10]}
df = pd.DataFrame(data)
print("Original DataFrame:")
print(df)
# 1. Drop rows with any missing value
print("\nDrop rows with any missing values:")
print(df.dropna())
# 2. Drop columns with at least one missing value
print("\nDrop columns with at least one missing value:")
print(df.dropna(axis=1))
# 3. Drop rows/columns with all missing values
print("\nDrop rows/columns with all missing values:")
print(df.dropna(how='all'))
# 4. Drop rows/columns based on threshold (at least 2 non-NaN values)
print("\nDrop rows/columns based on threshold:")
print(df.dropna(thresh=2))
# 5. Replace NaN with the previous value (Forward Fill)
print("\nReplace NaN with the previous value:")
print(df.ffill()) # Using ffill() instead of fillna(method='pad')
# 6. Replace NaN with the previous value, limit=1 (Forward Fill with Limit)
print("\nReplace NaN with the previous value, limit=1:")
print(df.ffill(limit=1)) # Using ffill() with limit
# 7. Replace NaN with the next value (Backward Fill)
print("\nReplace NaN with the forward value:")
print(df.bfill()) # Using bfill() instead of fillna(method='bfill')

Original DataFrame:
A B C
0 1.0 5.0 NaN
1 NaN 6.0 NaN
2 3.0 NaN 9.0
3 4.0 8.0 10.0

Drop rows with any missing values:

A B C
3 4.0 8.0 10.0

Drop columns with at least one missing value:

Empty DataFrame
Columns: []
Index: [0, 1, 2, 3]
Drop rows/columns with all missing values:
A B C
0 1.0 5.0 NaN
1 NaN 6.0 NaN
2 3.0 NaN 9.0
3 4.0 8.0 10.0

Drop rows/columns based on threshold:

A B C
0 1.0 5.0 NaN
2 3.0 NaN 9.0
3 4.0 8.0 10.0

Replace NaN with the previous value:

A B C
0 1.0 5.0 NaN
1 1.0 6.0 NaN
2 3.0 6.0 9.0
3 4.0 8.0 10.0

Replace NaN with the previous value, limit=1:

A B C
0 1.0 5.0 NaN
1 1.0 6.0 NaN
2 3.0 6.0 9.0
3 4.0 8.0 10.0

Replace NaN with the forward value:

A B C
0 1.0 5.0 9.0
1 3.0 6.0 9.0
2 3.0 8.0 9.0
3 4.0 8.0 10.0

import pandas as pd

fruit = { 'orange' : [3,2,0,1], 'apple' : [0,3,7,2], 'grapes' : [7,14,6,15] }

df1 = pd.DataFrame(fruit)
df1

orange apple grapes

0 3 0 7

1 2 3 14

2 0 7 6

3 1 2 15

Next steps: Generate code with df1

toggle_off View recommended plots New interactive sheet

fruit = { 'grapes' : [13,12,10,2,55,98], 'mango' : [10,13,17,2,9,76], 'banana' : [20,23,27,4,np.nan,np.nan]} # Added np.nan

df2 = pd.DataFrame(fruit)
df2

grapes mango banana

0 13 10 20.0

1 12 13 23.0

2 10 17 27.0

3 2 2 4.0

4 55 9 NaN

5 98 76 NaN

Next steps: Generate code with df2

toggle_off View recommended plots New interactive sheet

df2 = df2.drop(df2.index[2])
df2
grapes mango banana

0 13 10 20.0

1 12 13 23.0

3 2 2 4.0

4 55 9 NaN

5 98 76 NaN

Next steps: Generate code with df2

toggle_off View recommended plots New interactive sheet

pd.concat((df1, df2), axis = 0)

orange apple grapes mango banana

0 3.0 0.0 7 NaN NaN

1 2.0 3.0 14 NaN NaN

2 0.0 7.0 6 NaN NaN

3 1.0 2.0 15 NaN NaN

0 NaN NaN 13 10.0 20.0

1 NaN NaN 12 13.0 23.0

3 NaN NaN 2 2.0 4.0

4 NaN NaN 55 9.0 NaN

5 NaN NaN 98 76.0 NaN

df1

orange apple grapes

0 3 0 7

1 2 3 14

2 0 7 6

3 1 2 15

Next steps: Generate code with df1

toggle_off View recommended plots New interactive sheet

pd.concat([df1, df2], ignore_index=True)

orange apple grapes mango banana

0 3.0 0.0 7 NaN NaN

1 2.0 3.0 14 NaN NaN

2 0.0 7.0 6 NaN NaN

3 1.0 2.0 15 NaN NaN

4 NaN NaN 13 10.0 20.0

5 NaN NaN 12 13.0 23.0

6 NaN NaN 2 2.0 4.0

7 NaN NaN 55 9.0 NaN

8 NaN NaN 98 76.0 NaN

%%time
df = pd.DataFrame(columns=['A'])
for i in range(30):
# Instead of append, use concat to add rows
df = pd.concat([df, pd.DataFrame([{'A': i*2}])], ignore_index=True)

CPU times: user 17.4 ms, sys: 0 ns, total: 17.4 ms

Wall time: 16.7 ms

%%time
df = pd.concat([pd.DataFrame([i*2], columns=['A']) for i in range(30)], ignore_index=True)

CPU times: user 11.4 ms, sys: 1.04 ms, total: 12.5 ms
Wall time: 39.6 ms

Start coding or generate with AI.

Lost 2e Condition Cards 9 Up
100% (1)
Lost 2e Condition Cards 9 Up
14 pages
Python Interviews
No ratings yet
Python Interviews
154 pages
Six Dimensions of Entrepreneurship Stevenson
67% (3)
Six Dimensions of Entrepreneurship Stevenson
4 pages
TM CBLM - Copy 2.odt
100% (2)
TM CBLM - Copy 2.odt
98 pages
Pandas Cheat Sheet CN
No ratings yet
Pandas Cheat Sheet CN
4 pages
Python Libraries Cheat Sheets
No ratings yet
Python Libraries Cheat Sheets
6 pages
Pandas Cheat Sheet
85% (13)
Pandas Cheat Sheet
2 pages
12 Pandas
100% (1)
12 Pandas
21 pages
Pandas Cheat Sheet
100% (4)
Pandas Cheat Sheet
2 pages
Pandas Cheat Sheet
100% (2)
Pandas Cheat Sheet
6 pages
Python Cheat Sheets
97% (33)
Python Cheat Sheets
11 pages
Pandas Python For Data Science
100% (1)
Pandas Python For Data Science
1 page
Pandas Cheat Sheet
100% (1)
Pandas Cheat Sheet
2 pages
Pandas Commands
No ratings yet
Pandas Commands
3 pages
07 Data Wrangling
No ratings yet
07 Data Wrangling
51 pages
Module - d2
No ratings yet
Module - d2
41 pages
4th Unit Answer Bank
No ratings yet
4th Unit Answer Bank
40 pages
Pandas
No ratings yet
Pandas
94 pages
Pandas
No ratings yet
Pandas
44 pages
Pandas Dataframe1
No ratings yet
Pandas Dataframe1
43 pages
Logic (Immediate Inference)
100% (6)
Logic (Immediate Inference)
4 pages
Data Analysis With Python
No ratings yet
Data Analysis With Python
60 pages
DSP Unit-5 Updated
No ratings yet
DSP Unit-5 Updated
23 pages
Week 5 LAB
No ratings yet
Week 5 LAB
23 pages
Chapter 2 Python Pandas - II
No ratings yet
Chapter 2 Python Pandas - II
19 pages
Edp 3
No ratings yet
Edp 3
16 pages
IV Unit Fds
No ratings yet
IV Unit Fds
16 pages
Pandas
No ratings yet
Pandas
26 pages
Pandas Part-2
No ratings yet
Pandas Part-2
9 pages
Pandas Moderate
No ratings yet
Pandas Moderate
15 pages
Unit3 - 3) Pandas - Ipynb - Colab
No ratings yet
Unit3 - 3) Pandas - Ipynb - Colab
11 pages
9.9.24 Revision
No ratings yet
9.9.24 Revision
9 pages
Exp 6
No ratings yet
Exp 6
9 pages
Python For DS Unit4
No ratings yet
Python For DS Unit4
11 pages
Unit 4 DSE
No ratings yet
Unit 4 DSE
9 pages
Day 18-9-2023 - Jupyter Notebook
No ratings yet
Day 18-9-2023 - Jupyter Notebook
8 pages
10) Merging Dataframes: # Detecting Duplicates
No ratings yet
10) Merging Dataframes: # Detecting Duplicates
7 pages
Exp 3
No ratings yet
Exp 3
10 pages
Pandas Cheat Sheet Final
No ratings yet
Pandas Cheat Sheet Final
1 page
Cheat Python
No ratings yet
Cheat Python
8 pages
Wrangling 1
No ratings yet
Wrangling 1
5 pages
Different Methods of Plotting
No ratings yet
Different Methods of Plotting
4 pages
What Can You Do With Dataframes Using Pandas?: Pandas Is A High-Level Data Manipulation Tool Developed by Wes Mckinney
No ratings yet
What Can You Do With Dataframes Using Pandas?: Pandas Is A High-Level Data Manipulation Tool Developed by Wes Mckinney
10 pages
001 2014 4 e PDF
No ratings yet
001 2014 4 e PDF
168 pages
Pandaspythonfordatascience
No ratings yet
Pandaspythonfordatascience
1 page
Pandas Data Wrangling Cheatsheet Datacamp PDF
No ratings yet
Pandas Data Wrangling Cheatsheet Datacamp PDF
1 page
Content Pandas Cheat Sheet
No ratings yet
Content Pandas Cheat Sheet
9 pages
Pandas Cheat Sheet
No ratings yet
Pandas Cheat Sheet
5 pages
Unit 4 1
No ratings yet
Unit 4 1
3 pages
Ch-2 - Panda - Part-1 - 2nd - Day
No ratings yet
Ch-2 - Panda - Part-1 - 2nd - Day
4 pages
Data Wrangling
No ratings yet
Data Wrangling
2 pages
Python Cheatsy
No ratings yet
Python Cheatsy
1 page
16x Installation Manual 5166001r14
No ratings yet
16x Installation Manual 5166001r14
164 pages
Pandas Cheat Sheet
No ratings yet
Pandas Cheat Sheet
2 pages
LP For Reading and Writing Skills
No ratings yet
LP For Reading and Writing Skills
4 pages
Pandas Cheat Sheet
No ratings yet
Pandas Cheat Sheet
2 pages
Pandas Python For Data Science
No ratings yet
Pandas Python For Data Science
1 page
Python For Data Science: Advanced Indexing Data Wrangling in Pandas Cheat Sheet Combining Data
No ratings yet
Python For Data Science: Advanced Indexing Data Wrangling in Pandas Cheat Sheet Combining Data
1 page
WEBINTEL GUIDED LAB ACTIVITY Introduction To Pandas
No ratings yet
WEBINTEL GUIDED LAB ACTIVITY Introduction To Pandas
1 page
Non-Pharmacological Pain Management
No ratings yet
Non-Pharmacological Pain Management
19 pages
Symbolstix Online From Start To Finish
No ratings yet
Symbolstix Online From Start To Finish
6 pages
Disabling XP - Cmdshell Is It Really A "Best Practice"?: by Jeff Moden
No ratings yet
Disabling XP - Cmdshell Is It Really A "Best Practice"?: by Jeff Moden
67 pages
Maritime Thesis
No ratings yet
Maritime Thesis
32 pages
QUESTION AND ANSWERS For An Angel in Disguise
No ratings yet
QUESTION AND ANSWERS For An Angel in Disguise
8 pages
Articulation Assignment Final
No ratings yet
Articulation Assignment Final
7 pages
Visionary Leadership: Great Video On The 3 Most Important
No ratings yet
Visionary Leadership: Great Video On The 3 Most Important
21 pages
Dimensions of Knowledge
No ratings yet
Dimensions of Knowledge
10 pages
Physics Practicals Notes
No ratings yet
Physics Practicals Notes
3 pages
KFC Management Assignment 2
No ratings yet
KFC Management Assignment 2
3 pages
Adaptive Finite Element Methods: Lecture Notes Winter Term 2011/12
No ratings yet
Adaptive Finite Element Methods: Lecture Notes Winter Term 2011/12
144 pages
Request To Write PHD Research Proposal On Climate Change
No ratings yet
Request To Write PHD Research Proposal On Climate Change
8 pages
Thesis Reservoir Simulation
No ratings yet
Thesis Reservoir Simulation
87 pages
UPLB BSDC Social Science Electives
No ratings yet
UPLB BSDC Social Science Electives
10 pages
Sovrinmind Com Posts We Are Victimized by Facts
No ratings yet
Sovrinmind Com Posts We Are Victimized by Facts
21 pages
Idt 92HD73C DST 20110926
No ratings yet
Idt 92HD73C DST 20110926
252 pages
Assessment of SI Dysfunction
No ratings yet
Assessment of SI Dysfunction
24 pages
Distinctive Symbols in Heart of Darkness by Joseph Conrad
No ratings yet
Distinctive Symbols in Heart of Darkness by Joseph Conrad
21 pages
Network Address Translation NAT
No ratings yet
Network Address Translation NAT
10 pages
Katalon Demo ST
No ratings yet
Katalon Demo ST
15 pages
Research Forum Script
No ratings yet
Research Forum Script
4 pages
A315 Advertising & Consumer Culture - Syllabus-2
No ratings yet
A315 Advertising & Consumer Culture - Syllabus-2
9 pages
Chapter 11 - Compatibility Mode
No ratings yet
Chapter 11 - Compatibility Mode
26 pages
Velcro How To Make A Velcro Activity
No ratings yet
Velcro How To Make A Velcro Activity
3 pages
Num Py Lab Part-1-2
No ratings yet
Num Py Lab Part-1-2
2 pages
Numpy Programs
No ratings yet
Numpy Programs
3 pages
Sport Mechanics For Coaches 2nd Edition Book
No ratings yet
Sport Mechanics For Coaches 2nd Edition Book
2 pages
Composing Software: An Exploration of Functional Programming and Object Composition in JavaScript
From Everand
Composing Software: An Exploration of Functional Programming and Object Composition in JavaScript
Eric Elliott
No ratings yet
Programming PowerPoint With VBA Straight to the Point
From Everand
Programming PowerPoint With VBA Straight to the Point
Eduardo N Sanchez
No ratings yet
Gd Script
From Everand
Gd Script
Marijo Trkulja
No ratings yet
Develop Snakes & Ladders Game Complete Guide with Code & Design
From Everand
Develop Snakes & Ladders Game Complete Guide with Code & Design
Anurag Pandey
No ratings yet
Apache Cassandra Administrator Associate - Exam Practice Tests
From Everand
Apache Cassandra Administrator Associate - Exam Practice Tests
Cristian Scutaru
No ratings yet
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
TensorFlow深度学习项目实战: Chinese Edition
From Everand
TensorFlow深度学习项目实战: Chinese Edition
Posts & Telecom Press
No ratings yet
Amazing Java: Learn Java Quickly
From Everand
Amazing Java: Learn Java Quickly
Andrei Besedin
No ratings yet

Week 2

Uploaded by

Week 2

Uploaded by

import pandas as pd

result_row = pd.merge(df1, df2, on='Name')

Name Country Role ID

result_outer = pd.merge(df1, df2, on='Name', how='outer')

Result Left Join:

Result Right Join:

Result Outer Join:

result_outer = pd.merge(df1, df2, on='Name', how='outer')

Result Left Join:

Result Right Join:

Result Outer Join:

# Sales Dictionary and Region Dictionary

# b) Merging with Inner Join

Drop rows with any missing values:

Drop columns with at least one missing value:

Drop rows/columns based on threshold:

Replace NaN with the previous value:

Replace NaN with the previous value, limit=1:

Replace NaN with the forward value:

fruit = { 'orange' : [3,2,0,1], 'apple' : [0,3,7,2], 'grapes' : [7,14,6,15] }

orange apple grapes

Next steps: Generate code with df1

fruit = { 'grapes' : [13,12,10,2,55,98], 'mango' : [10,13,17,2,9,76], 'banana' : [20,23,27,4,np.nan,np.nan]} # Added np.nan

grapes mango banana

Next steps: Generate code with df2

Next steps: Generate code with df2

pd.concat((df1, df2), axis = 0)

orange apple grapes mango banana

0 3.0 0.0 7 NaN NaN

1 2.0 3.0 14 NaN NaN

2 0.0 7.0 6 NaN NaN

3 1.0 2.0 15 NaN NaN

0 NaN NaN 13 10.0 20.0

1 NaN NaN 12 13.0 23.0

3 NaN NaN 2 2.0 4.0

4 NaN NaN 55 9.0 NaN

5 NaN NaN 98 76.0 NaN

orange apple grapes

Next steps: Generate code with df1

pd.concat([df1, df2], ignore_index=True)

orange apple grapes mango banana

0 3.0 0.0 7 NaN NaN

1 2.0 3.0 14 NaN NaN

2 0.0 7.0 6 NaN NaN

3 1.0 2.0 15 NaN NaN

4 NaN NaN 13 10.0 20.0

5 NaN NaN 12 13.0 23.0

6 NaN NaN 2 2.0 4.0

7 NaN NaN 55 9.0 NaN

8 NaN NaN 98 76.0 NaN

CPU times: user 17.4 ms, sys: 0 ns, total: 17.4 ms

Start coding or generate with AI.

You might also like