Pandas - Dataframe - Handling Missing Nan Values
Pandas - Dataframe - Handling Missing Nan Values
Contents
1. NaN Value ........................................................................................................................................... 2
2. Creating a DataFrame by loading csv file .......................................................................................... 3
3. isna() and isnull() method – Checking NaN values ............................................................................ 4
4. notnull() method – Checking NaN values .......................................................................................... 6
5. Counting NaN values in column wise ................................................................................................ 7
6. dropna() method – Handling missing values ..................................................................................... 9
7. dropna(inplace = True) method – Handling missing values............................................................ 12
8. fillna() method – Handling missing values ...................................................................................... 13
1. NaN Value
import pandas as pd
df1 = pd.read_csv("fruits1.csv")
print(df1)
Output
import pandas as pd
df1 = pd.read_csv("fruits1.csv")
df2 = df1.isna()
print(df1.head())
print()
print(df2.head())
Output
import pandas as pd
df1 = pd.read_csv("fruits1.csv")
df2 = df1.isnull()
print(df1.head())
print()
print(df2.head())
Output
Make a note
import pandas as pd
df1 = pd.read_csv("fruits1.csv")
df2 = df1.notnull()
print(df1.head())
print()
print(df2.head())
Output
import pandas as pd
df1 = pd.read_csv('fruits1.csv')
s = df1.isna().sum()
print(s)
Output
import pandas as pd
df1 = pd.read_csv('fruits1.csv')
s = df1.isna().sum()
per = (s * 100) / len(df1)
print(per)
Output
import pandas as pd
df1 = pd.read_csv("fruits1.csv")
df2 = df1.dropna()
print(df2)
Output
import pandas as pd
df1 = pd.read_csv("fruits1.csv")
df2 = df1.dropna()
s = df2.isna().sum()
print(s)
Output
import pandas as pd
df1 = pd.read_csv('fruits1.csv')
df2 = df1.dropna()
df3 = df2.astype(int)
print(df2.head())
print()
print(df3.head())
Output
import pandas as pd
df1 = pd.read_csv("fruits1.csv")
df1.dropna(inplace = True)
print(df1)
Output
import pandas as pd
df1 = pd.read_csv("fruits1.csv")
df2 = df1.fillna(0)
print(df1.head())
print()
print(df2.head())
Output
import pandas as pd
import numpy as np
data = [
["Rajan", 26, 40000],
["Daniel", 16, 20000],
["Veeru", 45, 90000],
["Venkat", np.nan, 45000],
["Sumanth", 20, 95000],
["Shafi", np.nan, 97000]
]
print(df1)
print()
print(df2)
Output
import pandas as pd
import numpy as np
data = [
["Shahid", 26, 40000],
["Daniel", 16, 20000],
["Karteek", np.nan, 90000],
["Venkat", np.nan, 45000],
["Veeru", 24, 95000],
["Shafi", np.nan, 97000]
]
print(df1)
m = df1['Age'].mean()
df1['Age'] = df1['Age'].fillna(m)
print()
print(df1)
Output
Program Creating dataframe and replacing nan values with specific value
Name demo14.py
import pandas as pd
import numpy as np
data = [
['Shahid', np.nan, 40000],
['Daniel', 16, 20000],
['Veeru', 45, 90000],
['Sumanth', 20, 95000]
]
print(df1)
df2 = df1.replace(np.nan, 0)
print()
print(df2)
Output