0% found this document useful (0 votes)

17 views16 pages

Pandas - Dataframe - Handling Missing Nan Values

The document provides a comprehensive guide on handling missing or NaN values in Pandas, covering methods such as isna(), isnull(), dropna(), and fillna(). It explains how to create DataFrames from CSV files, check for missing values, count them, and handle them by dropping or filling with specific values. Various code examples illustrate the use of these methods for practical data manipulation.

Uploaded by

dheerajsai01

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views16 pages

Pandas - Dataframe - Handling Missing Nan Values

Uploaded by

dheerajsai01

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

Data Science – Pandas – Handling Missing or NaN values

12. PANDAS – Handling missing or NaN values

Contents
1. NaN Value ........................................................................................................................................... 2
2. Creating a DataFrame by loading csv file .......................................................................................... 3
3. isna() and isnull() method – Checking NaN values ............................................................................ 4
4. notnull() method – Checking NaN values .......................................................................................... 6
5. Counting NaN values in column wise ................................................................................................ 7
6. dropna() method – Handling missing values ..................................................................................... 9
7. dropna(inplace = True) method – Handling missing values............................................................ 12
8. fillna() method – Handling missing values ...................................................................................... 13

1|Page 12.PANDAS – HANDLING NAN VALUES

Data Science – Pandas – Handling Missing or NaN values

12. PANDAS – Handling missing or NaN values

1. NaN Value

 The full form of NaN is Not a Number

 The purpose of NaN is, to represent the missing values in data.
 The data type of NaN is float.
 While loading csv file, if file having missing values then it will be
considered as NaN values.
 During data analysis we need to handle these NaN values.
o For Example, suppose different users being surveyed may choose
not to share their income, some user may choose not to share the
address in this way many datasets went missing.

None and NaN

 None : None is a Python object which is holding nothing

 NaN : NaN is a pandas related object which represents missing data

2|Page 12.PANDAS – HANDLING NAN VALUES

Data Science – Pandas – Handling Missing or NaN values

2. Creating a DataFrame by loading csv file

 We can create DataFrame by loading csv file

 The given fruits.csv file having missing values.
 Kindly observe the missing/NaN values in DataFrame.

Program Loading fruits csv file

Name demo1.py
Input file fruits1.csv

import pandas as pd

df1 = pd.read_csv("fruits1.csv")

print(df1)

Output

3|Page 12.PANDAS – HANDLING NAN VALUES

Data Science – Pandas – Handling Missing or NaN values

3. isna() and isnull() method – Checking NaN values

 isna() and isnull() are a predefined methods in DataFrame

 We can access these methods by using DataFrame object.
 By using these methods we can check missing values exist in DataFrame
or not.
 If missing values are available then it return as True, otherwise False

Program isna() method

Name demo2.py
Input file fruits1.csv

import pandas as pd

df1 = pd.read_csv("fruits1.csv")
df2 = df1.isna()

print(df1.head())
print()
print(df2.head())

Output

4|Page 12.PANDAS – HANDLING NAN VALUES

Data Science – Pandas – Handling Missing or NaN values

Program isnull() method

Name demo3.py
Input file fruits1.csv

import pandas as pd

df1 = pd.read_csv("fruits1.csv")
df2 = df1.isnull()

print(df1.head())
print()
print(df2.head())

Output

Make a note

 isnull() and isna() both methods works in same way

5|Page 12.PANDAS – HANDLING NAN VALUES

Data Science – Pandas – Handling Missing or NaN values

4. notnull() method – Checking NaN values

 notnull() is a predefined method in DataFrame

 We can access this method by using DataFrame object.
 By using this method we can check missing values exist in DataFrame or
not.
 If missing values are available then it return as False, otherwise True

Program notnull() method

Name demo4.py
Input file fruits1.csv

import pandas as pd

df1 = pd.read_csv("fruits1.csv")
df2 = df1.notnull()

print(df1.head())
print()
print(df2.head())

Output

6|Page 12.PANDAS – HANDLING NAN VALUES

Data Science – Pandas – Handling Missing or NaN values

5. Counting NaN values in column wise

 We can count number of missing values in DataFrame

 By using isna() and sum() methods we can count the number of missing
values in each column.

Program Counting the missing values in each column

Name demo5.py
Input file fruits1.csv

import pandas as pd

df1 = pd.read_csv('fruits1.csv')
s = df1.isna().sum()

print(s)

Output

7|Page 12.PANDAS – HANDLING NAN VALUES

Data Science – Pandas – Handling Missing or NaN values

Program Counting the missing values in each column with percentage

Name demo6.py
Input file fruits1.csv

import pandas as pd

df1 = pd.read_csv('fruits1.csv')
s = df1.isna().sum()
per = (s * 100) / len(df1)

print(per)

Output

8|Page 12.PANDAS – HANDLING NAN VALUES

Data Science – Pandas – Handling Missing or NaN values

6. dropna() method – Handling missing values

 dropna() is a predefined method in DataFrame

 We can access dropna() method by using DataFrame object.
 This method drops the rows where at least one value is missing.

Program Dropping rows where NaN values existing

Name demo7.py
Input file fruits1.csv

import pandas as pd

df1 = pd.read_csv("fruits1.csv")
df2 = df1.dropna()

print(df2)

Output

9|Page 12.PANDAS – HANDLING NAN VALUES

Data Science – Pandas – Handling Missing or NaN values

Program Dropping rows where NaN values existing and counting

Name demo8.py
Input file fruits1.csv

import pandas as pd

df1 = pd.read_csv("fruits1.csv")
df2 = df1.dropna()
s = df2.isna().sum()

print(s)

Output

10 | P a g e 12.PANDAS – HANDLING NAN VALUES

Data Science – Pandas – Handling Missing or NaN values

Program Converting float column type into int data type

Name demo9.py
Input file fruits1.csv

import pandas as pd

df1 = pd.read_csv('fruits1.csv')
df2 = df1.dropna()
df3 = df2.astype(int)

print(df2.head())
print()
print(df3.head())

Output

11 | P a g e 12.PANDAS – HANDLING NAN VALUES

Data Science – Pandas – Handling Missing or NaN values

7. dropna(inplace = True) method – Handling missing values

 dropna(inplace = True) is a predefined method in DataFrame

 We can access this method by using DataFrame object.
 This method drops the rows and perform changes on existing
DataFrame.

Program Dropping NaN values by using inplace parameter

Name demo10.py
Input file fruits1.csv

import pandas as pd

df1 = pd.read_csv("fruits1.csv")
df1.dropna(inplace = True)

print(df1)

Output

12 | P a g e 12.PANDAS – HANDLING NAN VALUES

Data Science – Pandas – Handling Missing or NaN values

8. fillna() method – Handling missing values

 fillna() is a predefined method in DataFrame

 We can access this method by using DataFrame object.
 By using this method we can fill missing/NaN values with specific value.
o fillna(0) -> This method fill NaN with Zero values
o fillna(number) -> This method fill NaN with number

Program Filling NaN values with zero

Name demo11.py
Input file fruits1.csv

import pandas as pd

df1 = pd.read_csv("fruits1.csv")
df2 = df1.fillna(0)

print(df1.head())
print()
print(df2.head())

Output

13 | P a g e 12.PANDAS – HANDLING NAN VALUES

Data Science – Pandas – Handling Missing or NaN values

Program Filling NaN values with specific value

Name demo12.py

import pandas as pd
import numpy as np

data = [
["Rajan", 26, 40000],
["Daniel", 16, 20000],
["Veeru", 45, 90000],
["Venkat", np.nan, 45000],
["Sumanth", 20, 95000],
["Shafi", np.nan, 97000]
]

df1 = pd.DataFrame(data, columns = ['Name', 'Age', 'Salary'])

df2 = df1.fillna(22)

print(df1)
print()
print(df2)

Output

14 | P a g e 12.PANDAS – HANDLING NAN VALUES

Data Science – Pandas – Handling Missing or NaN values

Program Filling NaN value with mean value

Name demo13.py

import pandas as pd
import numpy as np

data = [
["Shahid", 26, 40000],
["Daniel", 16, 20000],
["Karteek", np.nan, 90000],
["Venkat", np.nan, 45000],
["Veeru", 24, 95000],
["Shafi", np.nan, 97000]
]

df1 = pd.DataFrame(data, columns = ['Name', 'Age', 'Salary'])

print(df1)
m = df1['Age'].mean()
df1['Age'] = df1['Age'].fillna(m)
print()
print(df1)

Output

15 | P a g e 12.PANDAS – HANDLING NAN VALUES

Data Science – Pandas – Handling Missing or NaN values

Program Creating dataframe and replacing nan values with specific value
Name demo14.py

import pandas as pd
import numpy as np

data = [
['Shahid', np.nan, 40000],
['Daniel', 16, 20000],
['Veeru', 45, 90000],
['Sumanth', 20, 95000]
]

df1 = pd.DataFrame(data, columns = ['Name', 'Age', 'Salary'])

print(df1)

df2 = df1.replace(np.nan, 0)

print()
print(df2)

Output

16 | P a g e 12.PANDAS – HANDLING NAN VALUES

12 Information Practices Text Book Preeti Arora
No ratings yet
12 Information Practices Text Book Preeti Arora
45 pages
ccs352 Multimedia and Animation
100% (2)
ccs352 Multimedia and Animation
4 pages
Fiori Security Document
100% (2)
Fiori Security Document
38 pages
Web
100% (1)
Web
16 pages
Problem Bank 06: Assignment I
No ratings yet
Problem Bank 06: Assignment I
10 pages
Data Cleaning With Python and Pandas
No ratings yet
Data Cleaning With Python and Pandas
49 pages
Pandas
No ratings yet
Pandas
4 pages
Handling Missing Data in Pandas by Jaume Boguñá
No ratings yet
Handling Missing Data in Pandas by Jaume Boguñá
17 pages
Code Explanation For Date Types
No ratings yet
Code Explanation For Date Types
8 pages
Dealing With Missing Values
No ratings yet
Dealing With Missing Values
19 pages
Pandas Missing Data
No ratings yet
Pandas Missing Data
30 pages
Practice 1
No ratings yet
Practice 1
45 pages
Kenny-230722-Data Cleaning With Python and Pandas - Detecting Missing Values
No ratings yet
Kenny-230722-Data Cleaning With Python and Pandas - Detecting Missing Values
13 pages
ML Practical 03
No ratings yet
ML Practical 03
20 pages
Unit 5 Python
No ratings yet
Unit 5 Python
30 pages
Lec9 Dealing With Missing Values
No ratings yet
Lec9 Dealing With Missing Values
22 pages
Ass-2 Ds
No ratings yet
Ass-2 Ds
29 pages
Pandas - Nan Value
No ratings yet
Pandas - Nan Value
3 pages
Exp3 2
No ratings yet
Exp3 2
5 pages
2 - 4 Data Cleaning
No ratings yet
2 - 4 Data Cleaning
24 pages
Lecture 8 Handling Missing Values
No ratings yet
Lecture 8 Handling Missing Values
25 pages
Chai
No ratings yet
Chai
5 pages
Lecture - 2 Pandas
No ratings yet
Lecture - 2 Pandas
24 pages
Module 3
No ratings yet
Module 3
20 pages
Exp-12 Iaiml
No ratings yet
Exp-12 Iaiml
13 pages
Traversing Dataframe Elements Using: Iterrows, Iteritems and Itertuples
No ratings yet
Traversing Dataframe Elements Using: Iterrows, Iteritems and Itertuples
8 pages
Lab Session 07: Perform Following Operations Using Pandas
No ratings yet
Lab Session 07: Perform Following Operations Using Pandas
4 pages
Dev Lab Record
No ratings yet
Dev Lab Record
21 pages
How To Handle Missing Data in Python. (Explained in 5 Easy Steps)
No ratings yet
How To Handle Missing Data in Python. (Explained in 5 Easy Steps)
10 pages
Pandas-Missing Values
No ratings yet
Pandas-Missing Values
2 pages
Phython Example
No ratings yet
Phython Example
12 pages
Data Science - Sec4
No ratings yet
Data Science - Sec4
16 pages
Missingvaluetreatment-Ex 1 Code
No ratings yet
Missingvaluetreatment-Ex 1 Code
2 pages
ML Lab Manual Final
No ratings yet
ML Lab Manual Final
36 pages
Missing Data
No ratings yet
Missing Data
14 pages
Pandas AI ML Python Software Engineering
No ratings yet
Pandas AI ML Python Software Engineering
63 pages
L-4 (Handling of Missing Values) .Ipynb - Colab
No ratings yet
L-4 (Handling of Missing Values) .Ipynb - Colab
8 pages
Handling Missing Data - Python Data Science Handbook
No ratings yet
Handling Missing Data - Python Data Science Handbook
9 pages
CH-6 Data Loading, Storage, and File Formats
No ratings yet
CH-6 Data Loading, Storage, and File Formats
163 pages
ANL252 SU4 Jul2022
No ratings yet
ANL252 SU4 Jul2022
55 pages
Data Cleaning in Python
No ratings yet
Data Cleaning in Python
6 pages
Pandas
No ratings yet
Pandas
63 pages
PYTHON PROGRAMMING: Data Handling
No ratings yet
PYTHON PROGRAMMING: Data Handling
12 pages
Missing Data
No ratings yet
Missing Data
25 pages
Exp 2
No ratings yet
Exp 2
28 pages
L32, 33 Pandas
No ratings yet
L32, 33 Pandas
7 pages
Python Basics Refresher
No ratings yet
Python Basics Refresher
19 pages
Unit 3
No ratings yet
Unit 3
30 pages
Create A Pandas Series From A Dictionary of Values and An Ndarray
No ratings yet
Create A Pandas Series From A Dictionary of Values and An Ndarray
15 pages
Document (4) - 1
No ratings yet
Document (4) - 1
15 pages
IAT-II FDS-Answer Key
No ratings yet
IAT-II FDS-Answer Key
11 pages
AI Practical 2025
No ratings yet
AI Practical 2025
14 pages
Week1 Numpy, Pandas (178) .Ipynb Colab
No ratings yet
Week1 Numpy, Pandas (178) .Ipynb Colab
6 pages
Handling Missing Values
No ratings yet
Handling Missing Values
4 pages
Dmdw-Lab Manual
No ratings yet
Dmdw-Lab Manual
61 pages
Python ClassXII AI
No ratings yet
Python ClassXII AI
4 pages
Lecture 4 New Data Pre Processing
No ratings yet
Lecture 4 New Data Pre Processing
41 pages
Lab 9
No ratings yet
Lab 9
9 pages
Pandas Module (Part-I)
No ratings yet
Pandas Module (Part-I)
36 pages
12 Pandas
100% (1)
12 Pandas
21 pages
Question 4
No ratings yet
Question 4
1 page
Mastering Pandas in Python: Course Book
From Everand
Mastering Pandas in Python: Course Book
Pedro Martins
No ratings yet
Hadoop实际解决方案手册: Chinese Edition
From Everand
Hadoop实际解决方案手册: Chinese Edition
Posts & Telecom Press
No ratings yet
Programming in Pascal: From simple Pascal programs to current desktop applications with Database DEV-PASCAL, LAZARUS AND PASCAL N-IDE
From Everand
Programming in Pascal: From simple Pascal programs to current desktop applications with Database DEV-PASCAL, LAZARUS AND PASCAL N-IDE
Olga Maria Stefania Cucaro
No ratings yet
ASE Syllabus
No ratings yet
ASE Syllabus
4 pages
JP N 24121378
No ratings yet
JP N 24121378
5 pages
Ids 2000
No ratings yet
Ids 2000
18 pages
ETPv3.3-Tutorial Intallation Multi-User
No ratings yet
ETPv3.3-Tutorial Intallation Multi-User
11 pages
Algorithms, 4th Edition by Robert Sedgewick and Kevin Wayne
No ratings yet
Algorithms, 4th Edition by Robert Sedgewick and Kevin Wayne
4 pages
Research Article A Review of Different Comparative Studies On Mobile Operating System
No ratings yet
Research Article A Review of Different Comparative Studies On Mobile Operating System
5 pages
Lab 6
No ratings yet
Lab 6
13 pages
Soyo App
No ratings yet
Soyo App
5 pages
Capstone Project
No ratings yet
Capstone Project
8 pages
5 XML (Unit 2)
No ratings yet
5 XML (Unit 2)
40 pages
RGUHS Final Document
No ratings yet
RGUHS Final Document
41 pages
Fujitsu Workstation Celsius Family
No ratings yet
Fujitsu Workstation Celsius Family
2 pages
BetaB0T (1) Pandey
No ratings yet
BetaB0T (1) Pandey
50 pages
Cyber Security and Ethical Hacking
No ratings yet
Cyber Security and Ethical Hacking
18 pages
Abhinav Ui React
No ratings yet
Abhinav Ui React
3 pages
TAXT Plus
No ratings yet
TAXT Plus
7 pages
Okto
No ratings yet
Okto
21 pages
Fairwinds Whitepaper Kubernetes Good Bad Misconfigured
No ratings yet
Fairwinds Whitepaper Kubernetes Good Bad Misconfigured
9 pages
Project-MIS of HBL Pakistan
78% (18)
Project-MIS of HBL Pakistan
28 pages
AcademAI - AI-Based PHD Student Tracking Platform
No ratings yet
AcademAI - AI-Based PHD Student Tracking Platform
13 pages
AIM:-To Perform The Registration Form Using Python and Having Validations in It. GUI
No ratings yet
AIM:-To Perform The Registration Form Using Python and Having Validations in It. GUI
12 pages
Power BI Basics
No ratings yet
Power BI Basics
2 pages
SEPDP Secure and Efficient Privacy Preserving
100% (1)
SEPDP Secure and Efficient Privacy Preserving
7 pages
Oracle E-Business Suite Installation and Upgrade Notes Release 12 12.1.1 For Linux x86-64 RHEL8 761566.1
No ratings yet
Oracle E-Business Suite Installation and Upgrade Notes Release 12 12.1.1 For Linux x86-64 RHEL8 761566.1
25 pages
Data Structure Online Courses: S No. Course Name
No ratings yet
Data Structure Online Courses: S No. Course Name
3 pages
Community Cloud Computing
No ratings yet
Community Cloud Computing
11 pages

Pandas - Dataframe - Handling Missing Nan Values

Uploaded by

Pandas - Dataframe - Handling Missing Nan Values

Uploaded by

Data Science – Pandas – Handling Missing or NaN values

12. PANDAS – Handling missing or NaN values

1|Page 12.PANDAS – HANDLING NAN VALUES

12. PANDAS – Handling missing or NaN values

 The full form of NaN is Not a Number

None and NaN

 None : None is a Python object which is holding nothing

2|Page 12.PANDAS – HANDLING NAN VALUES

2. Creating a DataFrame by loading csv file

 We can create DataFrame by loading csv file

Program Loading fruits csv file

3|Page 12.PANDAS – HANDLING NAN VALUES

3. isna() and isnull() method – Checking NaN values

 isna() and isnull() are a predefined methods in DataFrame

Program isna() method

4|Page 12.PANDAS – HANDLING NAN VALUES

Program isnull() method

 isnull() and isna() both methods works in same way

5|Page 12.PANDAS – HANDLING NAN VALUES

4. notnull() method – Checking NaN values

 notnull() is a predefined method in DataFrame

Program notnull() method

6|Page 12.PANDAS – HANDLING NAN VALUES

5. Counting NaN values in column wise

 We can count number of missing values in DataFrame

Program Counting the missing values in each column

7|Page 12.PANDAS – HANDLING NAN VALUES

Program Counting the missing values in each column with percentage

8|Page 12.PANDAS – HANDLING NAN VALUES

6. dropna() method – Handling missing values

 dropna() is a predefined method in DataFrame

Program Dropping rows where NaN values existing

9|Page 12.PANDAS – HANDLING NAN VALUES

Program Dropping rows where NaN values existing and counting

10 | P a g e 12.PANDAS – HANDLING NAN VALUES

Program Converting float column type into int data type

11 | P a g e 12.PANDAS – HANDLING NAN VALUES

7. dropna(inplace = True) method – Handling missing values

 dropna(inplace = True) is a predefined method in DataFrame

Program Dropping NaN values by using inplace parameter

12 | P a g e 12.PANDAS – HANDLING NAN VALUES

8. fillna() method – Handling missing values

 fillna() is a predefined method in DataFrame

Program Filling NaN values with zero

13 | P a g e 12.PANDAS – HANDLING NAN VALUES

Program Filling NaN values with specific value

df1 = pd.DataFrame(data, columns = ['Name', 'Age', 'Salary'])

14 | P a g e 12.PANDAS – HANDLING NAN VALUES

Program Filling NaN value with mean value

df1 = pd.DataFrame(data, columns = ['Name', 'Age', 'Salary'])

15 | P a g e 12.PANDAS – HANDLING NAN VALUES

df1 = pd.DataFrame(data, columns = ['Name', 'Age', 'Salary'])

16 | P a g e 12.PANDAS – HANDLING NAN VALUES

You might also like