0% found this document useful (0 votes)
5 views8 pages

Practical NO4abol

The document outlines a practical exercise for data cleansing using Excel and Weka tools. It details steps for identifying and handling missing or incorrect data, removing duplicates, and replacing zero values with averages in Excel, as well as loading datasets and managing missing values in Weka. The aim is to apply data cleansing techniques on two datasets effectively.

Uploaded by

shrutimanval104
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
5 views8 pages

Practical NO4abol

The document outlines a practical exercise for data cleansing using Excel and Weka tools. It details steps for identifying and handling missing or incorrect data, removing duplicates, and replacing zero values with averages in Excel, as well as loading datasets and managing missing values in Weka. The aim is to apply data cleansing techniques on two datasets effectively.

Uploaded by

shrutimanval104
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 8

Practical NO.

4
Name: Aboli Pramod Korale

Enroll No.2306099

Div : H2

Aim:*Apply data cleansing on any two datasets.

(Excel)

Step 1: Identify Missing or Incorrect Data.

Step 2: Replace or Remove Missing Values


• Enter Ctrl+H to launch the "Find and Replace" window
• Replace the ( ? ) with “ Unknown “.
• Replace the Blank cells with N/A.

Step 3: Go to the Data tab and click on Remove Duplicates


• Navigate to the Data Tab and select "Remove Duplicates" to easily
eliminate identical entries
• we want to remove duplicates based on all columns that's why choose
"Select all Columns" and click "OK".
• Preview results.

Step 4: Replace 0 with average value.


• we want to remove duplicates based on all columns that's why choose
"Select all Columns" and click "OK".
• Click on an empty cell and enter the formula:
=AVERAGEIF(Range,Criteria,Average_range)
=AVERAGEIF(E2:E10,”<>0”,E2:E10)
Step 5: Final Result.

(Weka Tool)

Step 1: Load the Dataset into Weka

• Open Weka.
• Click on Explorer.
• Go to Open file, then select your dataset
• Weka will display all attributes (columns) in the Preprocess tab.
Step 2: Handle Missing Values
• Use the Edit button to manually enter values.
Step 3: Select the attribute and the drag the options and select one
of them.

Step 4:Preview result.

You might also like