Basic Data Cleaning With Microsoft Excel v1.1
Basic Data Cleaning With Microsoft Excel v1.1
1. With the help of Filter feature, you can easily get rid of the blank cells from any
column.
2. The Filter option is available in the Home tab.
3. So, you need to navigate as go the Home tab→ Editing group → Sort & Filter →
Filter.
Select Filter.
Fill or Delete
4. Once you have filtered for blanks, you can choose to delete the rows containing
blanks or fill the blanks with appropriate values.
Let's say the missing values for the following cells are as follows:
Basic Data Cleaning with Microsoft Excel Department of ICT, CFS IIUM 1
You may click on the desired cell and fill in the missing values.
5. If the missing values are unknown, you can right-click on the selected rows and
choose Delete Row to remove the entire rows.
Basic Data Cleaning with Microsoft Excel Department of ICT, CFS IIUM 2
Go To Special
1. With the help of Go To feature, you also can easily select blank cells from any
column.
2. Select the table or column or row where you need to select the blank cells.
3. Press F5 or Ctrl + G to display the Go To dialog box.
4. Click on the Special button and then press OK.
Basic Data Cleaning with Microsoft Excel Department of ICT, CFS IIUM 3
Select Blanks button
Press OK button
Basic Data Cleaning with Microsoft Excel Department of ICT, CFS IIUM 4
#Issue 2: Inaccurate Data
Spelling Checkers
1. Many software applications, including word processors and spreadsheet tools like
3. While spell checkers are effective at catching common spelling errors, they have
limitations and may not identify all types of mistakes. This is especially true for
Basic Data Cleaning with Microsoft Excel Department of ICT, CFS IIUM 5
Two more misspellings were not detected
by the spell checker.
4. It is generally a good practice to review or manually inspect the text after using a
spell checker.
Basic Data Cleaning with Microsoft Excel Department of ICT, CFS IIUM 6
#Issue 3: Inconsistent Data
1. Ensure that data such as date and currency formats are consistent within each
column.
2. Look out for variations such as different date formats (MM/DD/YYYY,
DD/MM/YYYY) or currency symbols ($, €, £, RM).
Basic Data Cleaning with Microsoft Excel Department of ICT, CFS IIUM 7
#Issue 4: Duplicate Data
Remove Duplicate
1. Microsoft Excel has a built-in feature to remove duplicate values from tables called
Remove Duplicate.
2. This tool is found under the Data tab in the menu bar.
3. To use this tool, firstly you need to select either row, column, or table from where
we need to remove the duplicate data.
Basic Data Cleaning with Microsoft Excel Department of ICT, CFS IIUM 8
In the Remove Duplicates dialog box, check the
box My data has headers if your data includes
headers.
Press OK to proceed.
After removing duplicates in Excel, you will get a message showing how
many duplicates were removed and how many unique values are left.
Basic Data Cleaning with Microsoft Excel Department of ICT, CFS IIUM 9
#Issue 5: Unstructured Data
Text to Columns
1. With the help of Excel's Text to Columns feature, you can separate text into
3. The first option, which this tutorial will focus on, is to separate the text using a
delimiter. A delimiter is a specific character that marks the boundaries
4. The second method for Text to Columns involves using a predefined fixed width to
separate text into adjacent columns.
5. To access the Text to Columns feature, go to the Data tab ➞ Data Tools group →
Text to Columns.
Select Data.
Go to the Data tab, find the Data Tools
group, and click on Text to Columns.
Basic Data Cleaning with Microsoft Excel Department of ICT, CFS IIUM 10
Select Destination cell.
6. After the Text to Columns operation is complete, you may need to adjust the column
width to ensure that the data is displayed correctly.
7. Examine the headers of the newly created columns to ensure they reflect the
content accurately. If needed, modify, or add headers to provide clarity.
Basic Data Cleaning with Microsoft Excel Department of ICT, CFS IIUM 11
After modifying process
8. Carefully review the entire worksheet to ensure the split data is accurate and well-
organized.
9. Save your work.
Basic Data Cleaning with Microsoft Excel Department of ICT, CFS IIUM 12
A messy dataset with spaces
Basic Data Cleaning with Microsoft Excel Department of ICT, CFS IIUM 13
TRIM
1. With the help of the TRIM function in Excel, you can remove all unnecessary
spaces from text, including leading and trailing spaces, except for single spaces
between words.
2. The TRIM function cleans up text by removing unnecessary spaces, including spaces
at the beginning (leading) and end (trailing), but keeps single spaces between
words. This helps make the data more readable and easier to work with.
3. The syntax of the TRIM function is:
=TRIM(text)
or
=TRIM(cell)
The syntax parameters are as follows:
text: Text refers to any text or string from which you need to remove leading,
Basic Data Cleaning with Microsoft Excel Department of ICT, CFS IIUM 14
CLEAN
1. With the help of the CLEAN function in Excel, we can remove line breaks and all
characters from a string, making it easier to read and work with the data.
4. The syntax of the TRIM function is:
=CLEAN(text)
or
=CLEAN(cell)
The syntax parameters are as follows:
text: Text refers to any text or string from which you need to remove line breaks
and all non-printable characters from a string.
cell: Cell refers to a specific location (cell reference or cell address) where you
need to remove line breaks and all non-printable characters from a string.
shown or printed.
6. Common examples of no-nprintable characters include:
• Line Breaks: Characters that indicate the end of a line and the beginning of
a new one. They are necessary for formatting text but are not visible on the
printed page.
• Tab Characters: Used to create horizontal spacing or indentation in a
document.
Basic Data Cleaning with Microsoft Excel Department of ICT, CFS IIUM 15
• Control Characters: Various characters with ASCII values below 32 (such as
control characters like ESC, BEL, etc.), which are used for controlling hardware
devices or communication protocols.
Basic Data Cleaning with Microsoft Excel Department of ICT, CFS IIUM 16