Data Cleansing
Data Cleansing
The data cleansing process typically begins after data has been collected
from various sources, such as surveys, databases, sensors, or web
scraping. Before any cleaning takes place, the data is thoroughly
inspected. This inspection involves:
Errors and inconsistencies can arise from various sources, such as human
input errors, measurement inaccuracies, or system glitches. Data
cleansing involves:
4. Removing Duplicates:
5. Data Transformation:
7. Quality Assurance:
After cleansing the data, quality assurance checks are performed to verify
that the dataset now adheres to the defined data quality criteria. This
ensures that the data is ready for analysis, modeling, or other data-driven
tasks.