Towards an end-to-end human-centric data cleaning framework
Data Cleaning refers to the process of detecting and fixing errors in the data. Human
involvement is instrumental at several stages of this process such as providing rules or
validating computed repairs. There is a plethora of data cleaning algorithms addressing a
wide range of data errors (eg, detecting duplicates, violations of integrity constraints, and
missing values). Many of these algorithms involve a human in the loop, however, this latter is
usually coupled to the underlying cleaning algorithms. In a real data cleaning pipeline …
involvement is instrumental at several stages of this process such as providing rules or
validating computed repairs. There is a plethora of data cleaning algorithms addressing a
wide range of data errors (eg, detecting duplicates, violations of integrity constraints, and
missing values). Many of these algorithms involve a human in the loop, however, this latter is
usually coupled to the underlying cleaning algorithms. In a real data cleaning pipeline …
Towards an end-to-end human-centric data cleaning framework
M Ouzzani, AK Elmagarmid… - … on Human-In-the …, 2019 - researchportal.hbku.edu.qa
Data Cleaning refers to the process of detecting and fixing errors in the data. Human
involvement is instrumental at several stages of this process such as providing rules or
validating computed repairs. There is a plethora of data cleaning algorithms addressing a
wide range of data errors (eg, detecting duplicates, violations of integrity constraints, and
missing values). Many of these algorithms involve a human in the loop, however, this latter is
usually coupled to the underlying cleaning algorithms. In a real data cleaning pipeline …
involvement is instrumental at several stages of this process such as providing rules or
validating computed repairs. There is a plethora of data cleaning algorithms addressing a
wide range of data errors (eg, detecting duplicates, violations of integrity constraints, and
missing values). Many of these algorithms involve a human in the loop, however, this latter is
usually coupled to the underlying cleaning algorithms. In a real data cleaning pipeline …
Showing the best results for this search. See all results