International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 76 - Number 7 |
Year of Publication: 2013 |
Authors: Kofi Adu-manu Sarpong, John Kingsley Arthur |
10.5120/13258-0736 |
Kofi Adu-manu Sarpong, John Kingsley Arthur . Analysis of Data Cleansing Approaches regarding Dirty Data – A Comparative Study. International Journal of Computer Applications. 76, 7 ( August 2013), 14-18. DOI=10.5120/13258-0736
Data Cleansing is an activity involving a process of detecting and correcting the errors and inconsistencies in data warehouse. It deals with identification of corrupt and duplicate data inherent in the data sets of a data warehouse to enhance the quality of data. The research was directed at investigating some existing approaches and frameworks to data cleansing. That attempted to solve the data cleansing problem and came up with their strengths and weaknesses which led to the identification of gabs in those frameworks and approaches. A comparative analysis of the four frameworks was conducted and by using standard testing parameters a proposed feature was discussed to fit in the gaps.