Data Decontamination: Challenges and Current Approaches

Volume 12  Issue 2    2018

Download

Author(s): Zubair Afridi
Abstract Data cleaning is especially required while organizing heterogeneous data sources and should be tended to together with example related data changes. In data circulation focuses, data cleaning is a critical part of the asserted ETL process. In this paper, the author discuss current instrument support for data Cleansing data from dirtying impacts is an essential bit of data taking care of and upkeep cleaning. This has lead to the change of a wide extent of strategies intending to update the precision and usability of existing data. This paper shows an investigation of data cleansing issues, approaches, and methodologies. The author arrange the diverse sorts of anomalies occurrence in data that must be wiped out, and describe the game plan of worth criteria that totally washed down data needs to perform. In light of this course of action, the author evaluate and differentiate existing techniques for data refining and respect to the sorts of inconsistencies dealt with and wiped out by them. In like of manner depict , when all is said in done the assorted steps in data filtering and show the methods of used inside the cleansing strategy and give a viewpoint to research headings that supplement the momentum structures.
Keywords Data cleaning, ETL, process, strategies.
Year 2018
Volume 12
Issue 2
Type Research paper, manuscript, article
Journal Name Journal of Information & Communication Technology
Publisher Name ILMA University
Jel Classification -
DOI -
ISSN no (E, Electronic) 2075-7239
ISSN no (P, Print) 2415-0169
Country Pakistan
City Karachi
Institution Type University
Journal Type Open Access
Manuscript Processing Blind Peer Reviewed
Format PDF
Paper Link https://jict.ilmauniversity.edu.pk/journal/jict/12.2/2.pdf
Page 4-12