International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 108 - Number 6 |
Year of Publication: 2014 |
Authors: Warke Yamini, Arti Mohanpurkar |
10.5120/18916-0243 |
Warke Yamini, Arti Mohanpurkar . Review on Record LINKAGE and Deduplication based on Suffix Array Indexing. International Journal of Computer Applications. 108, 6 ( December 2014), 28-30. DOI=10.5120/18916-0243
Record linkage is a momentous process in data soundness which is used in combining, matching and duplicate removal from more than two databases that refer to the same entities. Deduplication is the process of taking off duplicate records in a united database. Now a day, data cleaning and standardization becomes a pompous process. Due to yielding capacity of today's database, discovering matching records in united database is a crucial one. Indexing technique specifically suffix array is used to efficiently implement record linkage and deduplication.