Improving database quality through eliminating duplicate records
Redundant or duplicate data are the most troublesome problem in database management and applications. Approximate field matching is the key solution to resolve the problem by identifying semantically equivalent string values in syntactically different representations. This paper considers token-base...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Ubiquity Press
2006-11-01
|
Series: | Data Science Journal |
Subjects: | |
Online Access: | http://datascience.codata.org/articles/475 |