Improving database quality through eliminating duplicate records

Redundant or duplicate data are the most troublesome problem in database management and applications. Approximate field matching is the key solution to resolve the problem by identifying semantically equivalent string values in syntactically different representations. This paper considers token-base...

Full description

Bibliographic Details
Main Authors: Mingzhen Wei, Andrew H Sung, Martha E Cather
Format: Article
Language:English
Published: Ubiquity Press 2006-11-01
Series:Data Science Journal
Subjects:
Online Access:http://datascience.codata.org/articles/475