Data De-Duplication through Active Learning
Data de-duplication concerns the identification and eventual elimination of records, in a particular dataset, that refer to the same entity without necessarily having the same attribute values, nor the same identifying values. Machine Learning techniques have been used to handle data de-duplication....
Main Author: | Muhivuwomunda, Divine |
---|---|
Format: | Others |
Language: | en |
Published: |
University of Ottawa (Canada)
2013
|
Subjects: | |
Online Access: | http://hdl.handle.net/10393/28859 http://dx.doi.org/10.20381/ruor-19478 |
Similar Items
-
Space and time scalability of duplicate detection in graph data
by: Herschel, Melanie, et al.
Published: (2008) -
Active duplicate detection with Bayesian nonparametric models
by: Matsakis, Nicholas E. (Nicholas Elias), 1976-
Published: (2010) -
Cloud De-Duplication Cost Model
by: Hocker, Christopher
Published: (2012) -
Learning from multirelational data through multiple views
by: Guo, Hongyu
Published: (2013) -
Visualizing and Understanding Code Duplication in Large Software Systems
by: Jiang, Zhen Ming
Published: (2006)