Methodology for fuzzy duplicate record identification based on the semantic-syntactic information of similarity
There are different methodologies for identification of fuzzy duplicate records in the process of data cleaning for data warehouse and data mining. The methodologies for duplicate record identification can be classified into three groups: blocking methods, windowing methods, and semantic methods. Th...
Main Authors: | Djulaga Hadzic, Nermin Sarajlic |
---|---|
Format: | Article |
Language: | English |
Published: |
Elsevier
2020-01-01
|
Series: | Journal of King Saud University: Computer and Information Sciences |
Online Access: | http://www.sciencedirect.com/science/article/pii/S1319157817304512 |
Similar Items
-
Exploiting Syntactic and Semantic Information for Textual Similarity Estimation
by: Jiajia Luo, et al.
Published: (2021-01-01) -
Calculation of Sentence Semantic Similarity Based on Syntactic Structure
by: Xiao Li, et al.
Published: (2015-01-01) -
A Semantic and Syntactic Similarity Measure for Political Tweets
by: Claire Little, et al.
Published: (2020-01-01) -
Semantic similarity to high-frequency verbs affects syntactic frame selection
by: Koenig, J.-P, et al.
Published: (2019) -
SYNTACTIC/SEMANTIC ACCEPTABILITY AND SEMANTIC SIMILARITY OF ORAL READING ERRORS AS FUNCTIONS OF VARIATION IN ATTAINED COMPREHENSION
by: Thomas, Keith John, 1943-
Published: (1975)