New approaches to the ontology alignment and identity resolution problems
This paper describes approaches to the vocabulary normalization and identity resolution problems arising during the use of the LOD datasets to populate the content of scholarly knowledge bases. We have proposed new heuristics, using additional information extracted from full text sources of data. Th...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Institute of Mathematics and Computer Science of the Academy of Sciences of Moldova
2014-11-01
|
Series: | Computer Science Journal of Moldova |
Subjects: | |
Online Access: | http://www.math.md/files/csjm/v22-n3/v22-n3-(pp405-422).pdf |
Summary: | This paper describes approaches to the vocabulary normalization and identity resolution problems arising during the use of the LOD datasets to populate the content of scholarly knowledge bases. We have proposed new heuristics, using additional information extracted from full text sources of data. The first heuristics uses the full record track of a person and the second one uses self-citation networks. The dataset of the Open Archive of the Russian Academy of Sciences and several bibliographic datasets are used as test
examples. |
---|---|
ISSN: | 1561-4042 |