New approaches to the ontology alignment and identity resolution problems

This paper describes approaches to the vocabulary normalization and identity resolution problems arising during the use of the LOD datasets to populate the content of scholarly knowledge bases. We have proposed new heuristics, using additional information extracted from full text sources of data. Th...

Full description

Bibliographic Details
Main Authors: Zinaida Apanovich, Alexander Marchuk
Format: Article
Language:English
Published: Institute of Mathematics and Computer Science of the Academy of Sciences of Moldova 2014-11-01
Series:Computer Science Journal of Moldova
Subjects:
Online Access:http://www.math.md/files/csjm/v22-n3/v22-n3-(pp405-422).pdf
Description
Summary:This paper describes approaches to the vocabulary normalization and identity resolution problems arising during the use of the LOD datasets to populate the content of scholarly knowledge bases. We have proposed new heuristics, using additional information extracted from full text sources of data. The first heuristics uses the full record track of a person and the second one uses self-citation networks. The dataset of the Open Archive of the Russian Academy of Sciences and several bibliographic datasets are used as test examples.
ISSN:1561-4042