Personal named entity linking based on simple partial tree matching and context free grammar

Personal name disambiguation is the task of linking a personal name to a unique comparable entry in the real world, also known as named entity linking (NEL). Algorithms for NEL consist of three main components: extractor, searcher, and disambiguator. Existing approaches for NEL use exact-matched loo...

Full description

Bibliographic Details
Main Author: Buatongkue, Sirisuda
Other Authors: Georgieva, Lilia
Published: Heriot-Watt University 2017
Online Access:https://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.739333
id ndltd-bl.uk-oai-ethos.bl.uk-739333
record_format oai_dc
spelling ndltd-bl.uk-oai-ethos.bl.uk-7393332019-01-08T03:26:30ZPersonal named entity linking based on simple partial tree matching and context free grammarBuatongkue, SirisudaGeorgieva, Lilia2017Personal name disambiguation is the task of linking a personal name to a unique comparable entry in the real world, also known as named entity linking (NEL). Algorithms for NEL consist of three main components: extractor, searcher, and disambiguator. Existing approaches for NEL use exact-matched look-up over the surface form to generate a set of candidate entities in each of the mentioned names. The exact-matched look-up is wholly inadequate to generate a candidate entity due to the fact that the personal names within a web page lack uniform representation. In addition, the performance of a disambiguator in ranking candidate entities is limited by context similarity. Context similarity is an inflexible feature for personal disambiguation because natural language is highly variable. We propose a new approach that can be used to both identify and disambiguate personal names mentioned on a web page. Our NEL algorithm uses: as an extractor: a control flow graph; AlchemyAPI, as a searcher: Personal Name Transformation Modules (PNTM) based on Context Free Grammar and the Jaro-Winkler text similarity metric and as a disambiguator: the entity coherence method: the Occupation Architecture for Personal Name Disambiguation (OAPnDis), personal name concepts and Simple Partial Tree Matching (SPTM). Experimental results, evaluated on real-world data sets, show that the accuracy of our NEL is 92%, which is higher than the accuracy of previously used methods.Heriot-Watt Universityhttps://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.739333http://hdl.handle.net/10399/3265Electronic Thesis or Dissertation
collection NDLTD
sources NDLTD
description Personal name disambiguation is the task of linking a personal name to a unique comparable entry in the real world, also known as named entity linking (NEL). Algorithms for NEL consist of three main components: extractor, searcher, and disambiguator. Existing approaches for NEL use exact-matched look-up over the surface form to generate a set of candidate entities in each of the mentioned names. The exact-matched look-up is wholly inadequate to generate a candidate entity due to the fact that the personal names within a web page lack uniform representation. In addition, the performance of a disambiguator in ranking candidate entities is limited by context similarity. Context similarity is an inflexible feature for personal disambiguation because natural language is highly variable. We propose a new approach that can be used to both identify and disambiguate personal names mentioned on a web page. Our NEL algorithm uses: as an extractor: a control flow graph; AlchemyAPI, as a searcher: Personal Name Transformation Modules (PNTM) based on Context Free Grammar and the Jaro-Winkler text similarity metric and as a disambiguator: the entity coherence method: the Occupation Architecture for Personal Name Disambiguation (OAPnDis), personal name concepts and Simple Partial Tree Matching (SPTM). Experimental results, evaluated on real-world data sets, show that the accuracy of our NEL is 92%, which is higher than the accuracy of previously used methods.
author2 Georgieva, Lilia
author_facet Georgieva, Lilia
Buatongkue, Sirisuda
author Buatongkue, Sirisuda
spellingShingle Buatongkue, Sirisuda
Personal named entity linking based on simple partial tree matching and context free grammar
author_sort Buatongkue, Sirisuda
title Personal named entity linking based on simple partial tree matching and context free grammar
title_short Personal named entity linking based on simple partial tree matching and context free grammar
title_full Personal named entity linking based on simple partial tree matching and context free grammar
title_fullStr Personal named entity linking based on simple partial tree matching and context free grammar
title_full_unstemmed Personal named entity linking based on simple partial tree matching and context free grammar
title_sort personal named entity linking based on simple partial tree matching and context free grammar
publisher Heriot-Watt University
publishDate 2017
url https://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.739333
work_keys_str_mv AT buatongkuesirisuda personalnamedentitylinkingbasedonsimplepartialtreematchingandcontextfreegrammar
_version_ 1718807586163130368