Personal named entity linking based on simple partial tree matching and context free grammar
Personal name disambiguation is the task of linking a personal name to a unique comparable entry in the real world, also known as named entity linking (NEL). Algorithms for NEL consist of three main components: extractor, searcher, and disambiguator. Existing approaches for NEL use exact-matched loo...
Main Author: | |
---|---|
Other Authors: | |
Published: |
Heriot-Watt University
2017
|
Online Access: | https://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.739333 |
id |
ndltd-bl.uk-oai-ethos.bl.uk-739333 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-bl.uk-oai-ethos.bl.uk-7393332019-01-08T03:26:30ZPersonal named entity linking based on simple partial tree matching and context free grammarBuatongkue, SirisudaGeorgieva, Lilia2017Personal name disambiguation is the task of linking a personal name to a unique comparable entry in the real world, also known as named entity linking (NEL). Algorithms for NEL consist of three main components: extractor, searcher, and disambiguator. Existing approaches for NEL use exact-matched look-up over the surface form to generate a set of candidate entities in each of the mentioned names. The exact-matched look-up is wholly inadequate to generate a candidate entity due to the fact that the personal names within a web page lack uniform representation. In addition, the performance of a disambiguator in ranking candidate entities is limited by context similarity. Context similarity is an inflexible feature for personal disambiguation because natural language is highly variable. We propose a new approach that can be used to both identify and disambiguate personal names mentioned on a web page. Our NEL algorithm uses: as an extractor: a control flow graph; AlchemyAPI, as a searcher: Personal Name Transformation Modules (PNTM) based on Context Free Grammar and the Jaro-Winkler text similarity metric and as a disambiguator: the entity coherence method: the Occupation Architecture for Personal Name Disambiguation (OAPnDis), personal name concepts and Simple Partial Tree Matching (SPTM). Experimental results, evaluated on real-world data sets, show that the accuracy of our NEL is 92%, which is higher than the accuracy of previously used methods.Heriot-Watt Universityhttps://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.739333http://hdl.handle.net/10399/3265Electronic Thesis or Dissertation |
collection |
NDLTD |
sources |
NDLTD |
description |
Personal name disambiguation is the task of linking a personal name to a unique comparable entry in the real world, also known as named entity linking (NEL). Algorithms for NEL consist of three main components: extractor, searcher, and disambiguator. Existing approaches for NEL use exact-matched look-up over the surface form to generate a set of candidate entities in each of the mentioned names. The exact-matched look-up is wholly inadequate to generate a candidate entity due to the fact that the personal names within a web page lack uniform representation. In addition, the performance of a disambiguator in ranking candidate entities is limited by context similarity. Context similarity is an inflexible feature for personal disambiguation because natural language is highly variable. We propose a new approach that can be used to both identify and disambiguate personal names mentioned on a web page. Our NEL algorithm uses: as an extractor: a control flow graph; AlchemyAPI, as a searcher: Personal Name Transformation Modules (PNTM) based on Context Free Grammar and the Jaro-Winkler text similarity metric and as a disambiguator: the entity coherence method: the Occupation Architecture for Personal Name Disambiguation (OAPnDis), personal name concepts and Simple Partial Tree Matching (SPTM). Experimental results, evaluated on real-world data sets, show that the accuracy of our NEL is 92%, which is higher than the accuracy of previously used methods. |
author2 |
Georgieva, Lilia |
author_facet |
Georgieva, Lilia Buatongkue, Sirisuda |
author |
Buatongkue, Sirisuda |
spellingShingle |
Buatongkue, Sirisuda Personal named entity linking based on simple partial tree matching and context free grammar |
author_sort |
Buatongkue, Sirisuda |
title |
Personal named entity linking based on simple partial tree matching and context free grammar |
title_short |
Personal named entity linking based on simple partial tree matching and context free grammar |
title_full |
Personal named entity linking based on simple partial tree matching and context free grammar |
title_fullStr |
Personal named entity linking based on simple partial tree matching and context free grammar |
title_full_unstemmed |
Personal named entity linking based on simple partial tree matching and context free grammar |
title_sort |
personal named entity linking based on simple partial tree matching and context free grammar |
publisher |
Heriot-Watt University |
publishDate |
2017 |
url |
https://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.739333 |
work_keys_str_mv |
AT buatongkuesirisuda personalnamedentitylinkingbasedonsimplepartialtreematchingandcontextfreegrammar |
_version_ |
1718807586163130368 |