Detecting proper nouns in indonesian-language translation of the quran using a guided method

Proper nouns (often abbreviated PN or NNP) are a class of words important in labelling and subsequent text processing, especially in natural language processing (NLP). Name entity recognition (NER) is one study that requires PN. The lack of labeled data for Indonesian text, especially the PN label,...

Full description

Bibliographic Details
Main Authors: Suwanto Raharjo, Retantyo Wardoyo, Agfianto Eko Putra
Format: Article
Language:English
Published: Elsevier 2020-06-01
Series:Journal of King Saud University: Computer and Information Sciences
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S1319157818300971
Description
Summary:Proper nouns (often abbreviated PN or NNP) are a class of words important in labelling and subsequent text processing, especially in natural language processing (NLP). Name entity recognition (NER) is one study that requires PN. The lack of labeled data for Indonesian text, especially the PN label, may be attributed for the lack of NER research in the Indonesian language. This study aims to detect PN in Indonesian-language translations of the Quran guided by deriving location information from the Quran as its source text. In the Indonesian language, PN are written using initial capital letters, which are used to determine and guide PN location. This article proposes that PN in Indonesian-language translations of the Quran can be determined based on PoS Information of Quranic text by developing a certain production rule. The results of this research showed that the proposed method has promising results for further research.
ISSN:1319-1578