Detecting proper nouns in indonesian-language translation of the quran using a guided method

Proper nouns (often abbreviated PN or NNP) are a class of words important in labelling and subsequent text processing, especially in natural language processing (NLP). Name entity recognition (NER) is one study that requires PN. The lack of labeled data for Indonesian text, especially the PN label,...

Full description

Bibliographic Details
Main Authors: Suwanto Raharjo, Retantyo Wardoyo, Agfianto Eko Putra
Format: Article
Language:English
Published: Elsevier 2020-06-01
Series:Journal of King Saud University: Computer and Information Sciences
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S1319157818300971
id doaj-7915d502a14c41029c8f188b71b53a83
record_format Article
spelling doaj-7915d502a14c41029c8f188b71b53a832020-11-25T03:16:55ZengElsevierJournal of King Saud University: Computer and Information Sciences1319-15782020-06-01325583591Detecting proper nouns in indonesian-language translation of the quran using a guided methodSuwanto Raharjo0Retantyo Wardoyo1Agfianto Eko Putra2Doctoral Program of Computer Science, Department of Computer Science and Electronics, Universitas Gadjah Mada, Yogyakarta, Indonesia; Corresponding author.Department of Computer Science and Electronics, Universitas Gadjah Mada, Yogyakarta, IndonesiaDepartment of Computer Science and Electronics, Universitas Gadjah Mada, Yogyakarta, IndonesiaProper nouns (often abbreviated PN or NNP) are a class of words important in labelling and subsequent text processing, especially in natural language processing (NLP). Name entity recognition (NER) is one study that requires PN. The lack of labeled data for Indonesian text, especially the PN label, may be attributed for the lack of NER research in the Indonesian language. This study aims to detect PN in Indonesian-language translations of the Quran guided by deriving location information from the Quran as its source text. In the Indonesian language, PN are written using initial capital letters, which are used to determine and guide PN location. This article proposes that PN in Indonesian-language translations of the Quran can be determined based on PoS Information of Quranic text by developing a certain production rule. The results of this research showed that the proposed method has promising results for further research.http://www.sciencedirect.com/science/article/pii/S1319157818300971Part of speechTaggingProper nounName entityRecognitionIndonesian
collection DOAJ
language English
format Article
sources DOAJ
author Suwanto Raharjo
Retantyo Wardoyo
Agfianto Eko Putra
spellingShingle Suwanto Raharjo
Retantyo Wardoyo
Agfianto Eko Putra
Detecting proper nouns in indonesian-language translation of the quran using a guided method
Journal of King Saud University: Computer and Information Sciences
Part of speech
Tagging
Proper noun
Name entity
Recognition
Indonesian
author_facet Suwanto Raharjo
Retantyo Wardoyo
Agfianto Eko Putra
author_sort Suwanto Raharjo
title Detecting proper nouns in indonesian-language translation of the quran using a guided method
title_short Detecting proper nouns in indonesian-language translation of the quran using a guided method
title_full Detecting proper nouns in indonesian-language translation of the quran using a guided method
title_fullStr Detecting proper nouns in indonesian-language translation of the quran using a guided method
title_full_unstemmed Detecting proper nouns in indonesian-language translation of the quran using a guided method
title_sort detecting proper nouns in indonesian-language translation of the quran using a guided method
publisher Elsevier
series Journal of King Saud University: Computer and Information Sciences
issn 1319-1578
publishDate 2020-06-01
description Proper nouns (often abbreviated PN or NNP) are a class of words important in labelling and subsequent text processing, especially in natural language processing (NLP). Name entity recognition (NER) is one study that requires PN. The lack of labeled data for Indonesian text, especially the PN label, may be attributed for the lack of NER research in the Indonesian language. This study aims to detect PN in Indonesian-language translations of the Quran guided by deriving location information from the Quran as its source text. In the Indonesian language, PN are written using initial capital letters, which are used to determine and guide PN location. This article proposes that PN in Indonesian-language translations of the Quran can be determined based on PoS Information of Quranic text by developing a certain production rule. The results of this research showed that the proposed method has promising results for further research.
topic Part of speech
Tagging
Proper noun
Name entity
Recognition
Indonesian
url http://www.sciencedirect.com/science/article/pii/S1319157818300971
work_keys_str_mv AT suwantoraharjo detectingpropernounsinindonesianlanguagetranslationofthequranusingaguidedmethod
AT retantyowardoyo detectingpropernounsinindonesianlanguagetranslationofthequranusingaguidedmethod
AT agfiantoekoputra detectingpropernounsinindonesianlanguagetranslationofthequranusingaguidedmethod
_version_ 1724634222930427904