Introducing a machine-based approach for Word Sense Disambiguation: using Lesk algorithm and Part Of Speech tagging

Present study introduces a machine-based approach for Word Sense Disambiguation (WSD). In Persian, a morphologically complex language, lots of homographs are made; one way for doing WSD is allocating the right Part Of Speech (POS) tags to words, prior to WSD. Since the frequency of noun and adjectiv...

Full description

Bibliographic Details
Main Author:	Elham Alayiaboozar
Format:	Article
Language:	fas
Published:	Iranian Research Institute for Information and Technology 2018-06-01
Series:	Iranian Journal of Information Processing & Management
Subjects:	homographs Word Sense Disambiguation Part Of Speech tagging disambiguation of Persian nouns and adjective homographs Lesk algorithm
Online Access:	http://jipm.irandoc.ac.ir/browse.php?a_code=A-10-3228-2&slc_lang=en&sid=1

id	doaj-7b00f74ad1a742b3be9ce35f5e2eeb0f
record_format	Article
spelling	doaj-7b00f74ad1a742b3be9ce35f5e2eeb0f2020-11-24T21:46:26ZfasIranian Research Institute for Information and TechnologyIranian Journal of Information Processing & Management2251-82232251-82312018-06-0133311651182Introducing a machine-based approach for Word Sense Disambiguation: using Lesk algorithm and Part Of Speech taggingElham Alayiaboozar0 Iranian Research Institute for Information Science and Technology(IranDoc) Present study introduces a machine-based approach for Word Sense Disambiguation (WSD). In Persian, a morphologically complex language, lots of homographs are made; one way for doing WSD is allocating the right Part Of Speech (POS) tags to words, prior to WSD. Since the frequency of noun and adjective homographs in different Persian text corpuses is high, POS disambiguation of such homographs seems to be necessary for WSD. This paper introduces an approach in which first POS tagging is done, then the output, which is tagged sentences, enters the next step which is POS disambiguation of Persian nouns and adjective homographs; then the output of this step enters the final step which is applying the Lesk algorithm(a kind of unsupervised learning) for WSD. The proposed approach speeds up the WSD procedure by filtering the only relevant glosses (exist in dictionary) and increases the accuracy of the WSD procedure as well.http://jipm.irandoc.ac.ir/browse.php?a_code=A-10-3228-2&slc_lang=en&sid=1homographs Word Sense Disambiguation Part Of Speech tagging disambiguation of Persian nouns and adjective homographs Lesk algorithm
collection	DOAJ
language	fas
format	Article
sources	DOAJ
author	Elham Alayiaboozar
spellingShingle	Elham Alayiaboozar Introducing a machine-based approach for Word Sense Disambiguation: using Lesk algorithm and Part Of Speech tagging Iranian Journal of Information Processing & Management homographs Word Sense Disambiguation Part Of Speech tagging disambiguation of Persian nouns and adjective homographs Lesk algorithm
author_facet	Elham Alayiaboozar
author_sort	Elham Alayiaboozar
title	Introducing a machine-based approach for Word Sense Disambiguation: using Lesk algorithm and Part Of Speech tagging
title_short	Introducing a machine-based approach for Word Sense Disambiguation: using Lesk algorithm and Part Of Speech tagging
title_full	Introducing a machine-based approach for Word Sense Disambiguation: using Lesk algorithm and Part Of Speech tagging
title_fullStr	Introducing a machine-based approach for Word Sense Disambiguation: using Lesk algorithm and Part Of Speech tagging
title_full_unstemmed	Introducing a machine-based approach for Word Sense Disambiguation: using Lesk algorithm and Part Of Speech tagging
title_sort	introducing a machine-based approach for word sense disambiguation: using lesk algorithm and part of speech tagging
publisher	Iranian Research Institute for Information and Technology
series	Iranian Journal of Information Processing & Management
issn	2251-8223 2251-8231
publishDate	2018-06-01
description	Present study introduces a machine-based approach for Word Sense Disambiguation (WSD). In Persian, a morphologically complex language, lots of homographs are made; one way for doing WSD is allocating the right Part Of Speech (POS) tags to words, prior to WSD. Since the frequency of noun and adjective homographs in different Persian text corpuses is high, POS disambiguation of such homographs seems to be necessary for WSD. This paper introduces an approach in which first POS tagging is done, then the output, which is tagged sentences, enters the next step which is POS disambiguation of Persian nouns and adjective homographs; then the output of this step enters the final step which is applying the Lesk algorithm(a kind of unsupervised learning) for WSD. The proposed approach speeds up the WSD procedure by filtering the only relevant glosses (exist in dictionary) and increases the accuracy of the WSD procedure as well.
topic	homographs Word Sense Disambiguation Part Of Speech tagging disambiguation of Persian nouns and adjective homographs Lesk algorithm
url	http://jipm.irandoc.ac.ir/browse.php?a_code=A-10-3228-2&slc_lang=en&sid=1
work_keys_str_mv	AT elhamalayiaboozar introducingamachinebasedapproachforwordsensedisambiguationusingleskalgorithmandpartofspeechtagging
_version_	1725902142296817664

Introducing a machine-based approach for Word Sense Disambiguation: using Lesk algorithm and Part Of Speech tagging

Similar Items