A Learning-Based Approach for Biomedical Word Sense Disambiguation

In the biomedical domain, word sense ambiguity is a widely spread problem with bioinformatics research effort devoted to it being not commensurate and allowing for more development. This paper presents and evaluates a learning-based approach for sense disambiguation within the biomedical domain. The...

Full description

Bibliographic Details
Main Authors: Hisham Al-Mubaid, Sandeep Gungu
Format: Article
Language:English
Published: Hindawi Limited 2012-01-01
Series:The Scientific World Journal
Online Access:http://dx.doi.org/10.1100/2012/949247
id doaj-7b3b26c50b0f4e638d9ea717a5266916
record_format Article
spelling doaj-7b3b26c50b0f4e638d9ea717a52669162020-11-24T21:04:44ZengHindawi LimitedThe Scientific World Journal1537-744X2012-01-01201210.1100/2012/949247949247A Learning-Based Approach for Biomedical Word Sense DisambiguationHisham Al-Mubaid0Sandeep Gungu1University of Houston-Clear Lake, Houston, TX 77058, USAUniversity of Houston-Clear Lake, Houston, TX 77058, USAIn the biomedical domain, word sense ambiguity is a widely spread problem with bioinformatics research effort devoted to it being not commensurate and allowing for more development. This paper presents and evaluates a learning-based approach for sense disambiguation within the biomedical domain. The main limitation with supervised methods is the need for a corpus of manually disambiguated instances of the ambiguous words. However, the advances in automatic text annotation and tagging techniques with the help of the plethora of knowledge sources like ontologies and text literature in the biomedical domain will help lessen this limitation. The proposed method utilizes the interaction model (mutual information) between the context words and the senses of the target word to induce reliable learning models for sense disambiguation. The method has been evaluated with the benchmark dataset NLM-WSD with various settings and in biomedical entity species disambiguation. The evaluation results showed that the approach is very competitive and outperforms recently reported results of other published techniques.http://dx.doi.org/10.1100/2012/949247
collection DOAJ
language English
format Article
sources DOAJ
author Hisham Al-Mubaid
Sandeep Gungu
spellingShingle Hisham Al-Mubaid
Sandeep Gungu
A Learning-Based Approach for Biomedical Word Sense Disambiguation
The Scientific World Journal
author_facet Hisham Al-Mubaid
Sandeep Gungu
author_sort Hisham Al-Mubaid
title A Learning-Based Approach for Biomedical Word Sense Disambiguation
title_short A Learning-Based Approach for Biomedical Word Sense Disambiguation
title_full A Learning-Based Approach for Biomedical Word Sense Disambiguation
title_fullStr A Learning-Based Approach for Biomedical Word Sense Disambiguation
title_full_unstemmed A Learning-Based Approach for Biomedical Word Sense Disambiguation
title_sort learning-based approach for biomedical word sense disambiguation
publisher Hindawi Limited
series The Scientific World Journal
issn 1537-744X
publishDate 2012-01-01
description In the biomedical domain, word sense ambiguity is a widely spread problem with bioinformatics research effort devoted to it being not commensurate and allowing for more development. This paper presents and evaluates a learning-based approach for sense disambiguation within the biomedical domain. The main limitation with supervised methods is the need for a corpus of manually disambiguated instances of the ambiguous words. However, the advances in automatic text annotation and tagging techniques with the help of the plethora of knowledge sources like ontologies and text literature in the biomedical domain will help lessen this limitation. The proposed method utilizes the interaction model (mutual information) between the context words and the senses of the target word to induce reliable learning models for sense disambiguation. The method has been evaluated with the benchmark dataset NLM-WSD with various settings and in biomedical entity species disambiguation. The evaluation results showed that the approach is very competitive and outperforms recently reported results of other published techniques.
url http://dx.doi.org/10.1100/2012/949247
work_keys_str_mv AT hishamalmubaid alearningbasedapproachforbiomedicalwordsensedisambiguation
AT sandeepgungu alearningbasedapproachforbiomedicalwordsensedisambiguation
AT hishamalmubaid learningbasedapproachforbiomedicalwordsensedisambiguation
AT sandeepgungu learningbasedapproachforbiomedicalwordsensedisambiguation
_version_ 1716770017409761280