Predicting speculation: a simple disambiguation approach to hedge detection in biomedical literature

<p>Abstract</p> <p>Background</p> <p>This paper presents a novel approach to the problem of <it>hedge detection</it>, which involves identifying so-called hedge cues for labeling sentences as certain or uncertain. This is the classification problem for Task...

Full description

Bibliographic Details
Main Author: Velldal Erik
Format: Article
Language:English
Published: BMC 2011-10-01
Series:Journal of Biomedical Semantics
Online Access:http://www.jbiomedsem.com/content/2/S5/S7
id doaj-d7dbd5ec02ff4eaca36936fdbfda4ec8
record_format Article
spelling doaj-d7dbd5ec02ff4eaca36936fdbfda4ec82020-11-24T21:33:53ZengBMCJournal of Biomedical Semantics2041-14802011-10-012Suppl 5S710.1186/2041-1480-2-S5-S7Predicting speculation: a simple disambiguation approach to hedge detection in biomedical literatureVelldal Erik<p>Abstract</p> <p>Background</p> <p>This paper presents a novel approach to the problem of <it>hedge detection</it>, which involves identifying so-called hedge cues for labeling sentences as certain or uncertain. This is the classification problem for Task 1 of the CoNLL-2010 Shared Task, which focuses on hedging in the biomedical domain. We here propose to view hedge detection as a simple disambiguation problem, restricted to words that have previously been observed as hedge cues. As the feature space for the classifier is still very large, we also perform experiments with dimensionality reduction using the method of <it>random indexing</it>.</p> <p>Results</p> <p>The SVM-based classifiers developed in this paper achieves the best published results so far for sentence-level uncertainty prediction on the CoNLL-2010 Shared Task test data. We also show that the technique of random indexing can be successfully applied for reducing the dimensionality of the original feature space by several orders of magnitude, without sacrificing classifier performance.</p> <p>Conclusions</p> <p>This paper introduces a simplified approach to detecting speculation or uncertainty in text, focusing on the biomedical domain. Evaluated at the sentence-level, our SVM-based classifiers achieve the best published results so far. We also show that the feature space can be aggressively compressed using random indexing while still maintaining comparable classifier performance.</p> http://www.jbiomedsem.com/content/2/S5/S7
collection DOAJ
language English
format Article
sources DOAJ
author Velldal Erik
spellingShingle Velldal Erik
Predicting speculation: a simple disambiguation approach to hedge detection in biomedical literature
Journal of Biomedical Semantics
author_facet Velldal Erik
author_sort Velldal Erik
title Predicting speculation: a simple disambiguation approach to hedge detection in biomedical literature
title_short Predicting speculation: a simple disambiguation approach to hedge detection in biomedical literature
title_full Predicting speculation: a simple disambiguation approach to hedge detection in biomedical literature
title_fullStr Predicting speculation: a simple disambiguation approach to hedge detection in biomedical literature
title_full_unstemmed Predicting speculation: a simple disambiguation approach to hedge detection in biomedical literature
title_sort predicting speculation: a simple disambiguation approach to hedge detection in biomedical literature
publisher BMC
series Journal of Biomedical Semantics
issn 2041-1480
publishDate 2011-10-01
description <p>Abstract</p> <p>Background</p> <p>This paper presents a novel approach to the problem of <it>hedge detection</it>, which involves identifying so-called hedge cues for labeling sentences as certain or uncertain. This is the classification problem for Task 1 of the CoNLL-2010 Shared Task, which focuses on hedging in the biomedical domain. We here propose to view hedge detection as a simple disambiguation problem, restricted to words that have previously been observed as hedge cues. As the feature space for the classifier is still very large, we also perform experiments with dimensionality reduction using the method of <it>random indexing</it>.</p> <p>Results</p> <p>The SVM-based classifiers developed in this paper achieves the best published results so far for sentence-level uncertainty prediction on the CoNLL-2010 Shared Task test data. We also show that the technique of random indexing can be successfully applied for reducing the dimensionality of the original feature space by several orders of magnitude, without sacrificing classifier performance.</p> <p>Conclusions</p> <p>This paper introduces a simplified approach to detecting speculation or uncertainty in text, focusing on the biomedical domain. Evaluated at the sentence-level, our SVM-based classifiers achieve the best published results so far. We also show that the feature space can be aggressively compressed using random indexing while still maintaining comparable classifier performance.</p>
url http://www.jbiomedsem.com/content/2/S5/S7
work_keys_str_mv AT velldalerik predictingspeculationasimpledisambiguationapproachtohedgedetectioninbiomedicalliterature
_version_ 1725951473705025536