PathBinder – text empirics and automatic extraction of biomolecular interactions
<p>Abstract</p> <p>Motivation</p> <p>The increasingly large amount of free, online biological text makes automatic interaction extraction correspondingly attractive. Machine learning is one strategy that works by uncovering and using useful properties that are implicit...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2009-10-01
|
Series: | BMC Bioinformatics |
id |
doaj-f36b2fbeaeab4c6bac59152f6ceffe79 |
---|---|
record_format |
Article |
spelling |
doaj-f36b2fbeaeab4c6bac59152f6ceffe792020-11-25T02:51:14ZengBMCBMC Bioinformatics1471-21052009-10-0110Suppl 11S1810.1186/1471-2105-10-S11-S18PathBinder – text empirics and automatic extraction of biomolecular interactionsDing JingBerleant DanielZhang LifengCao TuanSyrkin Wurtele Eve<p>Abstract</p> <p>Motivation</p> <p>The increasingly large amount of free, online biological text makes automatic interaction extraction correspondingly attractive. Machine learning is one strategy that works by uncovering and using useful properties that are implicit in the text. However these properties are usually not reported in the literature explicitly. By investigating specific properties of biological text passages in this paper, we aim to facilitate an alternative strategy, the use of <it>text empirics</it>, to support mining of biomedical texts for biomolecular interactions. We report on our application of this approach, and also report some empirical findings about an important class of passages. These may be useful to others who may also wish to use the empirical properties we describe.</p> <p>Results</p> <p>We manually analyzed syntactic and semantic properties of sentences likely to describe interactions between biomolecules. The resulting empirical data were used to design an algorithm for the PathBinder system to extract biomolecular interactions from texts. PathBinder searches PubMed for sentences describing interactions between two given biomolecules. PathBinder then uses probabilistic methods to combine evidence from multiple relevant sentences in PubMed to assess the relative likelihood of interaction between two arbitrary biomolecules. A biomolecular interaction network was constructed based on those likelihoods.</p> <p>Conclusion</p> <p>The text empirics approach used here supports computationally friendly, performance competitive, automatic extraction of biomolecular interactions from texts.</p> <p>Availability</p> <p><url>http://www.metnetdb.org/pathbinder</url>.</p> |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Ding Jing Berleant Daniel Zhang Lifeng Cao Tuan Syrkin Wurtele Eve |
spellingShingle |
Ding Jing Berleant Daniel Zhang Lifeng Cao Tuan Syrkin Wurtele Eve PathBinder – text empirics and automatic extraction of biomolecular interactions BMC Bioinformatics |
author_facet |
Ding Jing Berleant Daniel Zhang Lifeng Cao Tuan Syrkin Wurtele Eve |
author_sort |
Ding Jing |
title |
PathBinder – text empirics and automatic extraction of biomolecular interactions |
title_short |
PathBinder – text empirics and automatic extraction of biomolecular interactions |
title_full |
PathBinder – text empirics and automatic extraction of biomolecular interactions |
title_fullStr |
PathBinder – text empirics and automatic extraction of biomolecular interactions |
title_full_unstemmed |
PathBinder – text empirics and automatic extraction of biomolecular interactions |
title_sort |
pathbinder – text empirics and automatic extraction of biomolecular interactions |
publisher |
BMC |
series |
BMC Bioinformatics |
issn |
1471-2105 |
publishDate |
2009-10-01 |
description |
<p>Abstract</p> <p>Motivation</p> <p>The increasingly large amount of free, online biological text makes automatic interaction extraction correspondingly attractive. Machine learning is one strategy that works by uncovering and using useful properties that are implicit in the text. However these properties are usually not reported in the literature explicitly. By investigating specific properties of biological text passages in this paper, we aim to facilitate an alternative strategy, the use of <it>text empirics</it>, to support mining of biomedical texts for biomolecular interactions. We report on our application of this approach, and also report some empirical findings about an important class of passages. These may be useful to others who may also wish to use the empirical properties we describe.</p> <p>Results</p> <p>We manually analyzed syntactic and semantic properties of sentences likely to describe interactions between biomolecules. The resulting empirical data were used to design an algorithm for the PathBinder system to extract biomolecular interactions from texts. PathBinder searches PubMed for sentences describing interactions between two given biomolecules. PathBinder then uses probabilistic methods to combine evidence from multiple relevant sentences in PubMed to assess the relative likelihood of interaction between two arbitrary biomolecules. A biomolecular interaction network was constructed based on those likelihoods.</p> <p>Conclusion</p> <p>The text empirics approach used here supports computationally friendly, performance competitive, automatic extraction of biomolecular interactions from texts.</p> <p>Availability</p> <p><url>http://www.metnetdb.org/pathbinder</url>.</p> |
work_keys_str_mv |
AT dingjing pathbindertextempiricsandautomaticextractionofbiomolecularinteractions AT berleantdaniel pathbindertextempiricsandautomaticextractionofbiomolecularinteractions AT zhanglifeng pathbindertextempiricsandautomaticextractionofbiomolecularinteractions AT caotuan pathbindertextempiricsandautomaticextractionofbiomolecularinteractions AT syrkinwurteleeve pathbindertextempiricsandautomaticextractionofbiomolecularinteractions |
_version_ |
1724735649333903360 |