PathBinder – text empirics and automatic extraction of biomolecular interactions

<p>Abstract</p> <p>Motivation</p> <p>The increasingly large amount of free, online biological text makes automatic interaction extraction correspondingly attractive. Machine learning is one strategy that works by uncovering and using useful properties that are implicit...

Full description

Bibliographic Details
Main Authors: Ding Jing, Berleant Daniel, Zhang Lifeng, Cao Tuan, Syrkin Wurtele Eve
Format: Article
Language:English
Published: BMC 2009-10-01
Series:BMC Bioinformatics
id doaj-f36b2fbeaeab4c6bac59152f6ceffe79
record_format Article
spelling doaj-f36b2fbeaeab4c6bac59152f6ceffe792020-11-25T02:51:14ZengBMCBMC Bioinformatics1471-21052009-10-0110Suppl 11S1810.1186/1471-2105-10-S11-S18PathBinder – text empirics and automatic extraction of biomolecular interactionsDing JingBerleant DanielZhang LifengCao TuanSyrkin Wurtele Eve<p>Abstract</p> <p>Motivation</p> <p>The increasingly large amount of free, online biological text makes automatic interaction extraction correspondingly attractive. Machine learning is one strategy that works by uncovering and using useful properties that are implicit in the text. However these properties are usually not reported in the literature explicitly. By investigating specific properties of biological text passages in this paper, we aim to facilitate an alternative strategy, the use of <it>text empirics</it>, to support mining of biomedical texts for biomolecular interactions. We report on our application of this approach, and also report some empirical findings about an important class of passages. These may be useful to others who may also wish to use the empirical properties we describe.</p> <p>Results</p> <p>We manually analyzed syntactic and semantic properties of sentences likely to describe interactions between biomolecules. The resulting empirical data were used to design an algorithm for the PathBinder system to extract biomolecular interactions from texts. PathBinder searches PubMed for sentences describing interactions between two given biomolecules. PathBinder then uses probabilistic methods to combine evidence from multiple relevant sentences in PubMed to assess the relative likelihood of interaction between two arbitrary biomolecules. A biomolecular interaction network was constructed based on those likelihoods.</p> <p>Conclusion</p> <p>The text empirics approach used here supports computationally friendly, performance competitive, automatic extraction of biomolecular interactions from texts.</p> <p>Availability</p> <p><url>http://www.metnetdb.org/pathbinder</url>.</p>
collection DOAJ
language English
format Article
sources DOAJ
author Ding Jing
Berleant Daniel
Zhang Lifeng
Cao Tuan
Syrkin Wurtele Eve
spellingShingle Ding Jing
Berleant Daniel
Zhang Lifeng
Cao Tuan
Syrkin Wurtele Eve
PathBinder – text empirics and automatic extraction of biomolecular interactions
BMC Bioinformatics
author_facet Ding Jing
Berleant Daniel
Zhang Lifeng
Cao Tuan
Syrkin Wurtele Eve
author_sort Ding Jing
title PathBinder – text empirics and automatic extraction of biomolecular interactions
title_short PathBinder – text empirics and automatic extraction of biomolecular interactions
title_full PathBinder – text empirics and automatic extraction of biomolecular interactions
title_fullStr PathBinder – text empirics and automatic extraction of biomolecular interactions
title_full_unstemmed PathBinder – text empirics and automatic extraction of biomolecular interactions
title_sort pathbinder – text empirics and automatic extraction of biomolecular interactions
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2009-10-01
description <p>Abstract</p> <p>Motivation</p> <p>The increasingly large amount of free, online biological text makes automatic interaction extraction correspondingly attractive. Machine learning is one strategy that works by uncovering and using useful properties that are implicit in the text. However these properties are usually not reported in the literature explicitly. By investigating specific properties of biological text passages in this paper, we aim to facilitate an alternative strategy, the use of <it>text empirics</it>, to support mining of biomedical texts for biomolecular interactions. We report on our application of this approach, and also report some empirical findings about an important class of passages. These may be useful to others who may also wish to use the empirical properties we describe.</p> <p>Results</p> <p>We manually analyzed syntactic and semantic properties of sentences likely to describe interactions between biomolecules. The resulting empirical data were used to design an algorithm for the PathBinder system to extract biomolecular interactions from texts. PathBinder searches PubMed for sentences describing interactions between two given biomolecules. PathBinder then uses probabilistic methods to combine evidence from multiple relevant sentences in PubMed to assess the relative likelihood of interaction between two arbitrary biomolecules. A biomolecular interaction network was constructed based on those likelihoods.</p> <p>Conclusion</p> <p>The text empirics approach used here supports computationally friendly, performance competitive, automatic extraction of biomolecular interactions from texts.</p> <p>Availability</p> <p><url>http://www.metnetdb.org/pathbinder</url>.</p>
work_keys_str_mv AT dingjing pathbindertextempiricsandautomaticextractionofbiomolecularinteractions
AT berleantdaniel pathbindertextempiricsandautomaticextractionofbiomolecularinteractions
AT zhanglifeng pathbindertextempiricsandautomaticextractionofbiomolecularinteractions
AT caotuan pathbindertextempiricsandautomaticextractionofbiomolecularinteractions
AT syrkinwurteleeve pathbindertextempiricsandautomaticextractionofbiomolecularinteractions
_version_ 1724735649333903360