Summary: | <p>Abstract</p> <p>Background</p> <p>Non-coding RNAs gain more attention as their diverse roles in many cellular processes are discovered. At the same time, the need for efficient computational prediction of ncRNAs increases with the pace of sequencing technology. Existing tools are based on various approaches and techniques, but none of them provides a reliable ncRNA detector yet. Consequently, a natural approach is to combine existing tools. Due to a lack of standard input and output formats combination and comparison of existing tools is difficult. Also, for genomic scans they often need to be incorporated in detection workflows using custom scripts, which decreases transparency and reproducibility.</p> <p>Results</p> <p>We developed a Java-based framework to integrate existing tools and methods for ncRNA detection. This framework enables users to construct transparent detection workflows and to combine and compare different methods efficiently. We demonstrate the effectiveness of combining detection methods in case studies with the small genomes of <it>Escherichia coli</it>, <it>Listeria monocytogenes </it>and <it>Streptococcus pyogenes</it>. With the combined method, we gained 10% to 20% precision for sensitivities from 30% to 80%. Further, we investigated <it>Streptococcus pyogenes </it>for novel ncRNAs. Using multiple methods--integrated by our framework--we determined four highly probable candidates. We verified all four candidates experimentally using RT-PCR.</p> <p>Conclusions</p> <p>We have created an extensible framework for practical, transparent and reproducible combination and comparison of ncRNA detection methods. We have proven the effectiveness of this approach in tests and by guiding experiments to find new ncRNAs. The software is freely available under the GNU General Public License (GPL), version 3 at <url>http://www.sbi.uni-rostock.de/moses</url> along with source code, screen shots, examples and tutorial material.</p>
|