Screening non-coding RNAs in transcriptomes from neglected species using PORTRAIT: case study of the pathogenic fungus <it>Paracoccidioides brasiliensis</it>

Abstract Background Transcriptome sequences provide a complement to structural genomic information and provide snapshots of an organism's transcriptional profile. Such sequences also represent an alternative method for characterizing neglected spec...

Full description

Bibliographic Details
Main Authors:	Togawa Roberto C, Arrial Roberto T, Brigido Marcelo
Format:	Article
Language:	English
Published:	BMC 2009-08-01
Series:	BMC Bioinformatics
Online Access:	http://www.biomedcentral.com/1471-2105/10/239

id	doaj-c5df65cc47ac4675a8706746ba782258
record_format	Article
spelling	doaj-c5df65cc47ac4675a8706746ba7822582020-11-24T21:15:58ZengBMCBMC Bioinformatics1471-21052009-08-0110123910.1186/1471-2105-10-239Screening non-coding RNAs in transcriptomes from neglected species using PORTRAIT: case study of the pathogenic fungus <it>Paracoccidioides brasiliensis</it>Togawa Roberto CArrial Roberto TBrigido Marcelo<p>Abstract</p> <p>Background</p> <p>Transcriptome sequences provide a complement to structural genomic information and provide snapshots of an organism's transcriptional profile. Such sequences also represent an alternative method for characterizing neglected species that are not expected to undergo whole-genome sequencing. One difficulty for transcriptome sequencing of these organisms is the low quality of reads and incomplete coverage of transcripts, both of which compromise further bioinformatics analyses. Another complicating factor is the lack of known protein homologs, which frustrates searches against established protein databases. This lack of homologs may be caused by divergence from well-characterized and over-represented model organisms. Another explanation is that non-coding RNAs (ncRNAs) may be caught during sequencing. NcRNAs are RNA sequences that, unlike messenger RNAs, do not code for protein products and instead perform unique functions by folding into higher order structural conformations. There is ncRNA screening software available that is specific for transcriptome sequences, but their analyses are optimized for those transcriptomes that are well represented in protein databases, and also assume that input ESTs are full-length and high quality.</p> <p>Results</p> <p>We propose an algorithm called PORTRAIT, which is suitable for ncRNA analysis of transcriptomes from poorly characterized species. Sequences are translated by software that is resistant to sequencing errors, and the predicted putative proteins, along with their source transcripts, are evaluated for coding potential by a support vector machine (SVM). Either of two SVM models may be employed: if a putative protein is found, a protein-dependent SVM model is used; if it is not found, a protein-independent SVM model is used instead. Only <it>ab initio </it>features are extracted, so that no homology information is needed. We illustrate the use of PORTRAIT by predicting ncRNAs from the transcriptome of the pathogenic fungus <it>Paracoccidoides brasiliensis </it>and five other related fungi.</p> <p>Conclusion</p> <p>PORTRAIT can be integrated into pipelines, and provides a low computational cost solution for ncRNA detection in transcriptome sequencing projects.</p> http://www.biomedcentral.com/1471-2105/10/239
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Togawa Roberto C Arrial Roberto T Brigido Marcelo
spellingShingle	Togawa Roberto C Arrial Roberto T Brigido Marcelo Screening non-coding RNAs in transcriptomes from neglected species using PORTRAIT: case study of the pathogenic fungus <it>Paracoccidioides brasiliensis</it> BMC Bioinformatics
author_facet	Togawa Roberto C Arrial Roberto T Brigido Marcelo
author_sort	Togawa Roberto C
title	Screening non-coding RNAs in transcriptomes from neglected species using PORTRAIT: case study of the pathogenic fungus <it>Paracoccidioides brasiliensis</it>
title_short	Screening non-coding RNAs in transcriptomes from neglected species using PORTRAIT: case study of the pathogenic fungus <it>Paracoccidioides brasiliensis</it>
title_full	Screening non-coding RNAs in transcriptomes from neglected species using PORTRAIT: case study of the pathogenic fungus <it>Paracoccidioides brasiliensis</it>
title_fullStr	Screening non-coding RNAs in transcriptomes from neglected species using PORTRAIT: case study of the pathogenic fungus <it>Paracoccidioides brasiliensis</it>
title_full_unstemmed	Screening non-coding RNAs in transcriptomes from neglected species using PORTRAIT: case study of the pathogenic fungus <it>Paracoccidioides brasiliensis</it>
title_sort	screening non-coding rnas in transcriptomes from neglected species using portrait: case study of the pathogenic fungus <it>paracoccidioides brasiliensis</it>
publisher	BMC
series	BMC Bioinformatics
issn	1471-2105
publishDate	2009-08-01
description	<p>Abstract</p> <p>Background</p> <p>Transcriptome sequences provide a complement to structural genomic information and provide snapshots of an organism's transcriptional profile. Such sequences also represent an alternative method for characterizing neglected species that are not expected to undergo whole-genome sequencing. One difficulty for transcriptome sequencing of these organisms is the low quality of reads and incomplete coverage of transcripts, both of which compromise further bioinformatics analyses. Another complicating factor is the lack of known protein homologs, which frustrates searches against established protein databases. This lack of homologs may be caused by divergence from well-characterized and over-represented model organisms. Another explanation is that non-coding RNAs (ncRNAs) may be caught during sequencing. NcRNAs are RNA sequences that, unlike messenger RNAs, do not code for protein products and instead perform unique functions by folding into higher order structural conformations. There is ncRNA screening software available that is specific for transcriptome sequences, but their analyses are optimized for those transcriptomes that are well represented in protein databases, and also assume that input ESTs are full-length and high quality.</p> <p>Results</p> <p>We propose an algorithm called PORTRAIT, which is suitable for ncRNA analysis of transcriptomes from poorly characterized species. Sequences are translated by software that is resistant to sequencing errors, and the predicted putative proteins, along with their source transcripts, are evaluated for coding potential by a support vector machine (SVM). Either of two SVM models may be employed: if a putative protein is found, a protein-dependent SVM model is used; if it is not found, a protein-independent SVM model is used instead. Only <it>ab initio </it>features are extracted, so that no homology information is needed. We illustrate the use of PORTRAIT by predicting ncRNAs from the transcriptome of the pathogenic fungus <it>Paracoccidoides brasiliensis </it>and five other related fungi.</p> <p>Conclusion</p> <p>PORTRAIT can be integrated into pipelines, and provides a low computational cost solution for ncRNA detection in transcriptome sequencing projects.</p>
url	http://www.biomedcentral.com/1471-2105/10/239
work_keys_str_mv	AT togawarobertoc screeningnoncodingrnasintranscriptomesfromneglectedspeciesusingportraitcasestudyofthepathogenicfungusitparacoccidioidesbrasiliensisit AT arrialrobertot screeningnoncodingrnasintranscriptomesfromneglectedspeciesusingportraitcasestudyofthepathogenicfungusitparacoccidioidesbrasiliensisit AT brigidomarcelo screeningnoncodingrnasintranscriptomesfromneglectedspeciesusingportraitcasestudyofthepathogenicfungusitparacoccidioidesbrasiliensisit
_version_	1716743948391677952

Screening non-coding RNAs in transcriptomes from neglected species using PORTRAIT: case study of the pathogenic fungus <it>Paracoccidioides brasiliensis</it>

Similar Items