Using deep RNA sequencing for the structural annotation of the Laccaria bicolor mycorrhizal transcriptome.

BACKGROUND: Accurate structural annotation is important for prediction of function and required for in vitro approaches to characterize or validate the gene expression products. Despite significant efforts in the field, determination of the gene structure from genomic data alone is a challenging and...

Full description

Bibliographic Details
Main Authors: Peter E Larsen, Geetika Trivedi, Avinash Sreedasyam, Vincent Lu, Gopi K Podila, Frank R Collart
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2010-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC2897884?pdf=render
id doaj-14d5fb1a4e3f4064a721887ca475db31
record_format Article
spelling doaj-14d5fb1a4e3f4064a721887ca475db312020-11-25T02:20:05ZengPublic Library of Science (PLoS)PLoS ONE1932-62032010-01-0157e978010.1371/journal.pone.0009780Using deep RNA sequencing for the structural annotation of the Laccaria bicolor mycorrhizal transcriptome.Peter E LarsenGeetika TrivediAvinash SreedasyamVincent LuGopi K PodilaFrank R CollartBACKGROUND: Accurate structural annotation is important for prediction of function and required for in vitro approaches to characterize or validate the gene expression products. Despite significant efforts in the field, determination of the gene structure from genomic data alone is a challenging and inaccurate process. The ease of acquisition of transcriptomic sequence provides a direct route to identify expressed sequences and determine the correct gene structure. METHODOLOGY: We developed methods to utilize RNA-seq data to correct errors in the structural annotation and extend the boundaries of current gene models using assembly approaches. The methods were validated with a transcriptomic data set derived from the fungus Laccaria bicolor, which develops a mycorrhizal symbiotic association with the roots of many tree species. Our analysis focused on the subset of 1501 gene models that are differentially expressed in the free living vs. mycorrhizal transcriptome and are expected to be important elements related to carbon metabolism, membrane permeability and transport, and intracellular signaling. Of the set of 1501 gene models, 1439 (96%) successfully generated modified gene models in which all error flags were successfully resolved and the sequences aligned to the genomic sequence. The remaining 4% (62 gene models) either had deviations from transcriptomic data that could not be spanned or generated sequence that did not align to genomic sequence. The outcome of this process is a set of high confidence gene models that can be reliably used for experimental characterization of protein function. CONCLUSIONS: 69% of expressed mycorrhizal JGI "best" gene models deviated from the transcript sequence derived by this method. The transcriptomic sequence enabled correction of a majority of the structural inconsistencies and resulted in a set of validated models for 96% of the mycorrhizal genes. The method described here can be applied to improve gene structural annotation in other species, provided that there is a sequenced genome and a set of gene models.http://europepmc.org/articles/PMC2897884?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Peter E Larsen
Geetika Trivedi
Avinash Sreedasyam
Vincent Lu
Gopi K Podila
Frank R Collart
spellingShingle Peter E Larsen
Geetika Trivedi
Avinash Sreedasyam
Vincent Lu
Gopi K Podila
Frank R Collart
Using deep RNA sequencing for the structural annotation of the Laccaria bicolor mycorrhizal transcriptome.
PLoS ONE
author_facet Peter E Larsen
Geetika Trivedi
Avinash Sreedasyam
Vincent Lu
Gopi K Podila
Frank R Collart
author_sort Peter E Larsen
title Using deep RNA sequencing for the structural annotation of the Laccaria bicolor mycorrhizal transcriptome.
title_short Using deep RNA sequencing for the structural annotation of the Laccaria bicolor mycorrhizal transcriptome.
title_full Using deep RNA sequencing for the structural annotation of the Laccaria bicolor mycorrhizal transcriptome.
title_fullStr Using deep RNA sequencing for the structural annotation of the Laccaria bicolor mycorrhizal transcriptome.
title_full_unstemmed Using deep RNA sequencing for the structural annotation of the Laccaria bicolor mycorrhizal transcriptome.
title_sort using deep rna sequencing for the structural annotation of the laccaria bicolor mycorrhizal transcriptome.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2010-01-01
description BACKGROUND: Accurate structural annotation is important for prediction of function and required for in vitro approaches to characterize or validate the gene expression products. Despite significant efforts in the field, determination of the gene structure from genomic data alone is a challenging and inaccurate process. The ease of acquisition of transcriptomic sequence provides a direct route to identify expressed sequences and determine the correct gene structure. METHODOLOGY: We developed methods to utilize RNA-seq data to correct errors in the structural annotation and extend the boundaries of current gene models using assembly approaches. The methods were validated with a transcriptomic data set derived from the fungus Laccaria bicolor, which develops a mycorrhizal symbiotic association with the roots of many tree species. Our analysis focused on the subset of 1501 gene models that are differentially expressed in the free living vs. mycorrhizal transcriptome and are expected to be important elements related to carbon metabolism, membrane permeability and transport, and intracellular signaling. Of the set of 1501 gene models, 1439 (96%) successfully generated modified gene models in which all error flags were successfully resolved and the sequences aligned to the genomic sequence. The remaining 4% (62 gene models) either had deviations from transcriptomic data that could not be spanned or generated sequence that did not align to genomic sequence. The outcome of this process is a set of high confidence gene models that can be reliably used for experimental characterization of protein function. CONCLUSIONS: 69% of expressed mycorrhizal JGI "best" gene models deviated from the transcript sequence derived by this method. The transcriptomic sequence enabled correction of a majority of the structural inconsistencies and resulted in a set of validated models for 96% of the mycorrhizal genes. The method described here can be applied to improve gene structural annotation in other species, provided that there is a sequenced genome and a set of gene models.
url http://europepmc.org/articles/PMC2897884?pdf=render
work_keys_str_mv AT peterelarsen usingdeeprnasequencingforthestructuralannotationofthelaccariabicolormycorrhizaltranscriptome
AT geetikatrivedi usingdeeprnasequencingforthestructuralannotationofthelaccariabicolormycorrhizaltranscriptome
AT avinashsreedasyam usingdeeprnasequencingforthestructuralannotationofthelaccariabicolormycorrhizaltranscriptome
AT vincentlu usingdeeprnasequencingforthestructuralannotationofthelaccariabicolormycorrhizaltranscriptome
AT gopikpodila usingdeeprnasequencingforthestructuralannotationofthelaccariabicolormycorrhizaltranscriptome
AT frankrcollart usingdeeprnasequencingforthestructuralannotationofthelaccariabicolormycorrhizaltranscriptome
_version_ 1724873601616707584