Using deep RNA sequencing for the structural annotation of the Laccaria bicolor mycorrhizal transcriptome.
BACKGROUND: Accurate structural annotation is important for prediction of function and required for in vitro approaches to characterize or validate the gene expression products. Despite significant efforts in the field, determination of the gene structure from genomic data alone is a challenging and...
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2010-01-01
|
Series: | PLoS ONE |
Online Access: | http://europepmc.org/articles/PMC2897884?pdf=render |
id |
doaj-14d5fb1a4e3f4064a721887ca475db31 |
---|---|
record_format |
Article |
spelling |
doaj-14d5fb1a4e3f4064a721887ca475db312020-11-25T02:20:05ZengPublic Library of Science (PLoS)PLoS ONE1932-62032010-01-0157e978010.1371/journal.pone.0009780Using deep RNA sequencing for the structural annotation of the Laccaria bicolor mycorrhizal transcriptome.Peter E LarsenGeetika TrivediAvinash SreedasyamVincent LuGopi K PodilaFrank R CollartBACKGROUND: Accurate structural annotation is important for prediction of function and required for in vitro approaches to characterize or validate the gene expression products. Despite significant efforts in the field, determination of the gene structure from genomic data alone is a challenging and inaccurate process. The ease of acquisition of transcriptomic sequence provides a direct route to identify expressed sequences and determine the correct gene structure. METHODOLOGY: We developed methods to utilize RNA-seq data to correct errors in the structural annotation and extend the boundaries of current gene models using assembly approaches. The methods were validated with a transcriptomic data set derived from the fungus Laccaria bicolor, which develops a mycorrhizal symbiotic association with the roots of many tree species. Our analysis focused on the subset of 1501 gene models that are differentially expressed in the free living vs. mycorrhizal transcriptome and are expected to be important elements related to carbon metabolism, membrane permeability and transport, and intracellular signaling. Of the set of 1501 gene models, 1439 (96%) successfully generated modified gene models in which all error flags were successfully resolved and the sequences aligned to the genomic sequence. The remaining 4% (62 gene models) either had deviations from transcriptomic data that could not be spanned or generated sequence that did not align to genomic sequence. The outcome of this process is a set of high confidence gene models that can be reliably used for experimental characterization of protein function. CONCLUSIONS: 69% of expressed mycorrhizal JGI "best" gene models deviated from the transcript sequence derived by this method. The transcriptomic sequence enabled correction of a majority of the structural inconsistencies and resulted in a set of validated models for 96% of the mycorrhizal genes. The method described here can be applied to improve gene structural annotation in other species, provided that there is a sequenced genome and a set of gene models.http://europepmc.org/articles/PMC2897884?pdf=render |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Peter E Larsen Geetika Trivedi Avinash Sreedasyam Vincent Lu Gopi K Podila Frank R Collart |
spellingShingle |
Peter E Larsen Geetika Trivedi Avinash Sreedasyam Vincent Lu Gopi K Podila Frank R Collart Using deep RNA sequencing for the structural annotation of the Laccaria bicolor mycorrhizal transcriptome. PLoS ONE |
author_facet |
Peter E Larsen Geetika Trivedi Avinash Sreedasyam Vincent Lu Gopi K Podila Frank R Collart |
author_sort |
Peter E Larsen |
title |
Using deep RNA sequencing for the structural annotation of the Laccaria bicolor mycorrhizal transcriptome. |
title_short |
Using deep RNA sequencing for the structural annotation of the Laccaria bicolor mycorrhizal transcriptome. |
title_full |
Using deep RNA sequencing for the structural annotation of the Laccaria bicolor mycorrhizal transcriptome. |
title_fullStr |
Using deep RNA sequencing for the structural annotation of the Laccaria bicolor mycorrhizal transcriptome. |
title_full_unstemmed |
Using deep RNA sequencing for the structural annotation of the Laccaria bicolor mycorrhizal transcriptome. |
title_sort |
using deep rna sequencing for the structural annotation of the laccaria bicolor mycorrhizal transcriptome. |
publisher |
Public Library of Science (PLoS) |
series |
PLoS ONE |
issn |
1932-6203 |
publishDate |
2010-01-01 |
description |
BACKGROUND: Accurate structural annotation is important for prediction of function and required for in vitro approaches to characterize or validate the gene expression products. Despite significant efforts in the field, determination of the gene structure from genomic data alone is a challenging and inaccurate process. The ease of acquisition of transcriptomic sequence provides a direct route to identify expressed sequences and determine the correct gene structure. METHODOLOGY: We developed methods to utilize RNA-seq data to correct errors in the structural annotation and extend the boundaries of current gene models using assembly approaches. The methods were validated with a transcriptomic data set derived from the fungus Laccaria bicolor, which develops a mycorrhizal symbiotic association with the roots of many tree species. Our analysis focused on the subset of 1501 gene models that are differentially expressed in the free living vs. mycorrhizal transcriptome and are expected to be important elements related to carbon metabolism, membrane permeability and transport, and intracellular signaling. Of the set of 1501 gene models, 1439 (96%) successfully generated modified gene models in which all error flags were successfully resolved and the sequences aligned to the genomic sequence. The remaining 4% (62 gene models) either had deviations from transcriptomic data that could not be spanned or generated sequence that did not align to genomic sequence. The outcome of this process is a set of high confidence gene models that can be reliably used for experimental characterization of protein function. CONCLUSIONS: 69% of expressed mycorrhizal JGI "best" gene models deviated from the transcript sequence derived by this method. The transcriptomic sequence enabled correction of a majority of the structural inconsistencies and resulted in a set of validated models for 96% of the mycorrhizal genes. The method described here can be applied to improve gene structural annotation in other species, provided that there is a sequenced genome and a set of gene models. |
url |
http://europepmc.org/articles/PMC2897884?pdf=render |
work_keys_str_mv |
AT peterelarsen usingdeeprnasequencingforthestructuralannotationofthelaccariabicolormycorrhizaltranscriptome AT geetikatrivedi usingdeeprnasequencingforthestructuralannotationofthelaccariabicolormycorrhizaltranscriptome AT avinashsreedasyam usingdeeprnasequencingforthestructuralannotationofthelaccariabicolormycorrhizaltranscriptome AT vincentlu usingdeeprnasequencingforthestructuralannotationofthelaccariabicolormycorrhizaltranscriptome AT gopikpodila usingdeeprnasequencingforthestructuralannotationofthelaccariabicolormycorrhizaltranscriptome AT frankrcollart usingdeeprnasequencingforthestructuralannotationofthelaccariabicolormycorrhizaltranscriptome |
_version_ |
1724873601616707584 |