Comparative high-throughput transcriptome sequencing and development of SiESTa, the <it>Silene </it>EST annotation database

<p>Abstract</p> <p>Background</p> <p>The genus <it>Silene </it>is widely used as a model system for addressing ecological and evolutionary questions in plants, but advances in using the genus as a model system are impeded by the lack of available resources f...

Full description

Bibliographic Details
Main Authors: Marais Gabriel AB, Oger-Desfeux Christine, Charif Delphine, Blavet Nicolas, Widmer Alex
Format: Article
Language:English
Published: BMC 2011-07-01
Series:BMC Genomics
Subjects:
EST
SNP
Online Access:http://www.biomedcentral.com/1471-2164/12/376
id doaj-5b4bf20cc8dd4ab589711ca3c5be1176
record_format Article
spelling doaj-5b4bf20cc8dd4ab589711ca3c5be11762020-11-25T02:47:35ZengBMCBMC Genomics1471-21642011-07-0112137610.1186/1471-2164-12-376Comparative high-throughput transcriptome sequencing and development of SiESTa, the <it>Silene </it>EST annotation databaseMarais Gabriel ABOger-Desfeux ChristineCharif DelphineBlavet NicolasWidmer Alex<p>Abstract</p> <p>Background</p> <p>The genus <it>Silene </it>is widely used as a model system for addressing ecological and evolutionary questions in plants, but advances in using the genus as a model system are impeded by the lack of available resources for studying its genome. Massively parallel sequencing cDNA has recently developed into an efficient method for characterizing the transcriptomes of non-model organisms, generating massive amounts of data that enable the study of multiple species in a comparative framework. The sequences generated provide an excellent resource for identifying expressed genes, characterizing functional variation and developing molecular markers, thereby laying the foundations for future studies on gene sequence and gene expression divergence. Here, we report the results of a comparative transcriptome sequencing study of eight individuals representing four <it>Silene </it>and one <it>Dianthus </it>species as outgroup. All sequences and annotations have been deposited in a newly developed and publicly available database called SiESTa, the <it>Silene </it>EST annotation database.</p> <p>Results</p> <p>A total of 1,041,122 EST reads were generated in two runs on a Roche GS-FLX 454 pyrosequencing platform. EST reads were analyzed separately for all eight individuals sequenced and were assembled into contigs using TGICL. These were annotated with results from BLASTX searches and Gene Ontology (GO) terms, and thousands of single-nucleotide polymorphisms (SNPs) were characterized. Unassembled reads were kept as singletons and together with the contigs contributed to the unigenes characterized in each individual. The high quality of unigenes is evidenced by the proportion (49%) that have significant hits in similarity searches with the <it>A. thaliana </it>proteome. The SiESTa database is accessible at <url>http://www.siesta.ethz.ch</url>.</p> <p>Conclusion</p> <p>The sequence collections established in the present study provide an important genomic resource for four <it>Silene </it>and one <it>Dianthus </it>species and will help to further develop <it>Silene </it>as a plant model system. The genes characterized will be useful for future research not only in the species included in the present study, but also in related species for which no genomic resources are yet available. Our results demonstrate the efficiency of massively parallel transcriptome sequencing in a comparative framework as an approach for developing genomic resources in diverse groups of non-model organisms.</p> http://www.biomedcentral.com/1471-2164/12/376cDNA librarydatabaseESTSNP<it>Silene</it>
collection DOAJ
language English
format Article
sources DOAJ
author Marais Gabriel AB
Oger-Desfeux Christine
Charif Delphine
Blavet Nicolas
Widmer Alex
spellingShingle Marais Gabriel AB
Oger-Desfeux Christine
Charif Delphine
Blavet Nicolas
Widmer Alex
Comparative high-throughput transcriptome sequencing and development of SiESTa, the <it>Silene </it>EST annotation database
BMC Genomics
cDNA library
database
EST
SNP
<it>Silene</it>
author_facet Marais Gabriel AB
Oger-Desfeux Christine
Charif Delphine
Blavet Nicolas
Widmer Alex
author_sort Marais Gabriel AB
title Comparative high-throughput transcriptome sequencing and development of SiESTa, the <it>Silene </it>EST annotation database
title_short Comparative high-throughput transcriptome sequencing and development of SiESTa, the <it>Silene </it>EST annotation database
title_full Comparative high-throughput transcriptome sequencing and development of SiESTa, the <it>Silene </it>EST annotation database
title_fullStr Comparative high-throughput transcriptome sequencing and development of SiESTa, the <it>Silene </it>EST annotation database
title_full_unstemmed Comparative high-throughput transcriptome sequencing and development of SiESTa, the <it>Silene </it>EST annotation database
title_sort comparative high-throughput transcriptome sequencing and development of siesta, the <it>silene </it>est annotation database
publisher BMC
series BMC Genomics
issn 1471-2164
publishDate 2011-07-01
description <p>Abstract</p> <p>Background</p> <p>The genus <it>Silene </it>is widely used as a model system for addressing ecological and evolutionary questions in plants, but advances in using the genus as a model system are impeded by the lack of available resources for studying its genome. Massively parallel sequencing cDNA has recently developed into an efficient method for characterizing the transcriptomes of non-model organisms, generating massive amounts of data that enable the study of multiple species in a comparative framework. The sequences generated provide an excellent resource for identifying expressed genes, characterizing functional variation and developing molecular markers, thereby laying the foundations for future studies on gene sequence and gene expression divergence. Here, we report the results of a comparative transcriptome sequencing study of eight individuals representing four <it>Silene </it>and one <it>Dianthus </it>species as outgroup. All sequences and annotations have been deposited in a newly developed and publicly available database called SiESTa, the <it>Silene </it>EST annotation database.</p> <p>Results</p> <p>A total of 1,041,122 EST reads were generated in two runs on a Roche GS-FLX 454 pyrosequencing platform. EST reads were analyzed separately for all eight individuals sequenced and were assembled into contigs using TGICL. These were annotated with results from BLASTX searches and Gene Ontology (GO) terms, and thousands of single-nucleotide polymorphisms (SNPs) were characterized. Unassembled reads were kept as singletons and together with the contigs contributed to the unigenes characterized in each individual. The high quality of unigenes is evidenced by the proportion (49%) that have significant hits in similarity searches with the <it>A. thaliana </it>proteome. The SiESTa database is accessible at <url>http://www.siesta.ethz.ch</url>.</p> <p>Conclusion</p> <p>The sequence collections established in the present study provide an important genomic resource for four <it>Silene </it>and one <it>Dianthus </it>species and will help to further develop <it>Silene </it>as a plant model system. The genes characterized will be useful for future research not only in the species included in the present study, but also in related species for which no genomic resources are yet available. Our results demonstrate the efficiency of massively parallel transcriptome sequencing in a comparative framework as an approach for developing genomic resources in diverse groups of non-model organisms.</p>
topic cDNA library
database
EST
SNP
<it>Silene</it>
url http://www.biomedcentral.com/1471-2164/12/376
work_keys_str_mv AT maraisgabrielab comparativehighthroughputtranscriptomesequencinganddevelopmentofsiestatheitsileneitestannotationdatabase
AT ogerdesfeuxchristine comparativehighthroughputtranscriptomesequencinganddevelopmentofsiestatheitsileneitestannotationdatabase
AT charifdelphine comparativehighthroughputtranscriptomesequencinganddevelopmentofsiestatheitsileneitestannotationdatabase
AT blavetnicolas comparativehighthroughputtranscriptomesequencinganddevelopmentofsiestatheitsileneitestannotationdatabase
AT widmeralex comparativehighthroughputtranscriptomesequencinganddevelopmentofsiestatheitsileneitestannotationdatabase
_version_ 1724752718762868736