Integrative analysis of large scale transcriptome data draws a comprehensive landscape of Phaeodactylum tricornutum genome and evolutionary origin of diatoms
Abstract Diatoms are one of the most successful and ecologically important groups of eukaryotic phytoplankton in the modern ocean. Deciphering their genomes is a key step towards better understanding of their biological innovations, evolutionary origins, and ecological underpinnings. Here, we have u...
Main Authors: | , , , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Nature Publishing Group
2018-03-01
|
Series: | Scientific Reports |
Online Access: | https://doi.org/10.1038/s41598-018-23106-x |
id |
doaj-ae73ac7d9a9049ecb9f15bbfa637e359 |
---|---|
record_format |
Article |
spelling |
doaj-ae73ac7d9a9049ecb9f15bbfa637e3592020-12-08T03:35:50ZengNature Publishing GroupScientific Reports2045-23222018-03-018111410.1038/s41598-018-23106-xIntegrative analysis of large scale transcriptome data draws a comprehensive landscape of Phaeodactylum tricornutum genome and evolutionary origin of diatomsAchal Rastogi0Uma Maheswari1Richard G. Dorrell2Fabio Rocha Jimenez Vieira3Florian Maumus4Adam Kustka5James McCarthy6Andy E. Allen7Paul Kersey8Chris Bowler9Leila Tirichine10Institut de biologie de l’Ecole normale supérieure (IBENS), Ecole normale supérieure, CNRS, INSERM, PSL UniversitéEMBL-EBI, Wellcome Trust Genome CampusInstitut de biologie de l’Ecole normale supérieure (IBENS), Ecole normale supérieure, CNRS, INSERM, PSL UniversitéInstitut de biologie de l’Ecole normale supérieure (IBENS), Ecole normale supérieure, CNRS, INSERM, PSL UniversitéURGI, INRA, Université Paris-SaclayEarth and Environmental Sciences, Rutgers University, 101 Warren StreetJ. Craig Venter Institute, 10355 Science Center DriveJ. Craig Venter Institute, 10355 Science Center DriveEMBL-EBI, Wellcome Trust Genome CampusInstitut de biologie de l’Ecole normale supérieure (IBENS), Ecole normale supérieure, CNRS, INSERM, PSL UniversitéInstitut de biologie de l’Ecole normale supérieure (IBENS), Ecole normale supérieure, CNRS, INSERM, PSL UniversitéAbstract Diatoms are one of the most successful and ecologically important groups of eukaryotic phytoplankton in the modern ocean. Deciphering their genomes is a key step towards better understanding of their biological innovations, evolutionary origins, and ecological underpinnings. Here, we have used 90 RNA-Seq datasets from different growth conditions combined with published expressed sequence tags and protein sequences from multiple taxa to explore the genome of the model diatom Phaeodactylum tricornutum, and introduce 1,489 novel genes. The new annotation additionally permitted the discovery of extensive alternative splicing in diatoms, including intron retention and exon skipping, which increase the diversity of transcripts generated in changing environments. In addition, we have used up-to-date reference sequence libraries to dissect the taxonomic origins of diatom genes. We show that the P. tricornutum genome is enriched in lineage-specific genes, with up to 47% of the gene models present only possessing orthologues in other stramenopile groups. Finally, we have performed a comprehensive de novo annotation of repetitive elements showing novel classes of transposable elements such as SINE, MITE and TRIM/LARD. This work provides a solid foundation for future studies of diatom gene function, evolution and ecology.https://doi.org/10.1038/s41598-018-23106-x |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Achal Rastogi Uma Maheswari Richard G. Dorrell Fabio Rocha Jimenez Vieira Florian Maumus Adam Kustka James McCarthy Andy E. Allen Paul Kersey Chris Bowler Leila Tirichine |
spellingShingle |
Achal Rastogi Uma Maheswari Richard G. Dorrell Fabio Rocha Jimenez Vieira Florian Maumus Adam Kustka James McCarthy Andy E. Allen Paul Kersey Chris Bowler Leila Tirichine Integrative analysis of large scale transcriptome data draws a comprehensive landscape of Phaeodactylum tricornutum genome and evolutionary origin of diatoms Scientific Reports |
author_facet |
Achal Rastogi Uma Maheswari Richard G. Dorrell Fabio Rocha Jimenez Vieira Florian Maumus Adam Kustka James McCarthy Andy E. Allen Paul Kersey Chris Bowler Leila Tirichine |
author_sort |
Achal Rastogi |
title |
Integrative analysis of large scale transcriptome data draws a comprehensive landscape of Phaeodactylum tricornutum genome and evolutionary origin of diatoms |
title_short |
Integrative analysis of large scale transcriptome data draws a comprehensive landscape of Phaeodactylum tricornutum genome and evolutionary origin of diatoms |
title_full |
Integrative analysis of large scale transcriptome data draws a comprehensive landscape of Phaeodactylum tricornutum genome and evolutionary origin of diatoms |
title_fullStr |
Integrative analysis of large scale transcriptome data draws a comprehensive landscape of Phaeodactylum tricornutum genome and evolutionary origin of diatoms |
title_full_unstemmed |
Integrative analysis of large scale transcriptome data draws a comprehensive landscape of Phaeodactylum tricornutum genome and evolutionary origin of diatoms |
title_sort |
integrative analysis of large scale transcriptome data draws a comprehensive landscape of phaeodactylum tricornutum genome and evolutionary origin of diatoms |
publisher |
Nature Publishing Group |
series |
Scientific Reports |
issn |
2045-2322 |
publishDate |
2018-03-01 |
description |
Abstract Diatoms are one of the most successful and ecologically important groups of eukaryotic phytoplankton in the modern ocean. Deciphering their genomes is a key step towards better understanding of their biological innovations, evolutionary origins, and ecological underpinnings. Here, we have used 90 RNA-Seq datasets from different growth conditions combined with published expressed sequence tags and protein sequences from multiple taxa to explore the genome of the model diatom Phaeodactylum tricornutum, and introduce 1,489 novel genes. The new annotation additionally permitted the discovery of extensive alternative splicing in diatoms, including intron retention and exon skipping, which increase the diversity of transcripts generated in changing environments. In addition, we have used up-to-date reference sequence libraries to dissect the taxonomic origins of diatom genes. We show that the P. tricornutum genome is enriched in lineage-specific genes, with up to 47% of the gene models present only possessing orthologues in other stramenopile groups. Finally, we have performed a comprehensive de novo annotation of repetitive elements showing novel classes of transposable elements such as SINE, MITE and TRIM/LARD. This work provides a solid foundation for future studies of diatom gene function, evolution and ecology. |
url |
https://doi.org/10.1038/s41598-018-23106-x |
work_keys_str_mv |
AT achalrastogi integrativeanalysisoflargescaletranscriptomedatadrawsacomprehensivelandscapeofphaeodactylumtricornutumgenomeandevolutionaryoriginofdiatoms AT umamaheswari integrativeanalysisoflargescaletranscriptomedatadrawsacomprehensivelandscapeofphaeodactylumtricornutumgenomeandevolutionaryoriginofdiatoms AT richardgdorrell integrativeanalysisoflargescaletranscriptomedatadrawsacomprehensivelandscapeofphaeodactylumtricornutumgenomeandevolutionaryoriginofdiatoms AT fabiorochajimenezvieira integrativeanalysisoflargescaletranscriptomedatadrawsacomprehensivelandscapeofphaeodactylumtricornutumgenomeandevolutionaryoriginofdiatoms AT florianmaumus integrativeanalysisoflargescaletranscriptomedatadrawsacomprehensivelandscapeofphaeodactylumtricornutumgenomeandevolutionaryoriginofdiatoms AT adamkustka integrativeanalysisoflargescaletranscriptomedatadrawsacomprehensivelandscapeofphaeodactylumtricornutumgenomeandevolutionaryoriginofdiatoms AT jamesmccarthy integrativeanalysisoflargescaletranscriptomedatadrawsacomprehensivelandscapeofphaeodactylumtricornutumgenomeandevolutionaryoriginofdiatoms AT andyeallen integrativeanalysisoflargescaletranscriptomedatadrawsacomprehensivelandscapeofphaeodactylumtricornutumgenomeandevolutionaryoriginofdiatoms AT paulkersey integrativeanalysisoflargescaletranscriptomedatadrawsacomprehensivelandscapeofphaeodactylumtricornutumgenomeandevolutionaryoriginofdiatoms AT chrisbowler integrativeanalysisoflargescaletranscriptomedatadrawsacomprehensivelandscapeofphaeodactylumtricornutumgenomeandevolutionaryoriginofdiatoms AT leilatirichine integrativeanalysisoflargescaletranscriptomedatadrawsacomprehensivelandscapeofphaeodactylumtricornutumgenomeandevolutionaryoriginofdiatoms |
_version_ |
1724392563619659776 |