Integrative analysis of large scale transcriptome data draws a comprehensive landscape of Phaeodactylum tricornutum genome and evolutionary origin of diatoms

Abstract Diatoms are one of the most successful and ecologically important groups of eukaryotic phytoplankton in the modern ocean. Deciphering their genomes is a key step towards better understanding of their biological innovations, evolutionary origins, and ecological underpinnings. Here, we have u...

Full description

Bibliographic Details
Main Authors: Achal Rastogi, Uma Maheswari, Richard G. Dorrell, Fabio Rocha Jimenez Vieira, Florian Maumus, Adam Kustka, James McCarthy, Andy E. Allen, Paul Kersey, Chris Bowler, Leila Tirichine
Format: Article
Language:English
Published: Nature Publishing Group 2018-03-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-018-23106-x
id doaj-ae73ac7d9a9049ecb9f15bbfa637e359
record_format Article
spelling doaj-ae73ac7d9a9049ecb9f15bbfa637e3592020-12-08T03:35:50ZengNature Publishing GroupScientific Reports2045-23222018-03-018111410.1038/s41598-018-23106-xIntegrative analysis of large scale transcriptome data draws a comprehensive landscape of Phaeodactylum tricornutum genome and evolutionary origin of diatomsAchal Rastogi0Uma Maheswari1Richard G. Dorrell2Fabio Rocha Jimenez Vieira3Florian Maumus4Adam Kustka5James McCarthy6Andy E. Allen7Paul Kersey8Chris Bowler9Leila Tirichine10Institut de biologie de l’Ecole normale supérieure (IBENS), Ecole normale supérieure, CNRS, INSERM, PSL UniversitéEMBL-EBI, Wellcome Trust Genome CampusInstitut de biologie de l’Ecole normale supérieure (IBENS), Ecole normale supérieure, CNRS, INSERM, PSL UniversitéInstitut de biologie de l’Ecole normale supérieure (IBENS), Ecole normale supérieure, CNRS, INSERM, PSL UniversitéURGI, INRA, Université Paris-SaclayEarth and Environmental Sciences, Rutgers University, 101 Warren StreetJ. Craig Venter Institute, 10355 Science Center DriveJ. Craig Venter Institute, 10355 Science Center DriveEMBL-EBI, Wellcome Trust Genome CampusInstitut de biologie de l’Ecole normale supérieure (IBENS), Ecole normale supérieure, CNRS, INSERM, PSL UniversitéInstitut de biologie de l’Ecole normale supérieure (IBENS), Ecole normale supérieure, CNRS, INSERM, PSL UniversitéAbstract Diatoms are one of the most successful and ecologically important groups of eukaryotic phytoplankton in the modern ocean. Deciphering their genomes is a key step towards better understanding of their biological innovations, evolutionary origins, and ecological underpinnings. Here, we have used 90 RNA-Seq datasets from different growth conditions combined with published expressed sequence tags and protein sequences from multiple taxa to explore the genome of the model diatom Phaeodactylum tricornutum, and introduce 1,489 novel genes. The new annotation additionally permitted the discovery of extensive alternative splicing in diatoms, including intron retention and exon skipping, which increase the diversity of transcripts generated in changing environments. In addition, we have used up-to-date reference sequence libraries to dissect the taxonomic origins of diatom genes. We show that the P. tricornutum genome is enriched in lineage-specific genes, with up to 47% of the gene models present only possessing orthologues in other stramenopile groups. Finally, we have performed a comprehensive de novo annotation of repetitive elements showing novel classes of transposable elements such as SINE, MITE and TRIM/LARD. This work provides a solid foundation for future studies of diatom gene function, evolution and ecology.https://doi.org/10.1038/s41598-018-23106-x
collection DOAJ
language English
format Article
sources DOAJ
author Achal Rastogi
Uma Maheswari
Richard G. Dorrell
Fabio Rocha Jimenez Vieira
Florian Maumus
Adam Kustka
James McCarthy
Andy E. Allen
Paul Kersey
Chris Bowler
Leila Tirichine
spellingShingle Achal Rastogi
Uma Maheswari
Richard G. Dorrell
Fabio Rocha Jimenez Vieira
Florian Maumus
Adam Kustka
James McCarthy
Andy E. Allen
Paul Kersey
Chris Bowler
Leila Tirichine
Integrative analysis of large scale transcriptome data draws a comprehensive landscape of Phaeodactylum tricornutum genome and evolutionary origin of diatoms
Scientific Reports
author_facet Achal Rastogi
Uma Maheswari
Richard G. Dorrell
Fabio Rocha Jimenez Vieira
Florian Maumus
Adam Kustka
James McCarthy
Andy E. Allen
Paul Kersey
Chris Bowler
Leila Tirichine
author_sort Achal Rastogi
title Integrative analysis of large scale transcriptome data draws a comprehensive landscape of Phaeodactylum tricornutum genome and evolutionary origin of diatoms
title_short Integrative analysis of large scale transcriptome data draws a comprehensive landscape of Phaeodactylum tricornutum genome and evolutionary origin of diatoms
title_full Integrative analysis of large scale transcriptome data draws a comprehensive landscape of Phaeodactylum tricornutum genome and evolutionary origin of diatoms
title_fullStr Integrative analysis of large scale transcriptome data draws a comprehensive landscape of Phaeodactylum tricornutum genome and evolutionary origin of diatoms
title_full_unstemmed Integrative analysis of large scale transcriptome data draws a comprehensive landscape of Phaeodactylum tricornutum genome and evolutionary origin of diatoms
title_sort integrative analysis of large scale transcriptome data draws a comprehensive landscape of phaeodactylum tricornutum genome and evolutionary origin of diatoms
publisher Nature Publishing Group
series Scientific Reports
issn 2045-2322
publishDate 2018-03-01
description Abstract Diatoms are one of the most successful and ecologically important groups of eukaryotic phytoplankton in the modern ocean. Deciphering their genomes is a key step towards better understanding of their biological innovations, evolutionary origins, and ecological underpinnings. Here, we have used 90 RNA-Seq datasets from different growth conditions combined with published expressed sequence tags and protein sequences from multiple taxa to explore the genome of the model diatom Phaeodactylum tricornutum, and introduce 1,489 novel genes. The new annotation additionally permitted the discovery of extensive alternative splicing in diatoms, including intron retention and exon skipping, which increase the diversity of transcripts generated in changing environments. In addition, we have used up-to-date reference sequence libraries to dissect the taxonomic origins of diatom genes. We show that the P. tricornutum genome is enriched in lineage-specific genes, with up to 47% of the gene models present only possessing orthologues in other stramenopile groups. Finally, we have performed a comprehensive de novo annotation of repetitive elements showing novel classes of transposable elements such as SINE, MITE and TRIM/LARD. This work provides a solid foundation for future studies of diatom gene function, evolution and ecology.
url https://doi.org/10.1038/s41598-018-23106-x
work_keys_str_mv AT achalrastogi integrativeanalysisoflargescaletranscriptomedatadrawsacomprehensivelandscapeofphaeodactylumtricornutumgenomeandevolutionaryoriginofdiatoms
AT umamaheswari integrativeanalysisoflargescaletranscriptomedatadrawsacomprehensivelandscapeofphaeodactylumtricornutumgenomeandevolutionaryoriginofdiatoms
AT richardgdorrell integrativeanalysisoflargescaletranscriptomedatadrawsacomprehensivelandscapeofphaeodactylumtricornutumgenomeandevolutionaryoriginofdiatoms
AT fabiorochajimenezvieira integrativeanalysisoflargescaletranscriptomedatadrawsacomprehensivelandscapeofphaeodactylumtricornutumgenomeandevolutionaryoriginofdiatoms
AT florianmaumus integrativeanalysisoflargescaletranscriptomedatadrawsacomprehensivelandscapeofphaeodactylumtricornutumgenomeandevolutionaryoriginofdiatoms
AT adamkustka integrativeanalysisoflargescaletranscriptomedatadrawsacomprehensivelandscapeofphaeodactylumtricornutumgenomeandevolutionaryoriginofdiatoms
AT jamesmccarthy integrativeanalysisoflargescaletranscriptomedatadrawsacomprehensivelandscapeofphaeodactylumtricornutumgenomeandevolutionaryoriginofdiatoms
AT andyeallen integrativeanalysisoflargescaletranscriptomedatadrawsacomprehensivelandscapeofphaeodactylumtricornutumgenomeandevolutionaryoriginofdiatoms
AT paulkersey integrativeanalysisoflargescaletranscriptomedatadrawsacomprehensivelandscapeofphaeodactylumtricornutumgenomeandevolutionaryoriginofdiatoms
AT chrisbowler integrativeanalysisoflargescaletranscriptomedatadrawsacomprehensivelandscapeofphaeodactylumtricornutumgenomeandevolutionaryoriginofdiatoms
AT leilatirichine integrativeanalysisoflargescaletranscriptomedatadrawsacomprehensivelandscapeofphaeodactylumtricornutumgenomeandevolutionaryoriginofdiatoms
_version_ 1724392563619659776