Combination of Proteogenomics with Peptide De Novo Sequencing Identifies New Genes and Hidden Posttranscriptional Modifications

Next-generation sequencing techniques have considerably increased the number of completely sequenced eukaryotic genomes. These genomes are mostly automatically annotated, and ab initio gene prediction is commonly combined with homology-based search approaches and often supported by transcriptomic da...

Full description

Bibliographic Details
Main Authors: B. Blank-Landeshammer, I. Teichert, R. Märker, M. Nowrousian, U. Kück, A. Sickmann
Format: Article
Language:English
Published: American Society for Microbiology 2019-10-01
Series:mBio
Subjects:
Online Access:https://doi.org/10.1128/mBio.02367-19
id doaj-d9ab8af6b3e74779bf905b781b8d6955
record_format Article
spelling doaj-d9ab8af6b3e74779bf905b781b8d69552021-07-02T14:07:01ZengAmerican Society for MicrobiologymBio2150-75112019-10-01105e02367-1910.1128/mBio.02367-19Combination of Proteogenomics with Peptide De Novo Sequencing Identifies New Genes and Hidden Posttranscriptional ModificationsB. Blank-LandeshammerI. TeichertR. MärkerM. NowrousianU. KückA. SickmannNext-generation sequencing techniques have considerably increased the number of completely sequenced eukaryotic genomes. These genomes are mostly automatically annotated, and ab initio gene prediction is commonly combined with homology-based search approaches and often supported by transcriptomic data. The latter in particular improve the prediction of intron splice sites and untranslated regions. However, correct prediction of translation initiation sites (TIS), alternative splice junctions, and protein-coding potential remains challenging. Here, we present an advanced proteogenomics approach, namely, the combination of proteogenomics and de novo peptide sequencing analysis, in conjunction with Blast2GO and phylostratigraphy. Using the model fungus Sordaria macrospora as an example, we provide a comprehensive view of the proteome that not only increases the functional understanding of this multicellular organism at different developmental stages but also immensely enhances the genome annotation quality.Proteogenomics combines proteomics, genomics, and transcriptomics and has considerably improved genome annotation in poorly investigated phylogenetic groups for which homology information is lacking. Furthermore, it can be advantageous when reinvestigating well-annotated genomes. Here, we applied an advanced proteogenomics approach, combining standard proteogenomics with peptide de novo sequencing, to refine annotation of the well-studied model fungus Sordaria macrospora. We investigated samples from different developmental and physiological conditions, resulting in the detection of 104 so-far hidden proteins and annotation changes in 575 genes, including 389 splice site refinements. Significantly, our approach provides peptide-level evidence for 113 single-amino-acid variations and 15 C-terminal protein elongations originating from A-to-I RNA editing, a phenomenon recently detected in fungi. Coexpression and phylostratigraphic analysis of the refined proteome suggest that new functions in evolutionarily young genes correlate with distinct developmental stages. In conclusion, our advanced proteogenomics approach supports and promotes functional studies of fungal model systems.https://doi.org/10.1128/mBio.02367-19proteogenomicspeptide de novo sequencingrna editingalternative splicingphylostratigraphygene ontologyalternative splice sitesfungal genomegenomicsproteomics
collection DOAJ
language English
format Article
sources DOAJ
author B. Blank-Landeshammer
I. Teichert
R. Märker
M. Nowrousian
U. Kück
A. Sickmann
spellingShingle B. Blank-Landeshammer
I. Teichert
R. Märker
M. Nowrousian
U. Kück
A. Sickmann
Combination of Proteogenomics with Peptide De Novo Sequencing Identifies New Genes and Hidden Posttranscriptional Modifications
mBio
proteogenomics
peptide de novo sequencing
rna editing
alternative splicing
phylostratigraphy
gene ontology
alternative splice sites
fungal genome
genomics
proteomics
author_facet B. Blank-Landeshammer
I. Teichert
R. Märker
M. Nowrousian
U. Kück
A. Sickmann
author_sort B. Blank-Landeshammer
title Combination of Proteogenomics with Peptide De Novo Sequencing Identifies New Genes and Hidden Posttranscriptional Modifications
title_short Combination of Proteogenomics with Peptide De Novo Sequencing Identifies New Genes and Hidden Posttranscriptional Modifications
title_full Combination of Proteogenomics with Peptide De Novo Sequencing Identifies New Genes and Hidden Posttranscriptional Modifications
title_fullStr Combination of Proteogenomics with Peptide De Novo Sequencing Identifies New Genes and Hidden Posttranscriptional Modifications
title_full_unstemmed Combination of Proteogenomics with Peptide De Novo Sequencing Identifies New Genes and Hidden Posttranscriptional Modifications
title_sort combination of proteogenomics with peptide de novo sequencing identifies new genes and hidden posttranscriptional modifications
publisher American Society for Microbiology
series mBio
issn 2150-7511
publishDate 2019-10-01
description Next-generation sequencing techniques have considerably increased the number of completely sequenced eukaryotic genomes. These genomes are mostly automatically annotated, and ab initio gene prediction is commonly combined with homology-based search approaches and often supported by transcriptomic data. The latter in particular improve the prediction of intron splice sites and untranslated regions. However, correct prediction of translation initiation sites (TIS), alternative splice junctions, and protein-coding potential remains challenging. Here, we present an advanced proteogenomics approach, namely, the combination of proteogenomics and de novo peptide sequencing analysis, in conjunction with Blast2GO and phylostratigraphy. Using the model fungus Sordaria macrospora as an example, we provide a comprehensive view of the proteome that not only increases the functional understanding of this multicellular organism at different developmental stages but also immensely enhances the genome annotation quality.Proteogenomics combines proteomics, genomics, and transcriptomics and has considerably improved genome annotation in poorly investigated phylogenetic groups for which homology information is lacking. Furthermore, it can be advantageous when reinvestigating well-annotated genomes. Here, we applied an advanced proteogenomics approach, combining standard proteogenomics with peptide de novo sequencing, to refine annotation of the well-studied model fungus Sordaria macrospora. We investigated samples from different developmental and physiological conditions, resulting in the detection of 104 so-far hidden proteins and annotation changes in 575 genes, including 389 splice site refinements. Significantly, our approach provides peptide-level evidence for 113 single-amino-acid variations and 15 C-terminal protein elongations originating from A-to-I RNA editing, a phenomenon recently detected in fungi. Coexpression and phylostratigraphic analysis of the refined proteome suggest that new functions in evolutionarily young genes correlate with distinct developmental stages. In conclusion, our advanced proteogenomics approach supports and promotes functional studies of fungal model systems.
topic proteogenomics
peptide de novo sequencing
rna editing
alternative splicing
phylostratigraphy
gene ontology
alternative splice sites
fungal genome
genomics
proteomics
url https://doi.org/10.1128/mBio.02367-19
work_keys_str_mv AT bblanklandeshammer combinationofproteogenomicswithpeptidedenovosequencingidentifiesnewgenesandhiddenposttranscriptionalmodifications
AT iteichert combinationofproteogenomicswithpeptidedenovosequencingidentifiesnewgenesandhiddenposttranscriptionalmodifications
AT rmarker combinationofproteogenomicswithpeptidedenovosequencingidentifiesnewgenesandhiddenposttranscriptionalmodifications
AT mnowrousian combinationofproteogenomicswithpeptidedenovosequencingidentifiesnewgenesandhiddenposttranscriptionalmodifications
AT ukuck combinationofproteogenomicswithpeptidedenovosequencingidentifiesnewgenesandhiddenposttranscriptionalmodifications
AT asickmann combinationofproteogenomicswithpeptidedenovosequencingidentifiesnewgenesandhiddenposttranscriptionalmodifications
_version_ 1721328349548118016