Combination of Proteogenomics with Peptide De Novo Sequencing Identifies New Genes and Hidden Posttranscriptional Modifications
Next-generation sequencing techniques have considerably increased the number of completely sequenced eukaryotic genomes. These genomes are mostly automatically annotated, and ab initio gene prediction is commonly combined with homology-based search approaches and often supported by transcriptomic da...
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
American Society for Microbiology
2019-10-01
|
Series: | mBio |
Subjects: | |
Online Access: | https://doi.org/10.1128/mBio.02367-19 |
id |
doaj-d9ab8af6b3e74779bf905b781b8d6955 |
---|---|
record_format |
Article |
spelling |
doaj-d9ab8af6b3e74779bf905b781b8d69552021-07-02T14:07:01ZengAmerican Society for MicrobiologymBio2150-75112019-10-01105e02367-1910.1128/mBio.02367-19Combination of Proteogenomics with Peptide De Novo Sequencing Identifies New Genes and Hidden Posttranscriptional ModificationsB. Blank-LandeshammerI. TeichertR. MärkerM. NowrousianU. KückA. SickmannNext-generation sequencing techniques have considerably increased the number of completely sequenced eukaryotic genomes. These genomes are mostly automatically annotated, and ab initio gene prediction is commonly combined with homology-based search approaches and often supported by transcriptomic data. The latter in particular improve the prediction of intron splice sites and untranslated regions. However, correct prediction of translation initiation sites (TIS), alternative splice junctions, and protein-coding potential remains challenging. Here, we present an advanced proteogenomics approach, namely, the combination of proteogenomics and de novo peptide sequencing analysis, in conjunction with Blast2GO and phylostratigraphy. Using the model fungus Sordaria macrospora as an example, we provide a comprehensive view of the proteome that not only increases the functional understanding of this multicellular organism at different developmental stages but also immensely enhances the genome annotation quality.Proteogenomics combines proteomics, genomics, and transcriptomics and has considerably improved genome annotation in poorly investigated phylogenetic groups for which homology information is lacking. Furthermore, it can be advantageous when reinvestigating well-annotated genomes. Here, we applied an advanced proteogenomics approach, combining standard proteogenomics with peptide de novo sequencing, to refine annotation of the well-studied model fungus Sordaria macrospora. We investigated samples from different developmental and physiological conditions, resulting in the detection of 104 so-far hidden proteins and annotation changes in 575 genes, including 389 splice site refinements. Significantly, our approach provides peptide-level evidence for 113 single-amino-acid variations and 15 C-terminal protein elongations originating from A-to-I RNA editing, a phenomenon recently detected in fungi. Coexpression and phylostratigraphic analysis of the refined proteome suggest that new functions in evolutionarily young genes correlate with distinct developmental stages. In conclusion, our advanced proteogenomics approach supports and promotes functional studies of fungal model systems.https://doi.org/10.1128/mBio.02367-19proteogenomicspeptide de novo sequencingrna editingalternative splicingphylostratigraphygene ontologyalternative splice sitesfungal genomegenomicsproteomics |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
B. Blank-Landeshammer I. Teichert R. Märker M. Nowrousian U. Kück A. Sickmann |
spellingShingle |
B. Blank-Landeshammer I. Teichert R. Märker M. Nowrousian U. Kück A. Sickmann Combination of Proteogenomics with Peptide De Novo Sequencing Identifies New Genes and Hidden Posttranscriptional Modifications mBio proteogenomics peptide de novo sequencing rna editing alternative splicing phylostratigraphy gene ontology alternative splice sites fungal genome genomics proteomics |
author_facet |
B. Blank-Landeshammer I. Teichert R. Märker M. Nowrousian U. Kück A. Sickmann |
author_sort |
B. Blank-Landeshammer |
title |
Combination of Proteogenomics with Peptide De Novo Sequencing Identifies New Genes and Hidden Posttranscriptional Modifications |
title_short |
Combination of Proteogenomics with Peptide De Novo Sequencing Identifies New Genes and Hidden Posttranscriptional Modifications |
title_full |
Combination of Proteogenomics with Peptide De Novo Sequencing Identifies New Genes and Hidden Posttranscriptional Modifications |
title_fullStr |
Combination of Proteogenomics with Peptide De Novo Sequencing Identifies New Genes and Hidden Posttranscriptional Modifications |
title_full_unstemmed |
Combination of Proteogenomics with Peptide De Novo Sequencing Identifies New Genes and Hidden Posttranscriptional Modifications |
title_sort |
combination of proteogenomics with peptide de novo sequencing identifies new genes and hidden posttranscriptional modifications |
publisher |
American Society for Microbiology |
series |
mBio |
issn |
2150-7511 |
publishDate |
2019-10-01 |
description |
Next-generation sequencing techniques have considerably increased the number of completely sequenced eukaryotic genomes. These genomes are mostly automatically annotated, and ab initio gene prediction is commonly combined with homology-based search approaches and often supported by transcriptomic data. The latter in particular improve the prediction of intron splice sites and untranslated regions. However, correct prediction of translation initiation sites (TIS), alternative splice junctions, and protein-coding potential remains challenging. Here, we present an advanced proteogenomics approach, namely, the combination of proteogenomics and de novo peptide sequencing analysis, in conjunction with Blast2GO and phylostratigraphy. Using the model fungus Sordaria macrospora as an example, we provide a comprehensive view of the proteome that not only increases the functional understanding of this multicellular organism at different developmental stages but also immensely enhances the genome annotation quality.Proteogenomics combines proteomics, genomics, and transcriptomics and has considerably improved genome annotation in poorly investigated phylogenetic groups for which homology information is lacking. Furthermore, it can be advantageous when reinvestigating well-annotated genomes. Here, we applied an advanced proteogenomics approach, combining standard proteogenomics with peptide de novo sequencing, to refine annotation of the well-studied model fungus Sordaria macrospora. We investigated samples from different developmental and physiological conditions, resulting in the detection of 104 so-far hidden proteins and annotation changes in 575 genes, including 389 splice site refinements. Significantly, our approach provides peptide-level evidence for 113 single-amino-acid variations and 15 C-terminal protein elongations originating from A-to-I RNA editing, a phenomenon recently detected in fungi. Coexpression and phylostratigraphic analysis of the refined proteome suggest that new functions in evolutionarily young genes correlate with distinct developmental stages. In conclusion, our advanced proteogenomics approach supports and promotes functional studies of fungal model systems. |
topic |
proteogenomics peptide de novo sequencing rna editing alternative splicing phylostratigraphy gene ontology alternative splice sites fungal genome genomics proteomics |
url |
https://doi.org/10.1128/mBio.02367-19 |
work_keys_str_mv |
AT bblanklandeshammer combinationofproteogenomicswithpeptidedenovosequencingidentifiesnewgenesandhiddenposttranscriptionalmodifications AT iteichert combinationofproteogenomicswithpeptidedenovosequencingidentifiesnewgenesandhiddenposttranscriptionalmodifications AT rmarker combinationofproteogenomicswithpeptidedenovosequencingidentifiesnewgenesandhiddenposttranscriptionalmodifications AT mnowrousian combinationofproteogenomicswithpeptidedenovosequencingidentifiesnewgenesandhiddenposttranscriptionalmodifications AT ukuck combinationofproteogenomicswithpeptidedenovosequencingidentifiesnewgenesandhiddenposttranscriptionalmodifications AT asickmann combinationofproteogenomicswithpeptidedenovosequencingidentifiesnewgenesandhiddenposttranscriptionalmodifications |
_version_ |
1721328349548118016 |