Towards the characterization of the hidden world of small proteins in Staphylococcus aureus, a proteogenomics approach.

Small proteins play essential roles in bacterial physiology and virulence, however, automated algorithms for genome annotation are often not yet able to accurately predict the corresponding genes. The accuracy and reliability of genome annotations, particularly for small open reading frames (sORFs),...

Full description

Bibliographic Details
Main Authors: Stephan Fuchs, Martin Kucklick, Erik Lehmann, Alexander Beckmann, Maya Wilkens, Baban Kolte, Ayten Mustafayeva, Tobias Ludwig, Maurice Diwo, Josef Wissing, Lothar Jänsch, Christian H Ahrens, Zoya Ignatova, Susanne Engelmann
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2021-06-01
Series:PLoS Genetics
Online Access:https://doi.org/10.1371/journal.pgen.1009585
id doaj-00f37bb424e54b35991ac9f080fef8f5
record_format Article
spelling doaj-00f37bb424e54b35991ac9f080fef8f52021-06-25T04:30:30ZengPublic Library of Science (PLoS)PLoS Genetics1553-73901553-74042021-06-01176e100958510.1371/journal.pgen.1009585Towards the characterization of the hidden world of small proteins in Staphylococcus aureus, a proteogenomics approach.Stephan FuchsMartin KucklickErik LehmannAlexander BeckmannMaya WilkensBaban KolteAyten MustafayevaTobias LudwigMaurice DiwoJosef WissingLothar JänschChristian H AhrensZoya IgnatovaSusanne EngelmannSmall proteins play essential roles in bacterial physiology and virulence, however, automated algorithms for genome annotation are often not yet able to accurately predict the corresponding genes. The accuracy and reliability of genome annotations, particularly for small open reading frames (sORFs), can be significantly improved by integrating protein evidence from experimental approaches. Here we present a highly optimized and flexible bioinformatics workflow for bacterial proteogenomics covering all steps from (i) generation of protein databases, (ii) database searches and (iii) peptide-to-genome mapping to (iv) visualization of results. We used the workflow to identify high quality peptide spectrum matches (PSMs) for small proteins (≤ 100 aa, SP100) in Staphylococcus aureus Newman. Protein extracts from S. aureus were subjected to different experimental workflows for protein digestion and prefractionation and measured with highly sensitive mass spectrometers. In total, 175 proteins with up to 100 aa (SP100) were identified. Out of these 24 (ranging from 9 to 99 aa) were novel and not contained in the used genome annotation.144 SP100 are highly conserved and were found in at least 50% of the publicly available S. aureus genomes, while 127 are additionally conserved in other staphylococci. Almost half of the identified SP100 were basic, suggesting a role in binding to more acidic molecules such as nucleic acids or phospholipids.https://doi.org/10.1371/journal.pgen.1009585
collection DOAJ
language English
format Article
sources DOAJ
author Stephan Fuchs
Martin Kucklick
Erik Lehmann
Alexander Beckmann
Maya Wilkens
Baban Kolte
Ayten Mustafayeva
Tobias Ludwig
Maurice Diwo
Josef Wissing
Lothar Jänsch
Christian H Ahrens
Zoya Ignatova
Susanne Engelmann
spellingShingle Stephan Fuchs
Martin Kucklick
Erik Lehmann
Alexander Beckmann
Maya Wilkens
Baban Kolte
Ayten Mustafayeva
Tobias Ludwig
Maurice Diwo
Josef Wissing
Lothar Jänsch
Christian H Ahrens
Zoya Ignatova
Susanne Engelmann
Towards the characterization of the hidden world of small proteins in Staphylococcus aureus, a proteogenomics approach.
PLoS Genetics
author_facet Stephan Fuchs
Martin Kucklick
Erik Lehmann
Alexander Beckmann
Maya Wilkens
Baban Kolte
Ayten Mustafayeva
Tobias Ludwig
Maurice Diwo
Josef Wissing
Lothar Jänsch
Christian H Ahrens
Zoya Ignatova
Susanne Engelmann
author_sort Stephan Fuchs
title Towards the characterization of the hidden world of small proteins in Staphylococcus aureus, a proteogenomics approach.
title_short Towards the characterization of the hidden world of small proteins in Staphylococcus aureus, a proteogenomics approach.
title_full Towards the characterization of the hidden world of small proteins in Staphylococcus aureus, a proteogenomics approach.
title_fullStr Towards the characterization of the hidden world of small proteins in Staphylococcus aureus, a proteogenomics approach.
title_full_unstemmed Towards the characterization of the hidden world of small proteins in Staphylococcus aureus, a proteogenomics approach.
title_sort towards the characterization of the hidden world of small proteins in staphylococcus aureus, a proteogenomics approach.
publisher Public Library of Science (PLoS)
series PLoS Genetics
issn 1553-7390
1553-7404
publishDate 2021-06-01
description Small proteins play essential roles in bacterial physiology and virulence, however, automated algorithms for genome annotation are often not yet able to accurately predict the corresponding genes. The accuracy and reliability of genome annotations, particularly for small open reading frames (sORFs), can be significantly improved by integrating protein evidence from experimental approaches. Here we present a highly optimized and flexible bioinformatics workflow for bacterial proteogenomics covering all steps from (i) generation of protein databases, (ii) database searches and (iii) peptide-to-genome mapping to (iv) visualization of results. We used the workflow to identify high quality peptide spectrum matches (PSMs) for small proteins (≤ 100 aa, SP100) in Staphylococcus aureus Newman. Protein extracts from S. aureus were subjected to different experimental workflows for protein digestion and prefractionation and measured with highly sensitive mass spectrometers. In total, 175 proteins with up to 100 aa (SP100) were identified. Out of these 24 (ranging from 9 to 99 aa) were novel and not contained in the used genome annotation.144 SP100 are highly conserved and were found in at least 50% of the publicly available S. aureus genomes, while 127 are additionally conserved in other staphylococci. Almost half of the identified SP100 were basic, suggesting a role in binding to more acidic molecules such as nucleic acids or phospholipids.
url https://doi.org/10.1371/journal.pgen.1009585
work_keys_str_mv AT stephanfuchs towardsthecharacterizationofthehiddenworldofsmallproteinsinstaphylococcusaureusaproteogenomicsapproach
AT martinkucklick towardsthecharacterizationofthehiddenworldofsmallproteinsinstaphylococcusaureusaproteogenomicsapproach
AT eriklehmann towardsthecharacterizationofthehiddenworldofsmallproteinsinstaphylococcusaureusaproteogenomicsapproach
AT alexanderbeckmann towardsthecharacterizationofthehiddenworldofsmallproteinsinstaphylococcusaureusaproteogenomicsapproach
AT mayawilkens towardsthecharacterizationofthehiddenworldofsmallproteinsinstaphylococcusaureusaproteogenomicsapproach
AT babankolte towardsthecharacterizationofthehiddenworldofsmallproteinsinstaphylococcusaureusaproteogenomicsapproach
AT aytenmustafayeva towardsthecharacterizationofthehiddenworldofsmallproteinsinstaphylococcusaureusaproteogenomicsapproach
AT tobiasludwig towardsthecharacterizationofthehiddenworldofsmallproteinsinstaphylococcusaureusaproteogenomicsapproach
AT mauricediwo towardsthecharacterizationofthehiddenworldofsmallproteinsinstaphylococcusaureusaproteogenomicsapproach
AT josefwissing towardsthecharacterizationofthehiddenworldofsmallproteinsinstaphylococcusaureusaproteogenomicsapproach
AT lotharjansch towardsthecharacterizationofthehiddenworldofsmallproteinsinstaphylococcusaureusaproteogenomicsapproach
AT christianhahrens towardsthecharacterizationofthehiddenworldofsmallproteinsinstaphylococcusaureusaproteogenomicsapproach
AT zoyaignatova towardsthecharacterizationofthehiddenworldofsmallproteinsinstaphylococcusaureusaproteogenomicsapproach
AT susanneengelmann towardsthecharacterizationofthehiddenworldofsmallproteinsinstaphylococcusaureusaproteogenomicsapproach
_version_ 1721360817809522688