A Systematic Evaluation of Semispecific Peptide Search Parameter Enables Identification of Previously Undescribed N-Terminal Peptides and Conserved Proteolytic Processing in Cancer Cell Lines

Liquid chromatography-tandem mass spectrometry (LC-MS/MS) has become the most commonly used technique in explorative proteomic research. A variety of open-source tools for peptide-spectrum matching have become available. Most analyses of explorative MS data are performed using conventional settings,...

Full description

Bibliographic Details
Main Authors: Matthias Fahrner, Lucas Kook, Klemens Fröhlich, Martin L. Biniossek, Oliver Schilling
Format: Article
Language:English
Published: MDPI AG 2021-05-01
Series:Proteomes
Subjects:
Online Access:https://www.mdpi.com/2227-7382/9/2/26
id doaj-bef804db7f3c420a9bfbe3cf3e6296b9
record_format Article
spelling doaj-bef804db7f3c420a9bfbe3cf3e6296b92021-06-01T01:06:46ZengMDPI AGProteomes2227-73822021-05-019262610.3390/proteomes9020026A Systematic Evaluation of Semispecific Peptide Search Parameter Enables Identification of Previously Undescribed N-Terminal Peptides and Conserved Proteolytic Processing in Cancer Cell LinesMatthias Fahrner0Lucas Kook1Klemens Fröhlich2Martin L. Biniossek3Oliver Schilling4Institute for Surgical Pathology, Medical Center–University of Freiburg, Faculty of Medicine, University of Freiburg, 79106 Freiburg, GermanyEpidemiology, Biostatistics & Prevention Institute, University of Zurich, 8001 Zurich, SwitzerlandInstitute for Surgical Pathology, Medical Center–University of Freiburg, Faculty of Medicine, University of Freiburg, 79106 Freiburg, GermanyInstitute for Molecular Medicine and Cell Research, University of Freiburg, 79104 Freiburg, GermanyInstitute for Surgical Pathology, Medical Center–University of Freiburg, Faculty of Medicine, University of Freiburg, 79106 Freiburg, GermanyLiquid chromatography-tandem mass spectrometry (LC-MS/MS) has become the most commonly used technique in explorative proteomic research. A variety of open-source tools for peptide-spectrum matching have become available. Most analyses of explorative MS data are performed using conventional settings, such as fully specific enzymatic constraints. Here we evaluated the impact of the fragment mass tolerance in combination with the enzymatic constraints on the performance of three search engines. Three open-source search engines (Myrimatch, X! Tandem, and MSGF+) were evaluated concerning the suitability in semi- and unspecific searches as well as the importance of accurate fragment mass spectra in non-specific peptide searches. We then performed a semispecific reanalysis of the published NCI-60 deep proteome data applying the most suited parameters. Semi- and unspecific LC-MS/MS data analyses particularly benefit from accurate fragment mass spectra while this effect is less pronounced for conventional, fully specific peptide-spectrum matching. Search speed differed notably between the three search engines for semi- and non-specific peptide-spectrum matching. Semispecific reanalysis of NCI-60 proteome data revealed hundreds of previously undescribed N-terminal peptides, including cases of proteolytic processing or likely alternative translation start sites, some of which were ubiquitously present in all cell lines of the reanalyzed panel. Highly accurate MS2 fragment data in combination with modern open-source search algorithms enable the confident identification of semispecific peptides from large proteomic datasets. The identification of previously undescribed N-terminal peptides in published studies highlights the potential of future reanalysis and data mining in proteomic datasets.https://www.mdpi.com/2227-7382/9/2/26endogenous proteolysisfragment mass tolerancemass spectrometryNCI-60 reanalysissemispecific peptide search
collection DOAJ
language English
format Article
sources DOAJ
author Matthias Fahrner
Lucas Kook
Klemens Fröhlich
Martin L. Biniossek
Oliver Schilling
spellingShingle Matthias Fahrner
Lucas Kook
Klemens Fröhlich
Martin L. Biniossek
Oliver Schilling
A Systematic Evaluation of Semispecific Peptide Search Parameter Enables Identification of Previously Undescribed N-Terminal Peptides and Conserved Proteolytic Processing in Cancer Cell Lines
Proteomes
endogenous proteolysis
fragment mass tolerance
mass spectrometry
NCI-60 reanalysis
semispecific peptide search
author_facet Matthias Fahrner
Lucas Kook
Klemens Fröhlich
Martin L. Biniossek
Oliver Schilling
author_sort Matthias Fahrner
title A Systematic Evaluation of Semispecific Peptide Search Parameter Enables Identification of Previously Undescribed N-Terminal Peptides and Conserved Proteolytic Processing in Cancer Cell Lines
title_short A Systematic Evaluation of Semispecific Peptide Search Parameter Enables Identification of Previously Undescribed N-Terminal Peptides and Conserved Proteolytic Processing in Cancer Cell Lines
title_full A Systematic Evaluation of Semispecific Peptide Search Parameter Enables Identification of Previously Undescribed N-Terminal Peptides and Conserved Proteolytic Processing in Cancer Cell Lines
title_fullStr A Systematic Evaluation of Semispecific Peptide Search Parameter Enables Identification of Previously Undescribed N-Terminal Peptides and Conserved Proteolytic Processing in Cancer Cell Lines
title_full_unstemmed A Systematic Evaluation of Semispecific Peptide Search Parameter Enables Identification of Previously Undescribed N-Terminal Peptides and Conserved Proteolytic Processing in Cancer Cell Lines
title_sort systematic evaluation of semispecific peptide search parameter enables identification of previously undescribed n-terminal peptides and conserved proteolytic processing in cancer cell lines
publisher MDPI AG
series Proteomes
issn 2227-7382
publishDate 2021-05-01
description Liquid chromatography-tandem mass spectrometry (LC-MS/MS) has become the most commonly used technique in explorative proteomic research. A variety of open-source tools for peptide-spectrum matching have become available. Most analyses of explorative MS data are performed using conventional settings, such as fully specific enzymatic constraints. Here we evaluated the impact of the fragment mass tolerance in combination with the enzymatic constraints on the performance of three search engines. Three open-source search engines (Myrimatch, X! Tandem, and MSGF+) were evaluated concerning the suitability in semi- and unspecific searches as well as the importance of accurate fragment mass spectra in non-specific peptide searches. We then performed a semispecific reanalysis of the published NCI-60 deep proteome data applying the most suited parameters. Semi- and unspecific LC-MS/MS data analyses particularly benefit from accurate fragment mass spectra while this effect is less pronounced for conventional, fully specific peptide-spectrum matching. Search speed differed notably between the three search engines for semi- and non-specific peptide-spectrum matching. Semispecific reanalysis of NCI-60 proteome data revealed hundreds of previously undescribed N-terminal peptides, including cases of proteolytic processing or likely alternative translation start sites, some of which were ubiquitously present in all cell lines of the reanalyzed panel. Highly accurate MS2 fragment data in combination with modern open-source search algorithms enable the confident identification of semispecific peptides from large proteomic datasets. The identification of previously undescribed N-terminal peptides in published studies highlights the potential of future reanalysis and data mining in proteomic datasets.
topic endogenous proteolysis
fragment mass tolerance
mass spectrometry
NCI-60 reanalysis
semispecific peptide search
url https://www.mdpi.com/2227-7382/9/2/26
work_keys_str_mv AT matthiasfahrner asystematicevaluationofsemispecificpeptidesearchparameterenablesidentificationofpreviouslyundescribednterminalpeptidesandconservedproteolyticprocessingincancercelllines
AT lucaskook asystematicevaluationofsemispecificpeptidesearchparameterenablesidentificationofpreviouslyundescribednterminalpeptidesandconservedproteolyticprocessingincancercelllines
AT klemensfrohlich asystematicevaluationofsemispecificpeptidesearchparameterenablesidentificationofpreviouslyundescribednterminalpeptidesandconservedproteolyticprocessingincancercelllines
AT martinlbiniossek asystematicevaluationofsemispecificpeptidesearchparameterenablesidentificationofpreviouslyundescribednterminalpeptidesandconservedproteolyticprocessingincancercelllines
AT oliverschilling asystematicevaluationofsemispecificpeptidesearchparameterenablesidentificationofpreviouslyundescribednterminalpeptidesandconservedproteolyticprocessingincancercelllines
AT matthiasfahrner systematicevaluationofsemispecificpeptidesearchparameterenablesidentificationofpreviouslyundescribednterminalpeptidesandconservedproteolyticprocessingincancercelllines
AT lucaskook systematicevaluationofsemispecificpeptidesearchparameterenablesidentificationofpreviouslyundescribednterminalpeptidesandconservedproteolyticprocessingincancercelllines
AT klemensfrohlich systematicevaluationofsemispecificpeptidesearchparameterenablesidentificationofpreviouslyundescribednterminalpeptidesandconservedproteolyticprocessingincancercelllines
AT martinlbiniossek systematicevaluationofsemispecificpeptidesearchparameterenablesidentificationofpreviouslyundescribednterminalpeptidesandconservedproteolyticprocessingincancercelllines
AT oliverschilling systematicevaluationofsemispecificpeptidesearchparameterenablesidentificationofpreviouslyundescribednterminalpeptidesandconservedproteolyticprocessingincancercelllines
_version_ 1721413102956707840