ProbPS: A new model for peak selection based on quantifying the dependence of the existence of derivative peaks on primary ion intensity
<p>Abstract</p> <p>Background</p> <p>The analysis of mass spectra suggests that the existence of derivative peaks is strongly dependent on the intensity of the primary peaks. Peak selection from tandem mass spectrum is used to filter out noise and contaminant peaks. It...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2011-08-01
|
Series: | BMC Bioinformatics |
Online Access: | http://www.biomedcentral.com/1471-2105/12/346 |
id |
doaj-e6abc25bdf0b4855b780e6c6afb18fdc |
---|---|
record_format |
Article |
spelling |
doaj-e6abc25bdf0b4855b780e6c6afb18fdc2020-11-25T01:18:03ZengBMCBMC Bioinformatics1471-21052011-08-0112134610.1186/1471-2105-12-346ProbPS: A new model for peak selection based on quantifying the dependence of the existence of derivative peaks on primary ion intensityZhang HongBu DongboWang YaojunZhang ShenghuiSun Shiwei<p>Abstract</p> <p>Background</p> <p>The analysis of mass spectra suggests that the existence of derivative peaks is strongly dependent on the intensity of the primary peaks. Peak selection from tandem mass spectrum is used to filter out noise and contaminant peaks. It is widely accepted that a valid primary peak tends to have high intensity and is accompanied by derivative peaks, including isotopic peaks, neutral loss peaks, and complementary peaks. Existing models for peak selection ignore the dependence between the existence of the derivative peaks and the intensity of the primary peaks. Simple models for peak selection assume that these two attributes are independent; however, this assumption is contrary to real data and prone to error.</p> <p>Results</p> <p>In this paper, we present a statistical model to quantitatively measure the dependence of the derivative peak's existence on the primary peak's intensity. Here, we propose a statistical model, named ProbPS, to capture the dependence in a quantitative manner and describe a statistical model for peak selection. Our results show that the quantitative understanding can successfully guide the peak selection process. By comparing ProbPS with AuDeNS we demonstrate the advantages of our method in both filtering out noise peaks and in improving <it>de novo </it>identification. In addition, we present a tag identification approach based on our peak selection method. Our results, using a test data set, suggest that our tag identification method (876 correct tags in 1000 spectra) outperforms PepNovoTag (790 correct tags in 1000 spectra).</p> <p>Conclusions</p> <p>We have shown that ProbPS improves the accuracy of peak selection which further enhances the performance of de novo sequencing and tag identification. Thus, our model saves valuable computation time and improving the accuracy of the results.</p> http://www.biomedcentral.com/1471-2105/12/346 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Zhang Hong Bu Dongbo Wang Yaojun Zhang Shenghui Sun Shiwei |
spellingShingle |
Zhang Hong Bu Dongbo Wang Yaojun Zhang Shenghui Sun Shiwei ProbPS: A new model for peak selection based on quantifying the dependence of the existence of derivative peaks on primary ion intensity BMC Bioinformatics |
author_facet |
Zhang Hong Bu Dongbo Wang Yaojun Zhang Shenghui Sun Shiwei |
author_sort |
Zhang Hong |
title |
ProbPS: A new model for peak selection based on quantifying the dependence of the existence of derivative peaks on primary ion intensity |
title_short |
ProbPS: A new model for peak selection based on quantifying the dependence of the existence of derivative peaks on primary ion intensity |
title_full |
ProbPS: A new model for peak selection based on quantifying the dependence of the existence of derivative peaks on primary ion intensity |
title_fullStr |
ProbPS: A new model for peak selection based on quantifying the dependence of the existence of derivative peaks on primary ion intensity |
title_full_unstemmed |
ProbPS: A new model for peak selection based on quantifying the dependence of the existence of derivative peaks on primary ion intensity |
title_sort |
probps: a new model for peak selection based on quantifying the dependence of the existence of derivative peaks on primary ion intensity |
publisher |
BMC |
series |
BMC Bioinformatics |
issn |
1471-2105 |
publishDate |
2011-08-01 |
description |
<p>Abstract</p> <p>Background</p> <p>The analysis of mass spectra suggests that the existence of derivative peaks is strongly dependent on the intensity of the primary peaks. Peak selection from tandem mass spectrum is used to filter out noise and contaminant peaks. It is widely accepted that a valid primary peak tends to have high intensity and is accompanied by derivative peaks, including isotopic peaks, neutral loss peaks, and complementary peaks. Existing models for peak selection ignore the dependence between the existence of the derivative peaks and the intensity of the primary peaks. Simple models for peak selection assume that these two attributes are independent; however, this assumption is contrary to real data and prone to error.</p> <p>Results</p> <p>In this paper, we present a statistical model to quantitatively measure the dependence of the derivative peak's existence on the primary peak's intensity. Here, we propose a statistical model, named ProbPS, to capture the dependence in a quantitative manner and describe a statistical model for peak selection. Our results show that the quantitative understanding can successfully guide the peak selection process. By comparing ProbPS with AuDeNS we demonstrate the advantages of our method in both filtering out noise peaks and in improving <it>de novo </it>identification. In addition, we present a tag identification approach based on our peak selection method. Our results, using a test data set, suggest that our tag identification method (876 correct tags in 1000 spectra) outperforms PepNovoTag (790 correct tags in 1000 spectra).</p> <p>Conclusions</p> <p>We have shown that ProbPS improves the accuracy of peak selection which further enhances the performance of de novo sequencing and tag identification. Thus, our model saves valuable computation time and improving the accuracy of the results.</p> |
url |
http://www.biomedcentral.com/1471-2105/12/346 |
work_keys_str_mv |
AT zhanghong probpsanewmodelforpeakselectionbasedonquantifyingthedependenceoftheexistenceofderivativepeaksonprimaryionintensity AT budongbo probpsanewmodelforpeakselectionbasedonquantifyingthedependenceoftheexistenceofderivativepeaksonprimaryionintensity AT wangyaojun probpsanewmodelforpeakselectionbasedonquantifyingthedependenceoftheexistenceofderivativepeaksonprimaryionintensity AT zhangshenghui probpsanewmodelforpeakselectionbasedonquantifyingthedependenceoftheexistenceofderivativepeaksonprimaryionintensity AT sunshiwei probpsanewmodelforpeakselectionbasedonquantifyingthedependenceoftheexistenceofderivativepeaksonprimaryionintensity |
_version_ |
1725144094790909952 |