ProbPS: A new model for peak selection based on quantifying the dependence of the existence of derivative peaks on primary ion intensity

<p>Abstract</p> <p>Background</p> <p>The analysis of mass spectra suggests that the existence of derivative peaks is strongly dependent on the intensity of the primary peaks. Peak selection from tandem mass spectrum is used to filter out noise and contaminant peaks. It...

Full description

Bibliographic Details
Main Authors: Zhang Hong, Bu Dongbo, Wang Yaojun, Zhang Shenghui, Sun Shiwei
Format: Article
Language:English
Published: BMC 2011-08-01
Series:BMC Bioinformatics
Online Access:http://www.biomedcentral.com/1471-2105/12/346
id doaj-e6abc25bdf0b4855b780e6c6afb18fdc
record_format Article
spelling doaj-e6abc25bdf0b4855b780e6c6afb18fdc2020-11-25T01:18:03ZengBMCBMC Bioinformatics1471-21052011-08-0112134610.1186/1471-2105-12-346ProbPS: A new model for peak selection based on quantifying the dependence of the existence of derivative peaks on primary ion intensityZhang HongBu DongboWang YaojunZhang ShenghuiSun Shiwei<p>Abstract</p> <p>Background</p> <p>The analysis of mass spectra suggests that the existence of derivative peaks is strongly dependent on the intensity of the primary peaks. Peak selection from tandem mass spectrum is used to filter out noise and contaminant peaks. It is widely accepted that a valid primary peak tends to have high intensity and is accompanied by derivative peaks, including isotopic peaks, neutral loss peaks, and complementary peaks. Existing models for peak selection ignore the dependence between the existence of the derivative peaks and the intensity of the primary peaks. Simple models for peak selection assume that these two attributes are independent; however, this assumption is contrary to real data and prone to error.</p> <p>Results</p> <p>In this paper, we present a statistical model to quantitatively measure the dependence of the derivative peak's existence on the primary peak's intensity. Here, we propose a statistical model, named ProbPS, to capture the dependence in a quantitative manner and describe a statistical model for peak selection. Our results show that the quantitative understanding can successfully guide the peak selection process. By comparing ProbPS with AuDeNS we demonstrate the advantages of our method in both filtering out noise peaks and in improving <it>de novo </it>identification. In addition, we present a tag identification approach based on our peak selection method. Our results, using a test data set, suggest that our tag identification method (876 correct tags in 1000 spectra) outperforms PepNovoTag (790 correct tags in 1000 spectra).</p> <p>Conclusions</p> <p>We have shown that ProbPS improves the accuracy of peak selection which further enhances the performance of de novo sequencing and tag identification. Thus, our model saves valuable computation time and improving the accuracy of the results.</p> http://www.biomedcentral.com/1471-2105/12/346
collection DOAJ
language English
format Article
sources DOAJ
author Zhang Hong
Bu Dongbo
Wang Yaojun
Zhang Shenghui
Sun Shiwei
spellingShingle Zhang Hong
Bu Dongbo
Wang Yaojun
Zhang Shenghui
Sun Shiwei
ProbPS: A new model for peak selection based on quantifying the dependence of the existence of derivative peaks on primary ion intensity
BMC Bioinformatics
author_facet Zhang Hong
Bu Dongbo
Wang Yaojun
Zhang Shenghui
Sun Shiwei
author_sort Zhang Hong
title ProbPS: A new model for peak selection based on quantifying the dependence of the existence of derivative peaks on primary ion intensity
title_short ProbPS: A new model for peak selection based on quantifying the dependence of the existence of derivative peaks on primary ion intensity
title_full ProbPS: A new model for peak selection based on quantifying the dependence of the existence of derivative peaks on primary ion intensity
title_fullStr ProbPS: A new model for peak selection based on quantifying the dependence of the existence of derivative peaks on primary ion intensity
title_full_unstemmed ProbPS: A new model for peak selection based on quantifying the dependence of the existence of derivative peaks on primary ion intensity
title_sort probps: a new model for peak selection based on quantifying the dependence of the existence of derivative peaks on primary ion intensity
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2011-08-01
description <p>Abstract</p> <p>Background</p> <p>The analysis of mass spectra suggests that the existence of derivative peaks is strongly dependent on the intensity of the primary peaks. Peak selection from tandem mass spectrum is used to filter out noise and contaminant peaks. It is widely accepted that a valid primary peak tends to have high intensity and is accompanied by derivative peaks, including isotopic peaks, neutral loss peaks, and complementary peaks. Existing models for peak selection ignore the dependence between the existence of the derivative peaks and the intensity of the primary peaks. Simple models for peak selection assume that these two attributes are independent; however, this assumption is contrary to real data and prone to error.</p> <p>Results</p> <p>In this paper, we present a statistical model to quantitatively measure the dependence of the derivative peak's existence on the primary peak's intensity. Here, we propose a statistical model, named ProbPS, to capture the dependence in a quantitative manner and describe a statistical model for peak selection. Our results show that the quantitative understanding can successfully guide the peak selection process. By comparing ProbPS with AuDeNS we demonstrate the advantages of our method in both filtering out noise peaks and in improving <it>de novo </it>identification. In addition, we present a tag identification approach based on our peak selection method. Our results, using a test data set, suggest that our tag identification method (876 correct tags in 1000 spectra) outperforms PepNovoTag (790 correct tags in 1000 spectra).</p> <p>Conclusions</p> <p>We have shown that ProbPS improves the accuracy of peak selection which further enhances the performance of de novo sequencing and tag identification. Thus, our model saves valuable computation time and improving the accuracy of the results.</p>
url http://www.biomedcentral.com/1471-2105/12/346
work_keys_str_mv AT zhanghong probpsanewmodelforpeakselectionbasedonquantifyingthedependenceoftheexistenceofderivativepeaksonprimaryionintensity
AT budongbo probpsanewmodelforpeakselectionbasedonquantifyingthedependenceoftheexistenceofderivativepeaksonprimaryionintensity
AT wangyaojun probpsanewmodelforpeakselectionbasedonquantifyingthedependenceoftheexistenceofderivativepeaksonprimaryionintensity
AT zhangshenghui probpsanewmodelforpeakselectionbasedonquantifyingthedependenceoftheexistenceofderivativepeaksonprimaryionintensity
AT sunshiwei probpsanewmodelforpeakselectionbasedonquantifyingthedependenceoftheexistenceofderivativepeaksonprimaryionintensity
_version_ 1725144094790909952