Identification of biomarkers from mass spectrometry data using a "common" peak approach

<p>Abstract</p> <p>Background</p> <p>Proteomic data obtained from mass spectrometry have attracted great interest for the detection of early-stage cancer. However, as mass spectrometry data are high-dimensional, identification of biomarkers is a key problem.</p> &...

Full description

Bibliographic Details
Main Authors: Eguchi Shinto, Fujisawa Hironori, Fushiki Tadayoshi
Format: Article
Language:English
Published: BMC 2006-07-01
Series:BMC Bioinformatics
Online Access:http://www.biomedcentral.com/1471-2105/7/358
Description
Summary:<p>Abstract</p> <p>Background</p> <p>Proteomic data obtained from mass spectrometry have attracted great interest for the detection of early-stage cancer. However, as mass spectrometry data are high-dimensional, identification of biomarkers is a key problem.</p> <p>Results</p> <p>This paper proposes the use of "common" peaks in data as biomarkers. Analysis is conducted as follows: data preprocessing, identification of biomarkers, and application of AdaBoost to construct a classification function. Informative "common" peaks are selected by AdaBoost. AsymBoost is also examined to balance false negatives and false positives. The effectiveness of the approach is demonstrated using an ovarian cancer dataset.</p> <p>Conclusion</p> <p>Continuous covariates and discrete covariates can be used in the present approach. The difference between the result for the continuous covariates and that for the discrete covariates was investigated in detail. In the example considered here, both covariates provide a good prediction, but it seems that they provide different kinds of information. We can obtain more information on the structure of the data by integrating both results.</p>
ISSN:1471-2105