Computational protein biomarker prediction: a case study for prostate cancer

<p>Abstract</p> <p>Background</p> <p>Recent technological advances in mass spectrometry pose challenges in computational mathematics and statistics to process the mass spectral data into predictive models with clinical and biological significance. We discuss several cla...

Full description

Bibliographic Details
Main Authors: Adam Bao-Ling, Devineni Raghu, Kasukurti Srinivas, Pothen Alex, Naik Dayanand N, Wagner Michael, Semmes O John, Wright George L
Format: Article
Language:English
Published: BMC 2004-03-01
Series:BMC Bioinformatics
Subjects:
Online Access:http://www.biomedcentral.com/1471-2105/5/26
id doaj-454f3639d40f4a6e97fcf2233fe1baad
record_format Article
spelling doaj-454f3639d40f4a6e97fcf2233fe1baad2020-11-25T00:44:11ZengBMCBMC Bioinformatics1471-21052004-03-01512610.1186/1471-2105-5-26Computational protein biomarker prediction: a case study for prostate cancerAdam Bao-LingDevineni RaghuKasukurti SrinivasPothen AlexNaik Dayanand NWagner MichaelSemmes O JohnWright George L<p>Abstract</p> <p>Background</p> <p>Recent technological advances in mass spectrometry pose challenges in computational mathematics and statistics to process the mass spectral data into predictive models with clinical and biological significance. We discuss several classification-based approaches to finding protein biomarker candidates using protein profiles obtained via mass spectrometry, and we assess their statistical significance. Our overall goal is to implicate peaks that have a high likelihood of being biologically linked to a given disease state, and thus to narrow the search for biomarker candidates.</p> <p>Results</p> <p>Thorough cross-validation studies and randomization tests are performed on a prostate cancer dataset with over 300 patients, obtained at the Eastern Virginia Medical School using SELDI-TOF mass spectrometry. We obtain average classification accuracies of 87% on a four-group classification problem using a two-stage linear SVM-based procedure and just 13 peaks, with other methods performing comparably.</p> <p>Conclusions</p> <p>Modern feature selection and classification methods are powerful techniques for both the identification of biomarker candidates and the related problem of building predictive models from protein mass spectrometric profiles. Cross-validation and randomization are essential tools that must be performed carefully in order not to bias the results unfairly. However, only a biological validation and identification of the underlying proteins will ultimately confirm the actual value and power of any computational predictions.</p> http://www.biomedcentral.com/1471-2105/5/26ANOVA F-statisticB/W ratiobiomarker discoveryfeature (peak) selectionprostate cancerSELD1-TOF mass spectrometrystatistical discrimination methodssupport vector machine.
collection DOAJ
language English
format Article
sources DOAJ
author Adam Bao-Ling
Devineni Raghu
Kasukurti Srinivas
Pothen Alex
Naik Dayanand N
Wagner Michael
Semmes O John
Wright George L
spellingShingle Adam Bao-Ling
Devineni Raghu
Kasukurti Srinivas
Pothen Alex
Naik Dayanand N
Wagner Michael
Semmes O John
Wright George L
Computational protein biomarker prediction: a case study for prostate cancer
BMC Bioinformatics
ANOVA F-statistic
B/W ratio
biomarker discovery
feature (peak) selection
prostate cancer
SELD1-TOF mass spectrometry
statistical discrimination methods
support vector machine.
author_facet Adam Bao-Ling
Devineni Raghu
Kasukurti Srinivas
Pothen Alex
Naik Dayanand N
Wagner Michael
Semmes O John
Wright George L
author_sort Adam Bao-Ling
title Computational protein biomarker prediction: a case study for prostate cancer
title_short Computational protein biomarker prediction: a case study for prostate cancer
title_full Computational protein biomarker prediction: a case study for prostate cancer
title_fullStr Computational protein biomarker prediction: a case study for prostate cancer
title_full_unstemmed Computational protein biomarker prediction: a case study for prostate cancer
title_sort computational protein biomarker prediction: a case study for prostate cancer
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2004-03-01
description <p>Abstract</p> <p>Background</p> <p>Recent technological advances in mass spectrometry pose challenges in computational mathematics and statistics to process the mass spectral data into predictive models with clinical and biological significance. We discuss several classification-based approaches to finding protein biomarker candidates using protein profiles obtained via mass spectrometry, and we assess their statistical significance. Our overall goal is to implicate peaks that have a high likelihood of being biologically linked to a given disease state, and thus to narrow the search for biomarker candidates.</p> <p>Results</p> <p>Thorough cross-validation studies and randomization tests are performed on a prostate cancer dataset with over 300 patients, obtained at the Eastern Virginia Medical School using SELDI-TOF mass spectrometry. We obtain average classification accuracies of 87% on a four-group classification problem using a two-stage linear SVM-based procedure and just 13 peaks, with other methods performing comparably.</p> <p>Conclusions</p> <p>Modern feature selection and classification methods are powerful techniques for both the identification of biomarker candidates and the related problem of building predictive models from protein mass spectrometric profiles. Cross-validation and randomization are essential tools that must be performed carefully in order not to bias the results unfairly. However, only a biological validation and identification of the underlying proteins will ultimately confirm the actual value and power of any computational predictions.</p>
topic ANOVA F-statistic
B/W ratio
biomarker discovery
feature (peak) selection
prostate cancer
SELD1-TOF mass spectrometry
statistical discrimination methods
support vector machine.
url http://www.biomedcentral.com/1471-2105/5/26
work_keys_str_mv AT adambaoling computationalproteinbiomarkerpredictionacasestudyforprostatecancer
AT devineniraghu computationalproteinbiomarkerpredictionacasestudyforprostatecancer
AT kasukurtisrinivas computationalproteinbiomarkerpredictionacasestudyforprostatecancer
AT pothenalex computationalproteinbiomarkerpredictionacasestudyforprostatecancer
AT naikdayanandn computationalproteinbiomarkerpredictionacasestudyforprostatecancer
AT wagnermichael computationalproteinbiomarkerpredictionacasestudyforprostatecancer
AT semmesojohn computationalproteinbiomarkerpredictionacasestudyforprostatecancer
AT wrightgeorgel computationalproteinbiomarkerpredictionacasestudyforprostatecancer
_version_ 1725275903495241728