Prediction of B-cell epitopes using evolutionary information and propensity scales
<p>Abstract</p> <p>Background</p> <p>Development of computational tools that can accurately predict presence and location of B-cell epitopes on pathogenic proteins has a valuable application to the field of vaccinology. Because of the highly variable yet enigmatic natur...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2013-01-01
|
Series: | BMC Bioinformatics |
id |
doaj-5694e207da31431cb7e033ba117de6e0 |
---|---|
record_format |
Article |
spelling |
doaj-5694e207da31431cb7e033ba117de6e02020-11-24T22:43:28ZengBMCBMC Bioinformatics1471-21052013-01-0114Suppl 2S1010.1186/1471-2105-14-S2-S10Prediction of B-cell epitopes using evolutionary information and propensity scalesLin ScottCheng Cheng-WeiSu Emily<p>Abstract</p> <p>Background</p> <p>Development of computational tools that can accurately predict presence and location of B-cell epitopes on pathogenic proteins has a valuable application to the field of vaccinology. Because of the highly variable yet enigmatic nature of B-cell epitopes, their prediction presents a great challenge to computational immunologists.</p> <p>Methods</p> <p>We propose a method, BEEPro (B-cell epitope prediction by evolutionary information and propensity scales), which adapts a linear averaging scheme on 16 properties using a support vector machine model to predict both linear and conformational B-cell epitopes. These 16 properties include position specific scoring matrix (PSSM), an amino acid ratio scale, and a set of 14 physicochemical scales obtained via a feature selection process. Finally, a three-way data split procedure is used during the validation process to prevent over-estimation of prediction performance and avoid bias in our experiment results.</p> <p>Results</p> <p>In our experiment, first we use a non-redundant linear B-cell epitope dataset curated by Sollner <it>et al. </it>for feature selection and parameter optimization. Evaluated by a three-way data split procedure, BEEPro achieves significant improvement with the area under the receiver operating curve (AUC) = 0.9987, accuracy = 99.29%, mathew's correlation coefficient (MCC) = 0.9281, sensitivity = 0.9604, specificity = 0.9946, positive predictive value (PPV) = 0.9042 for the Sollner dataset. In addition, the same parameters are used to evaluate performance on other independent linear B-cell epitope test datasets, BEEPro attains an AUC which ranges from 0.9874 to 0.9950 and an accuracy which ranges from 93.73% to 97.31%. Moreover, five-fold cross-validation on one benchmark conformational B-cell epitope dataset yields an accuracy of 92.14% and AUC of 0.9066.</p> <p>Conclusions</p> <p>Compared with other current models, our method achieves a significant improvement with respect to AUC, accuracy, MCC, sensitivity, specificity, and PPV. Thus, we have shown that an appropriate combination of evolutionary information and propensity scales with a support vector machine model can significantly enhance the prediction performance of both linear and conformational B-cell epitopes.</p> |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Lin Scott Cheng Cheng-Wei Su Emily |
spellingShingle |
Lin Scott Cheng Cheng-Wei Su Emily Prediction of B-cell epitopes using evolutionary information and propensity scales BMC Bioinformatics |
author_facet |
Lin Scott Cheng Cheng-Wei Su Emily |
author_sort |
Lin Scott |
title |
Prediction of B-cell epitopes using evolutionary information and propensity scales |
title_short |
Prediction of B-cell epitopes using evolutionary information and propensity scales |
title_full |
Prediction of B-cell epitopes using evolutionary information and propensity scales |
title_fullStr |
Prediction of B-cell epitopes using evolutionary information and propensity scales |
title_full_unstemmed |
Prediction of B-cell epitopes using evolutionary information and propensity scales |
title_sort |
prediction of b-cell epitopes using evolutionary information and propensity scales |
publisher |
BMC |
series |
BMC Bioinformatics |
issn |
1471-2105 |
publishDate |
2013-01-01 |
description |
<p>Abstract</p> <p>Background</p> <p>Development of computational tools that can accurately predict presence and location of B-cell epitopes on pathogenic proteins has a valuable application to the field of vaccinology. Because of the highly variable yet enigmatic nature of B-cell epitopes, their prediction presents a great challenge to computational immunologists.</p> <p>Methods</p> <p>We propose a method, BEEPro (B-cell epitope prediction by evolutionary information and propensity scales), which adapts a linear averaging scheme on 16 properties using a support vector machine model to predict both linear and conformational B-cell epitopes. These 16 properties include position specific scoring matrix (PSSM), an amino acid ratio scale, and a set of 14 physicochemical scales obtained via a feature selection process. Finally, a three-way data split procedure is used during the validation process to prevent over-estimation of prediction performance and avoid bias in our experiment results.</p> <p>Results</p> <p>In our experiment, first we use a non-redundant linear B-cell epitope dataset curated by Sollner <it>et al. </it>for feature selection and parameter optimization. Evaluated by a three-way data split procedure, BEEPro achieves significant improvement with the area under the receiver operating curve (AUC) = 0.9987, accuracy = 99.29%, mathew's correlation coefficient (MCC) = 0.9281, sensitivity = 0.9604, specificity = 0.9946, positive predictive value (PPV) = 0.9042 for the Sollner dataset. In addition, the same parameters are used to evaluate performance on other independent linear B-cell epitope test datasets, BEEPro attains an AUC which ranges from 0.9874 to 0.9950 and an accuracy which ranges from 93.73% to 97.31%. Moreover, five-fold cross-validation on one benchmark conformational B-cell epitope dataset yields an accuracy of 92.14% and AUC of 0.9066.</p> <p>Conclusions</p> <p>Compared with other current models, our method achieves a significant improvement with respect to AUC, accuracy, MCC, sensitivity, specificity, and PPV. Thus, we have shown that an appropriate combination of evolutionary information and propensity scales with a support vector machine model can significantly enhance the prediction performance of both linear and conformational B-cell epitopes.</p> |
work_keys_str_mv |
AT linscott predictionofbcellepitopesusingevolutionaryinformationandpropensityscales AT chengchengwei predictionofbcellepitopesusingevolutionaryinformationandpropensityscales AT suemily predictionofbcellepitopesusingevolutionaryinformationandpropensityscales |
_version_ |
1725695719079149568 |