Prediction of B-cell Linear Epitopes with a Combination of Support Vector Machine Classification and Amino Acid Propensity Identification

Epitopes are antigenic determinants that are useful because they induce B-cell antibody production and stimulate T-cell activation. Bioinformatics can enable rapid, efficient prediction of potential epitopes. Here, we designed a novel B-cell linear epitope prediction system called LEPS, Linear Epito...

Full description

Bibliographic Details
Main Authors: Hsin-Wei Wang, Ya-Chi Lin, Tun-Wen Pai, Hao-Teng Chang
Format: Article
Language:English
Published: Hindawi Limited 2011-01-01
Series:Journal of Biomedicine and Biotechnology
Online Access:http://dx.doi.org/10.1155/2011/432830
id doaj-3f02292a1d21482b8a8c2e0ac61106c7
record_format Article
spelling doaj-3f02292a1d21482b8a8c2e0ac61106c72020-11-25T02:19:30ZengHindawi LimitedJournal of Biomedicine and Biotechnology1110-72431110-72512011-01-01201110.1155/2011/432830432830Prediction of B-cell Linear Epitopes with a Combination of Support Vector Machine Classification and Amino Acid Propensity IdentificationHsin-Wei Wang0Ya-Chi Lin1Tun-Wen Pai2Hao-Teng Chang3Department of Computer Science and Engineering, National Taiwan Ocean University, Keelung 20224, TaiwanDepartment of Computer Science and Engineering, National Taiwan Ocean University, Keelung 20224, TaiwanDepartment of Computer Science and Engineering, National Taiwan Ocean University, Keelung 20224, TaiwanGraduate Institute of Molecular Systems Biomedicine, China Medical University, Taichung 40402, TaiwanEpitopes are antigenic determinants that are useful because they induce B-cell antibody production and stimulate T-cell activation. Bioinformatics can enable rapid, efficient prediction of potential epitopes. Here, we designed a novel B-cell linear epitope prediction system called LEPS, Linear Epitope Prediction by Propensities and Support Vector Machine, that combined physico-chemical propensity identification and support vector machine (SVM) classification. We tested the LEPS on four datasets: AntiJen, HIV, a newly generated PC, and AHP, a combination of these three datasets. Peptides with globally or locally high physicochemical propensities were first identified as primitive linear epitope (LE) candidates. Then, candidates were classified with the SVM based on the unique features of amino acid segments. This reduced the number of predicted epitopes and enhanced the positive prediction value (PPV). Compared to four other well-known LE prediction systems, the LEPS achieved the highest accuracy (72.52%), specificity (84.22%), PPV (32.07%), and Matthews' correlation coefficient (10.36%).http://dx.doi.org/10.1155/2011/432830
collection DOAJ
language English
format Article
sources DOAJ
author Hsin-Wei Wang
Ya-Chi Lin
Tun-Wen Pai
Hao-Teng Chang
spellingShingle Hsin-Wei Wang
Ya-Chi Lin
Tun-Wen Pai
Hao-Teng Chang
Prediction of B-cell Linear Epitopes with a Combination of Support Vector Machine Classification and Amino Acid Propensity Identification
Journal of Biomedicine and Biotechnology
author_facet Hsin-Wei Wang
Ya-Chi Lin
Tun-Wen Pai
Hao-Teng Chang
author_sort Hsin-Wei Wang
title Prediction of B-cell Linear Epitopes with a Combination of Support Vector Machine Classification and Amino Acid Propensity Identification
title_short Prediction of B-cell Linear Epitopes with a Combination of Support Vector Machine Classification and Amino Acid Propensity Identification
title_full Prediction of B-cell Linear Epitopes with a Combination of Support Vector Machine Classification and Amino Acid Propensity Identification
title_fullStr Prediction of B-cell Linear Epitopes with a Combination of Support Vector Machine Classification and Amino Acid Propensity Identification
title_full_unstemmed Prediction of B-cell Linear Epitopes with a Combination of Support Vector Machine Classification and Amino Acid Propensity Identification
title_sort prediction of b-cell linear epitopes with a combination of support vector machine classification and amino acid propensity identification
publisher Hindawi Limited
series Journal of Biomedicine and Biotechnology
issn 1110-7243
1110-7251
publishDate 2011-01-01
description Epitopes are antigenic determinants that are useful because they induce B-cell antibody production and stimulate T-cell activation. Bioinformatics can enable rapid, efficient prediction of potential epitopes. Here, we designed a novel B-cell linear epitope prediction system called LEPS, Linear Epitope Prediction by Propensities and Support Vector Machine, that combined physico-chemical propensity identification and support vector machine (SVM) classification. We tested the LEPS on four datasets: AntiJen, HIV, a newly generated PC, and AHP, a combination of these three datasets. Peptides with globally or locally high physicochemical propensities were first identified as primitive linear epitope (LE) candidates. Then, candidates were classified with the SVM based on the unique features of amino acid segments. This reduced the number of predicted epitopes and enhanced the positive prediction value (PPV). Compared to four other well-known LE prediction systems, the LEPS achieved the highest accuracy (72.52%), specificity (84.22%), PPV (32.07%), and Matthews' correlation coefficient (10.36%).
url http://dx.doi.org/10.1155/2011/432830
work_keys_str_mv AT hsinweiwang predictionofbcelllinearepitopeswithacombinationofsupportvectormachineclassificationandaminoacidpropensityidentification
AT yachilin predictionofbcelllinearepitopeswithacombinationofsupportvectormachineclassificationandaminoacidpropensityidentification
AT tunwenpai predictionofbcelllinearepitopeswithacombinationofsupportvectormachineclassificationandaminoacidpropensityidentification
AT haotengchang predictionofbcelllinearepitopeswithacombinationofsupportvectormachineclassificationandaminoacidpropensityidentification
_version_ 1724876461014253568