NClassG+: A classifier for non-classically secreted Gram-positive bacterial proteins
<p>Abstract</p> <p>Background</p> <p>Most predictive methods currently available for the identification of protein secretion mechanisms have focused on classically secreted proteins. In fact, only two methods have been reported for predicting non-classically secreted pr...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2011-01-01
|
Series: | BMC Bioinformatics |
Online Access: | http://www.biomedcentral.com/1471-2105/12/21 |
id |
doaj-d1332a74495c47ae9349729aac6327ca |
---|---|
record_format |
Article |
spelling |
doaj-d1332a74495c47ae9349729aac6327ca2020-11-25T00:24:59ZengBMCBMC Bioinformatics1471-21052011-01-011212110.1186/1471-2105-12-21NClassG+: A classifier for non-classically secreted Gram-positive bacterial proteinsPino CamiloRestrepo-Montoya DanielNino Luis FPatarroyo Manuel EPatarroyo Manuel A<p>Abstract</p> <p>Background</p> <p>Most predictive methods currently available for the identification of protein secretion mechanisms have focused on classically secreted proteins. In fact, only two methods have been reported for predicting non-classically secreted proteins of Gram-positive bacteria. This study describes the implementation of a sequence-based classifier, denoted as NClassG+, for identifying non-classically secreted Gram-positive bacterial proteins.</p> <p>Results</p> <p>Several feature-based classifiers were trained using different sequence transformation vectors (frequencies, dipeptides, physicochemical factors and PSSM) and Support Vector Machines (SVMs) with Linear, Polynomial and Gaussian kernel functions. Nested <it>k</it>-fold cross-validation (CV) was applied to select the best models, using the inner CV loop to tune the model parameters and the outer CV group to compute the error. The parameters and Kernel functions and the combinations between all possible feature vectors were optimized using grid search.</p> <p>Conclusions</p> <p>The final model was tested against an independent set not previously seen by the model, obtaining better predictive performance compared to SecretomeP V2.0 and SecretPV2.0 for the identification of non-classically secreted proteins. NClassG+ is freely available on the web at <url>http://www.biolisi.unal.edu.co/web-servers/nclassgpositive/</url></p> http://www.biomedcentral.com/1471-2105/12/21 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Pino Camilo Restrepo-Montoya Daniel Nino Luis F Patarroyo Manuel E Patarroyo Manuel A |
spellingShingle |
Pino Camilo Restrepo-Montoya Daniel Nino Luis F Patarroyo Manuel E Patarroyo Manuel A NClassG+: A classifier for non-classically secreted Gram-positive bacterial proteins BMC Bioinformatics |
author_facet |
Pino Camilo Restrepo-Montoya Daniel Nino Luis F Patarroyo Manuel E Patarroyo Manuel A |
author_sort |
Pino Camilo |
title |
NClassG+: A classifier for non-classically secreted Gram-positive bacterial proteins |
title_short |
NClassG+: A classifier for non-classically secreted Gram-positive bacterial proteins |
title_full |
NClassG+: A classifier for non-classically secreted Gram-positive bacterial proteins |
title_fullStr |
NClassG+: A classifier for non-classically secreted Gram-positive bacterial proteins |
title_full_unstemmed |
NClassG+: A classifier for non-classically secreted Gram-positive bacterial proteins |
title_sort |
nclassg+: a classifier for non-classically secreted gram-positive bacterial proteins |
publisher |
BMC |
series |
BMC Bioinformatics |
issn |
1471-2105 |
publishDate |
2011-01-01 |
description |
<p>Abstract</p> <p>Background</p> <p>Most predictive methods currently available for the identification of protein secretion mechanisms have focused on classically secreted proteins. In fact, only two methods have been reported for predicting non-classically secreted proteins of Gram-positive bacteria. This study describes the implementation of a sequence-based classifier, denoted as NClassG+, for identifying non-classically secreted Gram-positive bacterial proteins.</p> <p>Results</p> <p>Several feature-based classifiers were trained using different sequence transformation vectors (frequencies, dipeptides, physicochemical factors and PSSM) and Support Vector Machines (SVMs) with Linear, Polynomial and Gaussian kernel functions. Nested <it>k</it>-fold cross-validation (CV) was applied to select the best models, using the inner CV loop to tune the model parameters and the outer CV group to compute the error. The parameters and Kernel functions and the combinations between all possible feature vectors were optimized using grid search.</p> <p>Conclusions</p> <p>The final model was tested against an independent set not previously seen by the model, obtaining better predictive performance compared to SecretomeP V2.0 and SecretPV2.0 for the identification of non-classically secreted proteins. NClassG+ is freely available on the web at <url>http://www.biolisi.unal.edu.co/web-servers/nclassgpositive/</url></p> |
url |
http://www.biomedcentral.com/1471-2105/12/21 |
work_keys_str_mv |
AT pinocamilo nclassgaclassifierfornonclassicallysecretedgrampositivebacterialproteins AT restrepomontoyadaniel nclassgaclassifierfornonclassicallysecretedgrampositivebacterialproteins AT ninoluisf nclassgaclassifierfornonclassicallysecretedgrampositivebacterialproteins AT patarroyomanuele nclassgaclassifierfornonclassicallysecretedgrampositivebacterialproteins AT patarroyomanuela nclassgaclassifierfornonclassicallysecretedgrampositivebacterialproteins |
_version_ |
1725350511298740224 |