NClassG+: A classifier for non-classically secreted Gram-positive bacterial proteins

<p>Abstract</p> <p>Background</p> <p>Most predictive methods currently available for the identification of protein secretion mechanisms have focused on classically secreted proteins. In fact, only two methods have been reported for predicting non-classically secreted pr...

Full description

Bibliographic Details
Main Authors: Pino Camilo, Restrepo-Montoya Daniel, Nino Luis F, Patarroyo Manuel E, Patarroyo Manuel A
Format: Article
Language:English
Published: BMC 2011-01-01
Series:BMC Bioinformatics
Online Access:http://www.biomedcentral.com/1471-2105/12/21
id doaj-d1332a74495c47ae9349729aac6327ca
record_format Article
spelling doaj-d1332a74495c47ae9349729aac6327ca2020-11-25T00:24:59ZengBMCBMC Bioinformatics1471-21052011-01-011212110.1186/1471-2105-12-21NClassG+: A classifier for non-classically secreted Gram-positive bacterial proteinsPino CamiloRestrepo-Montoya DanielNino Luis FPatarroyo Manuel EPatarroyo Manuel A<p>Abstract</p> <p>Background</p> <p>Most predictive methods currently available for the identification of protein secretion mechanisms have focused on classically secreted proteins. In fact, only two methods have been reported for predicting non-classically secreted proteins of Gram-positive bacteria. This study describes the implementation of a sequence-based classifier, denoted as NClassG+, for identifying non-classically secreted Gram-positive bacterial proteins.</p> <p>Results</p> <p>Several feature-based classifiers were trained using different sequence transformation vectors (frequencies, dipeptides, physicochemical factors and PSSM) and Support Vector Machines (SVMs) with Linear, Polynomial and Gaussian kernel functions. Nested <it>k</it>-fold cross-validation (CV) was applied to select the best models, using the inner CV loop to tune the model parameters and the outer CV group to compute the error. The parameters and Kernel functions and the combinations between all possible feature vectors were optimized using grid search.</p> <p>Conclusions</p> <p>The final model was tested against an independent set not previously seen by the model, obtaining better predictive performance compared to SecretomeP V2.0 and SecretPV2.0 for the identification of non-classically secreted proteins. NClassG+ is freely available on the web at <url>http://www.biolisi.unal.edu.co/web-servers/nclassgpositive/</url></p> http://www.biomedcentral.com/1471-2105/12/21
collection DOAJ
language English
format Article
sources DOAJ
author Pino Camilo
Restrepo-Montoya Daniel
Nino Luis F
Patarroyo Manuel E
Patarroyo Manuel A
spellingShingle Pino Camilo
Restrepo-Montoya Daniel
Nino Luis F
Patarroyo Manuel E
Patarroyo Manuel A
NClassG+: A classifier for non-classically secreted Gram-positive bacterial proteins
BMC Bioinformatics
author_facet Pino Camilo
Restrepo-Montoya Daniel
Nino Luis F
Patarroyo Manuel E
Patarroyo Manuel A
author_sort Pino Camilo
title NClassG+: A classifier for non-classically secreted Gram-positive bacterial proteins
title_short NClassG+: A classifier for non-classically secreted Gram-positive bacterial proteins
title_full NClassG+: A classifier for non-classically secreted Gram-positive bacterial proteins
title_fullStr NClassG+: A classifier for non-classically secreted Gram-positive bacterial proteins
title_full_unstemmed NClassG+: A classifier for non-classically secreted Gram-positive bacterial proteins
title_sort nclassg+: a classifier for non-classically secreted gram-positive bacterial proteins
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2011-01-01
description <p>Abstract</p> <p>Background</p> <p>Most predictive methods currently available for the identification of protein secretion mechanisms have focused on classically secreted proteins. In fact, only two methods have been reported for predicting non-classically secreted proteins of Gram-positive bacteria. This study describes the implementation of a sequence-based classifier, denoted as NClassG+, for identifying non-classically secreted Gram-positive bacterial proteins.</p> <p>Results</p> <p>Several feature-based classifiers were trained using different sequence transformation vectors (frequencies, dipeptides, physicochemical factors and PSSM) and Support Vector Machines (SVMs) with Linear, Polynomial and Gaussian kernel functions. Nested <it>k</it>-fold cross-validation (CV) was applied to select the best models, using the inner CV loop to tune the model parameters and the outer CV group to compute the error. The parameters and Kernel functions and the combinations between all possible feature vectors were optimized using grid search.</p> <p>Conclusions</p> <p>The final model was tested against an independent set not previously seen by the model, obtaining better predictive performance compared to SecretomeP V2.0 and SecretPV2.0 for the identification of non-classically secreted proteins. NClassG+ is freely available on the web at <url>http://www.biolisi.unal.edu.co/web-servers/nclassgpositive/</url></p>
url http://www.biomedcentral.com/1471-2105/12/21
work_keys_str_mv AT pinocamilo nclassgaclassifierfornonclassicallysecretedgrampositivebacterialproteins
AT restrepomontoyadaniel nclassgaclassifierfornonclassicallysecretedgrampositivebacterialproteins
AT ninoluisf nclassgaclassifierfornonclassicallysecretedgrampositivebacterialproteins
AT patarroyomanuele nclassgaclassifierfornonclassicallysecretedgrampositivebacterialproteins
AT patarroyomanuela nclassgaclassifierfornonclassicallysecretedgrampositivebacterialproteins
_version_ 1725350511298740224