Regression applied to protein binding site prediction and comparison with classification

<p>Abstract</p> <p>Background</p> <p>The structural genomics centers provide hundreds of protein structures of unknown function. Therefore, developing methods enabling the determination of a protein function automatically is imperative. The determination of a protein fu...

Full description

Bibliographic Details
Main Authors: Gala Jean-Luc, Ambroise Jérôme, Giard Joachim, Macq Benoît
Format: Article
Language:English
Published: BMC 2009-09-01
Series:BMC Bioinformatics
Online Access:http://www.biomedcentral.com/1471-2105/10/276
id doaj-442b9bbd375c483dbe028c84ccd902d8
record_format Article
spelling doaj-442b9bbd375c483dbe028c84ccd902d82020-11-25T00:42:05ZengBMCBMC Bioinformatics1471-21052009-09-0110127610.1186/1471-2105-10-276Regression applied to protein binding site prediction and comparison with classificationGala Jean-LucAmbroise JérômeGiard JoachimMacq Benoît<p>Abstract</p> <p>Background</p> <p>The structural genomics centers provide hundreds of protein structures of unknown function. Therefore, developing methods enabling the determination of a protein function automatically is imperative. The determination of a protein function can be achieved by studying the network of its physical interactions. In this context, identifying a potential binding site between proteins is of primary interest. In the literature, methods for predicting a potential binding site location generally are based on classification tools. The aim of this paper is to show that regression tools are more efficient than classification tools for patches based binding site predictors. For this purpose, we developed a patches based binding site localization method usable with either regression or classification tools.</p> <p>Results</p> <p>We compared predictive performances of regression tools with performances of machine learning classifiers. Using leave-one-out cross-validation, we showed that regression tools provide better predictions than classification ones. Among regression tools, Multilayer Perceptron ranked highest in the quality of predictions. We compared also the predictive performance of our patches based method using Multilayer Perceptron with the performance of three other methods usable through a web server. Our method performed similarly to the other methods.</p> <p>Conclusion</p> <p>Regression is more efficient than classification when applied to our binding site localization method. When it is possible, using regression instead of classification for other existing binding site predictors will probably improve results. Furthermore, the method presented in this work is flexible because the size of the predicted binding site is adjustable. This adaptability is useful when either false positive or negative rates have to be limited.</p> http://www.biomedcentral.com/1471-2105/10/276
collection DOAJ
language English
format Article
sources DOAJ
author Gala Jean-Luc
Ambroise Jérôme
Giard Joachim
Macq Benoît
spellingShingle Gala Jean-Luc
Ambroise Jérôme
Giard Joachim
Macq Benoît
Regression applied to protein binding site prediction and comparison with classification
BMC Bioinformatics
author_facet Gala Jean-Luc
Ambroise Jérôme
Giard Joachim
Macq Benoît
author_sort Gala Jean-Luc
title Regression applied to protein binding site prediction and comparison with classification
title_short Regression applied to protein binding site prediction and comparison with classification
title_full Regression applied to protein binding site prediction and comparison with classification
title_fullStr Regression applied to protein binding site prediction and comparison with classification
title_full_unstemmed Regression applied to protein binding site prediction and comparison with classification
title_sort regression applied to protein binding site prediction and comparison with classification
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2009-09-01
description <p>Abstract</p> <p>Background</p> <p>The structural genomics centers provide hundreds of protein structures of unknown function. Therefore, developing methods enabling the determination of a protein function automatically is imperative. The determination of a protein function can be achieved by studying the network of its physical interactions. In this context, identifying a potential binding site between proteins is of primary interest. In the literature, methods for predicting a potential binding site location generally are based on classification tools. The aim of this paper is to show that regression tools are more efficient than classification tools for patches based binding site predictors. For this purpose, we developed a patches based binding site localization method usable with either regression or classification tools.</p> <p>Results</p> <p>We compared predictive performances of regression tools with performances of machine learning classifiers. Using leave-one-out cross-validation, we showed that regression tools provide better predictions than classification ones. Among regression tools, Multilayer Perceptron ranked highest in the quality of predictions. We compared also the predictive performance of our patches based method using Multilayer Perceptron with the performance of three other methods usable through a web server. Our method performed similarly to the other methods.</p> <p>Conclusion</p> <p>Regression is more efficient than classification when applied to our binding site localization method. When it is possible, using regression instead of classification for other existing binding site predictors will probably improve results. Furthermore, the method presented in this work is flexible because the size of the predicted binding site is adjustable. This adaptability is useful when either false positive or negative rates have to be limited.</p>
url http://www.biomedcentral.com/1471-2105/10/276
work_keys_str_mv AT galajeanluc regressionappliedtoproteinbindingsitepredictionandcomparisonwithclassification
AT ambroisejerome regressionappliedtoproteinbindingsitepredictionandcomparisonwithclassification
AT giardjoachim regressionappliedtoproteinbindingsitepredictionandcomparisonwithclassification
AT macqbenoit regressionappliedtoproteinbindingsitepredictionandcomparisonwithclassification
_version_ 1725284031166152704