HemeBIND: a novel method for heme binding residue prediction by combining structural and sequence information

<p>Abstract</p> <p>Background</p> <p>Accurate prediction of binding residues involved in the interactions between proteins and small ligands is one of the major challenges in structural bioinformatics. Heme is an essential and commonly used ligand that plays critical ro...

Full description

Bibliographic Details
Main Authors: Hu Jianjun, Liu Rong
Format: Article
Language:English
Published: BMC 2011-05-01
Series:BMC Bioinformatics
Online Access:http://www.biomedcentral.com/1471-2105/12/207
id doaj-6ed3b167750043b5b90ae0ca34433080
record_format Article
spelling doaj-6ed3b167750043b5b90ae0ca344330802020-11-24T21:37:56ZengBMCBMC Bioinformatics1471-21052011-05-0112120710.1186/1471-2105-12-207HemeBIND: a novel method for heme binding residue prediction by combining structural and sequence informationHu JianjunLiu Rong<p>Abstract</p> <p>Background</p> <p>Accurate prediction of binding residues involved in the interactions between proteins and small ligands is one of the major challenges in structural bioinformatics. Heme is an essential and commonly used ligand that plays critical roles in electron transfer, catalysis, signal transduction and gene expression. Although much effort has been devoted to the development of various generic algorithms for ligand binding site prediction over the last decade, no algorithm has been specifically designed to complement experimental techniques for identification of heme binding residues. Consequently, an urgent need is to develop a computational method for recognizing these important residues.</p> <p>Results</p> <p>Here we introduced an efficient algorithm HemeBIND for predicting heme binding residues by integrating structural and sequence information. We systematically investigated the characteristics of binding interfaces based on a non-redundant dataset of heme-protein complexes. It was found that several sequence and structural attributes such as evolutionary conservation, solvent accessibility, depth and protrusion clearly illustrate the differences between heme binding and non-binding residues. These features can then be separately used or combined to build the structure-based classifiers using support vector machine (SVM). The results showed that the information contained in these features is largely complementary and their combination achieved the best performance. To further improve the performance, an attempt has been made to develop a post-processing procedure to reduce the number of false positives. In addition, we built a sequence-based classifier based on SVM and sequence profile as an alternative when only sequence information can be used. Finally, we employed a voting method to combine the outputs of structure-based and sequence-based classifiers, which demonstrated remarkably better performance than the individual classifier alone.</p> <p>Conclusions</p> <p>HemeBIND is the first specialized algorithm used to predict binding residues in protein structures for heme ligands. Extensive experiments indicated that both the structure-based and sequence-based methods have effectively identified heme binding residues while the complementary relationship between them can result in a significant improvement in prediction performance. The value of our method is highlighted through the development of HemeBIND web server that is freely accessible at <url>http://mleg.cse.sc.edu/hemeBIND/</url>.</p> http://www.biomedcentral.com/1471-2105/12/207
collection DOAJ
language English
format Article
sources DOAJ
author Hu Jianjun
Liu Rong
spellingShingle Hu Jianjun
Liu Rong
HemeBIND: a novel method for heme binding residue prediction by combining structural and sequence information
BMC Bioinformatics
author_facet Hu Jianjun
Liu Rong
author_sort Hu Jianjun
title HemeBIND: a novel method for heme binding residue prediction by combining structural and sequence information
title_short HemeBIND: a novel method for heme binding residue prediction by combining structural and sequence information
title_full HemeBIND: a novel method for heme binding residue prediction by combining structural and sequence information
title_fullStr HemeBIND: a novel method for heme binding residue prediction by combining structural and sequence information
title_full_unstemmed HemeBIND: a novel method for heme binding residue prediction by combining structural and sequence information
title_sort hemebind: a novel method for heme binding residue prediction by combining structural and sequence information
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2011-05-01
description <p>Abstract</p> <p>Background</p> <p>Accurate prediction of binding residues involved in the interactions between proteins and small ligands is one of the major challenges in structural bioinformatics. Heme is an essential and commonly used ligand that plays critical roles in electron transfer, catalysis, signal transduction and gene expression. Although much effort has been devoted to the development of various generic algorithms for ligand binding site prediction over the last decade, no algorithm has been specifically designed to complement experimental techniques for identification of heme binding residues. Consequently, an urgent need is to develop a computational method for recognizing these important residues.</p> <p>Results</p> <p>Here we introduced an efficient algorithm HemeBIND for predicting heme binding residues by integrating structural and sequence information. We systematically investigated the characteristics of binding interfaces based on a non-redundant dataset of heme-protein complexes. It was found that several sequence and structural attributes such as evolutionary conservation, solvent accessibility, depth and protrusion clearly illustrate the differences between heme binding and non-binding residues. These features can then be separately used or combined to build the structure-based classifiers using support vector machine (SVM). The results showed that the information contained in these features is largely complementary and their combination achieved the best performance. To further improve the performance, an attempt has been made to develop a post-processing procedure to reduce the number of false positives. In addition, we built a sequence-based classifier based on SVM and sequence profile as an alternative when only sequence information can be used. Finally, we employed a voting method to combine the outputs of structure-based and sequence-based classifiers, which demonstrated remarkably better performance than the individual classifier alone.</p> <p>Conclusions</p> <p>HemeBIND is the first specialized algorithm used to predict binding residues in protein structures for heme ligands. Extensive experiments indicated that both the structure-based and sequence-based methods have effectively identified heme binding residues while the complementary relationship between them can result in a significant improvement in prediction performance. The value of our method is highlighted through the development of HemeBIND web server that is freely accessible at <url>http://mleg.cse.sc.edu/hemeBIND/</url>.</p>
url http://www.biomedcentral.com/1471-2105/12/207
work_keys_str_mv AT hujianjun hemebindanovelmethodforhemebindingresiduepredictionbycombiningstructuralandsequenceinformation
AT liurong hemebindanovelmethodforhemebindingresiduepredictionbycombiningstructuralandsequenceinformation
_version_ 1725936309464203264