HemeBIND: a novel method for heme binding residue prediction by combining structural and sequence information

Abstract Background Accurate prediction of binding residues involved in the interactions between proteins and small ligands is one of the major challenges in structural bioinformatics. Heme is an essential and commonly used ligand that plays critical ro...

Full description

Bibliographic Details
Main Authors:	Hu Jianjun, Liu Rong
Format:	Article
Language:	English
Published:	BMC 2011-05-01
Series:	BMC Bioinformatics
Online Access:	http://www.biomedcentral.com/1471-2105/12/207

id	doaj-6ed3b167750043b5b90ae0ca34433080
record_format	Article
spelling	doaj-6ed3b167750043b5b90ae0ca344330802020-11-24T21:37:56ZengBMCBMC Bioinformatics1471-21052011-05-0112120710.1186/1471-2105-12-207HemeBIND: a novel method for heme binding residue prediction by combining structural and sequence informationHu JianjunLiu Rong<p>Abstract</p> <p>Background</p> <p>Accurate prediction of binding residues involved in the interactions between proteins and small ligands is one of the major challenges in structural bioinformatics. Heme is an essential and commonly used ligand that plays critical roles in electron transfer, catalysis, signal transduction and gene expression. Although much effort has been devoted to the development of various generic algorithms for ligand binding site prediction over the last decade, no algorithm has been specifically designed to complement experimental techniques for identification of heme binding residues. Consequently, an urgent need is to develop a computational method for recognizing these important residues.</p> <p>Results</p> <p>Here we introduced an efficient algorithm HemeBIND for predicting heme binding residues by integrating structural and sequence information. We systematically investigated the characteristics of binding interfaces based on a non-redundant dataset of heme-protein complexes. It was found that several sequence and structural attributes such as evolutionary conservation, solvent accessibility, depth and protrusion clearly illustrate the differences between heme binding and non-binding residues. These features can then be separately used or combined to build the structure-based classifiers using support vector machine (SVM). The results showed that the information contained in these features is largely complementary and their combination achieved the best performance. To further improve the performance, an attempt has been made to develop a post-processing procedure to reduce the number of false positives. In addition, we built a sequence-based classifier based on SVM and sequence profile as an alternative when only sequence information can be used. Finally, we employed a voting method to combine the outputs of structure-based and sequence-based classifiers, which demonstrated remarkably better performance than the individual classifier alone.</p> <p>Conclusions</p> <p>HemeBIND is the first specialized algorithm used to predict binding residues in protein structures for heme ligands. Extensive experiments indicated that both the structure-based and sequence-based methods have effectively identified heme binding residues while the complementary relationship between them can result in a significant improvement in prediction performance. The value of our method is highlighted through the development of HemeBIND web server that is freely accessible at <url>http://mleg.cse.sc.edu/hemeBIND/</url>.</p> http://www.biomedcentral.com/1471-2105/12/207
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Hu Jianjun Liu Rong
spellingShingle	Hu Jianjun Liu Rong HemeBIND: a novel method for heme binding residue prediction by combining structural and sequence information BMC Bioinformatics
author_facet	Hu Jianjun Liu Rong
author_sort	Hu Jianjun
title	HemeBIND: a novel method for heme binding residue prediction by combining structural and sequence information
title_short	HemeBIND: a novel method for heme binding residue prediction by combining structural and sequence information
title_full	HemeBIND: a novel method for heme binding residue prediction by combining structural and sequence information
title_fullStr	HemeBIND: a novel method for heme binding residue prediction by combining structural and sequence information
title_full_unstemmed	HemeBIND: a novel method for heme binding residue prediction by combining structural and sequence information
title_sort	hemebind: a novel method for heme binding residue prediction by combining structural and sequence information
publisher	BMC
series	BMC Bioinformatics
issn	1471-2105
publishDate	2011-05-01
description	<p>Abstract</p> <p>Background</p> <p>Accurate prediction of binding residues involved in the interactions between proteins and small ligands is one of the major challenges in structural bioinformatics. Heme is an essential and commonly used ligand that plays critical roles in electron transfer, catalysis, signal transduction and gene expression. Although much effort has been devoted to the development of various generic algorithms for ligand binding site prediction over the last decade, no algorithm has been specifically designed to complement experimental techniques for identification of heme binding residues. Consequently, an urgent need is to develop a computational method for recognizing these important residues.</p> <p>Results</p> <p>Here we introduced an efficient algorithm HemeBIND for predicting heme binding residues by integrating structural and sequence information. We systematically investigated the characteristics of binding interfaces based on a non-redundant dataset of heme-protein complexes. It was found that several sequence and structural attributes such as evolutionary conservation, solvent accessibility, depth and protrusion clearly illustrate the differences between heme binding and non-binding residues. These features can then be separately used or combined to build the structure-based classifiers using support vector machine (SVM). The results showed that the information contained in these features is largely complementary and their combination achieved the best performance. To further improve the performance, an attempt has been made to develop a post-processing procedure to reduce the number of false positives. In addition, we built a sequence-based classifier based on SVM and sequence profile as an alternative when only sequence information can be used. Finally, we employed a voting method to combine the outputs of structure-based and sequence-based classifiers, which demonstrated remarkably better performance than the individual classifier alone.</p> <p>Conclusions</p> <p>HemeBIND is the first specialized algorithm used to predict binding residues in protein structures for heme ligands. Extensive experiments indicated that both the structure-based and sequence-based methods have effectively identified heme binding residues while the complementary relationship between them can result in a significant improvement in prediction performance. The value of our method is highlighted through the development of HemeBIND web server that is freely accessible at <url>http://mleg.cse.sc.edu/hemeBIND/</url>.</p>
url	http://www.biomedcentral.com/1471-2105/12/207
work_keys_str_mv	AT hujianjun hemebindanovelmethodforhemebindingresiduepredictionbycombiningstructuralandsequenceinformation AT liurong hemebindanovelmethodforhemebindingresiduepredictionbycombiningstructuralandsequenceinformation
_version_	1725936309464203264

HemeBIND: a novel method for heme binding residue prediction by combining structural and sequence information

Similar Items