Planning combinatorial disulfide cross-links for protein fold determination

Abstract Background Fold recognition techniques take advantage of the limited number of overall structural organizations, and have become increasingly effective at identifying the fold of a given target sequence. However, in the absence of sufficient se...

Full description

Bibliographic Details
Main Authors:	Xiong Fei, Friedman Alan M, Bailey-Kellogg Chris
Format:	Article
Language:	English
Published:	BMC 2011-11-01
Series:	BMC Bioinformatics

id	doaj-abff5c62d46c4c529a81a88c9a8ac56d
record_format	Article
spelling	doaj-abff5c62d46c4c529a81a88c9a8ac56d2020-11-25T02:56:37ZengBMCBMC Bioinformatics1471-21052011-11-0112Suppl 12S510.1186/1471-2105-12-S12-S5Planning combinatorial disulfide cross-links for protein fold determinationXiong FeiFriedman Alan MBailey-Kellogg Chris<p>Abstract</p> <p>Background</p> <p>Fold recognition techniques take advantage of the limited number of overall structural organizations, and have become increasingly effective at identifying the fold of a given target sequence. However, in the absence of sufficient sequence identity, it remains difficult for fold recognition methods to always select the correct model. While a native-like model is often among a pool of highly ranked models, it is not necessarily the highest-ranked one, and the model rankings depend sensitively on the scoring function used. <it>Structure elucidation</it> methods can then be employed to decide among the models based on relatively rapid biochemical/biophysical experiments.</p> <p>Results</p> <p>This paper presents an integrated computational-experimental method to determine the fold of a target protein by probing it with a set of planned disulfide cross-links. We start with predicted structural models obtained by standard fold recognition techniques. In a first stage, we characterize the fold-level differences between the models in terms of topological (contact) patterns of secondary structure elements (SSEs), and select a small set of SSE pairs that differentiate the folds. In a second stage, we determine a set of residue-level cross-links to probe the selected SSE pairs. Each stage employs an information-theoretic planning algorithm to maximize information gain while minimizing experimental complexity, along with a Bayes error plan assessment framework to characterize the probability of making a correct decision once data for the plan are collected. By focusing on overall topological differences and planning cross-linking experiments to probe them, our <it>fold determination</it> approach is robust to noise and uncertainty in the models (e.g., threading misalignment) and in the actual structure (e.g., flexibility). We demonstrate the effectiveness of our approach in case studies for a number of CASP targets, showing that the optimized plans have low risk of error while testing only a small portion of the quadratic number of possible cross-link candidates. Simulation studies with these plans further show that they do a very good job of selecting the correct model, according to cross-links simulated from the actual crystal structures.</p> <p>Conclusions</p> <p>Fold determination can overcome scoring limitations in purely computational fold recognition methods, while requiring less experimental effort than traditional protein structure determination approaches.</p>
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Xiong Fei Friedman Alan M Bailey-Kellogg Chris
spellingShingle	Xiong Fei Friedman Alan M Bailey-Kellogg Chris Planning combinatorial disulfide cross-links for protein fold determination BMC Bioinformatics
author_facet	Xiong Fei Friedman Alan M Bailey-Kellogg Chris
author_sort	Xiong Fei
title	Planning combinatorial disulfide cross-links for protein fold determination
title_short	Planning combinatorial disulfide cross-links for protein fold determination
title_full	Planning combinatorial disulfide cross-links for protein fold determination
title_fullStr	Planning combinatorial disulfide cross-links for protein fold determination
title_full_unstemmed	Planning combinatorial disulfide cross-links for protein fold determination
title_sort	planning combinatorial disulfide cross-links for protein fold determination
publisher	BMC
series	BMC Bioinformatics
issn	1471-2105
publishDate	2011-11-01
description	<p>Abstract</p> <p>Background</p> <p>Fold recognition techniques take advantage of the limited number of overall structural organizations, and have become increasingly effective at identifying the fold of a given target sequence. However, in the absence of sufficient sequence identity, it remains difficult for fold recognition methods to always select the correct model. While a native-like model is often among a pool of highly ranked models, it is not necessarily the highest-ranked one, and the model rankings depend sensitively on the scoring function used. <it>Structure elucidation</it> methods can then be employed to decide among the models based on relatively rapid biochemical/biophysical experiments.</p> <p>Results</p> <p>This paper presents an integrated computational-experimental method to determine the fold of a target protein by probing it with a set of planned disulfide cross-links. We start with predicted structural models obtained by standard fold recognition techniques. In a first stage, we characterize the fold-level differences between the models in terms of topological (contact) patterns of secondary structure elements (SSEs), and select a small set of SSE pairs that differentiate the folds. In a second stage, we determine a set of residue-level cross-links to probe the selected SSE pairs. Each stage employs an information-theoretic planning algorithm to maximize information gain while minimizing experimental complexity, along with a Bayes error plan assessment framework to characterize the probability of making a correct decision once data for the plan are collected. By focusing on overall topological differences and planning cross-linking experiments to probe them, our <it>fold determination</it> approach is robust to noise and uncertainty in the models (e.g., threading misalignment) and in the actual structure (e.g., flexibility). We demonstrate the effectiveness of our approach in case studies for a number of CASP targets, showing that the optimized plans have low risk of error while testing only a small portion of the quadratic number of possible cross-link candidates. Simulation studies with these plans further show that they do a very good job of selecting the correct model, according to cross-links simulated from the actual crystal structures.</p> <p>Conclusions</p> <p>Fold determination can overcome scoring limitations in purely computational fold recognition methods, while requiring less experimental effort than traditional protein structure determination approaches.</p>
work_keys_str_mv	AT xiongfei planningcombinatorialdisulfidecrosslinksforproteinfolddetermination AT friedmanalanm planningcombinatorialdisulfidecrosslinksforproteinfolddetermination AT baileykelloggchris planningcombinatorialdisulfidecrosslinksforproteinfolddetermination
_version_	1724713066735599616

Planning combinatorial disulfide cross-links for protein fold determination

Similar Items