CONFOLD2: improved contact-driven ab initio protein structure modeling

Abstract Background Contact-guided protein structure prediction methods are becoming more and more successful because of the latest advances in residue-residue contact prediction. To support contact-driven structure prediction, effective tools that can quickly build tertiary structural models of goo...

Full description

Bibliographic Details
Main Authors: Badri Adhikari, Jianlin Cheng
Format: Article
Language:English
Published: BMC 2018-01-01
Series:BMC Bioinformatics
Subjects:
Online Access:http://link.springer.com/article/10.1186/s12859-018-2032-6
id doaj-3c97a3da49e84894b752b3183bc8d1d2
record_format Article
spelling doaj-3c97a3da49e84894b752b3183bc8d1d22020-11-25T02:49:36ZengBMCBMC Bioinformatics1471-21052018-01-011911510.1186/s12859-018-2032-6CONFOLD2: improved contact-driven ab initio protein structure modelingBadri Adhikari0Jianlin Cheng1Department of Mathematics and Computer Science, University of Missouri-St. LouisDepartment of Electrical Engineering and Computer Science, University of MissouriAbstract Background Contact-guided protein structure prediction methods are becoming more and more successful because of the latest advances in residue-residue contact prediction. To support contact-driven structure prediction, effective tools that can quickly build tertiary structural models of good quality from predicted contacts need to be developed. Results We develop an improved contact-driven protein modelling method, CONFOLD2, and study how it may be effectively used for ab initio protein structure prediction with predicted contacts as input. It builds models using various subsets of input contacts to explore the fold space under the guidance of a soft square energy function, and then clusters the models to obtain the top five models. CONFOLD2 obtains an average reconstruction accuracy of 0.57 TM-score for the 150 proteins in the PSICOV contact prediction dataset. When benchmarked on the CASP11 contacts predicted using CONSIP2 and CASP12 contacts predicted using Raptor-X, CONFOLD2 achieves a mean TM-score of 0.41 on both datasets. Conclusion CONFOLD2 allows to quickly generate top five structural models for a protein sequence when its secondary structures and contacts predictions at hand. The source code of CONFOLD2 is publicly available at https://github.com/multicom-toolbox/CONFOLD2/ .http://link.springer.com/article/10.1186/s12859-018-2032-6ContactsProtein foldingCONFOLDModel selection
collection DOAJ
language English
format Article
sources DOAJ
author Badri Adhikari
Jianlin Cheng
spellingShingle Badri Adhikari
Jianlin Cheng
CONFOLD2: improved contact-driven ab initio protein structure modeling
BMC Bioinformatics
Contacts
Protein folding
CONFOLD
Model selection
author_facet Badri Adhikari
Jianlin Cheng
author_sort Badri Adhikari
title CONFOLD2: improved contact-driven ab initio protein structure modeling
title_short CONFOLD2: improved contact-driven ab initio protein structure modeling
title_full CONFOLD2: improved contact-driven ab initio protein structure modeling
title_fullStr CONFOLD2: improved contact-driven ab initio protein structure modeling
title_full_unstemmed CONFOLD2: improved contact-driven ab initio protein structure modeling
title_sort confold2: improved contact-driven ab initio protein structure modeling
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2018-01-01
description Abstract Background Contact-guided protein structure prediction methods are becoming more and more successful because of the latest advances in residue-residue contact prediction. To support contact-driven structure prediction, effective tools that can quickly build tertiary structural models of good quality from predicted contacts need to be developed. Results We develop an improved contact-driven protein modelling method, CONFOLD2, and study how it may be effectively used for ab initio protein structure prediction with predicted contacts as input. It builds models using various subsets of input contacts to explore the fold space under the guidance of a soft square energy function, and then clusters the models to obtain the top five models. CONFOLD2 obtains an average reconstruction accuracy of 0.57 TM-score for the 150 proteins in the PSICOV contact prediction dataset. When benchmarked on the CASP11 contacts predicted using CONSIP2 and CASP12 contacts predicted using Raptor-X, CONFOLD2 achieves a mean TM-score of 0.41 on both datasets. Conclusion CONFOLD2 allows to quickly generate top five structural models for a protein sequence when its secondary structures and contacts predictions at hand. The source code of CONFOLD2 is publicly available at https://github.com/multicom-toolbox/CONFOLD2/ .
topic Contacts
Protein folding
CONFOLD
Model selection
url http://link.springer.com/article/10.1186/s12859-018-2032-6
work_keys_str_mv AT badriadhikari confold2improvedcontactdrivenabinitioproteinstructuremodeling
AT jianlincheng confold2improvedcontactdrivenabinitioproteinstructuremodeling
_version_ 1724742495822151680