Protein structural model selection by combining consensus and single scoring methods.

Quality assessment (QA) for predicted protein structural models is an important and challenging research problem in protein structure prediction. Consensus Global Distance Test (CGDT) methods assess each decoy (predicted structural model) based on its structural similarity to all others in a decoy s...

Full description

Bibliographic Details
Main Authors: Zhiquan He, Meshari Alazmi, Jingfen Zhang, Dong Xu
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2013-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC3759460?pdf=render
id doaj-001ce7ad7f7249caa457832c77361a74
record_format Article
spelling doaj-001ce7ad7f7249caa457832c77361a742020-11-25T01:55:54ZengPublic Library of Science (PLoS)PLoS ONE1932-62032013-01-0189e7400610.1371/journal.pone.0074006Protein structural model selection by combining consensus and single scoring methods.Zhiquan HeMeshari AlazmiJingfen ZhangDong XuQuality assessment (QA) for predicted protein structural models is an important and challenging research problem in protein structure prediction. Consensus Global Distance Test (CGDT) methods assess each decoy (predicted structural model) based on its structural similarity to all others in a decoy set and has been proved to work well when good decoys are in a majority cluster. Scoring functions evaluate each single decoy based on its structural properties. Both methods have their merits and limitations. In this paper, we present a novel method called PWCom, which consists of two neural networks sequentially to combine CGDT and single model scoring methods such as RW, DDFire and OPUS-Ca. Specifically, for every pair of decoys, the difference of the corresponding feature vectors is input to the first neural network which enables one to predict whether the decoy-pair are significantly different in terms of their GDT scores to the native. If yes, the second neural network is used to decide which one of the two is closer to the native structure. The quality score for each decoy in the pool is based on the number of winning times during the pairwise comparisons. Test results on three benchmark datasets from different model generation methods showed that PWCom significantly improves over consensus GDT and single scoring methods. The QA server (MUFOLD-Server) applying this method in CASP 10 QA category was ranked the second place in terms of Pearson and Spearman correlation performance.http://europepmc.org/articles/PMC3759460?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Zhiquan He
Meshari Alazmi
Jingfen Zhang
Dong Xu
spellingShingle Zhiquan He
Meshari Alazmi
Jingfen Zhang
Dong Xu
Protein structural model selection by combining consensus and single scoring methods.
PLoS ONE
author_facet Zhiquan He
Meshari Alazmi
Jingfen Zhang
Dong Xu
author_sort Zhiquan He
title Protein structural model selection by combining consensus and single scoring methods.
title_short Protein structural model selection by combining consensus and single scoring methods.
title_full Protein structural model selection by combining consensus and single scoring methods.
title_fullStr Protein structural model selection by combining consensus and single scoring methods.
title_full_unstemmed Protein structural model selection by combining consensus and single scoring methods.
title_sort protein structural model selection by combining consensus and single scoring methods.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2013-01-01
description Quality assessment (QA) for predicted protein structural models is an important and challenging research problem in protein structure prediction. Consensus Global Distance Test (CGDT) methods assess each decoy (predicted structural model) based on its structural similarity to all others in a decoy set and has been proved to work well when good decoys are in a majority cluster. Scoring functions evaluate each single decoy based on its structural properties. Both methods have their merits and limitations. In this paper, we present a novel method called PWCom, which consists of two neural networks sequentially to combine CGDT and single model scoring methods such as RW, DDFire and OPUS-Ca. Specifically, for every pair of decoys, the difference of the corresponding feature vectors is input to the first neural network which enables one to predict whether the decoy-pair are significantly different in terms of their GDT scores to the native. If yes, the second neural network is used to decide which one of the two is closer to the native structure. The quality score for each decoy in the pool is based on the number of winning times during the pairwise comparisons. Test results on three benchmark datasets from different model generation methods showed that PWCom significantly improves over consensus GDT and single scoring methods. The QA server (MUFOLD-Server) applying this method in CASP 10 QA category was ranked the second place in terms of Pearson and Spearman correlation performance.
url http://europepmc.org/articles/PMC3759460?pdf=render
work_keys_str_mv AT zhiquanhe proteinstructuralmodelselectionbycombiningconsensusandsinglescoringmethods
AT mesharialazmi proteinstructuralmodelselectionbycombiningconsensusandsinglescoringmethods
AT jingfenzhang proteinstructuralmodelselectionbycombiningconsensusandsinglescoringmethods
AT dongxu proteinstructuralmodelselectionbycombiningconsensusandsinglescoringmethods
_version_ 1724982725198217216