Protein structural model selection by combining consensus and single scoring methods.
Quality assessment (QA) for predicted protein structural models is an important and challenging research problem in protein structure prediction. Consensus Global Distance Test (CGDT) methods assess each decoy (predicted structural model) based on its structural similarity to all others in a decoy s...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2013-01-01
|
Series: | PLoS ONE |
Online Access: | http://europepmc.org/articles/PMC3759460?pdf=render |
id |
doaj-001ce7ad7f7249caa457832c77361a74 |
---|---|
record_format |
Article |
spelling |
doaj-001ce7ad7f7249caa457832c77361a742020-11-25T01:55:54ZengPublic Library of Science (PLoS)PLoS ONE1932-62032013-01-0189e7400610.1371/journal.pone.0074006Protein structural model selection by combining consensus and single scoring methods.Zhiquan HeMeshari AlazmiJingfen ZhangDong XuQuality assessment (QA) for predicted protein structural models is an important and challenging research problem in protein structure prediction. Consensus Global Distance Test (CGDT) methods assess each decoy (predicted structural model) based on its structural similarity to all others in a decoy set and has been proved to work well when good decoys are in a majority cluster. Scoring functions evaluate each single decoy based on its structural properties. Both methods have their merits and limitations. In this paper, we present a novel method called PWCom, which consists of two neural networks sequentially to combine CGDT and single model scoring methods such as RW, DDFire and OPUS-Ca. Specifically, for every pair of decoys, the difference of the corresponding feature vectors is input to the first neural network which enables one to predict whether the decoy-pair are significantly different in terms of their GDT scores to the native. If yes, the second neural network is used to decide which one of the two is closer to the native structure. The quality score for each decoy in the pool is based on the number of winning times during the pairwise comparisons. Test results on three benchmark datasets from different model generation methods showed that PWCom significantly improves over consensus GDT and single scoring methods. The QA server (MUFOLD-Server) applying this method in CASP 10 QA category was ranked the second place in terms of Pearson and Spearman correlation performance.http://europepmc.org/articles/PMC3759460?pdf=render |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Zhiquan He Meshari Alazmi Jingfen Zhang Dong Xu |
spellingShingle |
Zhiquan He Meshari Alazmi Jingfen Zhang Dong Xu Protein structural model selection by combining consensus and single scoring methods. PLoS ONE |
author_facet |
Zhiquan He Meshari Alazmi Jingfen Zhang Dong Xu |
author_sort |
Zhiquan He |
title |
Protein structural model selection by combining consensus and single scoring methods. |
title_short |
Protein structural model selection by combining consensus and single scoring methods. |
title_full |
Protein structural model selection by combining consensus and single scoring methods. |
title_fullStr |
Protein structural model selection by combining consensus and single scoring methods. |
title_full_unstemmed |
Protein structural model selection by combining consensus and single scoring methods. |
title_sort |
protein structural model selection by combining consensus and single scoring methods. |
publisher |
Public Library of Science (PLoS) |
series |
PLoS ONE |
issn |
1932-6203 |
publishDate |
2013-01-01 |
description |
Quality assessment (QA) for predicted protein structural models is an important and challenging research problem in protein structure prediction. Consensus Global Distance Test (CGDT) methods assess each decoy (predicted structural model) based on its structural similarity to all others in a decoy set and has been proved to work well when good decoys are in a majority cluster. Scoring functions evaluate each single decoy based on its structural properties. Both methods have their merits and limitations. In this paper, we present a novel method called PWCom, which consists of two neural networks sequentially to combine CGDT and single model scoring methods such as RW, DDFire and OPUS-Ca. Specifically, for every pair of decoys, the difference of the corresponding feature vectors is input to the first neural network which enables one to predict whether the decoy-pair are significantly different in terms of their GDT scores to the native. If yes, the second neural network is used to decide which one of the two is closer to the native structure. The quality score for each decoy in the pool is based on the number of winning times during the pairwise comparisons. Test results on three benchmark datasets from different model generation methods showed that PWCom significantly improves over consensus GDT and single scoring methods. The QA server (MUFOLD-Server) applying this method in CASP 10 QA category was ranked the second place in terms of Pearson and Spearman correlation performance. |
url |
http://europepmc.org/articles/PMC3759460?pdf=render |
work_keys_str_mv |
AT zhiquanhe proteinstructuralmodelselectionbycombiningconsensusandsinglescoringmethods AT mesharialazmi proteinstructuralmodelselectionbycombiningconsensusandsinglescoringmethods AT jingfenzhang proteinstructuralmodelselectionbycombiningconsensusandsinglescoringmethods AT dongxu proteinstructuralmodelselectionbycombiningconsensusandsinglescoringmethods |
_version_ |
1724982725198217216 |