Automatic Prediction of Protein 3D Structures by Probabilistic Multi-template Homology Modeling.

Homology modeling predicts the 3D structure of a query protein based on the sequence alignment with one or more template proteins of known structure. Its great importance for biological research is owed to its speed, simplicity, reliability and wide applicability, covering more than half of the resi...

Full description

Bibliographic Details
Main Authors:	Armin Meier, Johannes Söding
Format:	Article
Language:	English
Published:	Public Library of Science (PLoS) 2015-10-01
Series:	PLoS Computational Biology
Online Access:	http://europepmc.org/articles/PMC4619893?pdf=render

id	doaj-a07df9d11d7d4691b9e37057c55d0d27
record_format	Article
spelling	doaj-a07df9d11d7d4691b9e37057c55d0d272020-11-25T01:57:43ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582015-10-011110e100434310.1371/journal.pcbi.1004343Automatic Prediction of Protein 3D Structures by Probabilistic Multi-template Homology Modeling.Armin MeierJohannes SödingHomology modeling predicts the 3D structure of a query protein based on the sequence alignment with one or more template proteins of known structure. Its great importance for biological research is owed to its speed, simplicity, reliability and wide applicability, covering more than half of the residues in protein sequence space. Although multiple templates have been shown to generally increase model quality over single templates, the information from multiple templates has so far been combined using empirically motivated, heuristic approaches. We present here a rigorous statistical framework for multi-template homology modeling. First, we find that the query proteins' atomic distance restraints can be accurately described by two-component Gaussian mixtures. This insight allowed us to apply the standard laws of probability theory to combine restraints from multiple templates. Second, we derive theoretically optimal weights to correct for the redundancy among related templates. Third, a heuristic template selection strategy is proposed. We improve the average GDT-ha model quality score by 11% over single template modeling and by 6.5% over a conventional multi-template approach on a set of 1000 query proteins. Robustness with respect to wrong constraints is likewise improved. We have integrated our multi-template modeling approach with the popular MODELLER homology modeling software in our free HHpred server http://toolkit.tuebingen.mpg.de/hhpred and also offer open source software for running MODELLER with the new restraints at https://bitbucket.org/soedinglab/hh-suite.http://europepmc.org/articles/PMC4619893?pdf=render
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Armin Meier Johannes Söding
spellingShingle	Armin Meier Johannes Söding Automatic Prediction of Protein 3D Structures by Probabilistic Multi-template Homology Modeling. PLoS Computational Biology
author_facet	Armin Meier Johannes Söding
author_sort	Armin Meier
title	Automatic Prediction of Protein 3D Structures by Probabilistic Multi-template Homology Modeling.
title_short	Automatic Prediction of Protein 3D Structures by Probabilistic Multi-template Homology Modeling.
title_full	Automatic Prediction of Protein 3D Structures by Probabilistic Multi-template Homology Modeling.
title_fullStr	Automatic Prediction of Protein 3D Structures by Probabilistic Multi-template Homology Modeling.
title_full_unstemmed	Automatic Prediction of Protein 3D Structures by Probabilistic Multi-template Homology Modeling.
title_sort	automatic prediction of protein 3d structures by probabilistic multi-template homology modeling.
publisher	Public Library of Science (PLoS)
series	PLoS Computational Biology
issn	1553-734X 1553-7358
publishDate	2015-10-01
description	Homology modeling predicts the 3D structure of a query protein based on the sequence alignment with one or more template proteins of known structure. Its great importance for biological research is owed to its speed, simplicity, reliability and wide applicability, covering more than half of the residues in protein sequence space. Although multiple templates have been shown to generally increase model quality over single templates, the information from multiple templates has so far been combined using empirically motivated, heuristic approaches. We present here a rigorous statistical framework for multi-template homology modeling. First, we find that the query proteins' atomic distance restraints can be accurately described by two-component Gaussian mixtures. This insight allowed us to apply the standard laws of probability theory to combine restraints from multiple templates. Second, we derive theoretically optimal weights to correct for the redundancy among related templates. Third, a heuristic template selection strategy is proposed. We improve the average GDT-ha model quality score by 11% over single template modeling and by 6.5% over a conventional multi-template approach on a set of 1000 query proteins. Robustness with respect to wrong constraints is likewise improved. We have integrated our multi-template modeling approach with the popular MODELLER homology modeling software in our free HHpred server http://toolkit.tuebingen.mpg.de/hhpred and also offer open source software for running MODELLER with the new restraints at https://bitbucket.org/soedinglab/hh-suite.
url	http://europepmc.org/articles/PMC4619893?pdf=render
work_keys_str_mv	AT arminmeier automaticpredictionofprotein3dstructuresbyprobabilisticmultitemplatehomologymodeling AT johannessoding automaticpredictionofprotein3dstructuresbyprobabilisticmultitemplatehomologymodeling
_version_	1724972901245911040

Automatic Prediction of Protein 3D Structures by Probabilistic Multi-template Homology Modeling.

Similar Items