Accuracy of protein-protein binding sites in high-throughput template-based modeling.

The accuracy of protein structures, particularly their binding sites, is essential for the success of modeling protein complexes. Computationally inexpensive methodology is required for genome-wide modeling of such structures. For systematic evaluation of potential accuracy in high-throughput modeli...

Full description

Bibliographic Details
Main Authors: Petras J Kundrotas, Ilya A Vakser
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2010-04-01
Series:PLoS Computational Biology
Online Access:http://europepmc.org/articles/PMC2848539?pdf=render
id doaj-39d7305f1afd4bd1be3d984bb6ae30a6
record_format Article
spelling doaj-39d7305f1afd4bd1be3d984bb6ae30a62020-11-25T01:12:25ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582010-04-0164e100072710.1371/journal.pcbi.1000727Accuracy of protein-protein binding sites in high-throughput template-based modeling.Petras J KundrotasIlya A VakserThe accuracy of protein structures, particularly their binding sites, is essential for the success of modeling protein complexes. Computationally inexpensive methodology is required for genome-wide modeling of such structures. For systematic evaluation of potential accuracy in high-throughput modeling of binding sites, a statistical analysis of target-template sequence alignments was performed for a representative set of protein complexes. For most of the complexes, alignments containing all residues of the interface were found. The full interface alignments were obtained even in the case of poor alignments where a relatively small part of the target sequence (as low as 40%) aligned to the template sequence, with a low overall alignment identity (<30%). Although such poor overall alignments might be considered inadequate for modeling of whole proteins, the alignment of the interfaces was strong enough for docking. In the set of homology models built on these alignments, one third of those ranked 1 by a simple sequence identity criteria had RMSD<5 A, the accuracy suitable for low-resolution template free docking. Such models corresponded to multi-domain target proteins, whereas for single-domain proteins the best models had 5 A<RMSD<10 A, the accuracy suitable for less sensitive structure-alignment methods. Overall, approximately 50% of complexes with the interfaces modeled by high-throughput techniques had accuracy suitable for meaningful docking experiments. This percentage will grow with the increasing availability of co-crystallized protein-protein complexes.http://europepmc.org/articles/PMC2848539?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Petras J Kundrotas
Ilya A Vakser
spellingShingle Petras J Kundrotas
Ilya A Vakser
Accuracy of protein-protein binding sites in high-throughput template-based modeling.
PLoS Computational Biology
author_facet Petras J Kundrotas
Ilya A Vakser
author_sort Petras J Kundrotas
title Accuracy of protein-protein binding sites in high-throughput template-based modeling.
title_short Accuracy of protein-protein binding sites in high-throughput template-based modeling.
title_full Accuracy of protein-protein binding sites in high-throughput template-based modeling.
title_fullStr Accuracy of protein-protein binding sites in high-throughput template-based modeling.
title_full_unstemmed Accuracy of protein-protein binding sites in high-throughput template-based modeling.
title_sort accuracy of protein-protein binding sites in high-throughput template-based modeling.
publisher Public Library of Science (PLoS)
series PLoS Computational Biology
issn 1553-734X
1553-7358
publishDate 2010-04-01
description The accuracy of protein structures, particularly their binding sites, is essential for the success of modeling protein complexes. Computationally inexpensive methodology is required for genome-wide modeling of such structures. For systematic evaluation of potential accuracy in high-throughput modeling of binding sites, a statistical analysis of target-template sequence alignments was performed for a representative set of protein complexes. For most of the complexes, alignments containing all residues of the interface were found. The full interface alignments were obtained even in the case of poor alignments where a relatively small part of the target sequence (as low as 40%) aligned to the template sequence, with a low overall alignment identity (<30%). Although such poor overall alignments might be considered inadequate for modeling of whole proteins, the alignment of the interfaces was strong enough for docking. In the set of homology models built on these alignments, one third of those ranked 1 by a simple sequence identity criteria had RMSD<5 A, the accuracy suitable for low-resolution template free docking. Such models corresponded to multi-domain target proteins, whereas for single-domain proteins the best models had 5 A<RMSD<10 A, the accuracy suitable for less sensitive structure-alignment methods. Overall, approximately 50% of complexes with the interfaces modeled by high-throughput techniques had accuracy suitable for meaningful docking experiments. This percentage will grow with the increasing availability of co-crystallized protein-protein complexes.
url http://europepmc.org/articles/PMC2848539?pdf=render
work_keys_str_mv AT petrasjkundrotas accuracyofproteinproteinbindingsitesinhighthroughputtemplatebasedmodeling
AT ilyaavakser accuracyofproteinproteinbindingsitesinhighthroughputtemplatebasedmodeling
_version_ 1725166401929347072