Amino-acid site variability among natural and designed proteins

Computational protein design attempts to create protein sequences that fold stably into pre-specified structures. Here we compare alignments of designed proteins to alignments of natural proteins and assess how closely designed sequences recapitulate patterns of sequence variation found in natural p...

Full description

Bibliographic Details
Main Authors: Eleisha L. Jackson, Noah Ollikainen, Arthur W. Covert III, Tanja Kortemme, Claus O. Wilke
Format: Article
Language:English
Published: PeerJ Inc. 2013-11-01
Series:PeerJ
Subjects:
Online Access:https://peerj.com/articles/211.pdf
id doaj-7b16924589a542e7a96736ca4fbd8f7b
record_format Article
spelling doaj-7b16924589a542e7a96736ca4fbd8f7b2020-11-25T00:05:45ZengPeerJ Inc.PeerJ2167-83592013-11-011e21110.7717/peerj.211211Amino-acid site variability among natural and designed proteinsEleisha L. Jackson0Noah Ollikainen1Arthur W. Covert III2Tanja Kortemme3Claus O. Wilke4Institute of Cellular and Molecular Biology, Center for Computational Biology and Bioinformatics, and Department of Integrative Biology, The University of Texas at Austin, Austin, TX, USAGraduate Program in Bioinformatics, University of California San Francisco, San Francisco, CA, USAInstitute of Cellular and Molecular Biology, Center for Computational Biology and Bioinformatics, and Department of Integrative Biology, The University of Texas at Austin, Austin, TX, USAGraduate Program in Bioinformatics, University of California San Francisco, San Francisco, CA, USAInstitute of Cellular and Molecular Biology, Center for Computational Biology and Bioinformatics, and Department of Integrative Biology, The University of Texas at Austin, Austin, TX, USAComputational protein design attempts to create protein sequences that fold stably into pre-specified structures. Here we compare alignments of designed proteins to alignments of natural proteins and assess how closely designed sequences recapitulate patterns of sequence variation found in natural protein sequences. We design proteins using RosettaDesign, and we evaluate both fixed-backbone designs and variable-backbone designs with different amounts of backbone flexibility. We find that proteins designed with a fixed backbone tend to underestimate the amount of site variability observed in natural proteins while proteins designed with an intermediate amount of backbone flexibility result in more realistic site variability. Further, the correlation between solvent exposure and site variability in designed proteins is lower than that in natural proteins. This finding suggests that site variability is too uniform across different solvent exposure states (i.e., buried residues are too variable or exposed residues too conserved). When comparing the amino acid frequencies in the designed proteins with those in natural proteins we find that in the designed proteins hydrophobic residues are underrepresented in the core. From these results we conclude that intermediate backbone flexibility during design results in more accurate protein design and that either scoring functions or backbone sampling methods require further improvement to accurately replicate structural constraints on site variability.https://peerj.com/articles/211.pdfProtein designFixed-backbone designFlexible-backbone designSequence alignmentsRelative solvent accessibilitySite variability
collection DOAJ
language English
format Article
sources DOAJ
author Eleisha L. Jackson
Noah Ollikainen
Arthur W. Covert III
Tanja Kortemme
Claus O. Wilke
spellingShingle Eleisha L. Jackson
Noah Ollikainen
Arthur W. Covert III
Tanja Kortemme
Claus O. Wilke
Amino-acid site variability among natural and designed proteins
PeerJ
Protein design
Fixed-backbone design
Flexible-backbone design
Sequence alignments
Relative solvent accessibility
Site variability
author_facet Eleisha L. Jackson
Noah Ollikainen
Arthur W. Covert III
Tanja Kortemme
Claus O. Wilke
author_sort Eleisha L. Jackson
title Amino-acid site variability among natural and designed proteins
title_short Amino-acid site variability among natural and designed proteins
title_full Amino-acid site variability among natural and designed proteins
title_fullStr Amino-acid site variability among natural and designed proteins
title_full_unstemmed Amino-acid site variability among natural and designed proteins
title_sort amino-acid site variability among natural and designed proteins
publisher PeerJ Inc.
series PeerJ
issn 2167-8359
publishDate 2013-11-01
description Computational protein design attempts to create protein sequences that fold stably into pre-specified structures. Here we compare alignments of designed proteins to alignments of natural proteins and assess how closely designed sequences recapitulate patterns of sequence variation found in natural protein sequences. We design proteins using RosettaDesign, and we evaluate both fixed-backbone designs and variable-backbone designs with different amounts of backbone flexibility. We find that proteins designed with a fixed backbone tend to underestimate the amount of site variability observed in natural proteins while proteins designed with an intermediate amount of backbone flexibility result in more realistic site variability. Further, the correlation between solvent exposure and site variability in designed proteins is lower than that in natural proteins. This finding suggests that site variability is too uniform across different solvent exposure states (i.e., buried residues are too variable or exposed residues too conserved). When comparing the amino acid frequencies in the designed proteins with those in natural proteins we find that in the designed proteins hydrophobic residues are underrepresented in the core. From these results we conclude that intermediate backbone flexibility during design results in more accurate protein design and that either scoring functions or backbone sampling methods require further improvement to accurately replicate structural constraints on site variability.
topic Protein design
Fixed-backbone design
Flexible-backbone design
Sequence alignments
Relative solvent accessibility
Site variability
url https://peerj.com/articles/211.pdf
work_keys_str_mv AT eleishaljackson aminoacidsitevariabilityamongnaturalanddesignedproteins
AT noahollikainen aminoacidsitevariabilityamongnaturalanddesignedproteins
AT arthurwcovertiii aminoacidsitevariabilityamongnaturalanddesignedproteins
AT tanjakortemme aminoacidsitevariabilityamongnaturalanddesignedproteins
AT clausowilke aminoacidsitevariabilityamongnaturalanddesignedproteins
_version_ 1725423544059297792