Statistical Analysis of Crystallization Database Links Protein Physico-Chemical Features with Crystallization Mechanisms

X-ray crystallography is the predominant method for obtaining atomic-scale information about biological macromolecules. Despite the success of the technique, obtaining well diffracting crystals still critically limits going from protein to structure. In practice, the crystallization process proceeds...

Full description

Bibliographic Details
Main Authors: Fusco, Diana (Author), Barnum, Timothy James (Contributor), Bruno, Andrew E. (Author), Luft, Joseph R. (Author), Snell, Edward H. (Author), Mukherjee, Sayan (Author), Charbonneau, Patrick (Author)
Other Authors: Massachusetts Institute of Technology. Department of Chemistry (Contributor)
Format: Article
Language:English
Published: Public Library of Science, 2014-09-09T18:49:53Z.
Subjects:
Online Access:Get fulltext
LEADER 02840 am a22002893u 4500
001 89398
042 |a dc 
100 1 0 |a Fusco, Diana  |e author 
100 1 0 |a Massachusetts Institute of Technology. Department of Chemistry  |e contributor 
100 1 0 |a Barnum, Timothy James  |e contributor 
700 1 0 |a Barnum, Timothy James  |e author 
700 1 0 |a Bruno, Andrew E.  |e author 
700 1 0 |a Luft, Joseph R.  |e author 
700 1 0 |a Snell, Edward H.  |e author 
700 1 0 |a Mukherjee, Sayan  |e author 
700 1 0 |a Charbonneau, Patrick  |e author 
245 0 0 |a Statistical Analysis of Crystallization Database Links Protein Physico-Chemical Features with Crystallization Mechanisms 
260 |b Public Library of Science,   |c 2014-09-09T18:49:53Z. 
856 |z Get fulltext  |u http://hdl.handle.net/1721.1/89398 
520 |a X-ray crystallography is the predominant method for obtaining atomic-scale information about biological macromolecules. Despite the success of the technique, obtaining well diffracting crystals still critically limits going from protein to structure. In practice, the crystallization process proceeds through knowledge-informed empiricism. Better physico-chemical understanding remains elusive because of the large number of variables involved, hence little guidance is available to systematically identify solution conditions that promote crystallization. To help determine relationships between macromolecular properties and their crystallization propensity, we have trained statistical models on samples for 182 proteins supplied by the Northeast Structural Genomics consortium. Gaussian processes, which capture trends beyond the reach of linear statistical models, distinguish between two main physico-chemical mechanisms driving crystallization. One is characterized by low levels of side chain entropy and has been extensively reported in the literature. The other identifies specific electrostatic interactions not previously described in the crystallization context. Because evidence for two distinct mechanisms can be gleaned both from crystal contacts and from solution conditions leading to successful crystallization, the model offers future avenues for optimizing crystallization screens based on partial structural information. The availability of crystallization data coupled with structural outcomes analyzed through state-of-the-art statistical models may thus guide macromolecular crystallization toward a more rational basis. 
520 |a National Institutes of Health (U.S.) (Protein Structure Initiative, NIGMS grant U54 GM094597) 
520 |a National Institutes of Health (U.S.) (grant NIH R01GM088396) 
520 |a National Science Foundation (U.S.) (Grant NSF CHE-1062607) 
520 |a National Science Foundation (U.S.) (Grant No. NSF DMR-1055586) 
546 |a en_US 
655 7 |a Article 
773 |t PLoS ONE