research papers
Significant reduction in errors associated with nonbonded contacts in protein crystal structures: automated all-atom PrimeX
withaSchrödinger, 120 West 45th Street, 17th Floor, New York, NY 10036, USA
*Correspondence e-mail: ramy.farid@schrodinger.com
All-atom models are essential for many applications in molecular modeling and computational chemistry. Nonbonded atomic contacts much closer than the sum of the van der Waals radii of the two atoms (clashes) are commonly observed in such models derived from protein crystal structures. A set of 94 recently deposited protein structures in the resolution range 1.5–2.8 Å were analyzed for clashes by the addition of all H atoms to the models followed by optimization and energy minimization of the positions of just these H atoms. The results were compared with the same set of structures after automated all-atom PrimeX and with nonbonded contacts in protein crystal structures at a resolution equal to or better than 0.9 Å. The additional PrimeX produced structures with reasonable summary geometric statistics and similar Rfree values to the original structures. The frequency of clashes at less than 0.8 times the sum of van der Waals radii was reduced over fourfold compared with that found in the original structures, to a level approaching that found in the ultrahigh-resolution structures. Moreover, severe clashes at less than or equal to 0.7 times the sum of atomic radii were reduced 15-fold. All-atom with PrimeX produced improved models with respect to nonbonded contacts and yielded changes in structural details that dramatically impacted on the interpretation of some protein–ligand interactions.
withKeywords: H atoms; van der Waals radii; restraints; nonbonded contacts; clashes; molecular geometry; model quality; force fields; refinement; riding H atoms; electrostatics; hydrogen bonds.
1. Introduction
The majority of protein crystal structures are solved in the resolution range 1.7–2.8 Å, a resolution range in which the diffraction experiment does not present sufficient information to accurately place individual atoms without additional chemical information. Electron-density peaks specifically for H atoms are not observed in this resolution range owing to a low signal-to-noise ratio. Therefore, H atoms are usually not explicitly included in molecular models of protein crystal structures. A molecular model without explicit coordinates for H atoms is denoted as an united-atom model, in contrast to an all-atom model. United-atom models are frequently insufficient for molecular modeling and computational chemistry applications (such as structure-based virtual screening or lead optimization). How is the gap bridged between current best crystallographic practices and the requirements of these other disciplines for all-atom structures that include hydrogen coordinates?
A brief history of the use of H atoms and chemical restraints in protein et al., 1973) first demonstrated that moderate-resolution protein crystal structures could benefit from the reciprocal-space techniques developed for use with crystal structures of small molecules at atomic resolution. They recognized the necessity of using additional chemical information combined with reciprocal-space to accurately determine atomic positions in this situation.
is useful before answering this question. Jensen and coworkers (WatenpaughA complete system of geometric restraints was devised for the first widely used protein reciprocal-space PROLSQ (Konnert, 1976; Konnert & Hendrickson, 1980; Hendrickson, 1985). H atoms were not explicitly considered in this system.
program,The introduction of simulated-annealing X-PLOR (Brünger, 1992). This program featured geometric restraints based on the CHARMM force field (Brünger et al., 1986, 1989; Brünger & Karplus, 1988). Originally, use of this force field required an all-atom model. CHARMM-based restraints evolved in a way that removed the requirements for hydrogen coordinates. This change was associated with an alteration in the representation of nonbonded contacts from a Lennard–Jones potential to a much simpler repulsive function and the elimination of the use of electrostatic potentials. These modifications were partially motivated by electrostatic artifacts that were introduced into the structural results owing to the lack of an implicit solvent model. In addition, the long time required for computation of the complete set of nonbonded interactions was a significant impediment to the of large crystal structures (Nilges et al., 1988; Weis et al., 1990). By the time that X-PLOR was superseded by the program CNS (Brünger et al., 1998), any requirement for explicit H-atom coordinates for protein crystallographic had been eliminated. However, the capability to apply an electrostatic model and more complete nonbonded interactions in an all-atom model remained an essential part of CNS for the determination of structures from NMR data (Linge et al., 2003).
led to the widespread adoption of the programEngh & Huber (1991) brought important additional information to the definition of the geometry for protein crystal structures. Their survey of bond lengths and angles observed in small peptide crystal structures at high resolution has been uniformly adopted as a standard against which protein models are judged. It has also become the basis for the restraint system in all of the major programs.
Recent developments indicate an interest among crystallographers in the application of more complex descriptions of molecular geometry in REFMAC (Murshudov et al., 2011) and PHENIX (Afonine et al., 2005) may be employed with `riding H atoms', even though the ultimate result to be deposited is a united-atom model. (Riding H atoms are those H atoms whose positions can be determined unambiguously from the positions of the non-H atoms; for example, the H atom attached to the Oγ of a serine residue is not a riding H atom since its position depends on the torsion angle of the Cβ—Oγ bond, while the H atom on the Cα atom of an amino acid is a riding H atom, since all torsion angles affecting its position are determined by non-H atom coordinates.) The advantages of a restraint scheme in which geometric target values for a residue depend on the torsion-angle conformation of the residue backbone have recently been demonstrated (Tronrud et al., 2010). Brunger and coworkers (Fenn et al., 2010, 2011; Schnieders et al., 2011) have combined the all-atom force field AMOEBA with a new scheme and have described the advantages of a more complex molecular description that includes the calculation of electrostatic interactions between protein atoms. Additional recent innovations in the use of geometric information in include the use of deformable elastic network (Schröder et al., 2010), hydropathic force-field terms (Koparde et al., 2011) and jelly-body restraints (Murshudov et al., 2011).
to aid in producing better models. The programsStructure-validation tools for protein geometry, partially based on the Engh & Huber standard, are available in several widely used computer programs, most notably PROCHECK (Laskowski et al., 1993), WHAT_CHECK (Hooft, Vriend et al., 1996), NUCheck (Feng et al., 1998) and SFCHECK (Vaguine et al., 1999). These programs address close nonbonded contacts largely from a united-atom perspective. More recently, the structure-validation programs Reduce and MolProbity (Davis et al., 2007; Chen, Arendall et al., 2010) have become important and popular additions to the toolkit of protein crystallographers. They are based on the concept that better judgments can be made as to the correct positioning of certain groups in the model after the addition of H atoms to a united-atom protein and after observing their interactions. Within their software system, interpenetration of van der Waals molecular surfaces by 0.4 Å or more constitutes a clash. The authors flatly state that
Such large overlaps cannot occur in the actual molecule, but mean that at least one of the two atoms is modeled incorrectly
At this point, the question of the source of all-atom models needed for computational work can be addressed more clearly. Currently, such all-atom models are produced by adding H atoms to the united-atom models produced by crystallography. For water molecules and for protein H atoms whose position is subject to some degree of freedom, i.e. non-riding H atoms, either a force-field-dependent or a rule-based method is employed to determine the positions of these H atoms in order to avoid close nonbonded contacts and to form hydrogen bonds as appropriate. Nevertheless, when H atoms are added in this way to a very large majority of protein crystal structures deposited in the Protein Data Bank (Berman et al., 2000), multiple close nonbonded contacts between atoms are observed. One goal of this work is to document this observation and to try to understand why such interactions occur, the recent focus on protein structure validation with H atoms present notwithstanding.
The usual remedy in computational chemistry to these high-energy close contacts is to minimize the coordinates of the all-atom model against a force field, with non-H atoms restrained to their positions in the crystallography-derived model so that they do not deviate too far from their experimentally determined positions. This solution is less than ideal, because the method produces no feedback as to whether the all-atom model is still consistent with the experimental data. In other words, one does not know how far is too far. This procedure could be especially dangerous if the original clashes were caused by atoms that were significantly misplaced.
The PrimeX was implemented partially in response to these issues. It applies well established methods of protein (Bell et al., 2012) combined with the all-atom OPLS force field (Jorgensen et al., 1996; Kaminski et al., 2001; Banks et al., 2005) for geometric restraints. Aside from the presence or absence of H atoms in the model, these OPLS-based restraints differ in two specific respects from what have become the traditional restraint systems: (i) a Lennard–Jones description of both the attractive and repulsive components of van der Waals interactions replaces the simpler repulsive term of most Engh and Huber-based restraints and (ii) electrostatic interactions are treated, including a Surface Generalized Born model to account for implicit solvent effects (Ghosh et al., 1998; Gallicchio et al., 2002; Zhu et al., 2007; Li, Abel et al., 2011). The net effect of these differences is very significant. In a simpler restraint system, the bond-length targets are each a function of a single parameter according to the atom types involved in the bond. A similar situation occurs for bond angles. However, the bond-length and bond-angle targets specified by OPLS are a function of several parameters that can all affect a single bond length or bond angle. In other words, the restraint target for a particular bond length (or angle) is contingent on the local environment of the atoms involved. Touw & Vriend (2010) have shown that at least one type of protein bond angle is a complex function of the local environment and is not well described by a single Engh & Huber (1991) target angle. The target geometric values in the well characterized restraint system of Karplus and coworkers depend on the local backbone conformation of the protein (Tronrud et al., 2010). That any particular force field can reproduce all such dependencies remains to be demonstrated, but potentially a force-field-based restraint system can more effectively adapt to local environments than current protein crystallography restraint systems.
programRefinement of protein crystal structures with an all-atom model and a complete force field does much more than avoid errors whose remediation may seriously degrade the accuracy of the coordinates. The more detailed accounting for nonbonded interactions within the protein used in PrimeX can also produce a direct positive effect during While even small changes in the structure near a ligand-binding site can be critical for structure-based drug discovery, examples are presented to show how with an all-atom model can result in large coordinate improvements at such sites.
2. Methods
2.1. Data-set selection
The members of the moderate-resolution protein data set used in this study were selected from the Protein Data Bank (Berman et al., 2000). Candidate structures were limited to those deposited in 2010 to ensure that followed modern practices and that sufficient time had passed for structures to be withdrawn if found to contain gross errors. Reflection data were required to have been deposited with the coordinates. Each entry was restricted to contain one or more protein chains but no DNA or RNA. The reported Rfree values were limited to 0.28 or lower. Structures were required to have a high-resolution limit between 1.5 and 2.8 Å. The molecular mass of protein within the was limited to be between 10 and 300 kDa. Proteins with homologous sequences were removed at 30% identity.
In addition, the PrimeX-calculated Rfree was required to exceed the R value by at least 0.008. This requirement ensured that the test set deposited in the PDB was likely to be the one that was actually used in the of the deposited coordinates. (The Rfree calculated using the deposited test set was found to actually be lower than the calculated R factor in several cases, strongly indicating that the deposited test set was not used in the final of the deposited coordinates.) Others have made similar observations about test-set entries in the PDB (Joosten et al., 2009; Afonine et al., 2010). A meaningful comparison of Rfree values was not possible without using a single consistent test set.
The high-resolution reference data set used in this study was selected from the PDB with the following restrictions: (i) the deposition of diffraction data was required, (ii) coordinate sets were selected from entries containing protein but no DNA or RNA and with a high-resolution limit of 0.9 Å or better, (iii) proteins with homologous sequences were removed at 30% identity and (iv) only proteins refined with coordinates for H atoms were included in this set.
2.2. PrimeX crystallographic calculations and refinement
The key features of PrimeX have been described in some detail elsewhere (Bell et al., 2012). Only details relevant to this work are described below.
2.2.1. Restrained reciprocal-space minimization
Reciprocal-space coordinate minimization is applied in PrimeX with a target, using the formulation of Pannu & Read (1996). The PrimeX implementation follows the general concepts developed by Brünger and coworkers (Brünger, 1989; Brünger et al., 1998). A target has been shown to improve the convergence of and to reduce the effects of model bias (Murshudov et al., 1997).
OPLS 2005 (Jorgensen et al., 1996; Kaminski et al., 2001; Banks et al., 2005) is a general-purpose force field for modeling proteins, and small molecules. PrimeX applies this consistent molecular description as geometric restraints during the of the atomic positions for all molecular components of large biological crystal structures. Restrained isotropic B-factor is applied in PrimeX using an approach similar to that used in the program CNS (Brünger et al., 1989).
Because of its dependence on the OPLS force field, PrimeX operates on all-atom models at all stages of H-atom coordinates do not participate in crystallographic calculations and are not influenced directly by the diffraction data. The advantage of this approach is that the underdetermined nature of the crystallographic calculation is not made worse by the many more parameters for H-atom coordinates and B factors. H-atom positions are a function of the force field acting on all atoms, while the positions of non-H atoms are refined under the joint influence of crystallographic and force-field gradients. Thus, the H-atom coordinates are not biased towards the centre of mass of the electron-density distribution, as may occur in some forms of all-atom (Coulson & Thomas, 1971).
For H atoms bonded to more electronegative atoms, electrostatic forces are a major determinant of nonbonded interactions and they must be evaluated during the calculation of geometric gradients in the PrimeX employs the complete molecular-mechanics description of atomic interactions embodied in OPLS, including electrostatic terms (Jorgensen et al., 1996; Kaminski et al., 2001; Banks et al., 2005). Also included in the current PrimeX calculations was an optional implicit solvation term (Ghosh et al., 1998; Gallicchio et al., 2002; Zhu et al., 2007).
Thus,An overview of the OPLS force field, and a description of the details of the second-generation Surface Generalized Born model used to implicitly account for solvation effects, have been provided by Li, Abel et al. (2011). Both the electrostatic and solvation calculations employ residue-based cutoffs of 15 Å for long-range interactions between neutral residues, of 30 Å between charged residues and of 20 Å between mixed charged and neutral residues. Such approximations and model features should be considered in the context of a trade-off between computational time and rigorous calculations, as discussed by Moulinier et al. (2003), who pioneered the use of the Generalized Born approach in and by Fenn et al. (2011), who have advocated the use of an alternate electrostatic model in incorporating a complete electrostatic description has been shown to lead to lower Rfree values compared with excluding these interactions (Knight et al., 2008; Fenn et al., 2010, 2011; Schnieders et al., 2011).
2.2.2. Simulated-annealing refinement
Simulated-annealing PrimeX is implemented through the general-purpose molecular-modeling package IMPACT (Banks et al., 2005), employing concepts for simulated-annealing validated in the program CNS (Adams et al., 1997). PrimeX simulated annealing provides two alternative energy models for dynamic simulation In the complete energy model, all molecular-mechanics terms are evaluated during the simulation. In the approximate method, the electrostatic and implicit solvation terms are not evaluated, a method similar to that employed in CNS (Adams et al., 1997). All calculations in this work involved the complete energy model.
within2.2.3. Electron-density map calculations
Map calculations in PrimeX are based on the SIGMAA weighting scheme of Read (1986), a data treatment that has been shown to decrease the bias in electron-density maps.
2.2.4. Hydrogen-bond network optimization
An additional hydrogen-bond optimization tool in PrimeX analyzes the locations of hydrogen-bond donors and acceptors to define clusters of such sites that might be connected through hydrogen bonds. Within each cluster, hydrogen bonding is evaluated using a rule-based method to find an optimal combination of the variable components of these systems. The structural features adjusted during hydrogen-bond optimization are (i) alcoholic H-atom positions; (ii) sulfhydryl H-atom positions; (iii) phenolic H-atom positions; (iv) charge and tautomeric states of aspartic acid and glutamic acid side chains; (v) charge states, tautomeric states and orientation (flip) of histidine side chains; (vi) orientation (flip) of asparagine and glutamine side chains; and (vii) positions of H atoms in water molecules. The goal of this procedure is to minimize the energy of the system by maximizing the number of hydrogen bonds while avoiding close high-energy nonbonded interactions. This is accomplished by enumerating plausible orientations for each rotatable hydrogen and water molecule by identifying nearby hydrogen-bond donors and acceptors. Initial solutions for the overall local hydrogen-bond network are then generated by iteratively choosing the optimal state for each species in turn until convergence, starting from a variety of random starting conditions. These initial solutions are then recombined with each other and further optimized via simulated annealing. The best solution obtained overall is then chosen. The hydrogen-bond optimization tool in PrimeX performs essentially the same tasks as a number of other hydrogen-bond optimization tools such as NETWORK (Hooft, Sander et al., 1996) and Reduce (Word et al., 1999).
2.2.5. The polish workflow
The all-atom structures produced by PrimeX as described in Table 4, were the result of application of the `polish' workflow to the united-atom models obtained from the PDB without any human intervention during the process. The refined coordinates that were the result of this process are archived at http://www.schrodinger.com/primex .
The purpose of the polish workflow is to produce the best all-atom model possible that is consistent with the diffraction data, starting with an already well refined
The workflow applies reciprocal-space optimization of coordinates and thermal factors, simulated-annealing and hydrogen-bond optimization in an automated manner as described below. It does not have as a purpose the remediation of more serious errors in fitting such as the choice of the wrong side-chain rotamer, mis-identification of protein electron density as part of the solvent model or the rebuilding of misplaced side chains, which would require the application of additional fitting functions.Bond orders are first assigned throughout the structure and H atoms are added. Initial analysis of the input structure provides basic crystallographic statistics for the structure using the bulk water correction in PrimeX (the flat model of Jiang & Brünger, 1994) and overall anisotropic scaling. A detailed analysis of close nonbonded contacts for the input structure is also provided.
As a next step, reciprocal-space minimization is applied at increasing weight on the X-ray terms (wA) in order to optimize this weight for subsequent The value selected corresponds to the weight employed when the minimum Rfree value is observed. The weights selected for this set of proteins ranged between 0.25 and 1.73. Issues surrounding the selection of restraint weights when using an all-atom force field have been discussed by Fenn & Schnieders (2011). B-factor restraint weights (wB) are estimated from the high-resolution limit (r) of the diffraction data according to the equation
The functional form of this equation was chosen based on the known B-factor restraint-weight requirements in PrimeX at the bounds of the usual resolution range for For a low-resolution structure of 2.7 Å or worse, a weight of at least ten times wA is required. For at high resolution (better than 1.7 Å), a very low B-factor restraint weight (<0.1 times wA) is required. The values of the two constants in the equation were varied while observing the R factors from in a broad resolution range. The current equation was observed to be as effective within PrimeX as stepwise optimization of wB as described for wA above and required much less computational time. Continued development of this method, such as the exploration of any effect of restraints, will be reported in future work. Although this equation is the default method for assigning B-factor restraint weights in the polish workflow, stepwise optimization of this value is provided as an option. Also, minimization performed during weight optimization may optionally be applied towards progress in the of the structure.
An initial optimization of hydrogen-bond orientation is applied and is followed by separate coordinate and B-factor reciprocal-space minimization steps. The model is then refined with a defined set of operations comprised of reciprocal-space coordinate minimization, hydrogen-bond optimization, reciprocal-space coordinate/B-factor minimization, simulated annealing and a final reciprocal-space coordinate/B-factor minimization. The optimization of X-ray and B-factor restraint weights as described above is repeated after this first round. The same defined set of procedures is then repeated twice more but without simulated annealing.
2.3. Direct generation of all-atom models from selected structures
To generate the all-atom models described in Table 3, H atoms were added to united-atom models from the PDB and the positions of the H atoms were optimized using the hydrogen-bond network optimization function described above (§2.2.4) as implemented in the Protein Preparation Wizard (Maestro v.9.2; Schrödinger LLC). The positions of all H atoms were then also optimized through energy minimization against the OPLS force field, with the positions of all heavier atoms held fixed.
2.4. Structure-validation calculations
2.4.1. Clash detection and van der Waals radii
The Rowland & Taylor (1996) compilation of van der Waals radii was employed in this study. It was based on an analysis of 28 403 structures in the Cambridge Structural Database (Allen, 2002). Their results agreed well with the frequently cited van der Waals radii derived by Bondi (1964) when the available solid-state structural data were not nearly so extensive. The largest difference between the two studies was that the radius of the H atom was determined to be 1.1 Å rather than 1.2 Å as in the older work.
As observed by Rowland & Taylor (1996), atoms may at times have nonbonded interactions somewhat less than the sum of their van der Waals radii. For the purposes of this work, a center-to-center distance of less than or equal to 0.8 times the sum of the van der Waals radii was defined as a `clash'. A reasonable conclusion from the selection of data presented by Rowland and Taylor is that interatomic distances of less than or equal to 0.7 times the sum of van der Waals radii are rare. Such interactions were denoted as `severe clashes' in this work.
The current work focuses specifically on interactions among atoms within the proteins since this issue is the central concern for computational chemistry applications. Close interactions with water and solvent molecules will be the focus of future work. Clashes generated from symmetry considerations were not counted for observations on the moderate-resolution data set since they are not optimized by the version of the Protein Preparation Wizard used in this study.
Interatomic contacts were calculated after removing hydrogen bonds from consideration. Because of the lack of certainty regarding the positions of H atoms, all donor–acceptor atom pairs that could potentially be involved in a hydrogen bond were also excluded from the list of close contacts, even if an H atom was not found directly between them. This conservative approach avoided over-reporting as clashes any interactions that might actually be hydrogen bonds. Where alternate conformations were found, only the conformation with the higher occupancy was considered for the calculation of clashes.
The definition of a clash most commonly used in protein crystallography derives from the work of Jane Richardson, David Richardson and coworkers (Word et al., 1999; Davis et al., 2007; Chen, Arendall et al., 2010). It is simply the overlap of two van der Waals surfaces by 0.4 Å or more. It is employed with an atomic radius of 1.00 Å for polar and aromatic H atoms and a radius of 1.17 Å for all other H atoms, resulting in clashes between two H atoms with the same radii at separations of 1.60 and 1.94 Å, respectively. Clashes between two H atoms in the current work occur at a separation of 0.8 times the sum of their van der Waals radii, i.e. 1.76 Å. For most other atoms the definition applied here is less strict than that applied by the Richardson group. The one exception is for O atoms, which have a smaller radius in the Richardson system, resulting in clashes between two O atoms at 2.40 Å separation compared with 2.53 Å in the current work. The Richardson atomic parameters were designed from various theoretical and practical considerations (Word et al., 1999) to yield a system in which all observed clashes were exceptional. The approach in the current work was to use the values for atomic radii (Rowland & Taylor, 1996) unadjusted and to observe from ultrahigh-resolution protein structures how frequently clashes might reasonably be expected to occur owing to the local molecular environment.
3. Results
3.1. Ultrahigh-resolution structures
Before exploring the close contacts (clashes and severe clashes as defined above) and structural geometry at moderate resolution, some perspective can be obtained on summary geometric statistics and the occurrence of clashes from ultra-high-resolution protein structures.
3.1.1. Clashes and severe clashes
Table 1 shows observations from 18 X-ray crystal structures with at least 0.9 Å resolution which were refined (by the authors of the respective structures) with H atoms present. (At this resolution hydrogen positions may have been guided by electron density, although electron density need not have been observed for all H atoms.) Over the entire set of structures, `clashes' and `severe clashes' were observed with a frequency of 1.5 and 0.6 occurrences per 100 residues, respectively.
‡A severe clash occurs when two atoms approach to within less than or equal to 0.7 times the sum of their van der Waals radii. |
All of the observed clashes were examined individually to determine their origin. The results are presented in Table 2. In 37% of the clashes a chemically implausible interaction was observed in which a hydroxyl or sulfhydryl H atom pointed directly at another H atom. The positions of these H atoms were not supported in any obvious way by the observed electron density. The most likely explanation is that these hydrogen positions were oversights in the model-building process. In addition, 21% of the close interactions identified occurred at positions where the heavy atoms to which the H atoms were attached did not fit the electron density well and appeared to be incorrectly positioned. Clashes were included in this category only if the electron-density map provided reasonable doubt as to the correctness of the structure and suggested a more attractive alternate position. A further 12% of close contacts could be removed by flipping or changing the tautomer of an asparagine, glutamine or histidine side chain.
|
Ultimately, only 30% of the observed clashes withstood critical examination and avoided being included in Table 2. In Table 1, the `corrected' columns offer a better estimate of the close contacts actually present in the structures. Thus, the frequency of bona fide `clashes' and `severe clashes' was 0.6 and 0.03 per 100 residues, respectively. Note that the second value is based on just a single observation (one severe clash in 2xu3 ).
3.1.2. Summary geometry statistics
The average r.m.s. deviations (r.m.s.d.s) of bond lengths and angles with respect to the Engh & Huber (1991) standard were 0.021 Å and 2.5°, respectively (Table 1). The significance of the former number might be questioned since at this resolution the refined bond lengths are likely to reflect the effects of restraints. However, atomic positions are observed accurately enough at this resolution such that bond angles can be precisely determined from the crystallographic results. Thus, the latter value may be significant. However, two observations must be considered in interpreting this bond-angle value. Firstly, the program SHELX (Sheldrick & Schneider, 1997) may apply bond-angle restraints (as 1,3-atom distance restraints) during although published information rarely allows one to deduce the effect of these possible restraints on bond angles. Normally, one might expect the net effect of restraints would be to narrow the distribution of values observed. At the same time, a critical observation is that high-resolution protein crystal structures very frequently have r.m.s. Z scores (Spronk et al., 2004) greater than 1, i.e. the standard deviation of bond angles for these structures are greater than what would be predicted from the work of Engh & Huber (1991). This observation (Joosten et al., 2009) is generally interpreted to mean that the bond angles are too widely distributed in very high resolution structures, possibly because these restraints are faulty owing to variations in bond lengths at high resolution. The possibility that the Engh and Huber parameters predict too narrow a distribution owing to biases in the small-molecule structures from which the parameters were derived is generally discounted.
The average r.m.s.d. from planarity for side-chain groups was 0.011 Å and the average standard deviation of the ω angle was 6.4°. For similar reasons, these two values may be considered to be reference points for the geometry of models at moderate resolution.
3.2. Moderate-resolution crystal structures as deposited
94 crystal structures with a broad range of sizes from several different ). Their most important common characteristics were recent deposition in the PDB and falling into the most highly populated resolution range typical for protein crystal structures (see §2 for further details on the selection of this data set).
programs were examined (Table 3
‡A clash occurs when two atoms approach to within less than or equal to 0.8 times the sum of their van der Waals radii but greater than 0.7 times that sum. §A severe clash occurs when two atoms approach to within less than or equal to 0.7 times the sum of their van der Waals radii. ¶L.Tresaugues, M. Welin, C. H. Arrowsmith, H. Berglund, C. Bountra, R. Collins, A. M. Edwards, S. Flodin, A. Flores, S. Graslund, M. Hammarstrom, I. Johansson, T. Karlberg, S. Kol, T. Kotenyova, E. Kouznetsova, M. Moche, T. Nyman, C. Persson, H. Schuler, P. Schutz, M. I. Siponen, A. G. Thorsell, S. Van der Berg, E. Wahlberg, J. Weigelt & P. Nordlund (unpublished work). |
3.2.1. Clashes and severe clashes
The occurrence of close contacts was enumerated after addition of H atoms and after careful optimization of H-atom positions without changing the coordinates of any non-H atoms. Table 3 shows the frequency of clashes and severe clashes for each protein. Their rates of occurrence per 100 residues were observed in a very broad range from 20.4 (3lpf ) to 0.0 (three instances) and from 2.9 (3lpf ) to 0.0 (35 instances) for clashes and severe clashes, respectively. Overall, clashes in the moderate-resolution structure set were observed at a frequency of 4.0 per 100 residues, over six times the rate of bona fide clashes in the ultrahigh-resolution set. Severe clashes were observed at a frequency of 0.5 per 100 residues, compared with 0.03 for bona fide severe clashes for the reference ultrahigh-resolution data set (Table 1).
3.2.2. Summary geometry statistics
As shown in Table 3, the bond-length r.m.s.d.s for the set of proteins varied over a wide range, from 0.004 Å for 3phe to 0.031 Å for 3lje , with an average of 0.014 Å. Bond-angle r.m.s.d.s varied from a minimum of 0.6° (3ni0 ) to a maximum of 2.5° (3nof ), with an average of 1.4°. The average r.m.s.d. for side-chain group planarity was 0.005 Å and the average peptide torsion-angle standard deviation was 5.1°. A more detailed examination and comparison of these summary statistics follows in §3.3.2.
3.3. All-atom with PrimeX
3.3.1. Clashes and severe clashes
The additional all-atom PrimeX applied to the moderate-resolution data set produced the structures characterized in Table 4. The frequency of regular clashes overall was 0.9 per 100 residues, which is well below the frequency originally observed for the ultrahigh-resolution set (1.5 per 100 residues; Table 1), but somewhat higher than the corrected value of 0.6 per 100 residues. The frequency of clashes overall was decreased more than fourfold from all-atom models derived from the coordinates as originally deposited. The frequency of severe clashes overall was 0.03 per 100 residues, the same value as obtained for the corrected ultrahigh-resolution structures (Table 1) and 17-fold lower than the frequency in the otherwise remediated moderate-resolution structures (Table 3). Seven of the 94 structures had neither type of clashes after all-atom All structures without clashes were solved at 2.2 Å resolution or better. 39 of the 94 structures had both no severe clashes and a lower frequency of clashes than the corrected ultrahigh-resolution structures.
in
|
Each of the residual clashes in the PrimeX-refined set was inspected with reference to a 2Fo − Fc composite OMIT map. Clear evidence of a better alternate interpretation of the electron density was present for 14% of the close contacts, owing to either large problems with the main-chain fit or to the need for a substantially different side-chain rotamer. Subtracting the number of clashes attributable to these issues from the total clashes provided an estimate of the frequency of bona fide regular clashes as 0.7 clashes per 100 residues, approaching the corrected frequency found in the ultrahigh-resolution structure set (0.6 clashes per 100 residues). Within this structural survey, many situations were observed to be ambiguous and were not counted. Thus, the level of clashes owing to model errors might actually have been somewhat higher. A very time-intensive comprehensive re-refinement of the structures would be required to confirm this suspicion, which is beyond the scope of the present work.
Clash frequencies derived from Tables 1, 3 and 4 are compared in Table 5, as well as with respect to the various programs. Unfortunately, results from BUSTER (Bricogne et al., 2011) were found to be relatively rare and only two instances were found in our data set. However, the statistics for these two proteins suggest improved results from this program regarding close contacts. None of the other programs even approached the values of 0.6–0.7 clashes per 100 residues that might be considered a reasonable target considering the results above.
‡Corrected for clashes owing to errors in the structures (see text); the numbers of uncorrected clashes and severe clashes per 100 residues are 0.9 and 0.03, respectively. |
3.3.2. Summary molecular geometry and statistics
Table 5 also provides a summary of the measures of molecular geometry over the three data sets in this study. The average bond-length r.m.s.d. for the PrimeX-refined proteins was 0.019 Å, compared with 0.015 Å for the original data set. The r.m.s. Z scores (Spronk et al., 2004) for bond lengths changed from an average value of 0.56 (0.15–1.27) as deposited to an average value of 0.89 (0.65–1.24) after PrimeX The ultrahigh-resolution set had a bond-length r.m.s.d. of 0.021 Å and a mean bond-length r.m.s. Z score of 0.87 (0.49–1.23), values that are very similar to those of the PrimeX-refined structures. Individual bond-length r.m.s. Z scores are available as Supplementary Material1.
Table 5 also allows comparison among the four programs originally used to refine the moderate-resolution data set. PHENIX and CNS clearly restrained bond lengths more tightly than did PrimeX. The bond-length r.m.s.d. for the REFMAC-refined set was not very different from the PrimeX-refined set.
The average bond-angle r.m.s.d. for PrimeX was 2.2°, which is somewhat larger than the average value of 1.4° over the original data set. The r.m.s. Z scores for bond angles changed from an average value of 0.71 (0.37–1.25) as deposited to an average value of 1.17 (1.00–1.49) after PrimeX The average bond-angle r.m.s.d. in the ultrahigh-resolution set was 2.5°, which is greater than that produced by any of the other programs, but closest to the value for PrimeX. The r.m.s. Z score for bond angles in the ultrahigh-resolution set was 1.12 (0.79–1.46), which is also similar to that of the PrimeX-refined structures. Individual bond-angle r.m.s. Z scores are available in the Supplementary Material1.
The average side-chain group planarity r.m.s.d. for PrimeX was 0.006 Å, which is a fairly typical value for this quantity among the programs. Side-chain planarity deviations were all small and not very different between these two data sets, nor did they differ much by program (Table 5).
The average ω-angle standard deviation for PrimeX, 7.1°, was larger than for any of the programs used to produce the original moderate-resolution data set, but compared well with the value of 6.4° obtained for the ultrahigh-resolution set (Table 5). The values among all the programs could be described as a range of values from 4.8° to 5.7°, with two outliers near 2° for BUSTER and CNS/CNX. The number of examples of BUSTER-refined proteins was too small to draw a conclusion. However, CNS and CNX clearly often restrain the ω angle very tightly. This issue was originally observed by Priestle (2003). Note that the overall average for the ω angles did not represent the situation well, since two of the 12 structures in the CNS/CNX had standard deviations in the normal range (Table 3). These two uncharacteristic CNS/CNX structures indicate that at least a few users of CNS/CNX have taken steps to loosen these peptide-bond planarity restraints. Ten of the 12 members of the CNS/CNX had standard deviations for ω of 1.5° or less, implying flattened peptide bonds throughout these crystal structures. The ω-angle standard deviation did not vary much among the other programs. The average value for REFMAC did not stand out as the one from CNS/CNX does. At the same time, some users of REFMAC did very tightly restrain peptide bonds. The lowest standard deviation for ω was not from among the CNS/CNX-refined structures, but instead was produced by REFMAC (3pgj ; 0.9°).
Changes in overall structure quality owing to PrimeX as judged by the Ramachandran Z scores (Spronk et al., 2004), were generally small. Only one change was noted as significant by the program WHAT_CHECK. This change was from an original value of −3.16 for the protein 3nl6 as originally deposited to a value of −2.11 after PrimeX The mean Ramachandran Z score changed from −0.37 (range −3.16 to +3.69) as originally deposited to −0.51 (range −2.56 to +3.05) after PrimeX Individual Ramachandran Z scores before and after PrimeX are shown in the Supplementary Material1.
The average Rfree value over the moderate-resolution set was the same with or without the additional PrimeX all-atom (0.243 versus 0.242; Tables 2 and 3). The average for all working R values was somewhat lower for the PrimeX-refined structures (0.193) versus the average from the original structures (0.207).
3.4. Additional benefits from all-atom refinement
The advantages of all-atom PrimeX refinements in this study illustrate how a detailed description of nonbonded contacts influenced and improved the results of refinement.
of structures at moderate resolution extend well beyond the prevention and remediation of clashes. A few examples from the3.4.1. Repositioning of a methionine methyl group
Fig. 1 provides an example in which all-atom used in PrimeX led to a significant improvement in the structural model. In PDB entry 3phe , clashes of the C∊ and associated H atoms of MetC187 with atoms from LeuC293 and TyrC296 suggest that at least one of these residues is in the wrong position. PrimeX using the `polish' workflow moved the methyl group as shown in Fig. 1 without manual intervention. The 2Fo − Fc electron-density map as shown did not give any clear indication of the correct position for this methyl group. However, the position as deposited was unfavourable and unlikely to be correct as judged from the observed clashes. The new position for the methionine methyl group relieved all close contacts and was confirmed by a small pair of negative and positive difference features in an Fo − Fc map (result not shown). The program CNX did not correct this situation. A reasonable hypothesis for why it did not do so is that the interactions between the methionine methyl group and the other two residues, as represented through a united-atom model in CNX, were not unfavorable enough to cause a change in the positions of these atoms.
3.4.2. Backbone change to relieve clash leads to additional ligand hydrogen bonds
In PDB entry 3nl6 , atoms in the side chain of ValC209 clash with the side chain of ValC15 (Fig. 2). In producing the all-atom model derived from this structure through energy minimization, these interactions were sufficiently repulsive that the bond angles around Cβ of ValC15 were distorted rather than allowing atoms to overlap to such an extreme extent. The close contact was relieved during PrimeX using the `polish' workflow by motion of residues C209 and C210 away from residue C15 and towards the bound thiamine phosphate (TPS), as shown in Fig. 2. This side-chain motion occurred with a change in the conformation of the main chain for residue C209. This change in the backbone position and a few other more subtle atomic shifts provided multiple additional hydrogen-bond interactions between the protein and the TPS molecule, a difference that has potentially major implications for the understanding of TPS binding. This large structural change during was probably related to the resolution of the strain of close contacts in the model, but electrostatic gradients or other influences during could also play a role. Assuming that the program was used as intended and in the absence of any indication to the contrary (Paul et al., 2010), phenix.refine seems to have tolerated these severe implied all-atom clashes during refinement.
3.4.3. of two side-chain positions provides a new view of ADP binding
In its original position in PDB entry 3pdt , as refined in REFMAC, a clash occurred between GlnA758 and PheA720 in the all-atom structure (Fig. 3). The change in structure after PrimeX using the `polish' workflow is hypothesized to have occurred through the following chain of events. The movement of the GlnA758 side chain was first driven by relief of this clash. Concurrently, the LysA722 side chain was moved towards the phosphate group of the ADP molecule under the influence of both the force field and electron-density gradients, which also required the motion of the glutamine to avoid the formation of a clash with the lysine. Whatever the causes, the result was a large coordinated movement of the lysine and glutamine side chains which was dramatic both in terms of the extent of the motion of the glutamine side chain and in the difference in the key interactions observed for the binding of ADP to this protein.
The molecular model as originally deposited contains several side chains, including GlnA758, that are misfitted and thus this structure might be considered by some to be a poor candidate for automated , residue GlnA758 is positioned outside of the anticipated radius of convergence for However, one conclusion is clear: REFMAC was tolerant of the implied clash as described above either because it was designed to behave so or because a decision by the users (Crawley et al., 2011) caused REFMAC to behave in this way. PrimeX all-atom is not tolerant of such interactions because of the highly unfavorable energetics calculated for such an interaction and it does not allow users to modify its behavior to tolerate such interactions without extraordinary efforts. Even when considered in this context, the ability of the automated PrimeX polish workflow to improve the model in the manner described in Fig. 3 is encouraging.
In this alternate view of the situation shown in Fig. 34. Discussion
4.1. Summary geometry statistics for PrimeX and other programs
While this study was primarily focused on close nonbonded contacts and
using an all-atom model, other issues regarding molecular geometry were also of interest and might best be discussed first. Only moderate-resolution structures deposited and released in 2010 were used in this study to ensure that the results reflected current practices in protein crystallography, especially with respect to geometric restraints.Use of the OPLS all-atom force field in PrimeX produced reasonable results with respect to summary geometry that were in line with other programs in terms of bond-length deviation and side-chain group planarity (Table 5). The results from the two other summary geometry descriptors monitored here deserve additional comment.
The average of the bond-angle r.m.s.d.s for PrimeX (2.2°) is greater than for any of the other programs that created the original moderate-resolution structure set (range 1.1–1.7°; Table 5). However, the observation from the ultrahigh-resolution data set of an average bond-angle r.m.s.d. of 2.5° (range 1.4–3.1°; Table 1) clearly suggests that this r.m.s.d. is reasonable.
Over-restraint of ω angles in CNS/CNX has been recognized for several years (Priestle, 2003). Considering the time that has passed since this publication, the number of structures from BUSTER, CNS/CNX and REFMAC observed with very low deviation of ω angles is hard to understand. While over-restraint is easy to recognize, the correct degree of variability is less easy to define. MacArthur & Thornton (1996) suggested from their study of proteins and small that a standard deviation of 6° is appropriate.
Forcing bond angles or torsion angles toward idealized values does carry a risk. If an interaction such as a nonbonded repulsion has driven a particular torsion or bond angle away from the idealized value, restraining it to be closer to the idealized value must make that other interaction more unfavorable. Thus, the result of strictly enforcing these ideal values could be an increase in the number or severity of clashes.
That the same Rfree was obtained with our force-field-based restraints as with Engh and Huber restraints suggests that these restraints are reasonably consistent with protein crystal structures. However, taken together, the decrease in the average R factor (Rwork), the relatively large r.m.s.d. for bond angles compared with the deposited structures and the somewhat larger standard deviation for the ω angle above the optimal value conceived by MacArthur & Thornton (1996) could be interpreted as evidence that the restraints employed may require further tuning to decrease the risk of overfitting. This consideration will be examined in future publications.
4.2. Advantages of all-atom with PrimeX
All-atom PrimeX resulted in a more than fourfold decrease in the number of clashes and a 17-fold decrease in the number of severe clashes. This improvement in model quality was achieved without sacrificing the goodness of fit to the X-ray data as judged by the average Rfree values. Importantly, these models also display good summary statistics, so that the protein models comply with reasonable molecular-geometry expectations.
of moderate-resolution protein crystal structures withAll-atom PrimeX to fix errors that other programs missed and provided a more accurate picture of critical protein features such as protein–ligand interactions, as illustrated in Figs. 1, 2 and 3. Resolution of clashes during can help to `push' the structure into the correct conformation, producing potentially remarkably large changes in conformation. They may also serve the role of preventing the structure from entering nonproductive conformations that are otherwise allowed in a less restrictive all-atom model.
with a force field allowedThese results also contain an indication of the limits of usefulness of the polish workflow. Fully 85% of the structures that entered the workflow with 50 or more total clashes (Table 3) resulted in an increase in Rfree (Tables 3 and 4). A large number of clashes is a warning sign that the structure may contain errors that could have negative consequences after the application of this process.
4.3. Reducing clashes in deposited X-ray models
The REFMAC and phenix.refine both have the capability to use riding H atoms during One might reasonably expect that the use of this feature would have a positive impact on the issue of clashes in all-atom models. Unfortunately, the extent to which the riding H atom option is actually employed in is impossible to determine in many cases, even after consulting both the primary literature references and the PDB entry. The lack of definitive information on this issue makes it nearly impossible to determine from these experiments whether these programs are capable of reducing clashes to the levels deduced to be reasonable goals from the ultrahigh-resolution structures or from the PrimeX-refined structures. Comparing the frequency of clashes in Tables 3 and 4, one can only conclude that either (i) the riding H atom models and nonbonded contact restraints do not make as much difference as one might expect, (ii) the riding H atom option is very rarely used in these two programs or (iii) both are true. At the very least, one may safely conclude that some attribute of these programs or the way that they are being used must change before either of these programs can be considered to be part of the solution to this problem.
programsA role for CNS in curbing clashes is also currently available. CNS can be employed with a more complex energy model than is routinely used by crystallographers. As well as deploying an Engh and Huber-based restraint system, CNS is distributed with a force field that includes Lennard–Jones and electrostatic terms and that is regularly used for the determination of NMR structures (Linge et al., 2003). This force field has been employed to produce some very high quality NMR structures (see, for example, Nozinovic et al., 2010).
What else can be done to reduce the number of clashes in deposited structures? Perhaps the answer to this question resides in the standards for structure deposition in the PDB. A committee of the PDB is currently working on structure-validation tools for use associated with the deposition of coordinates (Read et al., 2011). A reason for optimism is that the work of the Richardson group was included in the report of the committee. From the point of view of many users of protein structures, the deposition of all-atom models derived from protein crystal structures should be required. Clashes determined from an all-atom model should be, at the very least, measured and documented for all protein models that are deposited, just as other outliers to molecular-geometry standards are now listed in the entry header.
To achieve a higher standard for deposited protein structures, additional tools that are sensitive to close contacts could help. PrimeX can contribute to this goal, and the automated polish workflow presented here was designed to achieve this goal with the minimum of human intervention. However, the workflow was designed with the assumption that the coordinates on which it would operate would be essentially free of errors in the main-chain tracing or side-chain rotamer selection. The prevalence of such errors in the data set examined here established the need for additional automated structure tools with the capability of making large changes in side-chain torsion angles or chain trace. Design of these workflows is in progress based on the tool set in the PrimeX package (Bell et al., 2012). Although similar automated workflows exist for phenix.refine (Afonine et al., 2005) and indirectly for REFMAC (Murshudov et al., 2011) through the program SideAide in the PDB_REDO pipeline (Joosten et al., 2011), the frequency of clashes in structures refined by phenix.refine and REFMAC (Tables 3 and 5) raises the question whether these automated workflows can address the issue of all-atom clashes, no matter how capable and thorough these workflows are intended to be. While the program MolProbity (Davis et al., 2007; Chen, Arendall et al., 2010) is aimed at solving the right problem, these same results show that it is not being adequately employed to deal with the problem at hand.
The results presented here also highlighted a lack of attention to detail during
in the ultrahigh-resolution protein data set. H atoms should not be placed in chemically impossible positions when, by all appearances, convincing electron density at those positions is lacking. In addition, clashes highlighted several clear errors in the coordinates of non-H atoms.5. Conclusion
This study documents the existence of numerous unnecessary close contacts, including many severe ones, implicit in united-atom models deposited in the PDB. Many of these close contacts can be readily removed, and doing so need not damage the agreement of the model with the observed X-ray diffraction data. Furthermore, attention to close contacts can bring to light errors in the placement of non-H atoms in protein et al., 1999, Davis et al., 2007; Chen, Arendall et al., 2010).
models. This latter point has also been made abundantly clear by over a decade of work by Jane Richardson, David Richardson and coworkers (WordClashes are detrimental to advances in the various branches of science that depend on protein et al., 2009) is partially an outgrowth of this problem and an indication that this anticipated consequence is already becoming a reality.
models, such as protein design and drug discovery. Normally, scientists working in these areas are not in a position to evaluate the reliability of each protein nor are they able to judge whether the effects of remediation of crystal structures might result in different sorts of errors. If crystallographers, who are of course in the best position to do so, do not address these issues, then eventually other scientists will. The result will be that protein crystallographers will have less control over the form in which their experimental results are archived and deployed. The advent of remediated database alternatives to the PDB (JoostenBoth the expected bond-length and bond-angle parameters of Engh & Huber (1991) and the parameterization of van der Waals radii by Rowland & Taylor (1996) are equally well grounded in high-resolution small-molecule crystallographic results. In protein crystallography the former geometric statistics are very strictly applied, while the latter receive much less attention. An understandable explanation for this contrast is the absence of H-atom coordinates in classic protein models. However, if the requirements for the highest quality protein models possible are to be met, the consideration of nonbonded contacts in all-atom models must become more prominent.
Acknowledgements
Expert assistance in workflow scripting by Shawn Watts and Dave Giesen, and helpful discussions with Tyler Day are gratefully acknowledged.
References
Abu Tarboush, N., Jensen, L. M., Feng, M., Tachikawa, H., Wilmot, C. M. & Davidson, V. L. (2010). Biochemistry, 49, 9783–9791. Web of Science CrossRef CAS PubMed Google Scholar
Adams, P. D., Pannu, N. S., Read, R. J. & Brünger, A. T. (1997). Proc. Natl Acad. Sci. USA, 94, 5018–5023. CrossRef CAS PubMed Web of Science Google Scholar
Adams, C. J., Pike, A. C., Maniam, S., Sharpe, T. D., Coutts, A. S., Knapp, S., La Thangue, N. B. & Bullock, A. N. (2012). Proc. Natl. Acad. Sci. USA, 109, 3778–3783. Web of Science CrossRef CAS PubMed Google Scholar
Afonine, P. V., Grosse-Kunstleve, R. W. & Adams, P. D. (2005). CCP4 Newsl. 42, contribution 8. Google Scholar
Afonine, P. V., Grosse-Kunstleve, R. W., Chen, V. B., Headd, J. J., Moriarty, N. W., Richardson, J. S., Richardson, D. C., Urzhumtsev, A., Zwart, P. H. & Adams, P. D. (2010). J. Appl. Cryst. 43, 669–676. Web of Science CrossRef CAS IUCr Journals Google Scholar
Ali, A., Reddy, G. S., Nalam, M. N., Anjum, S. G., Cao, H., Schiffer, C. A. & Rana, T. M. (2010). J. Med. Chem. 53, 7699–7708. Web of Science CrossRef CAS PubMed Google Scholar
Allen, F. H. (2002). Acta Cryst. B58, 380–388. Web of Science CrossRef CAS IUCr Journals Google Scholar
Artz, J. D., Wernimont, A. K., Dunford, J. E., Schapira, M., Dong, A., Zhao, Y., Lew, J., Russell, R. G., Ebetino, F. H., Oppermann, U. & Hui, R. (2011). J. Biol. Chem. 286, 3315–3322. Web of Science CrossRef CAS PubMed Google Scholar
Attolino, E., Calderone, V., Dragoni, E., Fragai, M., Richichi, B., Luchinat, C. & Nativi, C. (2010). Eur. J. Med. Chem. 45, 5919–5925. Web of Science CrossRef CAS PubMed Google Scholar
Banks, J. L. et al. (2005). J. Comput. Chem. 26, 1752–1780. Web of Science CrossRef PubMed CAS Google Scholar
Begley, D. W., Hartley, R. C., Davies, D. R., Edwards, T. E., Leonard, J. T., Abendroth, J., Burris, C. A., Bhandari, J., Myler, P. J., Staker, B. L. & Stewart, L. J. (2011). J. Struct. Funct. Genomics, 12, 63–76. CrossRef CAS PubMed Google Scholar
Bell, J. A., Cao, Y., Gunn, J. R., Day, T., Gallicchio, E., Zhou, Z., Levy, R. & Farid, R. (2012). International Tables for Crystallography, Vol. F, edited by E. Arnold, D. M. Himmel & M. G. Rossmann, pp. 534–538. Chichester: John Wiley & Sons. Google Scholar
Berman, H. M., Westbrook, J., Feng, Z., Gilliland, G., Bhat, T. N., Weissig, H., Shindyalov, I. N. & Bourne, P. E. (2000). Nucleic Acids Res. 28, 235–242. Web of Science CrossRef PubMed CAS Google Scholar
Biswas, T., Small, J., Vandal, O., Odaira, T., Deng, H., Ehrt, S. & Tsodikov, O. V. (2010). Structure, 18, 1353–1363. Web of Science CrossRef CAS PubMed Google Scholar
Blaszczyk, J., Li, Y., Shi, G., Yan, H. & Ji, X. (2003). Biochemistry, 42, 1573–1580. Web of Science CrossRef PubMed CAS Google Scholar
Bondi, A. (1964). J. Phys. Chem. 68, 441–451. CrossRef CAS Web of Science Google Scholar
Bönisch, H., Schmidt, C. L., Bianco, P. & Ladenstein, R. (2005). Acta Cryst. D61, 990–1004. Web of Science CrossRef IUCr Journals Google Scholar
Borshchevskiy, V., Molseeva, E., Kuklin, A., Büldt, G., Hato, M. & Gordeliy, V. (2010). J. Cryst. Growth, 312, 3326–3330. Web of Science CrossRef CAS Google Scholar
Bosch, D. E., Kimple, A. J., Sammond, D. W., Muller, R. E., Miley, M. J., Machius, M., Kuhlman, B., Willard, F. S. & Siderovski, D. P. (2011). J. Biol. Chem. 286, 3351–3358. Web of Science CrossRef CAS PubMed Google Scholar
Bricogne, G., Blanc, E., Brandl, M., Flensburg, C., Keller, P., Paciorek, W., Roversi, P., Sharff, A., Smart, O. S., Vonrhein, C. & Womack, T. O. (2011). BUSTER. Cambridge: Global Phasing Ltd. Google Scholar
Brünger, A. T. (1989). Acta Cryst. A45, 42–50. CrossRef Web of Science IUCr Journals Google Scholar
Brünger, A. T. (1992). X-PLOR Version 3.1. A System for X-ray Crystallography and NMR. New Haven: Yale University Press. Google Scholar
Brünger, A. T., Adams, P. D., Clore, G. M., DeLano, W. L., Gros, P., Grosse-Kunstleve, R. W., Jiang, J.-S., Kuszewski, J., Nilges, M., Pannu, N. S., Read, R. J., Rice, L. M., Simonson, T. & Warren, G. L. (1998). Acta Cryst. D54, 905–921. Web of Science CrossRef IUCr Journals Google Scholar
Brünger, A. T., Clore, G. M., Gronenborn, A. M. & Karplus, M. (1986). Proc. Natl Acad. Sci. USA, 83, 3801–3805. PubMed Web of Science Google Scholar
Brünger, A. T. & Karplus, M. (1988). Proteins, 4, 148–156. CrossRef CAS PubMed Web of Science Google Scholar
Brünger, A. T., Karplus, M. & Petsko, G. A. (1989). Acta Cryst. A45, 50–61. CrossRef IUCr Journals Google Scholar
Čapkauskaitė, E., Baranauskienė, L., Golovenko, D., Manakova, E., Gražulis, S., Tumkevičius, S. & Matulis, D. (2010). Bioorg. Med. Chem. 18, 7357–7364. Web of Science PubMed Google Scholar
Carrillo, D. R., Parthier, C., Jänckel, N., Grandke, J., Stelter, M., Schilling, S., Boehme, M., Neumann, P., Wolf, R., Demuth, H. U., Stubbs, M. T. & Rahfeld, J. U. (2010). J. Biol. Chem. 391, 1419–1428. CAS Google Scholar
Chatwell, L., Holla, A., Kaufer, B. B. & Skerra, A. (2008). Mol. Immunol. 45, 1981–1994. Web of Science CrossRef PubMed CAS Google Scholar
Chen, V. B., Arendall, W. B., Headd, J. J., Keedy, D. A., Immormino, R. M., Kapral, G. J., Murray, L. W., Richardson, J. S. & Richardson, D. C. (2010). Acta Cryst. D66, 12–21. Web of Science CrossRef CAS IUCr Journals Google Scholar
Chen, Z., Pelc, L. A. & De Cera, E. (2010). Proc. Natl Acad. Sci. USA, 107, 19278–19283. Web of Science CrossRef CAS PubMed Google Scholar
Chen, B., Sysoeva, T. A., Chowdhury, S., Guo, L., De Carlo, S., Hanson, J. A., Yang, H. & Nixon, B. T. (2010). Structure, 18, 1420–1430. Web of Science CrossRef CAS PubMed Google Scholar
Chufán, E. E., Prigge, S. T., Siebert, X., Eipper, B. A., Mains, R. E. & Amzel, L. M. (2010). J. Am. Chem. Soc. 132, 15565–15572. Web of Science PubMed Google Scholar
Clasquin, M. F., Melamud, E., Singer, A., Gooding, J. R., Xu, X., Dong, A., Cui, H., Campagna, S. R., Savchenko, A., Yakunin, A. F., Rabinowitz, J. D. & Caudy, A. A. (2011). Cell, 145, 969–980. Web of Science CrossRef CAS PubMed Google Scholar
Compton, J. R., Legler, P. M., Clingan, B. V., Olson, M. A. & Millard, C. B. (2011). Proteins, 79, 1048–1060. Web of Science CrossRef CAS PubMed Google Scholar
Cook, W. J., Smith, C. D., Senkovich, O., Holder, A. A. & Chattopadhyay, D. (2010). Acta Cryst. F66, 1426–1431. Web of Science CrossRef IUCr Journals Google Scholar
Corminboeuf, O., Bezençon, O., Grisostomi, C., Remeň, L., Richard-Bildstein, S., Bur, D., Prade, L., Hess, P., Strickner, P., Fischli, W., Steiner, B. & Treiber, A. (2010). Bioorg. Med. Chem. Lett. 20, 6286–6290. Web of Science CrossRef CAS PubMed Google Scholar
Coulson, C. A. & Thomas, M. W. (1971). Acta Cryst. B27, 1354–1359. CrossRef CAS IUCr Journals Web of Science Google Scholar
Crawley, S. W., Gharaei, M. S., Ye, Q., Yang, Y., Raveh, B., London, N., Schueler-Furman, O., Jia, Z. & Côté, G. P. (2011). J. Biol. Chem. 286, 2607–2616. Web of Science CrossRef CAS PubMed Google Scholar
Dasgupta, J., Blenkowska-Haba, M., Ortega, M. E., Patel, H. D., Bodevin, S., Spillmann, D., Bishop, B., Sapp, M. & Chen, X. S. (2011). J. Biol. Chem. 286, 2617–2624. Web of Science CrossRef CAS PubMed Google Scholar
Davis, I. W., Leaver-Fay, A., Chen, V. B., Block, J. N., Kapral, G. J., Wang, X., Murray, L. W., Arendall, W. B., Snoeyink, J., Richardson, J. S. & Richardson, D. C. (2007). Nucleic Acid Res. 35, W375–W383. Web of Science CrossRef PubMed Google Scholar
Diebold, M. L., Koch, M., Loeliger, E., Cura, V., Winston, F., Cavarelli, J. & Romier, C. (2010). EMBO J. 29, 3979–3991. Web of Science CrossRef CAS PubMed Google Scholar
Eidam, O., Romagnoli, C., Caselli, E., Babaoglu, K., Pohlhaus, D. T., Karpiak, J., Bonnet, R., Shoichet, B. K. & Prati, F. (2010). J. Med. Chem. 53, 7852–7863. Web of Science CrossRef CAS PubMed Google Scholar
Empadinhas, N., Pereira, P. J., Albuquerque, L., Costa, J., Sá-Moura, B., Marques, A. T., Macedo-Ribeiro, S. & da Costa, M. S. (2011). Mol. Microbiol. 79, 76–93. Web of Science CrossRef CAS PubMed Google Scholar
Engh, R. A. & Huber, R. (1991). Acta Cryst. A47, 392–400. CrossRef CAS Web of Science IUCr Journals Google Scholar
Esposito, L., Vitagliano, L., Sica, F., Sorrentino, G., Zagari, A. & Mazzarella, L. (2000). J. Mol. Biol. 297, 713–732. Web of Science CrossRef PubMed CAS Google Scholar
Feng, Z., Westbrook, J. & Berman, H. M. (1998). NUCheck. Report NDB-407, Rutgers University, New Brunswick, New Jersey, USA. Google Scholar
Fenn, T. D., Ringe, D. & Petsko, G. A. (2004). Biochemistry, 43, 6464–6474. Web of Science CrossRef PubMed CAS Google Scholar
Fenn, T. D. & Schnieders, M. J. (2011). Acta Cryst. D67, 957–965. Web of Science CrossRef CAS IUCr Journals Google Scholar
Fenn, T. D., Schnieders, M. J., Brunger, A. T. & Pande, V. S. (2010). Biophys. J. 98, 2984–2992. Web of Science CrossRef CAS PubMed Google Scholar
Fenn, T. D., Schnieders, M. J., Mustyakimov, M., Wu, C., Langan, P., Pande, V. S. & Brunger, A. T. (2011). Structure, 19, 523–533. Web of Science CrossRef CAS PubMed Google Scholar
Fokina, O., Chellamuthu, V.-R., Forchhammer, K. & Zeth, K. (2010). Proc. Natl Acad. Sci. USA, 107, 19760–19765. Web of Science CrossRef CAS PubMed Google Scholar
Fuhrmann, C. N., Daugherty, M. D. & Agard, D. A. (2006). J. Am. Chem. Soc. 128, 9086–9102. Web of Science CrossRef PubMed CAS Google Scholar
Gallicchio, E., Zhang, L. Y. & Levy, R. M. (2002). J. Comput. Chem. 23, 517–529. Web of Science CrossRef PubMed CAS Google Scholar
Ghosh, A., Rapp, C. S. & Friesner, R. A. (1998). J. Phys. Chem. B, 102, 10983–10990. Web of Science CrossRef CAS Google Scholar
Hakanpää, J., Linder, M., Popov, A., Schmidt, A. & Rouvinen, J. (2006). Acta Cryst. D62, 356–367. Web of Science CrossRef IUCr Journals Google Scholar
Hall, G., Bradshaw, T. D., Laughton, C. A., Stevens, M. F. & Emsley, J. (2011). Protein Sci. 20, 210–215. Web of Science CrossRef CAS PubMed Google Scholar
Han, L., Monné, M., Okumura, H., Schwend, T., Cherry, A. L., Flot, D., Matsuda, T. & Jovine, L. (2010). Cell, 143, 404–415. Web of Science CrossRef CAS PubMed Google Scholar
Hardegger, L. A., Kuhn, B., Spinnier, B., Anselm, L., Ecabert, R., Stihle, M., Gsell, B., Thoma, R., Diez, J., Benz, J., Plancher, J. M., Hartmann, G., Banner, D. W., Haap, W. & Diederich, F. (2011). Agnew. Chem. Int. Ed. Engl. 50, 314–318. CrossRef CAS Google Scholar
Harmat, V., Domokos, K., Menyhárd, D. K., Palló, A., Szeltner, Z., Szamosi, I., Beke-Somfai, T., Náray-Szabó, G. & Polgár, L. (2011). J. Biol. Chem. 286, 1987–1998. Web of Science CrossRef CAS PubMed Google Scholar
Hee, C. S., Gao, S., Loll, B., Miller, M. M., Uchanska-Ziegler, B., Daumke, O. & Ziegler, A. (2010). PLoS Biol. 8, e1000557. Web of Science CrossRef PubMed Google Scholar
Heldman, N., Vonshak, O., Sevier, C. S., Vitu, E., Mehlman, T. & Fass, D. (2010). Protein Sci. 19, 1863–1876. Web of Science CrossRef CAS PubMed Google Scholar
Hendrickson, W. A. (1985). Methods Enzymol. 115, 252–270. CrossRef CAS PubMed Google Scholar
Hooft, R. W. W., Sander, C. & Vriend, G. (1996). Proteins, 26, 363–376. CrossRef CAS PubMed Google Scholar
Hooft, R. W. W., Vriend, G., Sander, C. & Abola, E. E. (1996). Nature (London), 381, 272. CrossRef PubMed Web of Science Google Scholar
Husain, N., Obranic, S., Koscinski, L., Seetharaman, J., Babic, F., Bujnicki, J. M., Maravic Vlahovicek, G. & Sivaraman, J. (2011). Nucleic Acids Res. 39, 1903–1918. Web of Science CrossRef CAS PubMed Google Scholar
Jacques, D. A., Langley, D. B., Hynson, R. M., Whitten, A. E., Kwan, A., Guss, J. M. & Trewhella, J. (2011). J. Mol. Biol. 405, 214–226. Web of Science CrossRef CAS PubMed Google Scholar
Jiang, J.-S. & Brünger, A. T. (1994). J. Mol. Biol. 243, 100–115. CrossRef CAS PubMed Web of Science Google Scholar
Joosten, R. P. et al. (2009). J. Appl. Cryst. 42, 376–384. Web of Science CrossRef CAS IUCr Journals Google Scholar
Joosten, R. P., Joosten, K., Cohen, S. X., Vriend, G. & Perrakis, A. (2011). Bioinformatics, 27, 3392–3398. Web of Science CrossRef CAS PubMed Google Scholar
Jorgensen, W. L., Maxwell, D. S. & Tirado-Rives, J. (1996). J. Am. Chem. Soc. 118, 11225–11236. CrossRef CAS Web of Science Google Scholar
Kaminski, G., Friesner, R. A., Tirado-Rives, J. & Jorgensen, W. L. (2001). J. Phys. Chem. B, 105, 6474–6487. Web of Science CrossRef CAS Google Scholar
Ke, W., Sampson, J. M., Ori, C., Prati, F., Drawz, S. M., Bethel, C. R., Bonomo, R. A. & van den Akker, F. (2011). Antimicrob. Agents Chemother. 55, 174–183. Web of Science CrossRef CAS PubMed Google Scholar
Khan, H., Barna, T., Harris, R. J., Bruce, N. C., Barsukov, I., Munro, A. W., Moody, P. C. & Scrutton, N. S. (2004). J. Biol. Chem. 279, 30563–30572. Web of Science CrossRef PubMed CAS Google Scholar
Khan, M. Q., Sweeting, B., Mulligan, V. K., Arslan, P. E., Cashman, N. R., Pai, E. F. & Chakrabartty, A. (2010). Proc. Natl Acad. Sci. USA, 107, 19808–19813. Web of Science CrossRef CAS PubMed Google Scholar
Knight, J. L., Zhou, Z., Gallicchio, E., Himmel, D. M., Friesner, R. A., Arnold, E. & Levy, R. M. (2008). Acta Cryst. D64, 383–396. Web of Science CrossRef IUCr Journals Google Scholar
Ko, T.-P., Robinson, H., Gao, Y.-G., Cheng, C.-H., DeVries, A. L. & Wang, A. H.-J. (2003). Biophys. J. 84, 1228–1237. CrossRef PubMed CAS Google Scholar
Konnert, J. H. (1976). Acta Cryst. A32, 614–617. CrossRef CAS IUCr Journals Web of Science Google Scholar
Konnert, J. H. & Hendrickson, W. A. (1980). Acta Cryst. A36, 344–350. CrossRef CAS IUCr Journals Google Scholar
Koparde, V. N., Scarsdale, J. N. & Kellogg, G. E. (2011). PLoS One, 6, e15920. Web of Science CrossRef PubMed Google Scholar
Kumar, J. & Mayer, M. L. (2010). J. Mol. Biol. 404, 680–696. Web of Science CrossRef CAS PubMed Google Scholar
Kumar, D. V., Rai, R., Brameld, K. A., Somoza, J. R., Rajagopalan, R., Janc, J. W., Xia, Y. M., Ton, T. L., Shaghafi, M. B., Hu, H., Lehoux, I., To, N., Young, W. B. & Green, M. J. (2011). Bioorg. Med. Chem. Lett. 21, 82–87. Web of Science CrossRef CAS PubMed Google Scholar
Laskowski, R. A., MacArthur, M. W., Moss, D. S. & Thornton, J. M. (1993). J. Appl. Cryst. 26, 283–291. CrossRef CAS Web of Science IUCr Journals Google Scholar
Lee, Y. T., Glazer, E. C., Wilson, R. F., Stout, C. D. & Goodin, D. B. (2011). Biochemistry, 50, 693–703. Web of Science CrossRef CAS PubMed Google Scholar
Lejon, S., Thong, S. Y., Murthy, A., AlQarni, S., Murzina, N. V., Blobel, G. A., Laue, E. D. & Mackay, J. P. (2011). J. Biol. Chem. 286, 1196–1203. Web of Science CrossRef CAS PubMed Google Scholar
Li, J., Abel, R., Zhu, K., Cao, Y., Zhao, S. & Friesner, R. A. (2011). Proteins, 79, 2794–2812. Web of Science CrossRef CAS PubMed Google Scholar
Li, H., Tong, S., Li, X., Shi, H., Ying, Z., Gao, Y., Ge, H., Niu, L. & Teng, M. (2011). Cell Res. 21, 1039–1051. Web of Science CrossRef CAS PubMed Google Scholar
Liebschner, D., Elias, M., Moniot, S., Fournier, B., Scott, K., Jelsch, C., Guillot, B., Lecomte, C. & Chabrière, E. (2009). J. Am. Chem. Soc. 131, 7879–7886. Web of Science CrossRef PubMed CAS Google Scholar
Lin, F.-Y., Liu, C.-I., Liu, Y.-L., Zhang, Y., Wang, K., Jeng, W. Y., Ko, T.-P., Cao, R., Wang, A. H.-J. & Oldfield, E. (2010). Proc. Natl Acad. Sci. USA, 107, 21337–21342. Web of Science CrossRef CAS PubMed Google Scholar
Linge, J. P., Williams, M. A., Spronk, C. A., Bonvin, A. M. & Nilges, M. (2003). Proteins, 50, 496–506. Web of Science CrossRef PubMed CAS Google Scholar
MacArthur, M. W. & Thornton, J. M. (1996). J. Mol. Biol. 264, 1180–1195. CrossRef CAS PubMed Web of Science Google Scholar
Matsumoto, K., Okamoto, K., Ashizawa, N. & Nishino, T. (2010). J. Pharmacol. Exp. Ther. 336, 95–103. Web of Science CrossRef PubMed Google Scholar
Minasov, G., Wang, X. & Shoichet, B. K. (2002). J. Am. Chem. Soc. 124, 5333–5340. Web of Science CrossRef PubMed CAS Google Scholar
Morin, A., Kaufmann, K. W., Fortenberry, C., Harp, J. M., Mizoue, L. S. & Meiler, J. (2011). Protein Eng. Des. Sel. 24, 503–516. Web of Science CrossRef CAS PubMed Google Scholar
Morita, H., Wanibuchi, K., Nii, H., Kato, R., Sugio, S. & Abe, I. (2010). Proc. Natl Acad. Sci. USA, 107, 19778–19783. Web of Science CrossRef CAS PubMed Google Scholar
Moulinier, L., Case, D. A. & Simonson, T. (2003). Acta Cryst. D59, 2094–2103. Web of Science CrossRef CAS IUCr Journals Google Scholar
Murakawa, T., Machida, Y. & Hayashi, H. (2011). J. Biol. Chem. 286, 2774–2784. Web of Science CrossRef CAS PubMed Google Scholar
Murshudov, G. N., Skubák, P., Lebedev, A. A., Pannu, N. S., Steiner, R. A., Nicholls, R. A., Winn, M. D., Long, F. & Vagin, A. A. (2011). Acta Cryst. D67, 355–367. Web of Science CrossRef CAS IUCr Journals Google Scholar
Murshudov, G. N., Vagin, A. A. & Dodson, E. J. (1997). Acta Cryst. D53, 240–255. CrossRef CAS Web of Science IUCr Journals Google Scholar
Natesh, R., Manikandan, K., Bhanumoorthy, P., Viswamitra, M. A. & Ramakumar, S. (2003). Acta Cryst. D59, 105–117. Web of Science CrossRef CAS IUCr Journals Google Scholar
Neu, U., Maginnis, M. S., Palma, A. S., Ströh, L. J., Nelson, C. D., Feizi, T., Atwood, W. J. & Stehle, T. (2010). Cell Host Microbe, 8, 309–319. Web of Science CrossRef CAS PubMed Google Scholar
Nicodeme, E. et al. (2010). Nature (London), 468, 1119–1123. Web of Science CrossRef CAS PubMed Google Scholar
Nilges, M., Gronenborn, A. M., Brunger, A. T. & Clore, G. M. (1988). Protein Eng. 2, 27–38. CrossRef CAS PubMed Web of Science Google Scholar
Niu, W., Shu, Q., Chen, Z., Mathews, S., Di Cera, E. & Frieden, C. (2010). J. Phys. Chem. B, 114, 16156–16165. Web of Science CrossRef CAS PubMed Google Scholar
Nozinovic, S., Fürtig, B., Jonker, H. R., Richter, C. & Schwalbe, H. (2010). Nucleic Acids Res. 38, 683–694. Web of Science CrossRef PubMed CAS Google Scholar
Orita, I., Kita, A., Yurimoto, H., Kato, N., Sakai, Y. & Miki, K. (2010). Proteins, 78, 3488–3492. Web of Science CrossRef CAS PubMed Google Scholar
Ouellet, H., Kells, P. M., Ortiz de Montellano, P. R. & Produst, L. M. (2011). Bioorg. Med. Chem. Lett. 21, 332–337. Web of Science CrossRef CAS PubMed Google Scholar
Pannu, N. S. & Read, R. J. (1996). Acta Cryst. A52, 659–668. CrossRef CAS Web of Science IUCr Journals Google Scholar
Patterson, A., Price, N. C. & Nairn, J. (2010). Acta Cryst. F66, 1415–1420. Web of Science CrossRef CAS IUCr Journals Google Scholar
Paul, D., Chatterjee, A., Begley, T. P. & Ealick, S. E. (2010). Biochemistry, 49, 9922–9934. Web of Science CrossRef CAS PubMed Google Scholar
Paz, S., Tizón, L., Otero, J. M., Llamas-Saiz, A. L., Fox, G. C., van Raaij, M. J., Lamb, H., Hawkins, A. R., Lapthorn, A. J., Castedo, L. & González-Bello, C. (2011). ChemMedChem, 6, 266–272. Web of Science CSD CrossRef CAS PubMed Google Scholar
Peters, C., Brejc, K., Belmont, L., Bodey, A. J., Lee, Y., Yu, M., Guo, J., Sakowicz, R., Hartman, J. & Moores, C. A. (2010). EMBO J. 29, 3437–3447. Web of Science CrossRef CAS PubMed Google Scholar
Pizzo, E., Merlino, A., Turano, M., Russo Krauss, I., Coscia, F., Zanfardino, A., Varcamonti, M., Furia, A., Giancola, C., Mazzarella, L., Sica, F. & D'Alessio, G. (2010). Biochem. J. 433, 345–355. Web of Science CrossRef Google Scholar
Priestle, J. P. (2003). J. Appl. Cryst. 36, 34–42. Web of Science CrossRef CAS IUCr Journals Google Scholar
Privé, G. G., Anderson, D. H., Wesson, L., Cascio, D. & Eisenberg, D. (1999). Protein Sci. 8, 1400–1409. Web of Science PubMed CAS Google Scholar
Radford, R. J., Lawrenz, M., Nguyen, P. C., McCammon, J. A. & Akif Tezcan, F. (2011). Chem. Commun. 47, 313–315. Web of Science CrossRef CAS Google Scholar
Read, R. J. (1986). Acta Cryst. A42, 140–149. CrossRef CAS Web of Science IUCr Journals Google Scholar
Read, R. J. et al. (2011). Structure, 19, 1395–1412. Web of Science CrossRef CAS PubMed Google Scholar
Romano, K. P., Ali, A., Royer, W. E. & Schiffer, C. A. (2010). Proc. Natl Acad. Sci. USA, 107, 20986–20991. Web of Science CrossRef CAS PubMed Google Scholar
Roosild, T. P., Castronovo, S., Healy, J., Miller, S., Pliotas, C., Rasmussen, T., Bartlett, W., Conway, S. J. & Booth, I. R. (2010). Proc. Natl Acad. Sci. USA, 107, 19784–19789. Web of Science CrossRef CAS PubMed Google Scholar
Rowland, R. S. & Taylor, R. (1996). J. Phys. Chem. 100, 7384–7391. CrossRef CAS Web of Science Google Scholar
Ryan, A. J., Ghuman, J., Zunszain, P. A., Chung, C. W. & Curry, S. (2011). J. Struct. Biol. 174, 84–91. Web of Science CrossRef CAS PubMed Google Scholar
Sainsbury, L., Bird, L., Rao, V., Shepherd, S. M., Stuart, D. I., Hunter, W. N., Owens, R. J. & Ren, J. (2010). J. Mol. Biol. 405, 173–184. Web of Science CrossRef PubMed Google Scholar
Schmelz, S. (2010). PhD Thesis. University of St Andrews, Fife, Scotland. Google Scholar
Schnieders, M. J., Fenn, T. D. & Pande, V. S. (2011). J. Chem. Theory Comput. 7, 1141–1156. Web of Science CrossRef CAS Google Scholar
Schröder, G. F., Levitt, M. & Brunger, A. T. (2010). Nature (London), 464, 1218–1222. Web of Science PubMed Google Scholar
Sendovski, M., Kanteev, M., Shuster Ben-Yosef, V., Adir, N. & Fishman, A. (2010). Acta Cryst. F66, 1101–1103. Web of Science CrossRef IUCr Journals Google Scholar
Sheldrick, G. M. & Schneider, T. R. (1997). Methods Enzymol. 277, 319–343. CrossRef PubMed CAS Web of Science Google Scholar
Shinobu, A., Palm, G. J., Schierbeek, A. J. & Agmon, N. (2010). J. Am. Chem. Soc. 132, 11093–11102. Web of Science CrossRef CAS PubMed Google Scholar
Spronk, C. A. E. M., Nabuurs, S. B., Krieger, E., Vriend, G. & Vuister, G. W. (2004). Prog. Nucl. Magn. Reson. Spectrosc. 45, 315–337. Web of Science CrossRef CAS Google Scholar
Sun, M., Larivière, L., Dengl, S., Mayer, A. & Cramer, P. (2010). J. Biol. Chem. 285, 41597–41603. Web of Science CrossRef CAS PubMed Google Scholar
Sun, J. & Weis, W. I. (2011). J. Mol. Biol. 405, 519–530. Web of Science CrossRef CAS PubMed Google Scholar
Swiecki, M., Scheaffer, S. M., Allaire, M., Fremont, D. H., Colonna, M. & Brett, T. J. (2011). J. Biol. Chem. 286, 2987–2997. Web of Science CrossRef CAS PubMed Google Scholar
Takeda, K., Kusumoto, K., Hirano, Y. & Miki, K. (2010). J. Struct. Biol. 169, 135–144. Web of Science CrossRef PubMed CAS Google Scholar
Thompson, M. K., Davis, M. F., de Serrano, V., Nicoletti, F. P., Howes, B. D., Smulevich, G. & Franzen, S. (2010). Biophys. J. 99, 1586–1595. Web of Science CrossRef CAS PubMed Google Scholar
Tong, Y., Tempel, W., Wang, H., Yamada, K., Shen, L., Senisterra, G. A., MacKenzie, F., Chishti, A. H. & Park, H. W. (2010). Proc. Natl Acad. Sci. USA, 107, 20346–20351. Web of Science CrossRef CAS PubMed Google Scholar
Touw, W. G. & Vriend, G. (2010). Acta Cryst. D66, 1341–1350. Web of Science CrossRef CAS IUCr Journals Google Scholar
Tronrud, D. E., Berkholz, D. S. & Karplus, P. A. (2010). Acta Cryst. D66, 834–842. Web of Science CrossRef IUCr Journals Google Scholar
Vaguine, A. A., Richelle, J. & Wodak, S. J. (1999). Acta Cryst. D55, 191–205. Web of Science CrossRef CAS IUCr Journals Google Scholar
Wallace, B. D., Wang, H., Lane, K. T., Scott, J. E., Orans, J., Koo, J. S., Venkatesh, M., Jobin, C., Yeh, L.-A., Mani, S. & Redinbo, M. R. (2010). Science, 330, 831–835. Web of Science CrossRef CAS PubMed Google Scholar
Wang, J., Dauter, M., Alkire, R., Joachimiak, A. & Dauter, Z. (2007). Acta Cryst. D63, 1254–1268. Web of Science CrossRef CAS IUCr Journals Google Scholar
Wang, Y. & Guo, H.-C. (2010). J. Mol. Biol. 403, 120–130. Web of Science CrossRef CAS PubMed Google Scholar
Watenpaugh, K. D., Sieker, L. C., Herriott, J. R. & Jensen, L. H. (1973). Acta Cryst. B29, 943–956. CrossRef CAS IUCr Journals Web of Science Google Scholar
Weis, W. I., Brünger, A. T., Skehel, J. J. & Wiley, D. C. (1990). J. Mol. Biol. 212, 737–761. CrossRef CAS PubMed Web of Science Google Scholar
Word, J. M., Lovell, S. C., LaBean, T. H., Taylor, H. C., Zalis, M. E., Presley, B. K., Richardson, J. S. & Richardson, D. C. (1999). J. Mol. Biol. 285, 1711–1733. Web of Science CrossRef CAS PubMed Google Scholar
Xiang, Y., Huang, R.-H., Liu, X. Z., Zhang, Y. & Wang, D.-C. (2004). J. Struct. Biol. 148, 86–97. Web of Science CrossRef PubMed CAS Google Scholar
Yang, W., Bell, S. G., Wang, H., Zhou, W., Bartlam, M., Wong, L.-L. & Rao, Z. (2010). Biochem. J. 433, 85–93. Web of Science CrossRef Google Scholar
Yang, F., West, A. P. Jr & Bjorkman, P. J. (2011). J. Struct. Biol. 174, 239–244. Web of Science CrossRef CAS PubMed Google Scholar
Yoshida, H., Takeda, K., Izumori, K. & Kamitori, S. (2010). Protein Eng. Des. Sel. 23, 919–927. Web of Science CrossRef CAS PubMed Google Scholar
Zhou, H., Brock, J., Casarotto, M. G., Oakley, A. J. & Board, P. G. (2011). J. Biol. Chem. 286, 4271–4279. Web of Science CrossRef CAS PubMed Google Scholar
Zhu, K., Shirts, M. R. & Friesner, R. A. (2007). J. Chem. Theory Comput. 3, 2108–2119. Web of Science CrossRef CAS Google Scholar
This is an open-access article distributed under the terms of the Creative Commons Attribution (CC-BY) Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original authors and source are cited.