letters to the editor\(\def\hfill{\hskip 5em}\def\hfil{\hskip 3em}\def\eqno#1{\hfil {#1}}\)

Journal logoBIOLOGICAL
CRYSTALLOGRAPHY
ISSN: 1399-0047

A forward-looking suggestion for resolving the stereochemical restraints debate: ideal geometry functions

CROSSMARK_Color_square_no_text.svg

aDepartment of Biochemistry and Biophysics, Oregon State University, Corvallis, OR 97331, USA, and bInstitute for Cancer Research, Fox Chase Cancer Center, Philadelphia, PA 19111, USA
*Correspondence e-mail: karplusp@science.oregonstate.edu

(Received 19 January 2008; accepted 21 January 2008)

Jaskolski et al. (2007a[Jaskolski, M., Gilski, M., Dauter, Z. & Wlodawer, A. (2007a). Acta Cryst. D63, 611-620.]) initiated a very important discussion (Jaskolski et al., 2007b[Jaskolski, M., Gilski, M., Dauter, Z. & Wlodawer, A. (2007b). Acta Cryst. D63, 1282-1283.]; Stec, 2007[Stec, B. (2007). Acta Cryst. D63, 1113-1114.]; Tickle, 2007[Tickle, I. J. (2007). Acta Cryst. D63, 1274-1281.]) about the accuracy of ideal geometry targets and the appropriate stringency with which they should be obeyed at various resolutions. All of the discussants agree that protein structures determined at ultra-high resolution, which should be closer to the truth, tend to have larger r.m.s. deviations from ideality. This shows that real deviations from ideality do occur in true protein structures. Another point of agreement is that the real deviations from ideality are to some extent context dependent, so that the single target values used in current refinement programs are a simplification: `The N–Cα–C valence angle has a wide spread and may have a bimodal distribution correlated with secondary structure', Jaskolski et al. (2007a[Jaskolski, M., Gilski, M., Dauter, Z. & Wlodawer, A. (2007a). Acta Cryst. D63, 611-620.]); `Recent results suggest that protein stereochemistry is context-dependent', Stec (2007[Stec, B. (2007). Acta Cryst. D63, 1113-1114.]); `As they point out, the [deviations from ideality] will include real variations arising from the chemical environment', Tickle (2007[Tickle, I. J. (2007). Acta Cryst. D63, 1274-1281.]); `In such a situation, a single target, as used in the refinement programs, will not agree with any of the truly preferred values. This again suggests that the geometrical parameters of protein models should not be too tightly restrained to some predefined values', Jaskolski et al. (2007b[Jaskolski, M., Gilski, M., Dauter, Z. & Wlodawer, A. (2007b). Acta Cryst. D63, 1282-1283.]).

We point out here that much of the context dependence of stereochemistry can be transformed from a frustrating reality that limits the accuracy of protein modeling to a feature that can instead enhance modeling accuracy. This transformation is possible because much of the context-dependent variation is not random but varies systematically with conformation. This systematic dependence was well documented in the mid-1990s, especially for the N—Cα—C bond angle, for which the expected value was seen to vary over a range of ∼10° (Jiang et al.[Jiang, X., Yu, C., Ming, C., Newton, S. Q., Paulus, E. P. & Schafer, L. (1997). J. Mol. Struct. 403, 83-93.]; Karplus, 1996[Karplus, P. A. (1996). Protein Sci. 5, 1406-1420.]; Schafer et al., 1995[Schafer, L., Cao, M. & Meadows, M. J. (1995). Biopolymers, 35, 603-606.]). We are now updating these analyses using ultra-high resolution protein structures. These structures not only confirm the highly systematic variation of backbone bond angles with conformation (as illustrated for the N—Cα—C bond angle in Fig. 1[link]), but they also reveal that the standard deviations are very low. For the N—Cα—C bond angle, the standard deviations in individual φ,ψ regions vary from 1.0 to 1.7° (Fig. 1b[link]), much lower than the standard deviation of near 2.2° derived for the N—Cα—C bond angle in the population as a whole (see Table 3 of Jaskolski et al., 2007a[Jaskolski, M., Gilski, M., Dauter, Z. & Wlodawer, A. (2007a). Acta Cryst. D63, 611-620.]). Based on this result, it is clear that no single ideal target value can be appropriate for all of these conformations. However, it is also true that because each mean (or expected value) can be empirically discovered, this feature of protein structure can be accounted for by altering our refinement protocols to allow for ideal geometry targets that are dependent on context. In other words, we propose that more accurate refinement of protein structures at all resolution ranges will be obtained by moving beyond the `single ideal value' paradigm to an `ideal geometry function' paradigm (Schafer et al., 1986[Schafer, L., Ewbank, J. D., Klimkowski, V. J. & Siam, K. (1986). J. Mol. Struct. 135, 141-158.]) in which each restraint target value varies depending on the local conformation. We are now working to develop a set of empirical conformation-dependent `ideal geometry functions' for backbone bond angles and lengths that can be incorporated into crystallographic refinement software (Berkholz & Karplus, unpublished work).

[Figure 1]
Figure 1
N—Cα—C bond angles vary systematically with conformation. (a) Average N—Cα—C backbone bond angles for well populated regions of the Ramachandran plot (≥10 observations in a 5 × 5° bin) are displayed with coloring as indicated in the sidebar. The values were derived for general amino-acid residues (all but Gly, Pro, and residues preceding Pro) from a database of ca 19 000 residues in protein structures determined at 1.0 Å resolution or better. (b) The standard deviations of the distributions in each region of the Ramachandran plot are shown as in (a). The smoothed plots of the data were produced in Matlab using an adaptive kernel regression with a global κ of 25.

Acknowledgements

This work was partly supported by NSF grant MCB-9982727 (to PAK) and NIH grant P20-GM76222 (to RLD).

References

First citationJaskolski, M., Gilski, M., Dauter, Z. & Wlodawer, A. (2007a). Acta Cryst. D63, 611–620.  Web of Science CrossRef IUCr Journals Google Scholar
First citationJaskolski, M., Gilski, M., Dauter, Z. & Wlodawer, A. (2007b). Acta Cryst. D63, 1282–1283.  Web of Science CrossRef IUCr Journals Google Scholar
First citationJiang, X., Yu, C., Ming, C., Newton, S. Q., Paulus, E. P. & Schafer, L. (1997). J. Mol. Struct. 403, 83–93.  CrossRef CAS Web of Science Google Scholar
First citationKarplus, P. A. (1996). Protein Sci. 5, 1406–1420.  CrossRef CAS PubMed Web of Science Google Scholar
First citationSchafer, L., Cao, M. & Meadows, M. J. (1995). Biopolymers, 35, 603–606.  CrossRef Web of Science Google Scholar
First citationSchafer, L., Ewbank, J. D., Klimkowski, V. J. & Siam, K. (1986). J. Mol. Struct. 135, 141–158.  Google Scholar
First citationStec, B. (2007). Acta Cryst. D63, 1113–1114.  Web of Science CrossRef IUCr Journals Google Scholar
First citationTickle, I. J. (2007). Acta Cryst. D63, 1274–1281.  Web of Science CrossRef CAS IUCr Journals Google Scholar

© International Union of Crystallography. Prior permission is not required to reproduce short quotations, tables and figures from this article, provided the original authors and source are cited. For more information, click here.

Journal logoBIOLOGICAL
CRYSTALLOGRAPHY
ISSN: 1399-0047
Follow Acta Cryst. D
Sign up for e-alerts
Follow Acta Cryst. on Twitter
Follow us on facebook
Sign up for RSS feeds