research papers\(\def\hfill{\hskip 5em}\def\hfil{\hskip 3em}\def\eqno#1{\hfil {#1}}\)

Journal logoBIOLOGICAL
CRYSTALLOGRAPHY
ISSN: 1399-0047

Solution solution: using NMR models for molecular replacement

aCambridge University Chemical Laboratory and Centre for Protein Engineering, MRC Centre, Hills Road, Cambridge CB2 2QH, England
*Correspondence e-mail: ywc@mrc-lmb.cam.ac.uk

(Received 23 February 2001; accepted 26 June 2001)

NMR structures can serve as a good source of search models in crystal structure determination by molecular replacement. However, owing to the inherent problems of NMR models, this procedure is not always straightforward. Here, an updated overview is presented with particular emphasis on the preparation of NMR search models and the latest trends in methodology. An experimental protocol developed recently is described and results on its use in solving a new structure as well as its test against a difficult published case are presented.

1. Introduction

Nuclear magnetic resonance (NMR) spectroscopy is a powerful alternative to X-ray crystallography for determining structures of small macromolecules and contributes to a substantial fraction of the depositions in the Protein Data Bank. Brünger et al. (1987[Brünger, A. T., Campbell, R. L., Clore, G. M., Gronenborn, A. M., Karplus, M., Petsko, G. A. & Teeter, M. M. (1987). Science, 235, 1049-1053.]) first provided a proof-of-principle that NMR structures, many of them being compact protein structural modules, can serve as trial models to solve crystal structures by molecular replacement (MR). The first real application was reported ten years ago (Baldwin et al., 1991[Baldwin, E. T., Weber, I. T., St Charles, R., Xuan, J.-C., Appella, E., Yamada, M., Matsushima, K., Edwards, B. F. P., Clore, G. M., Gronenborn, A. M. & Wlodawer, A. (1991). Proc. Natl Acad. Sci. USA, 88, 502-506.]). Owing to the inherent deficiencies of NMR models, their use in MR is not always straightforward, even if the search model and the target crystal structure are of the same macromolecule. Nevertheless, the number of successful MR solutions has risen steadily in the past few years. Up to the time when this article was written, 26 cases (Table 1[link], cases 3–28) have been reported in the literature, demonstrating that the difficulties in using NMR search models could be readily overcome. The readers are referred to a comprehensive review on this topic (Chen et al., 2000[Chen, Y. W., Dodson, E. J. & Kleywegt, G. J. (2000). Structure, 8, R213-R220.]). Here, based on this recent review, I present a concise overview supplemented by new information appearing in the past few months and new test results of a recommended MR protocol.

Table 1
A list of structures surveyed in this study

The accuracy column shows r.m.s. deviation of backbone (unless otherwise stated) atomic positions between the crystal and NMR structures in the well defined core region of the respective proteins involved, usually with loops and temini excluded. If the protein is oligomeric, the comparison is for the oligomeric state of the search model used in MR. This list is arranged in chronological order of publication. Some articles reported more than one MR calculation (e.g. case 7) and the tabulated parameters in this table are those for a representative calculation.

  Year Protein Software Accuracy (Å) Model content Reference
1 1987 Crambin Other 1.1 Mean Brünger et al. (1987[Brünger, A. T., Campbell, R. L., Clore, G. M., Gronenborn, A. M., Karplus, M., Petsko, G. A. & Teeter, M. M. (1987). Science, 235, 1049-1053.])
2 1989 Tendamistat Other 1.03 Individual Braun et al. (1989[Braun, W., Epp, O., Wüthrich, K. & Huber, R. (1989). J. Mol. Biol. 206, 669-676.])
3 1991 IL-8 Other 0.96 Mean Baldwin et al. (1991[Baldwin, E. T., Weber, I. T., St Charles, R., Xuan, J.-C., Appella, E., Yamada, M., Matsushima, K., Edwards, B. F. P., Clore, G. M., Gronenborn, A. M. & Wlodawer, A. (1991). Proc. Natl Acad. Sci. USA, 88, 502-506.])
4 1992 CD8α X-PLOR Composite Leahy et al. (1992[Leahy, D. J., Axel, R. & Hendrickson, W. A. (1992). Cell, 68, 1145-1162.])
5 1994 Endothelin Other 2.0 Individual Janes et al. (1994[Janes, R. W., Peapus, D. H. & Wallace, B. A. (1994). Nature Struct. Biol. 1, 311-319.])
6 1994 CRABP II AMoRe Composite Kleywegt et al. (1994[Kleywegt, G. J., Bergfors, T., Senn, H., Le Motte, P., Gsell, B., Shudo, K. & Jones, T. A. (1994). Structure, 2, 1241-1258.])
7 1995 hIL4 Others/X-PLOR 1.2–1.4 Cα Individual/ensemble Müller et al. (1995[Müller, T., Oehlenschläger, F. & Buehner, M. (1995). J. Mol. Biol. 247, 360-372.])
8 1995 PH AMoRe 1.6 Individual Hyvönen et al. (1995[Hyvönen, M., Macias, M. J., Nilges, M., Oschkinat, H., Saraste, M. & Wilmanns, M. (1995). EMBO J. 14, 4676-4685.]); (1996[Wilmanns, M. & Nilges, M. (1996). Acta Cryst. D52, 973-982.])
9 1995 Er-1 X-PLOR 1.17 all Cα Mean Weiss et al. (1995[Weiss, M. S., Anderson, D. H., Raffioni, S., Bradshaw, R. A., Ortenzi, C., Luporini, P. & Eisenberg, D. (1995). Proc. Natl. Acad. Sci. USA, 92, 10172-10176.]); Anderson et al. (1996[Anderson, D. H., Weiss, M. S. & Eisenberg, D. (1996). Acta Cryst. D52, 469-480.])
10 1996 ACBP AMoRe 1.9 Cα Ensemble Kleywegt (1996b[Kleywegt, G. J. (1996b). Acta Cryst. D52, 842-857.]); van Aalten et al. (in preparation)
11 1996 p53 TET X-PLOR 0.73 Cα Mean Miller et al. (1996[Miller, M., Lubkowski, J., Rao, J. K. M., Danishefsky, A. T., Omichinski, J. G., Sakaguchi, K., Sakamoto, H., Appella, E., Gronenborn, A. M. & Clore, G. M. (1996). FEBS Lett. 399, 166-170.])
12 1997 MCP-1 AMoRe 0.9/1.0 Cα Mean Lubkowski et al. (1997[Lubkowski, J., Bujacz, G., Boqué, L., Domaille, P. J., Handel, T. M. & Wlodawer, A. (1997). Nature Struct. Biol. 4, 64-69.])
13 1998 p53 TET AMoRe 1.4 Cα Ensemble Mittl et al. (1998[Mittl, P. R. E., Chène, P. & Grütter, M. G. (1998). Acta Cryst. D54, 86-89.])
14 1998 p14TCL1 AMoRe 1.1 Cα Mean Hoh et al. (1998[Hoh, F., Yang, Y.-S., Guignard, L., Padilla, A., Stern, M.-H., Lhoste, J. M. & van Tilbeurgh, H. (1998). Structure, 6, 147-155.])
15 1998 IM7 AMoRe 1.7 Cα Ensemble Dennis et al. (1998[Dennis, C. A., Videler, H., Pauptit, R. A., Wallis, R., James, R., Moore, G. R. & Kleanthous, C. (1998). Biochem. J. 333, 183-191.])
16 1998 RBI AMoRe 2.0 Mean Strobl et al. (1998[Strobl, S., Maskos, K., Wiegand, G., Huber, R., Gomis-Rüth, F. X. & Glockshuber, R. (1998). Structure, 6, 911-921.])
17 1998 p19 AMoRe 1.7 Cα Not reported Brotherton et al. (1998[Brotherton, D. H., Dhanaraj, V., Wick, S., Brizuela, L., Domaille, P. J., Volyanik, E., Xu, X., Parisini, E., Smith, B. O., Archer, S. J., Serrano, M., Brenner, S. L., Blundell, T. L. & Laue, E. D. (1998). Nature (London), 395, 244-250.])
18 1998 CHFI EPMR 1.61 Single calculated Behnke et al. (1998[Behnke, C. A., Yee, V. C., Trong, I. L., Pedersen, L. C., Stenkamp, R. E., Kim, S.-S., Reeck, G. R. & Teller, D. C. (1998). Biochemistry, 37, 15277-15288.])
19 1999 RATI AMoRe 2.5 all Cα Individual Gourinath et al. (1999[Gourinath, S., Srinivasan, A. & Singh, T. P. (1999). Acta Cryst. D55, 25-30.])
20 1999 NK1 AMoRe 0.4 Cα Mean Chirgadze et al. (1999[Chirgadze, D. Y., Hepple, J. P., Zhou, H., Byrd, R. A., Blundell, T. L. & Gherardi, E. (1999). Nature Struct. Biol. 6, 72-79.])
21 1999 CTLA-4 AMoRe 2.9 all Cα Mean Sheriff et al. (1999[Sheriff, S., Klei, H. E. & Davis, M. E. (1999). J. Appl. Cryst. 32, 98-101.])
22 1999 Protein S AMoRe 1.1 Mean Wenk et al. (1999[Wenk, M., Baumgartner, R., Holak, T. A., Huber, R., Jaenicke, R. & Mayr, E.-M. (1999). J. Mol. Biol. 286, 1533-1545.])
23 1999 CV-N AMoRe 0.55 Mean Yang et al. (1999[Yang, F., Bewley, C. A., Louis, J. M., Gustafson, K. R., Boyd, M. R., Gronenborn, A. M., Clore, G. M. & Wlodawer, A. (1999). J. Mol. Biol. 288, 403-412.])
24 1999 hCD2:hCD58 AMoRe 1.04 all Cα Mean Wang et al. (1999[Wang, J.-H., Smolyar, A., Tan, K., Liu, J.-H., Kim, M., Sun, Z. Y., Wagner, G. & Reinherz, E. L. (1999). Cell, 97, 791-803.])
25 1999 p85α SH2 AMoRe 0.92 Cα Ensemble Hoedemaeker et al. (1999[Hoedemaeker, F. J., Siegal, G., Roe, S. M., Driscoll, P. C. & Abrahams, J. P. (1999). J. Mol. Biol. 292, 763-770.])
26 2000 Met-Rantes AMoRe Not reported Not reported Hoover et al. (2000[Hoover, D. M., Shaw, J., Gryczynski, Z., Proudfoot, A. E. I., Wells, T. & Lubkowski, J. (2000). Protein Peptide Lett. 7, 73-82.])
27 2000 p73α SAM AMoRe 1.38 Ensemble Wang et al. (2000[Wang, W. K., Proctor, M. R., Buckle, A. M., Bycroft, M. & Chen, Y. W. (2000). Acta Cryst. D56, 769-771.], 2001[Wang, W. K., Bycroft, M., Foster, N. W., Buckle, A. M., Fersht, A. R. & Chen, Y. W. (2001). Acta Cryst. D57, 545-551.])
28 2001 Mss4 AMoRe 1.3 Cα Mean Zhu et al. (2001[Zhu, Z., Dumas, J. J., Lietzke, S. E. & Lambright, D. G. (2001). Biochemistry, 40, 3027-3036.])
29 2001 p85α SH2 AMoRe 1.0 Cα Ensemble Pauptit et al. (2001[Pauptit, R. A., Dennis, C. A., Derbyshire, D. J., Breeze, A. L., Weston, S. A., Rowsell, S. & Murshudov, G. N. (2001). Acta Cryst. D57, 1397-1404.])
30 2001 L30e AMoRe 1.62 Ensemble Chen & Wong (unpublished results)
†MR was performed on this component of a complex structure.

2. Overview

2.1. Problems of NMR models

The success of MR in macromolecular crystallography depends in general largely on the sequence (and structural) homology between the target structure and the search model. Using NMR search models in MR challenges this view: MR can often fail, even when the search model has 100% sequence identity to that of the target crystal structure. This can be attributed to three inherent problems of NMR models: (i) inaccuracies in NMR structures that are mainly based on short distance restraints, (ii) imprecision of search models as a result of limited data (low observation-to-parameter ratio) and (iii) the difficulty in representing the relative reliability of atomic positions in an NMR model.

2.1.1. Accuracy and precision

For the updated list in Table 1[link], when the well defined regions of a crystal structure and those of its NMR search model are compared, the r.m.s. deviations of the backbone atoms (Cα, N, C, O) have a mean value of 1.3 ± 0.2 Å (Chen et al., 2000[Chen, Y. W., Dodson, E. J. & Kleywegt, G. J. (2000). Structure, 8, R213-R220.]). The overall structures of the two states agree well and large structural differences are generally local and limited to flexible loops, the termini and exposed side chains. Advancements in NMR instrumentation and methodology have led to much improved agreement between NMR and crystal structures (see, for example, Kuszewski et al., 1999[Kuszewski, J., Gronenborn, A. M. & Clore, G. M. (1999). J. Am. Chem. Soc. 121, 2337-2338.]). Modern NMR structures show excellent internal consistency: the overall backbone r.m.s. deviation of individual conformers in well defined regions is usually less than 1 Å, with a mean value of 0.5 ± 0.1 Å for the cases listed in Table 1[link] (Chen et al., 2000[Chen, Y. W., Dodson, E. J. & Kleywegt, G. J. (2000). Structure, 8, R213-R220.]). In one study, the authors reported that only the best-refined final NMR model can lead to successful MR, while intermediate models that are lesser in precision all failed (case 7; Müller et al., 1995[Müller, T., Oehlenschläger, F. & Buehner, M. (1995). J. Mol. Biol. 247, 360-372.]). This is echoed by our results on one test study (see §3.2.2[link]). In many cases, modern NMR structures are good for MR calculations, provided that they are carefully prepared. The major task of model preparation is to remove (or down-weight) regions where large local structural variations are likely to occur and to use the best structural representation of the well defined regions in the MR calculations.

2.1.2. Reliability information

In MR calculations, crystallographic B factors are important in supplying weights to the atomic contributions of the scattering factors and reflecting the precision of atomic positions. An individual NMR conformer lacks this information. However, the equivalent information is embodied in an NMR ensemble — poorly defined regions supply fewer experimental restraints and these regions exhibit larger variations in atomic positions.

To exploit the reliability information in an NMR structure for use in MR, two approaches have been developed. The first method involves using a single model with artificially calculated B factors based on atomic r.m.s. deviations from the mean structure (Baldwin et al., 1991[Baldwin, E. T., Weber, I. T., St Charles, R., Xuan, J.-C., Appella, E., Yamada, M., Matsushima, K., Edwards, B. F. P., Clore, G. M., Gronenborn, A. M. & Wlodawer, A. (1991). Proc. Natl Acad. Sci. USA, 88, 502-506.]; Anderson et al., 1996[Anderson, D. H., Weiss, M. S. & Eisenberg, D. (1996). Acta Cryst. D52, 469-480.]; Wilmanns & Nilges, 1996[Wilmanns, M. & Nilges, M. (1996). Acta Cryst. D52, 973-982.]). The second approach simply involves using the whole ensemble as a composite model (Leahy et al., 1992[Leahy, D. J., Axel, R. & Hendrickson, W. A. (1992). Cell, 68, 1145-1162.]; Kleywegt et al., 1994[Kleywegt, G. J., Bergfors, T., Senn, H., Le Motte, P., Gsell, B., Shudo, K. & Jones, T. A. (1994). Structure, 2, 1241-1258.]; Müller et al., 1995[Müller, T., Oehlenschläger, F. & Buehner, M. (1995). J. Mol. Biol. 247, 360-372.]; Kleywegt, 1996a[Kleywegt, G. J. (1996a). CCP4/ESF-EACBM Newsl. Protein Crystallogr. 32, 32-36.]), with all atoms assigned uniform B factors. An ensemble supplies an inherent weighting scheme according to the mutual agreement of equivalent atomic positions.

Of the 22 cases in Table 1[link] where a crystal structure was solved with an NMR model, 13 were solved with single models and seven were solved with ensemble models; the remaining case 7 made use of both types, while the model used in case 26 was not detailed. Most (11) of the single model cases made use of a representative model, usually the restrained minimized average structure. Individual conformers rarely led to success (cases 5 and 19 only). Single models were often preferable because they are easier to handle and the calculations are faster to perform. However, some very difficult cases could only be solved using ensemble models (Dennis et al., 1998[Dennis, C. A., Videler, H., Pauptit, R. A., Wallis, R., James, R., Moore, G. R. & Kleanthous, C. (1998). Biochem. J. 333, 183-191.]; Mittl et al., 1998[Mittl, P. R. E., Chène, P. & Grütter, M. G. (1998). Acta Cryst. D54, 86-89.]; Hoedemaeker et al., 1999[Hoedemaeker, F. J., Siegal, G., Roe, S. M., Driscoll, P. C. & Abrahams, J. P. (1999). J. Mol. Biol. 292, 763-770.]; Chen & Clore, 2000[Chen, Y. W. & Clore, G. M. (2000). Acta Cryst. D56, 1535-1540.]; Wang et al., 2001[Wang, W. K., Bycroft, M., Foster, N. W., Buckle, A. M., Fersht, A. R. & Chen, Y. W. (2001). Acta Cryst. D57, 545-551.]). An ensemble is probably a more realistic representation of the `true' (time- and space-averaged) structure than a single model (Sutcliffe, 1993[Sutcliffe, M. J. (1993). Protein Sci. 2, 936-944.]). Use of ensemble models is becoming more popular.

2.2. Software

Since 1994, AMoRe (Navaza, 1994[Navaza, J. (1994). Acta Cryst. A50, 157-163.]; Navaza & Saludjian, 1997[Navaza, J. & Saludjian, P. (1997). Methods Enzymol. 276, 581-594.]) has led to the solution of more structures than any other program (Table 1[link]). The major advantages of this program are its speed and the fact that many potential solutions can be tested in a single run. It may not offer the best signal-to-noise discrimination at every stage, but a correct rotation-function solution that is buried among noise peaks is still useful in the subsequent translation search and rigid-body refinement.

Continued increases in computing power have brought about some new excitements. Recently developed partial six-dimensional searches (Chang & Lewis, 1997[Chang, G. & Lewis, M. (1997). Acta Cryst. D53, 279-289.]; Kissinger et al., 1999[Kissinger, C. R., Gehlhaar, D. K. & Fogel, D. B. (1999). Acta Cryst. D55, 484-491.]; Glykos & Kokkinidis, 2000[Glykos, N. M. & Kokkinidis, M. (2000). Acta Cryst. D56, 169-174.]) can be performed with a speed comparable to conventional two-step MR. Two of these programs are described in this issue (Glykos & Kokkinidis, 2001[Glykos, N. M. & Kokkinidis, M. (2001). Acta Cryst. D57, 1462-1473.]; Kissinger et al., 2001[Kissinger, C. R., Gehlhaar, D. K., Smith, B. A. & Bouzida, D. (2001). Acta Cryst. D57, 1475-1479.]).

A new approach in simultaneous search for multiple copies of a molecule in the unit cell by MR has been developed recently and implemented into the program MOLREP (Collaborative Computational Project, Number 4, 1994[Collaborative Computational Project, Number 4 (1994). Acta Cryst. D50, 760-763.]; Vagin & Teplyakov, 2000[Vagin, A. & Teplyakov, A. (2000). Acta Cryst. D56, 1622-1624.]). The authors reported success in a previously failed MR case searching with an NMR model.

Another new molecular-replacement program, Beast (Read, 2001[Read, R. (2001). Acta Cryst. D57, 1373-1382.]), has been designed to use multiple possible molecular-replacement models and is thus particularly suited to the use of ensemble NMR models in MR. We have performed preliminary trials with this program on a very difficult test case, that of CHFI (see §3.2.1[link]), and found it to be successful (Read & Chen, unpublished results).

3. Recommended protocol and case studies

3.1. Experimental

It is customary to delete unstructured residues in a search model. A set of tools to help with this task is described in Kleywegt (1996a[Kleywegt, G. J. (1996a). CCP4/ESF-EACBM Newsl. Protein Crystallogr. 32, 32-36.]) and is available at the URL http://xray.bmc.uu.se/usf/factory_6.html . Recent NMR structures usually contain around 20 conformers in a bundle and are generally good enough for MR (Chen & Clore, 2000[Chen, Y. W. & Clore, G. M. (2000). Acta Cryst. D56, 1535-1540.]). Obvious `outliers' can be removed from the set. A script called multi_probe (ftp://xray.bmc.uu.se/pub/gerard/omac/multi_probe ) was found to be most useful for preparing a set of three ensemble models, with varying extents of side-chain truncation. The script first aligns members of the ensemble and then prepares an all-atom model, a poly Ser/Ala/Gly (poly SAG; all non-glycine/alanine side chains changed to serine) model and a poly AG model. A detailed description of the procedure can be found on the internet (http://imsb.au.dk/~mok/o/ofaq/Q.879.html ). If a single NMR model is used, artificial B factors can be assigned using an empirical formula based on atomic r.m.s. deviations from the mean structure (Wilmanns & Nilges, 1996[Wilmanns, M. & Nilges, M. (1996). Acta Cryst. D52, 973-982.]). This procedure has been implemented in a Perl script and is available for download (http://www.mrc-cpe.cam.ac.uk/~ywc/rmsdB.html ).

The whole NMR ensemble, as prepared by the multi_probe script, was input into AMoRe as a single trial model. For most cases in Table 1[link], the high-resolution cut-off of the data used for searching falls in a narrow range from 3.5 to 4.5 Å; commonly used low-resolution limits are 10.0 and 15.0 Å. All the MR calculations in this work were performed with data in the resolution range 15–3.5 Å. This protocol has been tested on three difficult published cases (Table 1[link], cases 9, 13 and 18) and applied to solving two structures (Table 1[link], cases 27 and 30). I summarize previous findings and present new results in the following section.

3.2. Case studies

3.2.1. Three published cases tested

In two earlier articles, we have reported the successful application of this recommended protocol in re-solving two published problems: namely, the p53 TET domain and the Er-1 pheromone (Chen & Clore, 2000[Chen, Y. W. & Clore, G. M. (2000). Acta Cryst. D56, 1535-1540.]; Chen et al., 2000[Chen, Y. W., Dodson, E. J. & Kleywegt, G. J. (2000). Structure, 8, R213-R220.]). In both cases, a poly SAG model is found to be the most successful and results are very clear at every stage.

We also studied a third test case: the corn hageman factor inhibitor (CHFI). This structure was originally solved with EPMR (Kissinger et al., 1999[Kissinger, C. R., Gehlhaar, D. K. & Fogel, D. B. (1999). Acta Cryst. D55, 484-491.]), a program which implements six-dimensional searches with evolutionary programming. Neither X-PLOR nor AMoRe led to an MR solution (Behnke et al., 1998[Behnke, C. A., Yee, V. C., Trong, I. L., Pedersen, L. C., Stenkamp, R. E., Kim, S.-S., Reeck, G. R. & Teller, D. C. (1998). Biochemistry, 37, 15277-15288.]), making it particularly challenging. Our earlier attempt to solve the structure with this recommended protocol also failed (Chen et al., 2000[Chen, Y. W., Dodson, E. J. & Kleywegt, G. J. (2000). Structure, 8, R213-R220.]). Jorge Navaza, the author of AMoRe, kindly advised that a solution could be obtained with slight modification to the procedure. With the poly SAG ensemble model, the correct rotation solution is obtained without any problem as the top peak. Interestingly, the subsequent translation search only worked when the correlation coefficient (CC) in terms of intensities target and not the Crowther–Blow translation function (default in AMoRe) was used (Jorge Navaza, personal communications). The search results are noisy (Fig. 1[link]a): the correct translation-function peak only ranked fourth, with a CC of 29.5. After rigid-body refinement, the correct peak is promoted to the top (CC = 31.4), but is hardly distinguishable from the highest noise peak (CC = 31.2). Nevertheless, the top peak is associated with the lowest R factor and corresponds to the correct solution.

[Figure 1]
Figure 1
Results of MR calculations with NMR models. In each graph, (a) CHFI and (b) L30e, the top 10 peaks of the three MR steps: rotation function, translation function and rigid-body refinement are shown, in descending order. The correct solutions are coloured red and noise peaks are coloured black.
3.2.2. Two new cases solved

The recommended protocol was employed in solving the crystal structure of the human p73α SAM domain and has already been reported (Chen et al., 2000[Chen, Y. W., Dodson, E. J. & Kleywegt, G. J. (2000). Structure, 8, R213-R220.]; Wang et al., 2000[Wang, W. K., Proctor, M. R., Buckle, A. M., Bycroft, M. & Chen, Y. W. (2000). Acta Cryst. D56, 769-771.]).

Here, I present fresh results of the application of this protocol to the solution of a new structure: that of the ribosomal protein L30e (Wong et al., 2001[Wong, K. B., Wang, W. K., Proctor, M. R., Bycroft, M. & Chen, Y. W. (2001). Acta Cryst. D57, 865-866.]). Structure determination by NMR and by MR took place in parallel. MR using an NMR model of a homologous protein did not yield a solution and neither did NMR models of the same protein at the early stages of refinement give positive results. It was not until a good quality near-completion NMR model was available that we could solve the structure. This ensemble contains 25 conformers and has an internal precision (backbone r.m.s. deviation to the mean structure) of 0.6–0.7 Å. With this intermediate model, we generated the set of all atoms, poly AG and poly SAG models, but none of these gave an MR solution. We then generated an ensemble model preserving all the hydrophobic core side chains while the long surface side chains (Lys, Arg, Met, Gln, Glu, Asp and Asn) were changed to serine. Using this search model, a prominent peak appeared after the translation search that was clearly discriminated from the noise (Fig. 1[link]b). This solution has the highest CC of 50.7, which is well above the highest noise peak of 43.7 and has the lowest R factor of 0.46 among noise peaks having R factors ranging from 0.49 to 0.52. This solution is only obtained in P61 but not in the enantiomorphic space group. Checking backward, the correct orientation only ranked fourth in the rotation-function search (Fig. 1[link]b). Successful refinement confirmed that the MR solution is correct: the current R factor and free R factor are both below 0.3 (Chen & Wong, unpublished results).

4. Discussion

The recommended protocol was found to be successful in solving two new structures as well as offering improvements over published results for three difficult test cases. It is interesting to compare the structural differences in these cases. The r.m.s. deviation of well defined backbone atoms for the p53 TET domain is 0.4 Å, that for Er-1 is 1.2 Å and that for the p73α SAM domain is 1.4 Å, while that for CHFI and L30e is 1.6 Å. The clarity of the results is correlated somewhat with the structural differences between the search model and the target structure, i.e. the accuracy of the respective search models. During the re-examination of p53 TET domain, it was found that a more accurate structure led to substantially improved search results (Chen & Clore, 2000[Chen, Y. W. & Clore, G. M. (2000). Acta Cryst. D56, 1535-1540.]). Only a near-completely refined NMR models can lead to solution of the L30e. In the case of CHFI, one learned that MR calculations can be sensitive to the target translation function used.

Obtaining an MR solution is only halfway through the problem. Subsequent structure refinement and rebuilding can be very tedious and frustrating but these are outside the scope of this work. Some practical experiences can be found in another manuscript in this issue (Pauptit et al., 2001[Pauptit, R. A., Dennis, C. A., Derbyshire, D. J., Breeze, A. L., Weston, S. A., Rowsell, S. & Murshudov, G. N. (2001). Acta Cryst. D57, 1397-1404.]).

Acknowledgements

YWC is supported by Wellcome Trust Grant 061836. The author would like to thank Kambo Wong and Randy Read for allowing their unpublished work to be included here. David Teller is thanked for supplying the CHFI data for testing and Jorge Navaza for help in solving this test case.

References

First citationAnderson, D. H., Weiss, M. S. & Eisenberg, D. (1996). Acta Cryst. D52, 469–480. CrossRef CAS Web of Science IUCr Journals
First citationBaldwin, E. T., Weber, I. T., St Charles, R., Xuan, J.-C., Appella, E., Yamada, M., Matsushima, K., Edwards, B. F. P., Clore, G. M., Gronenborn, A. M. & Wlodawer, A. (1991). Proc. Natl Acad. Sci. USA, 88, 502–506. CrossRef PubMed CAS Web of Science
First citationBehnke, C. A., Yee, V. C., Trong, I. L., Pedersen, L. C., Stenkamp, R. E., Kim, S.-S., Reeck, G. R. & Teller, D. C. (1998). Biochemistry, 37, 15277–15288. Web of Science CrossRef CAS PubMed
First citationBraun, W., Epp, O., Wüthrich, K. & Huber, R. (1989). J. Mol. Biol. 206, 669–676. CrossRef CAS PubMed Web of Science
First citationBrotherton, D. H., Dhanaraj, V., Wick, S., Brizuela, L., Domaille, P. J., Volyanik, E., Xu, X., Parisini, E., Smith, B. O., Archer, S. J., Serrano, M., Brenner, S. L., Blundell, T. L. & Laue, E. D. (1998). Nature (London), 395, 244–250. Web of Science CAS PubMed
First citationBrünger, A. T., Campbell, R. L., Clore, G. M., Gronenborn, A. M., Karplus, M., Petsko, G. A. & Teeter, M. M. (1987). Science, 235, 1049–1053.  PubMed Web of Science
First citationChang, G. & Lewis, M. (1997). Acta Cryst. D53, 279–289. CrossRef CAS Web of Science IUCr Journals
First citationChen, Y. W. & Clore, G. M. (2000). Acta Cryst. D56, 1535–1540. Web of Science CrossRef CAS IUCr Journals
First citationChen, Y. W., Dodson, E. J. & Kleywegt, G. J. (2000). Structure, 8, R213–R220. Web of Science CrossRef PubMed CAS
First citationChirgadze, D. Y., Hepple, J. P., Zhou, H., Byrd, R. A., Blundell, T. L. & Gherardi, E. (1999). Nature Struct. Biol. 6, 72–79. Web of Science CAS PubMed
First citationCollaborative Computational Project, Number 4 (1994). Acta Cryst. D50, 760–763. CrossRef IUCr Journals
First citationDennis, C. A., Videler, H., Pauptit, R. A., Wallis, R., James, R., Moore, G. R. & Kleanthous, C. (1998). Biochem. J. 333, 183–191. Web of Science CAS PubMed
First citationGlykos, N. M. & Kokkinidis, M. (2000). Acta Cryst. D56, 169–174. Web of Science CrossRef CAS IUCr Journals
First citationGlykos, N. M. & Kokkinidis, M. (2001). Acta Cryst. D57, 1462–1473. CrossRef CAS IUCr Journals
First citationGourinath, S., Srinivasan, A. & Singh, T. P. (1999). Acta Cryst. D55, 25–30. Web of Science CrossRef CAS IUCr Journals
First citationHoedemaeker, F. J., Siegal, G., Roe, S. M., Driscoll, P. C. & Abrahams, J. P. (1999). J. Mol. Biol. 292, 763–770. Web of Science CrossRef PubMed CAS
First citationHoh, F., Yang, Y.-S., Guignard, L., Padilla, A., Stern, M.-H., Lhoste, J. M. & van Tilbeurgh, H. (1998). Structure, 6, 147–155. Web of Science CrossRef CAS PubMed
First citationHoover, D. M., Shaw, J., Gryczynski, Z., Proudfoot, A. E. I., Wells, T. & Lubkowski, J. (2000). Protein Peptide Lett. 7, 73–82. CAS
First citationHyvönen, M., Macias, M. J., Nilges, M., Oschkinat, H., Saraste, M. & Wilmanns, M. (1995). EMBO J. 14, 4676–4685. CAS PubMed Web of Science
First citationJanes, R. W., Peapus, D. H. & Wallace, B. A. (1994). Nature Struct. Biol. 1, 311–319. CrossRef CAS PubMed Web of Science
First citationKissinger, C. R., Gehlhaar, D. K. & Fogel, D. B. (1999). Acta Cryst. D55, 484–491. Web of Science CrossRef CAS IUCr Journals
First citationKissinger, C. R., Gehlhaar, D. K., Smith, B. A. & Bouzida, D. (2001). Acta Cryst. D57, 1475–1479. Web of Science CrossRef IUCr Journals
First citationKleywegt, G. J. (1996a). CCP4/ESF–EACBM Newsl. Protein Crystallogr. 32, 32–36.
First citationKleywegt, G. J. (1996b). Acta Cryst. D52, 842–857. CrossRef CAS Web of Science IUCr Journals
First citationKleywegt, G. J., Bergfors, T., Senn, H., Le Motte, P., Gsell, B., Shudo, K. & Jones, T. A. (1994). Structure, 2, 1241–1258. CrossRef CAS PubMed Web of Science
First citationKuszewski, J., Gronenborn, A. M. & Clore, G. M. (1999). J. Am. Chem. Soc. 121, 2337–2338. Web of Science CrossRef CAS
First citationLeahy, D. J., Axel, R. & Hendrickson, W. A. (1992). Cell, 68, 1145–1162. CrossRef PubMed CAS Web of Science
First citationLubkowski, J., Bujacz, G., Boqué, L., Domaille, P. J., Handel, T. M. & Wlodawer, A. (1997). Nature Struct. Biol. 4, 64–69. CrossRef CAS PubMed Web of Science
First citationMiller, M., Lubkowski, J., Rao, J. K. M., Danishefsky, A. T., Omichinski, J. G., Sakaguchi, K., Sakamoto, H., Appella, E., Gronenborn, A. M. & Clore, G. M. (1996). FEBS Lett. 399, 166–170. CrossRef CAS PubMed Web of Science
First citationMittl, P. R. E., Chène, P. & Grütter, M. G. (1998). Acta Cryst. D54, 86–89. Web of Science CrossRef CAS IUCr Journals
First citationMüller, T., Oehlenschläger, F. & Buehner, M. (1995). J. Mol. Biol. 247, 360–372. CrossRef CAS PubMed Web of Science
First citationNavaza, J. (1994). Acta Cryst. A50, 157–163. CrossRef CAS Web of Science IUCr Journals
First citationNavaza, J. & Saludjian, P. (1997). Methods Enzymol. 276, 581–594. CrossRef CAS Web of Science
First citationPauptit, R. A., Dennis, C. A., Derbyshire, D. J., Breeze, A. L., Weston, S. A., Rowsell, S. & Murshudov, G. N. (2001). Acta Cryst. D57, 1397–1404. Web of Science CrossRef CAS IUCr Journals
First citationRead, R. (2001). Acta Cryst. D57, 1373–1382. Web of Science CrossRef CAS IUCr Journals
First citationSheriff, S., Klei, H. E. & Davis, M. E. (1999). J. Appl. Cryst. 32, 98–101. Web of Science CrossRef CAS IUCr Journals
First citationStrobl, S., Maskos, K., Wiegand, G., Huber, R., Gomis-Rüth, F. X. & Glockshuber, R. (1998). Structure, 6, 911–921. Web of Science CrossRef CAS PubMed
First citationSutcliffe, M. J. (1993). Protein Sci. 2, 936–944. CrossRef CAS PubMed
First citationVagin, A. & Teplyakov, A. (2000). Acta Cryst. D56, 1622–1624. Web of Science CrossRef CAS IUCr Journals
First citationWang, J.-H., Smolyar, A., Tan, K., Liu, J.-H., Kim, M., Sun, Z. Y., Wagner, G. & Reinherz, E. L. (1999). Cell, 97, 791–803. Web of Science CrossRef PubMed CAS
First citationWang, W. K., Bycroft, M., Foster, N. W., Buckle, A. M., Fersht, A. R. & Chen, Y. W. (2001). Acta Cryst. D57, 545–551. Web of Science CrossRef CAS IUCr Journals
First citationWang, W. K., Proctor, M. R., Buckle, A. M., Bycroft, M. & Chen, Y. W. (2000). Acta Cryst. D56, 769–771. Web of Science CrossRef CAS IUCr Journals
First citationWeiss, M. S., Anderson, D. H., Raffioni, S., Bradshaw, R. A., Ortenzi, C., Luporini, P. & Eisenberg, D. (1995). Proc. Natl. Acad. Sci. USA, 92, 10172–10176. CrossRef CAS PubMed Web of Science
First citationWenk, M., Baumgartner, R., Holak, T. A., Huber, R., Jaenicke, R. & Mayr, E.-M. (1999). J. Mol. Biol. 286, 1533–1545. Web of Science CrossRef PubMed CAS
First citationWilmanns, M. & Nilges, M. (1996). Acta Cryst. D52, 973–982. CrossRef CAS Web of Science IUCr Journals
First citationWong, K. B., Wang, W. K., Proctor, M. R., Bycroft, M. & Chen, Y. W. (2001). Acta Cryst. D57, 865–866. Web of Science CrossRef CAS IUCr Journals
First citationYang, F., Bewley, C. A., Louis, J. M., Gustafson, K. R., Boyd, M. R., Gronenborn, A. M., Clore, G. M. & Wlodawer, A. (1999). J. Mol. Biol. 288, 403–412. Web of Science CrossRef PubMed CAS
First citationZhu, Z., Dumas, J. J., Lietzke, S. E. & Lambright, D. G. (2001). Biochemistry, 40, 3027–3036. Web of Science CrossRef PubMed CAS

© International Union of Crystallography. Prior permission is not required to reproduce short quotations, tables and figures from this article, provided the original authors and source are cited. For more information, click here.

Journal logoBIOLOGICAL
CRYSTALLOGRAPHY
ISSN: 1399-0047
Follow Acta Cryst. D
Sign up for e-alerts
Follow Acta Cryst. on Twitter
Follow us on facebook
Sign up for RSS feeds