Received 30 March 2010
Comparison of NMR and crystal structures for the proteins TM1112 and TM1367
Biswaranjan Mohanty,a,b+ Pedro Serrano,a,b++ Bill Pedrini,c Kristaps Jaudzems,a,b Michael Geralt,a,b Reto Horst,a,b Torsten Herrmann,d Marc-André Elsliger,a,b Ian A. Wilsona,b,e and Kurt Wüthricha,b,c,e*
aDepartment of Molecular Biology, The Scripps Research Institute, La Jolla, CA 92037, USA,bJoint Center for Structural Genomics, http://www.jcsg.org , USA,cInstitute of Molecular Biology and Biophysics, ETH Zürich, CH-8093 Zürich, Switzerland,dCentre Européen de RMN à Très Hauts Champs, Université de Lyon, F-69100 Villeurbanne, France, and eSkaggs Institute of Chemical Biology, The Scripps Research Institute, La Jolla, CA 92037, USA
The NMR structures of the TM1112 and TM1367 proteins from Thermotoga maritima in solution at 298 K were determined following a new protocol which uses the software package UNIO for extensive automation. The results obtained with this novel procedure were evaluated by comparison with the crystal structures solved by the JCSG at 100 K to 1.83 and 1.90 Å resolution, respectively. In addition, the TM1112 solution structure was compared with an NMR structure solved by the NESG using a conventional largely interactive methodology. For both proteins, the newly determined NMR structure could be superimposed with the crystal structure with r.m.s.d. values of <1.0 Å for the backbone heavy atoms, which provided a starting platform to investigate local structure variations, which may arise from either the methods used or from the different chemical environments in solution and in the crystal. Thereby, these comparative studies were further explored with the use of reference NMR and crystal structures, which were computed using the NMR software with input of upper-limit distance constraints derived from the molecular models that represent the results of structure determination by NMR and by X-ray diffraction, respectively. The results thus obtained show that NMR structure calculations with the new automated UNIO software used by the JCSG compare favorably with those from a more labor-intensive and time-intensive interactive procedure. An intriguing observation is that the `bundles' of two TM1112 or three TM1367 molecules in the asymmetric unit of the crystal structures mimic the behavior of the bundles of 20 conformers used to represent the NMR solution structures when comparing global r.m.s.d. values calculated either for the polypeptide backbone, the core residues with solvent accessibility below 15% or all heavy atoms.
Crystal structures of Thermotoga maritima proteins TM1112 and TM1367 have previously been determined by the JCSG (McMullan et al., 2004; Jin et al., 2006). These two proteins were therefore used in an NMR methods development and assessment project to evaluate the NMR structures that were obtained following a novel protocol, which uses the UNIO software suite (Herrmann et al., 2002a,b; Volk et al., 2008; Fiorito et al., 2008; Herrmann et al., unpublished work) that supports extensive automation. These structures were incorporated into the present series of NMR and crystal structure comparisons because the crystal structures include two and three independent molecules in the asymmetric unit, respectively, and it seemed of interest to follow up on earlier investigations of possible correlations between variations among the individual molecules in the asymmetric unit of the crystal and the bundle of NMR conformers or between the crystal and NMR structures (see, for example, Wilson & Brunger, 2000; DePristo et al., 2004; Furnham et al., 2006; Levin et al., 2007; Kondrashov et al., 2008).
The NMR structure of TM1112, a protein of unknown function (DUF861; PF05899), was initially determined by the Northeast Structural Genomics (NESG) consortium (PDB code 1lkn ) and was subsequently used by the JCSG to solve the crystal structure by molecular replacement (http://www.topsan.org/Proteins/JCSG/1o5u ). Thus, we include the NESG structure in the following comparative studies. TM1367 also represents a domain of unknown function (DUF369; PF0412; http://www.topsan.org/Proteins/JCSG/2ka0 ). Its crystal structure (PDB code 1zx8 ) revealed an atypical cyclophilin-type (peptidylprolyl isomerase-type) fold (Jin et al., 2006).
Furthermore, to support the comparative studies, we continued to explore the use of reference NMR and crystal structures (Jaudzems et al., 2010), which were computed from sets of distance restraints measured in the crystal and solution NMR molecular models, respectively, using the same simulated-annealing protocol as for the experimental NMR structure determination.
The plasmid vector MH1 containing the TM1112 gene obtained from the JCSG Crystallomics Core was used as the template for PCR amplification with the primers 5'-CCGCATATGGAAGTGAAGATAGAAAAGCCCACACCC-3' and 5'-CGGAAGCTTCTAGAAGAGGTTGTAGTGCTTTCTGACCGGCTCTAAAAC-3', where the NdeI and HindIII restriction sites are shown in bold and the initiation and stop codons are italicized. The PCR product was digested with NdeI and HindIII and inserted into the vector pET-25b between the same restriction sites after treatment with calf intestinal alkaline phosphatase (CIP). The resulting plasmid pET-25b-TM1112 was used to transform Escherichia coli strain Rosetta (DE3) (Novagen) and the protein was expressed in M9 minimal medium containing 1 g l-1 15NH4Cl and 4 g l-1 [13C6]-D-glucose (Cambridge Isotope Laboratories) as the sole sources of nitrogen and carbon. After the addition of 100 mg l-1 ampicillin, the cells were grown at 310 K to an OD600 of 0.45, induced with 1 mM isopropyl -D-1-thiogalactopyranoside (IPTG) and grown for a further 3 h to a final OD600 of 1.10. The cells were harvested at 5000g and 277 K for 5 min and frozen at 253 K overnight. The next day, the cell pellet was thawed and resuspended in 30 ml buffer A (25 mM sodium phosphate pH 7.6, 25 mM NaCl, 2 mM DTT) containing one Complete EDTA-free protease-inhibitor cocktail tablet (Roche) and lysed by ultrasonication. The soluble fraction of the cell lysate was isolated by centrifugation for 30 min at 20 000g and 277 K, decanting and filtration through a 0.22 µm pore-size filter. The solution was then incubated in a water bath at 348 K for 20 min. Precipitated material was removed by centrifugation at 6000g for 20 min at 277 K. The supernatant was recovered and passed through the aforementioned filter before application onto a 5 ml HiTrap QHP column (GE Healthcare) pre-equilibrated in buffer A. TM1112 eluted in the flowthrough during sample injection. These fractions were pooled and concentrated to 12 ml by ultrafiltration using an Amicon ultracentrifugal filter device with 5 kDa molecular-weight cutoff (Millipore) and then applied onto a HiLoad 26/60 column of Superdex 75 gel-filtration resin (GE Healthcare) pre-equilibrated in NMR buffer A (25 mM sodium phosphate pH 6.8, 50 mM NaCl, 0.5 mM DTT). The fractions containing TM1112 were pooled and concentrated from 50 ml to 500 µl by ultrafiltration. All purification steps were monitored by SDS-PAGE. The yield of purified TM1112 was 30 mg per litre of culture.
NMR samples were prepared by adding 10%(v/v) D2O, 4.5 mM d10-DTT and 0.03%(w/v) NaN3 to 500 µl of a 1.3 mM solution of 15N,13C-labeled TM1112 in NMR buffer A.
The plasmid vector MH4a containing the TM1367 gene obtained from the JCSG Crystallomics Core was used as the template for PCR amplification with the primers 5'-CCGCATATGAGAGTTGAACTCCTCTTTGAAAGTGGAAAATGTG-3' and 5'-CGGAAGCTTCTATGAGGATGCAAATCTGACGGCG-3', where the NdeI and HindIII restriction sites are shown in bold and the initiation and stop codons are italicized. The expression and purification of this protein then followed the same protocol as described in §2.1 for TM1112, with the following modifications. The cells were resuspended in 24 ml buffer B (25 mM sodium phosphate pH 6.8, 25 mM NaCl, 1 mM DTT) containing half of a Complete EDTA-free protease-inhibitor cocktail tablet (Roche). Buffer B was also used to pre-equilibrate the 5 ml HiTrap QHP column (GE Healthcare) used in a subsequent purification step, which yielded TM1367 fractions that were pooled into two volumes of 12 ml and applied onto a HiLoad 26/60 column of Superdex 75 gel-filtration resin (GE Healthcare) pre-equilibrated in NMR buffer B (25 mM sodium phosphate pH 6.0, 50 mM NaCl, 0.5 mM DTT). The fractions containing TM1367 were again pooled and concentrated from 60 ml to 500 µl by ultrafiltration using an Amicon ultracentrifugal filter device with 5 kDa molecular-weight cutoff (Millipore). The yield of purified TM1367 was 32 mg per litre of culture.
NMR samples were prepared by adding 10%(v/v) D2O, 4.5 mM d10-DTT and 0.03%(w/v) NaN3 to 500 µl of a 1.3 mM solution of 15N,13C-labeled TM1367 in NMR buffer B.
NMR experiments were conducted at 298 K on Bruker Avance 600 and 800 MHz spectrometers equipped with TXI HCN z-gradient and xyz-gradient room-temperature probes, respectively. 4D APSY-HACANH, 5D APSY-HACACONH and 5D APSY-CBCACONH data sets were recorded with 16, 16 and 16 projections, respectively (Hiller et al., 2005, 2008). Three NOESY spectra were recorded with a mixing time of 60 ms: 3D [1H,1H]-NOESY-15N-HSQC, 3D [1H,1H]-NOESY-13C(ali)-HSQC and 3D [1H,1H]-NOESY-13C(aro)-HSQC. The 13C carrier frequencies were set at 25 and 122 p.p.m., respectively, for obtaining the aliphatic and aromatic spectral regions. Chemical shifts were referenced internally to the 2,2-dimethyl-2-silapentane-5-sulfonate (DSS) signal (Wishart & Sykes, 1994). The chemical shift of the solvent water resonance relative to DSS was 4.796 p.p.m.
The polypeptide-backbone resonance assignments were obtained from APSY-generated four- and five-dimensional peak lists, which were used as input for automated backbone assignment with v.2.2 of the program MATCH (Volk et al., 2008) in the UNIO software package. The backbone assignments were then interactively checked and completed. Side-chain resonance assignments were obtained with the automated routine of v.2.2 of the program ATNOS/ASCAN (Herrmann et al., 2002a; Fiorito et al., 2008) in the UNIO software package, using as input the aforementioned 3D 15N-resolved and 13C-resolved [1H,1H]-NOESY spectra. The automatic assignments were interactively checked and extended using the software CARA (Keller, 2004). NOE distance restraints were automatically collected using the same three NOESY data sets as for the side-chain assignments as input for v.2.2 of the ATNOS/CANDID programs (Herrmann et al., 2002a,b) in the UNIO software package. The structure calculation and energy refinement were performed as described in Jaudzems et al. (2010).
Structure validation was performed as described in Jaudzems et al. (2010). The chemical shifts have been deposited in the BioMagResBank (entry Nos. 16006 and 16007 for TM1112 and TM1367, respectively; http://www.bmrb.wisc.edu ). The atomic coordinates of the bundles of 20 NMR conformers have been deposited in the PDB (accession codes 2k9z for TM1112 and 2ka0 for TM1367; http://www.rcsb.org/pdb/ ).
We follow the strategy introduced in Jaudzems et al. (2010). To compute the reference crystal structure, the positions of the H atoms in the crystal were calculated using the standard residue geometry from the AMBER94 library in the software MOLMOL (Koradi et al., 1996). All intra- and inter-residual distances shorter than 5.0 Å between pairs of H atoms were then extracted and those involving labile protons with fast chemical exchange (Wüthrich, 1986) were eliminated from the resulting list. The input of upper-limit distance bounds for the structure calculation was generated by increasing these proton-proton distances by 15%. This `loosening' of the distance constraints is in line with the basic strategy of interpreting 1H-1H NOEs in terms of upper-limit distance bounds (Wüthrich, 1986). For the NMR reference structure, we followed a three-step protocol: (i) a list was prepared of all the 1H-1H distances shorter than 5.0 Å in the 20 conformers that represent the NMR structure; (ii) a new list was obtained that included the longest distance among the 20 conformers for each pair of H atoms in the list resulting from (i); and (iii) the input of upper-limit distance bounds contained all entries in list (ii) that were shorter than 5.75 Å [this value was empirically selected as the shortest cutoff that gave virtually identical results for the structure calculation as an input consisting of the complete list (ii)].
The techniques used here have been described in Jaudzems et al. (2010). The global per-residue displacements between structure bundles refer to the mean structures calculated after superposition for minimal r.m.s.d. of the backbone atoms of residues 2-89 for TM1112 and 2-123 for TM1367.
Comparison of the bundle of 20 conformers representing the NMR structure of TM1112 with the two molecules in the asymmetric unit (a.s.u.) of the crystal structure (denoted here as CrystA and CrystB following the corresponding chain designation; PDB code 1o5u ) reveals remarkable similarity (Fig. 1) and comparable results were obtained for the comparison between the bundle of 20 NMR conformers of TM1367 with the three molecules in the a.s.u. of the crystal structure (denoted here as CrystA, CrystB and CrystC following the corresponding chain designation; PDB code 1zx8 ; Fig. 2). For TM1112, the NMR and crystal structures superimpose with global backbone heavy-atom r.m.s.d. values below 1.0 Å for residues 2-89 (Fig. 3). For TM1367, global r.m.s.d values below 1.0 Å were calculated for the backbone heavy atoms of residues 2-123 (Fig. 4).
| || Figure 1 |
Amino-acid sequence and NMR structure of the protein TM1112 and comparison of the NMR structure with the crystal structure. (a) Stereo ribbon diagram of the NMR conformer closest to the mean coordinates of the bundle of conformers in (b). Color code: -strands, cyan; helices, red/yellow; nonregular secondary structure, gray. The individual regular secondary structures are identified and the two chain ends are marked N and C. (b) Stereoview of a superposition for best fit of the polypeptide-backbone heavy atoms of residues 2-89 of the two molecules in the crystal asymmetric unit, CrystA and CrystB (black lines), with the bundle of 20 conformers that represent the NMR structure (brown). In generating this picture, CrystB was superimposed for best fit with CrystA and then each one in the ensemble of 20 NMR conformers was superimposed for best fit of the polypeptide-backbone heavy atoms with CrystA. (c) Amino-acid sequence. The locations of regular secondary structure are indicated above the sequence using the same color code as in (a).
| || Figure 2 |
Amino-acid sequence and NMR structure of the protein TM1367 and comparison of the NMR structure with the crystal structure. The same presentation is used as in Fig. 1.
| || Figure 3 |
Analysis of the crystal structure, the NMR structure and the reference crystal and NMR structures of TM1112. (a) R.m.s.d. values describing the precision of the structure determinations by NMR in solution at 298 K and by X-ray diffraction in crystals at 100 K. The smaller boxes show the r.m.s.d. values for pairwise comparisons between the bundle of 20 NMR conformers and CrystA and CrystB. For the crystal structure, `global deviations' corresponding to r.m.s.d.s were computed from the experimental B values using equations (2)-(5) in Jaudzems et al. (2010). For the structure comparisons, r.m.s.d. values for residues 2-89 were computed between the atom coordinates of the indicated crystal structure molecule and those of the conformer closest to the mean atom coordinates of the ensemble of 20 NMR conformers. The atoms used for the comparisons are bb, the backbone atoms N, C and C'; co, core heavy atoms defined as having less than 15% solvent accessibility; ha, all heavy atoms. (b) Corresponding data as in (a) for the reference NMR structure, the reference crystal structure computed from input collected with CrystA and for pairwise comparisons with the experimental structures. Numbers framed by thick lines represent the precision of the experimental NMR and crystal structures and their comparisons, those framed by medium lines represent the precision of the reference NMR and reference crystal structures and their comparison and those framed by thin lines represent the comparisons between experimental and reference structures.
| || Figure 4 |
Analysis of the crystal structure, the NMR structure and the reference crystal and NMR structures of TM1367. The same presentation is used as in Fig. 3.
With this starting platform, we aimed here to evaluate the accuracy and precision of the independently determined NMR and crystal structures and to investigate differences that may be associated with the different chemical environments in solution and in the crystal. To this end, we continued to explore a recently introduced approach to reduce possible bias from the different software used for structure determination by the two techniques (Jaudzems et al., 2010), which is based on the use of reference NMR and crystal structures computed with the NMR software from distance constraints measured in the NMR and crystal structure models, respectively. We now describe the results of these investigations using two T. maritima proteins, TM1112 and TM1367, which also include comparisons with an NMR structure of TM1112 solved by the NESG (PDB code 1lkn ).
The TM1112 NMR structure was solved by the JCSG at 298 K in 20 mM sodium phosphate buffer pH 6.8, 50 mM sodium chloride, 5 mM DTT and 0.03%(w/v) sodium azide. The JCSG crystal structure was determined at 1.83 Å resolution at 100 K using a crystal obtained at 277 K from 20 mM Tris buffer pH 7.9 and 25.5%(w/v) PEG 4000 (McMullan et al., 2004). TM1112 comprises seven antiparallel -strands (residues 4-6, 23-26, 30-35, 39-52, 57-60, 65-68 and 72-88), an -helix (10-16) and a 310-helix (18-20). These regular secondary-structure elements are arranged in the sequential order 1-1-310-2-3-4-5-6-7 (Fig. 1). The -barrel consists of two connecting sheets, 1-6-4-7-2 and 5-4-7-3, which are coupled together by two highly twisted strands 4 and 7 that extend from one sheet to the other. The helical segment (residues 10-20) is inserted between 1 and 2 on opposite ends of the -barrel via two well defined loops (residues 7-9 and 21-24) and traverses one face of the -barrel. Statistics for the NMR structure determination are given in Table 1 and those for the crystal structure have been presented elsewhere (McMullan et al., 2004).
+Structure calculated with CYANA from conformational constraints derived from the molecular model representing the crystal structure (Jaudzems et al., 2010).
§Structure calculated with CYANA from conformational constraints derived from the bundle of 20 molecular models representing the NMR structure (Jaudzems et al., 2010).
##1 kcal = 4.186 kJ.
++The numbers in parentheses indicate the residues for which the r.m.s.d. was calculated.
§§As determined by PROCHECK (Laskowski et al., 1993). The crystal structure (1o5u ) deposited in the PDB has 96.8% of residues in favored regions, 3.2% additionally allowed, 0% generously allowed and 0% disallowed.
The reference NMR and crystal structures were calculated from a significantly larger number of upper-limit distance constraints than the experimental NMR structure. The main factors causing the numbers of constraints to be different (Table 1) are that owing to the limited resolution and sensitivity of the NMR measurements only a fraction of the short 1H-1H distances are collected in the experimental structure determination, whereas in the aforementioned molecular models all of the short contacts are evaluated. Furthermore, in the present reference structure calculations only the methyl groups were represented by pseudo-atoms (Wüthrich et al., 1983), whereas in the experimentally collected input the methylene groups and the pairs of symmetry-related ring protons of Phe and Tyr were also represented by pseudo-atoms.
Comparison of the two molecules in the crystal structure with the NMR conformer closest to the mean coordinates of the bundle of 20 NMR conformers (Fig. 3a) yielded backbone r.m.s.d. values of 0.90 and 0.87 Å and all-heavy-atom r.m.s.d.s of 1.73 and 1.63 Å. The crystal structure and the reference crystal structure exhibit closely similar r.m.s.d.s relative to the experimental NMR structure and the same holds for the relationships between the NMR and reference NMR structures relative to the crystal structure (Fig. 3b).
Regarding the precision with which the experimental structures and the reference structures are defined, the present study coincides with previous observations on the treatment of the crystal structure with the NMR software (Jaudzems et al., 2010). While the global r.m.s.d. for all heavy atoms in the crystal structure is nearly identical to the r.m.s.d. value obtained for the backbone heavy atoms, the r.m.s.d. values calculated for the corresponding selections of atoms in the reference crystal structure give values that differ by about twofold, which is similar to the corresponding ratio of the r.m.s.d. values for the NMR structure and the reference NMR structure (Fig. 3b). In addition, we have the new observation that the all-heavy-atom r.m.s.d. between the two molecules in the crystal asymmetric unit is more than twofold greater than the corresponding backbone r.m.s.d.s. In this regard, the `bundle' consisting of the two independent molecules in the crystal structure shows a similar behavior to the bundle of conformers that represent the NMR structure in solution (Fig. 3).
For comparisons at a resolution of individual amino-acid residues, we used the per-residue displacement, , for the NMR structure and the two reference structures and an empirical determination of values by a linear fit of the values for CrystA to the values of the reference crystal structure A (Fig. 5; Jaudzems et al., 2010). The values for CrystA and CrystB vary in a narrow range of about ±0.05 Å along the amino-acid sequence and the same holds for the values (Fig. 5b). The profile of per-residue displacements versus amino-acid sequence for the NMR structure shows larger variations than the crystal structure and is closely similar to that of the reference NMR structure. With the exception of helix 1, the regular secondary structures show lower values than the intervening linker peptides (Fig. 5c). The N-terminus of helix 1 (Thr9-Trp21) is involved in extensive crystal packing with the neighboring crystallographically related molecule, which could explain the shift versus the NMR structure. Overall, there is thus no indication in the data of Figs. 5(b) and 5(c) of any polypeptide segments with outstanding local structural differences, except possibly the apparent lower precision of the 1 helix in the NMR structure.
| || Figure 5 |
Per-residue values for the backbone heavy atoms in CrystA and CrystB of TM1112, per-residue global backbone and all-heavy-atom displacements between CrystA and CrystB and mean values of the per-residue pairwise backbone displacements among the bundles of 20 conformers representing the NMR structure and the reference NMR and crystal structures. (a) Linear least-squares fit of the values for the CrystA versus the corresponding displacements in the reference crystal structure, . The resulting representation of the values by is used for comparisons with the values for the other structures (Jaudzems et al., 2010). (b)-(d) Plots of per-residue polypeptide backbone displacements versus the sequence. (b) CrystA and CrystB and reference crystal structure A. For crystal structure B, the same relation between and was used as for CrystA. The locations of the regular secondary structures are indicated and asterisks identify the residues with solvent accessibility below 15% in the NMR structure. (c) NMR structure and reference NMR structure. (d) Backbone displacements between CrystA and CrystB in the crystal asymmetric unit. (e) All-heavy-atom displacements, , between CrystA and CrystB. Residues with large values are identified.
The close fit between the NMR and crystal structures of TM1112 manifested by the global r.m.s.d. values (Fig. 3) can be rationalized by comparison of their torsion angles (Fig. 6). The backbone dihedral angles in the NMR structure are defined with high precision, with only five residues, Met1, Glu2, Pro8, Thr9 and Ser17, showing a spread of the and/or values greater than ±100° among the 20 NMR conformers, where the crystal structure and dihedral angles are within the range covered by the 20 NMR conformers. Overall, 85% of the and dihedral angle values in the crystal structure lie within the ranges covered by the 20 NMR conformers. Deviations by more than 15° from the range covered by the NMR conformers are found only for 14 of the 89 residues, all of which are located in solvent-exposed loop regions of the protein (Fig. 6a). The corresponding data for the reference NMR structure (Fig. 6b) show qualitatively similar features, as seen in Fig. 6(a), and the reference crystal structure shows a very close coincidence with the crystal structure for the entire polypeptide chain; a spread greater than 50° in the reference crystal structure was observed only for the values of residues 1 and 63 (Fig. 6c). The 1 side-chain torsion angles in the NMR structure show a large spread among the 20 conformers for 29 of the 89 residues and 34 residues show a large spread for 2 (Fig. 6d), but only 15 1 and 13 2 values of the crystal structure (only molecule A is shown) do not fall within the range covered by the 20 NMR conformers. The data of Fig. 6(d) are faithfully reproduced by the comparison of the reference NMR structure with the crystal structure (Fig. 6e), with an apparent discrepancy seen only for the turn linking strands 2 and 3. The reference crystal structure shows a very close fit to the crystal structure (Fig. 6f), despite large spreads of 1 or 2 values for residues devoid of non-labile H atoms in the peripheral atom groups (Jaudzems et al., 2010). The profiles of the plots of the occluded surface packing (OSP; Pattabiraman et al., 1995; Fleming & Richards, 2000) show similar patterns for the four experimental and reference structures, except for a strictly localized difference near the turn between 2 and 3 (residues 24-28; Fig. 7a). OSP values of 0.4 are seen exclusively for residues with low solvent accessibility. Fig. 7(b) shows that the standard deviations for the bundle of 20 experimental NMR conformers are small when compared with the OSP variations along the sequence, which documents not only that the comparisons in Fig. 7(a) are meaningful, but that the variations of 1 or 2 in the bundle of NMR conformers (Fig. 6d) are confined to ranges that are compatible with high packing density.
| || Figure 6 |
Variation of the backbone dihedral angles and side-chain torsion angles in the bundles of 20 energy-refined conformers representing the NMR structure and the reference structures (Fig. 3) of TM1112 and comparisons with CrystA. The spread of the values for the backbone dihedral angles and among the 20 conformers representing the NMR structure (a), the reference NMR structure (b) and the reference crystal structure A (c) (Fig. 3) is represented by blue vertical bars; the red dots indicate the deviations of the crystal structure values from the corresponding mean values for the bundles of 20 conformers, which are at 0°. (d)-(f) The same presentation as in (a) to (c) for the side-chain torsion angles 1 and 2 in the NMR structure (d), the reference NMR structure (e) and the reference crystal structure A (f). The locations of the regular secondary structures are indicated and asterisks identify the residues with solvent accessibility below 15% in the NMR structure.
| || Figure 7 |
Occluded surface packing along the polypeptide chain of TM1112. (a) Plots versus the amino-acid sequence of the per-residue occluded surface packing (Pattabiraman et al., 1995) for the NMR structure (red), CrystA (blue), the reference NMR structure (green) and the reference crystal structure A (black). For the NMR structure and the two reference structures, the OSP values for the conformer closest to the mean atom coordinates are shown. At the top, the locations of the regular secondary structures are indicated; asterisks identify residues with solvent accessibility below 15% in the NMR structure. (b) Plot versus the amino-acid sequence of the mean per-residue OSP values in the bundle of NMR conformers and the standard deviations among the 20 NMR conformers.
In the context of the present structure comparisons, the displacements between the two crystal structure molecules A and B, (Figs. 5d and 5e), are of special interest since they indicate that the `bundle' of `conformers' A and B in the crystal mimics the bundle of 20 NMR conformers in solution. Thus, the values in helix 1 are paralleled by high values of and even more pronounced correlations are seen for the all-heavy-atom per-residue displacements (Fig. 5e). With the sole exceptions of Leu13 and Val82, large values are observed only for charged solvent-accessible residues, which coincides with the low precision of these side chains in the NMR structure. Likewise, high and values do not correlate with the values for the corresponding values in the individual molecules in the crystal structure (Fig. 5b) and reflect actual differences of surface residues in CrystA and CrystB.
We used the comparison of the NMR structures of TM1112 solved independently by the JCSG and the NESG as an additional criterion to evaluate the quality of the structure obtained with the new JCSG protocol, which uses UNIO (Herrmann et al., unpublished work) for extensive automation. It is of special interest that the NESG NMR structure was used for molecular replacement to solve the JCSG crystal structure, so that comparison of the three structures might further provide insight into any possible bias in the techniques used.
Fig. 8(a) shows that the two NMR structures have been determined with nearly identical precision and that the r.m.s.d.s for the structure comparisons exceed those of the two separate bundles by about threefold. Pairwise comparisons of the NMR structures with CrystA and CrystB show the closest fits between the JCSG NMR structure and the crystal structures. It is remarkable that although the NESG structure was used to determine the crystal structure by molecular replacement, the r.m.s.d.s with the crystal structure are significantly larger than those for the JCSG NMR structure and the crystal structure; it is notable that when side-chain atoms are included in the comparison the r.m.s.d. values increase significantly (bb versus co or ha in Fig. 8a). The local origins of the contributions to the global r.m.s.d.s of Fig. 8(a) are visualized by the structure superpositions in Figs. 8(b)-8(d), where the lower definition of the core residues in the NESG NMR structure is clearly illustrated by Fig. 8(d). We conclude that the new JCSG NMR structure-determination protocol with automation through the use of the UNIO interface (Herrmann et al., unpublished work) yielded a structure that compares favorably with a structure obtained using a conventional interactive method. Since the JCSG NMR structure exhibits a closer fit with the crystal structure coordinates obtained by molecular replacement with the NESG NMR structure, further evidence is provided that the crystal structure determination was not biased by the molecular-replacement model used.
| || Figure 8 |
Comparison of the JCSG NMR structure and the NESG NMR structure of TM1112 with the crystal structure. (a) Global r.m.s.d. values describing the precision of the structure determinations of TM1112 by NMR and X-ray crystallography and pairwise comparisons of the four structures, as in Fig. 3. (b) Stereoview of a superposition for best fit of the polypeptide backbone heavy atoms of residues 2-89 of the NMR structures solved by the JCSG (brown; PDB code 2k9z ) and the NESG (cyan; PDB code 1lkn ). When generating this picture, we first computed the mean atom coordinates of the 20 JCSG NMR conformers (Fig. 1b) and identified the conformer closest to the mean. Each of the other 19 JCSG NMR conformers and the ten NESG NMR conformers were then superimposed for best fit of the backbone heavy atoms with this conformer. (c) Stereoview of the aromatic side chains in the bundle of 20 conformers representing the JCSG NMR structure (brown) superimposed with CrystA and CrystB (black). (d) The same presentation as in (c) for the NESG bundle of ten NMR conformers, with the aromatics in cyan. For ease of orientation in (c) and (d), the backbone of the best NMR conformer in each bundle is indicated with a thin line. When generating the drawings (c) and (d), CrystB and the 20 JCSG NMR conformers or the ten NESG NMR conformers, respectively, were superimposed for best fit of the polypeptide-backbone heavy atoms with CrystA.
The NMR structure was solved at 298 K in 25 mM sodium phosphate buffer pH 6.0, 50 mM sodium chloride, 5 mM DTT and 0.03%(w/v) sodium azide. The crystal structure was determined to 1.90 Å resolution at 100 K using a crystal obtained at 277 K from 100 mM phosphate-citrate buffer pH 4.2, 200 mM sodium chloride and 50%(w/v) PEG 200 (Jin et al., 2006). The NMR structure of TM1367 comprises 11 -strands consisting of residues 2-7, 11-16, 32-34, 37-38, 42-44, 57-58, 65-69, 74-78, 95-96, 99-103, 117-120 and one -helix and one 310-helix consisting of residues 21-29 and 105-110, respectively, as illustrated in Fig. 2(c). The structure contains a nine-stranded antiparallel -barrel composed of strands 4-5-8-7-10-2-1-11-3 that connect to a two-stranded antiparallel sheet composed of 6 and 9 at 7 (Fig. 2a). The two helices cover the open ends of the -barrel. Since the -barrel is strongly twisted, the axes of the two helices are approximately perpendicular to each other. Statistics for the NMR structure determination of TM1367 are given in Table 2 and those for the crystal structure have been presented elsewhere (Jin et al., 2006). The comparison of the three molecules in the crystal structure of TM1367 with the bundles of 20 conformers representing the NMR structure and the reference structures yielded very similar results as observed for TM1112 (Figs. 2, 4, 9, 10 and 11). It is worth noticing that the value for residue Gly40 is not reproduced by the values in the crystal structure. The lower precision for Gly40 in the reference crystal structure is clearly related to the use of the NMR software with a low number of constraints for Gly, as indicated by the coincidence with the NMR structure and the NMR reference structure. Overall, this more complex structure was determined with similar precision and comparable coincidence with the crystal structure. While the complete data are presented in the aforementioned figures, we limit the following discussion to selected features that support key results from the investigation of TM1112 as well as from previous studies (Jaudzems et al., 2010).
+Structure calculated with CYANA from conformational constraints derived from the molecular model representing the crystal structure (Jaudzems et al., 2010).
§Structure calculated with CYANA from conformational constraints derived from the bundle of 20 molecular models representing the NMR structure (Jaudzems et al., 2010).
##1 kcal = 4.186 kJ.
++The numbers in parentheses indicate the residues for which the r.m.s.d. was calculated.
§§As determined by PROCHECK (Laskowski et al., 1993). The crystal structure (1zx8 ) deposited in the PDB has values of 98.4% of residues in favored regions, 1.6% additionally allowed, 0% generously allowed and 0% disallowed.
| || Figure 9 |
Per-residue values of the backbone heavy atoms in CrystA, CrystB and CrystC of TM1367, per-residue pairwise global displacements between the three structures in the asymmetric unit of the crystal and mean values of the per-residue pairwise global displacements among the bundles of 20 conformers representing the NMR and reference structures. The same presentation is used as in Fig. 5, except that no data are given (see text).
| || Figure 10 |
Variation of the backbone dihedral angles in the bundles of 20 energy-refined conformers representing the NMR structure and the reference structures (Fig. 4) of TM1367 and comparisons with CrystA. The same presentation is used as in Fig. 6.
| || Figure 11 |
Occluded surface packing along the polypeptide chain of TM1367. The same presentation is used as in Fig. 7. The crystal structure is represented by CrystA and RefCrystA.
In terms of the global r.m.s.d. values (Fig. 4), the bundle of three crystal `conformers' mimics the behavior of the bundle of 20 NMR conformers in that the all-heavy-atom r.m.s.d. values are about twofold larger than the corresponding backbone r.m.s.d.s and the same is observed for the reference crystal structure. The somewhat smaller heavy-atom r.m.s.d.s with respect to TM1112 are a consequence of the fact that some side chains of solvent-exposed residues in CrystB and CrystC have been truncated owing to a lack of interpretable electron density (Jin et al., 2006).
The residue-by-residue data on the precision of the individual structure determinations and the structure comparisons (Figs. 9, 10 and 11) document a close coincidence of the different structures, similar to TM1112. Furthermore, as in TM1112 the regions with the largest pairwise displacements among CrystA, CrystB and CrystC (Figs. 9d, 9e and 9f) correlate with similar increased variation among the NMR conformers (Fig. 9c); notwithstanding, the structure variations among the molecules in the crystal are more pronounced and these increased displacements are only seen for polypeptide segments with nonregular secondary structure. A particular case is the prominent peaks in Figs. 9(d), 9(e) and 9(f), which seem to correspond to regions with significantly different environments in the a.s.u., such as the loop 85-89, which is involved in a crystal contact in CrystA, is in a solvent channel in CrystB, and in CrystC contacts the His tag of CrystA. Another example is the large displacement observed for Glu106, which in CrystC interacts with the side chain of Glu62 of a symmetry-related molecule via a water molecule. In contrast to TM1112, the residues with the largest variations among molecules CrystA, CrystB and CrystC also have the largest values (this analysis was not extended to the all-heavy-atom displacements because of the aforementioned high percentage of truncated side chains in CrystB and CrystC; PDB entry 1zx8 ).
The present comparisons of the NMR and crystal structures of the proteins TM1112 and TM1367 provide additional support that the new JCSG NMR structure-determination protocol, which includes extensive automation through use of the UNIO software (Herrmann et al., 2002a,b; Volk et al., 2008; Fiorito et al., 2008; Herrmann et al., unpublished work), yields highly precise and accurate structures of small globular proteins which compare favorably with the results from more highly interactive and time-consuming conventional procedures. An interesting new insight has emerged from this analysis of the multiple molecules or `conformers' in the crystal asymmetric unit, since these `bundles of conformers' reproduced features that were observed in solution for the ensemble of NMR conformers both with regard to global r.m.s.d.s as well as to residue-by-residue comparisons along the amino-acid sequence. However, the situation in the crystal is not strictly analogous to that in solution as the different molecules in the crystal asymmetric unit may have different chemical environments and, hence, adopt slightly different structures depending on the environment. Furthermore, at ultrahigh resolution, truly different conformers may be able to be discerned and interpreted for each of the molecules in the crystal structure either at the side-chain or at the backbone level. Nevertheless, at the presently achieved resolution of about 1.8 Å, superposition of the different molecules in the asymmetric unit emulates the conformational polymorphisms seen by NMR in solution, as has been implicated by a wide range of previous studies (e.g. Wilson & Brunger, 2000; DePristo et al., 2004; Furnham et al., 2006; Levin et al., 2007; Kondrashov et al., 2008). Furthermore, the recently introduced strategy of `reference structures', which are obtained by treatment of the crystal structural data with the NMR software, validates and supports the information derived from the detailed structure comparisons in the crystal and in solution. In particular, the reference crystal structure clearly manifests the same features that are derived from comparison of the multiple independent structures in the crystal. Thus, this extensive analysis of the comparisons of the crystal and NMR structures suggests that either individual molecules or groups of two or several molecules in the asymmetric unit of the crystal can provide valuable indications about the conformational polymorphisms that are present in solution.
This work was supported by the NIH, National Institute of General Medical Sciences, Protein Structure Initiative grant U54 GM074898. PS was also supported by a fellowship from the Spanish Ministry of Science and Education and KJ by a fellowship from the Latvian Institute of Organic Synthesis and BM, KJ and PS received support from the Skaggs Institute of Chemical Biology. Kurt Wüthrich is the Cecil H. and Ida M. Green Professor of Structural Biology and Ian Wilson is the Hansen Professor of Structural Biology at TSRI. Portions of this research were carried out at the Stanford Synchrotron Radiation Lightsource (SSRL) and the Advanced Light Source (ALS), which are national user facilities operated by Stanford University on behalf of the US Department of Energy, Office of Basic Energy Sciences. The SSRL Structural Molecular Biology Program is supported by the Department of Energy, Office of Biological and Environmental Research and by the National Institutes of Health (National Center for Research Resources, Biomedical Technology Program and the National Institute of General Medical Sciences). Genomic DNA from T. maritima was obtained from the American Type Culture Collection (ATCC 43067). The contents of this publication are solely the responsibility of the authors and do not necessarily represent the official views of the National Institute of General Medical Sciences or the National Institutes of Health.
DePristo, M. A., de Bakker, P. I. W. & Blundell, T. L. (2004). Structure, 12, 831-838.
Fiorito, F., Herrmann, T., Damberger, F. F. & Wüthrich, K. (2008). J. Biomol. NMR, 42, 23-33.
Fleming, P. J. & Richards, F. M. (2000). J. Mol. Biol. 299, 487-498.
Furnham, N., Blundell, T. L., DePristo, M. A. & Terwilliger, T. C. (2006). Nature Struct. Mol. Biol. 13, 184-185.
Herrmann, T., Güntert, P. & Wüthrich, K. (2002a). J. Biomol. NMR, 24, 171-189.
Herrmann, T., Güntert, P. & Wüthrich, K. (2002b). J. Mol. Biol. 319, 209-227.
Hiller, S., Fiorito, F., Wüthrich, K. & Wider, G. (2005). Proc. Natl Acad. Sci. USA, 102, 10876-10881.
Hiller, S., Wider, G. & Wüthrich, K. (2008). J. Biomol. NMR, 42, 179-195.
Jaudzems, K., Geralt, M., Serrano, P., Mohanty, B., Horst, R., Pedrini, B., Elsliger, M.-A., Wilson, I. A. & Wüthrich, K. (2010). Acta Cryst. F66, 1367-1380.
Jin, K. K. et al. (2006). Proteins, 63, 1112-1118.
Keller, R. (2004). Computer-Aided Resonance Assignment. http://cara.nmr.ch/ .
Kondrashov, D. A., Zhang, W., Aranda, R., Boguslaw, S. & Phillips, G. N. Jr (2008). Proteins, 70, 353-362.
Koradi, R., Billeter, M. & Wüthrich, K. (1996). J. Mol. Graph. 14, 51-55.
Laskowski, R. A., MacArthur, M. W., Moss, D. S. & Thornton, J. M. (1993). J. Appl. Cryst. 26, 283-291.
Levin, E. J., Kondrashov, D. A., Wesenberg, G. E. & Phillips, G. N. Jr (2007). Structure, 15, 1040-1052.
McMullan, D. et al. (2004). Proteins, 56, 615-618.
Pattabiraman, N., Ward, K. B. & Fleming, P. J. (1995). J. Mol. Recognit. 8, 334-344.
Volk, J., Herrmann, T. & Wüthrich, K. (2008). J. Biomol. NMR, 41, 127-138.
Wilson, M. A. & Brunger, A. T. (2000). J. Mol. Biol. 301, 1237-1256.
Wishart, D. & Sykes, B. (1994). J. Biomol. NMR, 4, 135-140.
Wüthrich, K. (1986). NMR of Proteins and Nucleic Acids. New York: Wiley-Interscience.
Wüthrich, K., Billeter, M. & Braun, W. (1983). J. Mol. Biol. 169, 949-961.