Preliminary X-ray crystallographic analysis of the secreted chorismate mutase from Mycobacterium tuberculosis: a tricky crystallization problem solved

A method is presented that allowed the diffraction limit of crystals of the secreted chorismate mutase from M. tuberculosis to be improved from approximately 3.5 to 1.3 Å. To obtain large well diffracting crystals, it was critical to initiate crystallization at higher precipitant concentration and then transfer the drops to lower precipitant concentrations within 5–15 min.


Introduction
The biosynthesis of amino acids is an essential process in living cells as it provides the building blocks for polypeptides and important primary and secondary metabolites. The shikimate pathway is the anabolic pathway employed by bacteria, fungi and plants to synthesize the aromatic amino acids phenylalanine (Phe), tyrosine (Tyr) and tryptophan (Trp) (Haslam, 1993). Chorismate mutase (CM; EC 5.4.99.5) is located at the branch point of the shikimate pathway and channels chorismate into the Tyr/Phe-specific branch. The enzyme catalyzes the conversion of chorismate to prephenate, which is formally a Claisen rearrangement. Chorismate mutases typically accelerate the uncatalyzed rearrangement by roughly one millionfold (Andrews et al., 1973). As one of the few enzymes known to catalyze a pericyclic process, it has generated considerable interest among enzymologists and bioorganic and computational chemists, specifically with respect to the origins of the catalytic efficiency (Andrews et al., 1973;Gö risch, 1978;Chook et al., 1994;Haynes et al., 1994;Wiest & Houk, 1995;Lee, Stewart et al., 1995;Kast, Asif-Ullah, Jiang et al., 1996;Ma et al., 1998;Mattei et al., 1999;Gustin et al., 1999;Kast et al., 2000;Kienhö fer et al., 2003;Guo et al., 2003;Guimarã es et al., 2003;S trajbl et al., 2003;Zhang et al., 2005).
Structures have been determined for chorismate mutases from Bacillus subtilis (Chook et al., 1993(Chook et al., , 1994Ladner et al., 2000), Escherichia coli (Lee, Karplus et al., 1995), Saccharomyces cerevisiae (Xue et al., 1994;Strä ter et al., 1996Strä ter et al., , 1997 and the catalytic antibody 1F7 with chorismate mutase activity (Haynes et al., 1994). Recently, coordinates have also been deposited in the PDB for chorismate mutases from Thermus thermophilus (PDB codes 1ode, 1ufy, 1ui9) and Clostridium thermocellum (PDB code 1xho; Xu et al., 2005). Interestingly, these examples comprise three completely unrelated folds: while the structurally characterized enzymes from B. subtilis, T. thermophilus and C. thermocellum have a trimeric pseudo-/barrel fold typical of the so-called AroH class of chorismate mutases (Chook et al., 1994;MacBeath et al., 1998), 1F7 exhibits the characteristic immunoglobulin -sheet topology. The chorismate mutases of the AroQ class adopt yet another fold, which is dominated by -helices. Its prototype, the chorismate mutase domain (AroQ p ) of the bifunctional E. coli chorismate mutase-prephenate dehydratase (also known as EcCM), is an intertwined homodimer that comprises three -helices per monomer (Lee, Karplus et al., 1995). This fold has recently been classified as AroQ (Ö kvist et al., 2006). The allosteric yeast chorismate mutase represents the AroQ subclass (Ö kvist et al., 2006), a more elaborate version of the basic AroQ fold, which shares weak sequence similarity with EcCM and possesses a structurally related active-site domain (Xue et al., 1994;Xue & Lipscomb, 1995;Strä ter et al., 1996Strä ter et al., , 1997. A third AroQ subclass, initially known as *AroQ (Xia et al., 1993;Calhoun et al., 2001;Sasso et al., 2005) and recently classified as AroQ (Ö kvist et al., 2006), comprises a group of chorismate mutases that exhibit convincing sequence similarity to other AroQ sequences only in the N-terminal moiety (Sasso et al., 2005;Ö kvist et al., 2006). These enzymes are about twice the size of the classical AroQ domain and feature a cleavable signal peptide responsible for the export of these chorismate mutases from the cytoplasm (Xia et al., 1993;Calhoun et al., 2001;Sasso et al., 2005). It has been speculated that AroQ enzymes may play some role in the pathogenicity of its producer organism (for a discussion and references, see Sasso et al., 2005). Primary structure comparisons suggested that AroQ proteins also have other structural peculiarities (MacBeath et al., 1998;Sasso et al., 2005). In particular, a critical amino-terminal segment carrying a conserved active-site arginine is lacking in AroQ chorismate mutases. Instead, a homologous helical segment is present in the middle of the polypeptide chain, indicating that the structure of the exported AroQ proteins must be topologically distinct from those of the other known AroQ prototypes (Sasso et al., 2005). To investigate this issue and to gain insights into the structure-function relationship of these enzymes, which have the potential to serve as formidable specific drug targets, we have set out to determine the high-resolution crystal structure of the secreted chorismate mutase from the model pathogen Mycobacterium tuberculosis (*MtCM; encoded by open reading frame Rv1885c in strain H37Rv), which represents the first characterized example of an AroQ fold (Ö kvist et al., 2006).
However, crystallization of this enzyme turned out to be nontrivial. While small well ordered crystals could readily be obtained within less than half an hour of setting up the drops, it seemed impossible to obtain (fewer and) larger crystals with better diffraction quality. Common strategies to increase crystal size are to lower the precipitant or protein concentration, thereby moving in the phase diagram from a condition of high supersaturation (where nucleation occurs) to a concentration of lower supersaturation (thus reducing the number of spontaneously formed nuclei, but still staying within the 'labile' zone where nucleation can occur; Mikol & Giegé, 1992). Another well known factor that can influence nucleation is the temperature, which has an impact both on the phase diagram itself and on equilibration kinetics. As an alternative to temperature variation, the equilibration time might be influenced more predictably by applying a layer of oil on top of the reservoir of a vapourdiffusion setup (Chayen, 1997). In this way, equilibration is slowed down, warranting controlled entry into the 'labile' nucleation zone.
All of these protocols work relatively well if the nucleation rate increases gradually upon increasing precipitant or protein concentration. However, if instead there is an abrupt increase in nucleation rate, it can be very difficult to hit the point optimal for both nucleation and crystal growth (if such a point exists at all). In these cases, the application of seeding techniques may be attempted (Stura & Wilson, 1992;Stura, 1999), which approach the problem from the opposite end by placing crystal nuclei in the form of small seeds into the metastable zone optimal for crystal growth. Finding conditions appropriate for seeding has its own inherent difficulties, however, as the metastable zone can be rather narrow and excursions to either side of the phase diagram will inevitably result in either extensive secondary nucleation or dissolution of the seeds. If only little protein is available, this adds to the challenge. A few years ago, two new approaches to the problem were described (Saridakis et al., 1994;Saridakis & Chayen, 2000). The first method involves diluting microbatch drops after incubating them for several hours under spontaneous nucleation conditions (Saridakis et al., 1994), while the second approach relies on transferring cover slips of hanging-drop vapour-diffusion setups from nucleation conditions to the metastable zone (Saridakis & Chayen, 2000). In the present report, we describe the crystal optimization of a protein target that crystallizes very rapidly and show that this problem can be solved by adapting the approach by Saridakis & Chayen (2000) to very short crystallization times.

Crystallization
Two different protein batches were used for crystallization. In both cases, recombinant *MtCM was prepared from E. coli strain KA29 as described in Sasso et al. (2005) and concentrated to 10-20 mg ml À1 . Immediately prior to crystallization setups, the protein solutions were centrifuged at 10 000 rev min À1 for 10 min at 277 K in order to spin down any floating particles and aggregated insoluble protein   dard initial crystallization tests were then performed by the hangingdrop vapour-diffusion method in 24-well tissue-culture plates (Gö teborgs Termometerfabrik, Sweden) using siliconized glass cover slips from Hampton Research (CA, USA).
For the first crystallization trials, we used a 175-residue version of *MtCM (encoded on plasmid pKTU3-HCT) devoid of the cleavable signal peptide but containing a C-terminal His tag (Sasso et al., 2005). The protein sample (12.6 mg ml À1 ) was dissolved in 20 mM potassium phosphate buffer pH 7.5. Structure Screen 1 (Molecular Dimensions Ltd, England) gave a few leads, which were optimized, with the best preliminary condition consisting only of 15% PEG 8000 (without buffer). In order to explore different conditions which were not limited by the use of phosphate as buffer, we then switched to protein (18.6 mg ml À1 ) buffered with 20 mM Tris-HCl pH 8.0. This time, the leaderless untagged 167-residue version of *MtCM encoded by plasmid pKTU3-HT was used (Sasso et al., 2005). The first screen at room temperature (293 K) gave a hit for condition No. 18 (0.2 M sodium acetate, cacodylate pH 6.5, 30% PEG 8000). A shower of small, well shaped and strongly birefringent single crystals appeared within 20 min of the first crystallization setup (Fig. 1a). These crystals diffracted to 3-4 Å resolution. Subsequently, an intense effort was invested into optimizing the crystallization conditions and particularly into slowing down crystallization in order to obtain fewer but larger crystals. However, none of the modifications, such as lowering the protein or precipitant concentration, changing the volume ratios and temperature, using additives or layering the reservoirs with various oils etc., proved successful: we either still obtained a shower of small crystals within 20 min of setting up the drops (using a reservoir concentration of !20% PEG) or we did not obtain any crystals at all. Even seeding did not help. The final breakthrough came when we recalled a publication by Saridakis & Chayen (2000) in which nucleation was initiated at a high precipitant concentration (which is more likely to produce nucleation centres) followed by transferring the drops before crystals became visible macroscopically to reservoirs with a lower precipitant concentration (which is favourable for crystal growth). In our case, crystallization times were extremely short, so the drops needed to be transferred within less than 20 min. After initial tests showed dramatic improvements upon implementing this strategy, we adopted a more systematic approach and devised a grid screen in which both the incubation period before transfer and the concentration of the final reservoir solution were varied. The best results were obtained when suspending the drops (1.5 ml protein + 1.5 ml reservoir solution) over a starting reservoir (1 ml) containing 20% PEG 4000 (in addition to 0.2 M sodium acetate and cacodylate buffer pH 6.5) and then transferring them after 5-15 min to reservoirs containing only 5-15% PEG 4000 (other ingredients constant). In this way, huge crystals ($1 mm in the longest dimension) appeared overnight which did not look perfect but diffracted to beyond 1.6 Å resolution (Fig. 1b). In general, lower reservoir concentrations and shorter incubation times coincided with fewer (and larger) crystals, with the reservoir concentration having the larger impact (particularly for concentrations of <10% PEG).
To our knowledge, this is the first time that the technique of Saridakis & Chayen (2000) has been applied to such short crystallization times. Obviously, in time spans as short as a few minutes, equilibration with the reservoir solution cannot occur. Nevertheless, it turned out to be critical for our experiments to initiate crystallization at a PEG concentration of 20% (before transferring the drop to lower concentrations). The reason for this is presumably a rather steep concentration gradient of PEG in the drop immediately after pipetting the reservoir solution onto the protein droplet. In the mixing zone at the interface between protein and precipitant solution, the PEG concentration will be very close to 20%, enabling the formation of crystal nuclei ('shock' nucleation). The locally high PEG concentration will subsequently decrease by diffusion, establishing conditions favorable for crystal growth, provided that the vapour pressure from the reservoir is sufficiently high to prevent rapid dehydration of the drop, thus avoiding further nucleation.

Preparation of a transition-state analogue inhibitor complex
In order to localize and investigate the active site of *MtCM, we performed soaks of the unliganded protein crystals in a solution containing an endo-oxabicyclic dicarboxylic acid (referred to here as 'TSA'; Bartlett & Johnson, 1985;Bartlett et al., 1988). This compound is known to be an excellent mimic of the geometry of the transition state of the chorismate mutase reaction and is currently the best inhibitor for a broad range of chorismate mutases (Mandal & Hilvert, 2003), including *MtCM (Sasso et al., 2005). Consequently, it has been used as a ligand in many structural studies of chorismate mutases (Chook et al., 1993(Chook et al., , 1994Haynes et al., 1994;Lee, Karplus et al., 1995;Strä ter et al., 1997).
Two different soaking techniques were tested. In order to minimize consumption of the precious compound, we first tried to dissolve a tiny amount of solid TSA directly in a drop containing *MtCM crystals. However, all crystals in the drop dissolved immediately. We then took 0.5 ml of this TSA-containing drop and added it to a 1.5 ml drop containing more *MtCM crystals. After 20 min soaking time, the crystals were cryoprotected by pipetting a drop of mother liquor containing 10% PEG 400 on top of the soak. Thereafter, the crystals were flash-frozen in liquid nitrogen and stored for data collection. Crystal structure analysis (Ö kvist et al., 2006) confirmed the presence of TSA in the active site of *MtCM.

X-ray diffraction analysis
X-ray data were collected and processed as reported by Ö kvist et al. (2006). *MtCM crystals diffracted to beyond 1.6 Å resolution (diffraction limit 1.3 Å resolution; Fig. 2 and Table 1) and belong to the monoclinic space group P2 1 , with unit-cell parameters a = 42.6, b = 72.6, c = 62.0 Å , = 104.5 for the free enzyme and a = 42.9, b = 72.8, c = 61.7 Å , = 104.0 for the TSA complex. The Matthews parameter V M is 2.9 Å 3 Da À1 (Matthews, 1968; calculated for a unit cell containing two protomers per asymmetric unit), which corresponds to a solvent content of 57%. The dimeric nature of *MtCM is consistent with previous biochemical data (Sasso et al., 2005;Prakash et al., 2005). Since sequence alignments (Sasso et al., 2005) suggested homology to the E. coli AroQ p domain, phasing was attempted by molecular replacement using the E. coli structure (PDB code 1ecm; Lee, Karplus et al., 1995) as a search model. However, even exhaustive trials did not yield a solution to the problem. A probable reason for the failure is the relatively poor overall sequence homology (16% identity, according to BLASTP v.2.2.6 from NCBI; Altschul et al., 1997) between the entire 167-residue *MtCM poly-peptide and the structurally characterized 109-residue E. coli AroQ p domain, together with significant structural differences of the Cterminal helices (Ö kvist et al., 2006).
We have therefore prepared heavy-atom derivatives (Pb II acetate and Tl III acetate soaks) and solved the structure by the MAD phasing technique based on the Pb II derivative (Ö kvist et al., 2006).

Concluding remarks
While crystallization of the same protein has been reported very recently by another group (Qamra et al., 2005), our results are distinct since the crystal forms are different (space group P2 1 instead of C2) and we have significantly higher resolution data ( 1.6 Å versus 2.1 Å ). Furthermore, we describe the crystallization of free enzyme and a soak with a transition-state analogue, while Qamra and coworkers cocrystallized *MtCM with l-tryptophan.
In addition, this report addresses in detail a general and very challenging crystallization problem and its subsequent solution, which led to the development of an empirical method for crystal optimization for cases with short crystallization times. By applying the optimized protocol, the diffraction limit of the *MtCM crystals could be extended from $3.5 to 1.3 Å . This technique, which involves the transfer of crystallization drops to lower concentration reservoirs (varying both reservoir concentration and incubation time before transfer), is particularly useful if the protein of interest crystallizes in a narrow window of conditions and only little protein material is available.
We would like to thank Rosalino Pulido for TSA synthesis and Elin Grahn for help with data collection. At the synchrotron beamlines ID14-4, ESRF, Grenoble and I711, Max II, Lund, we had excellent support from Raimond Ravelli and Yngve Cerenius, respectively. This work has been supported by grants from the Carl Trygger foundation (research grant No. 02:158 to UK and postdoctoral fellowship of RD) from the Glycoconjugates in Biological Systems program of the Swedish National Foundation for Strategic Research (research position of UK), from Novartis Pharma (to SS) and the ETH Zü rich (PK).  . 2); however, final data sets only include data to $1.6 Å for the following reasons: for unliganded *MtCM, the high-resolution data set was the third of three data sets collected from the same crystal (the preceding two data sets were collected at the MAD peak and edge) and therefore reflections at higher resolution were no longer of acceptable quality and strength. For the TSA complex, data collection was limited to 1.6 Å for technical reasons. ‡ R merge = P jI i À hIij= P I i .