Structure of 5-formyltetrahydrofolate cyclo-ligase from Bacillus anthracis (BA4489)

The structure of 5-formyltetrahydrofolate cyclo-ligase from B. anthracis determined by X-ray crystallography at a resolution of 1.6 Å is described.


Introduction
The Oxford Protein Production Facility (OPPF) was established to develop methods for high-throughput protein production and crystallization. As part of these developments, a pilot project was undertaken to study 48 proteins from Bacillus anthracis, focusing on protein families that are well conserved across a wide range of bacteria. The causative agent of anthrax, B. anthracis is a large Grampositive spore-bearing bacterium. Sequencing of the genome of the B. anthracis Ames strain (Read et al., 2003) revealed two plasmids, pXO1 and pXO2, that carry the major virulence factors, as well as 5.23 megabases of normal chromosomal DNA predicted to code for about 5311 genes. The set of proteins chosen for this study (Au et al., 2006) are all encoded by the chromosomal DNA.
Folate, a water-soluble B vitamin discovered in the 1940s, is a required cofactor for cell growth in all organisms and is synthesized by plants, prokaryotes and yeast. It acts a carrier of one-carbon groups and is necessary for the synthesis of DNA and RNA precursors as well as for methionine biosynthesis. In the cell, folic acid exists as a family of cofactors with different oxidation states [folate, dihydrofolate (DHF) and tetrahydrofolate (THF)]; the one-carbon units carried by these cofactors can also have different oxidation states (methanol to formate) which are enzymatically interconvertable. These different forms of THF donate or accept onecarbon units in a range of metabolic reactions (the so-called onecarbon metabolism), which include the de novo synthesis of purines and thymidylate (methylation of dUMP to give dTMP) and the remethylation of homocysteine to methionine [methionine can subsequently be adenylated to form S-adenosylmethionine (SAM), itself a cofactor and one-carbon donor for numerous other methylation reactions]. An overview of bacterial folate metabolism is given in Fig. 1(a).
5-Formyl-THF is a member of the folate cofactor family. However, it is not used directly as a one-carbon donor, but instead acts as a regulator of folate metabolism (Stover & Schirch, 1991;Bertrand & Jolivet, 1989). The enzyme 5-formyl-THF cyclo-ligase (also known as 5,10-methenyltetrahydrofolate synthetase; MTHFS) has a scavenger role as the only enzyme that acts on 5-formyl-THF, converting it to 5,10-methenyl-THF in an ATP-and Mg 2+ -dependent reaction (Fig. 1b). The product is then converted into other reduced folates involved in one-carbon metabolism (Chen et al., 2004). Since 5-formyl-THF acts as an inhibitor of other folate-dependent enzymes, 5-formyl-THF cyclo-ligase may also indirectly regulate several key metabolic processes, such as purine, pyrimidine and amino-acid biosynthesis (Stover & Schirch, 1991;Bertrand & Jolivet, 1989). Since all of these processes are required for cell growth and development, 5-formyl-THF cyclo-ligase has been suggested as a target for drug discovery (Jolivet et al., 1996).

Materials and methods
Cloning, expression and protein purification followed standard OPPF pipeline protocols, as described previously (Ren et al., 2005;Alzari et al., 2006). Briefly, the 5-formyl-THF cyclo-ligase gene (BA4489) was amplified from genomic DNA by PCR with the forward primer ggggacaagtttgtacaaaaaagcaggcttcgaaggagatagaaccatggcacatcaccacca-ccatcacGTGAGAGAAGAGAAGCTACGTTTACGTAAAC and the reverse primer ggggaccactttgtacaagaaagctgggtctcaCTATACAA-GCCCATTTTTAACCATTGTTCC incorporating a ribosomebinding site and an N-terminal hexahistidine tag. The gene was then inserted into the expression vector pDEST14 using Gateway recombinatorial cloning (Invitrogen), resulting in an expressed protein in which the N-terminal methionine residue is replaced by an uncleavable purification tag MAHHHHHHV-. The expression vector was transformed into Escherichia coli (strain Rosetta pLysS) and expression was induced by the addition of 0.5 mM isopropyl -d-thiogalactopyranoside (IPTG). The protein was purified by a combination of Ni-NTA affinity chromatography and gel filtration.
The crystal of 5-formyl-THF cyclo-ligase grown in the presence of ATP and folinate diffracted well, with measurements extending to Bragg spacings of 1.5 Å . Diffraction data were recorded (at the ESRF synchrotron, beamline ID14-EH1) in two passes: a high-resolution pass of 362 images was initially recorded using 0.5 oscillations, 6 s exposures and a crystal-to-detector distance of 114 mm, followed by a low-resolution pass of 113 images using 2 oscillations, 2 s (the first 21 images) or 1 s exposures and a crystal-to-detector distance of 286 mm. However, processing these images using either DENZO or MOSFLM was initially problematic owing to scaling problems that we attribute to a high mosaic spread (Bahar et al., 2006. In DENZO, refining the mosaicity proved unstable and the most plausible processing was obtained using a fixed mosaicity of 1.5 , while with MOSFLM the refined mosaicity varied between 1.2 and 4 . As part of a crystallographic workshop, the diffraction data were reprocessed using the program XDS (Kabsch, 1993), which yielded an improved data set with a mosaicity of 0.6 (using the XDS definition; see Table 1 and Bahar et al., 2006 for more details). Eventually, the structure was solved using the program MOLREP (Vagin & Teplyakov, 1997) with the crystal structure of a homologue from B. subtilis (Yqgn) as a search model (PDB code 1ydm; $40% sequence identity). Based on the molecular-replacement solution, the program ARP/wARP (Perrakis et al., 1999) was used to automatically build the structure and carry out ligand fitting. Manual rebuilding used Coot (Emsley & Cowtan, 2004) and REFMAC (Murshudov et al., 1999) was employed to perform TLS refinement (using each chain as a separate body) using isotropic B-factor refinement and weak restraints between main-chain atoms of the two chains. Crystallographic statistics are shown in Table 1. The structure was analysed using PROCHECK (Laskowski et al., 1993) and MolProbity (Lovell et al., 2003). Structure superpositions were calculated using SHP . Folate metabolism and the role of 5-formyl-THF cyclo-ligase. (a) An overview of bacterial folate metabolism (modified from Zittoun & Zittoun, 1972). Both folate and the one-carbon group it carries can exist in different oxidation states, which are interconvertable. The different forms of folate play a role in several important metabolic pathways, including the biosynthesis of purines, methionine and thymidylate. (b) The reaction converting 5-formyl-THF (top) to 5,10-methenyl-THF (bottom) that is catalysed by 5-formyl-THF cyclo-ligase. The metabolically important one-carbon group carried by 5-formyl-THF is highlighted in grey. The catalytically important N5 and N10 positions are labelled.

Results and discussion
The structure of 5-formyl-THF cyclo-ligase (Fig. 2) forms a single domain with an + fold: a central layer of mixed (parallel and antiparallel) -sheet flanked by helices on either side. This fold belongs to the NagB/RpiA/CoA transferase-like superfamily of enzymes (Murzin et al., 1995), the members of which are involved in a diverse range of metabolic processes but share the common property of phosphate binding.
The crystal structure comprises two protein chains in the crystallographic asymmetric unit. Both chain traces are continuous and include parts of the hexahistidine purification tag (the A chain extends from residue À4 to 189 and the B chain from À1 to 188). The conformations of the two chains are nearly identical [a root-meansquare deviation (r.m.s.d.) between C-atom positions of 0.39 Å for residues 1-188 in each chain], but the electron density for the A chain is of somewhat better quality and the discussion focuses on this chain.
It has been suggested that 5-formyl-THF cyclo-ligase forms dimers in solution and that this oligomeric state is related to the cooperative binding of its substrate 5-formyl-THF (Chen et al., 2005). While the structure presented here is, in principle, consistent with a dimeric protein, an analysis (using PISA; Krissinel & Henrick, 2005) suggests that no biologically significant interfaces are present in our structure. All the potential interface regions in the crystal consist exclusively of hydrogen bonds and salt bridges and the largest possible interface area comprises only 500 Å 2 ($5% of the total surface of the protein monomer), which is significantly less than for most biological dimers (Lesk, 2004). However, it is conceivable that dimerization occurs transiently in solution, especially at higher concentrations (Chen et al., 2005).
The structure of B. anthracis 5-formyl-THF cyclo-ligase was obtained by cocrystallization with folinate, ATP and Mg 2+ . The structure shows a pocket in the protein containing a bound nucleotide cofactor (Fig. 3a). Interestingly, the ATP molecule which was present in the crystallization experiment has been hydrolysed to ADP and inorganic phosphate. The high-resolution map shows clear and separated electron density for these moieties separated by a water molecule. The conformation of the ADPand -phosphates is stabilized as part of an octahedral Mg 2+ coordination shell (Fig. 3b). There is some weak ill-defined electron density in the region expected to be occupied by the substrate and/or product, but it is not of sufficient quality to allow it to be built. However, ATP/ADP only occupies as small part of the active-site pocket, adjacent to a wider cavity which could accommodate the pteridine ring of folinate. Furthermore, this cavity is lined by a number of hydrogen-bond donors and acceptors, as well as hydrophobic residues, a suitable environment for binding a heterocyclic aromatic ring (Fig. 3a).
Our findings support a catalytic mechanism in which 5-formyl-THF interacts directly with the -phosphate of the ATP cofactor and thereby allows the formation of the azoline ring in the substrate. Based on NMR-labelling studies, a catalytic mechanism has been proposed in which 5-formyl-THF becomes transiently phosphorylated by ATP to form a phospho-enol intermediate (Chen et al., 2005;Fig. 3c), which is entirely consistent with our observations. Based on this scheme, our crystal structure would represent the final stage of the catalytic mechanism, after the enzyme has turned over and released the product.

Figure 2
Two orthogonal views of the secondary structure of B. anthracis 5-formyl-THF cyclo-ligase (coloured from blue at the N-terminus to red at the C-terminus). Secondarystructural elements are labelled on the right-hand view. The ADP and phosphate cofactors are shown in a stick representation, with the Mg 2+ ion shown as a grey sphere.
pneumoniae (PDB codes 1sbq, 1u3f and 1u3g; Chen et al., 2004Chen et al., , 2005 and from Bacillus subtilis (PDB code 1ydm; unpublished work). Sequence and structural alignments (Fig. 4) reveal only moderate sequence similarities to the B. anthracis enzyme ($25% and $40% identity, respectively) but a high degree of structure conservation (r.m.s.d. for C-atom superpositions of $1.1 Å for 150 and 160 residues, respectively). These findings underscore the critical metabolic function of the enzyme, for which there appears to be strongly conservative evolutionary pressure. The B. anthracis and B. subtilis sequences both contain a 20 amino-acid insertion compared with the M. pneumoniae homologue (residues 93-112 in B. anthracis numbering). Structurally, this insertion lines part of the substrate-binding site and, interestingly, it is positioned such that it precludes the substrate-binding mode postulated for the M. pneumoniae structure (PDB code 1u3g; Chen et al., 2005). However, we note that in the model the atoms forming the portion of the substrate that impinges on this insertion have been assigned occupancies of zero, suggesting uncertainty in the interpretation. This difference, along with other changes to the amino acids lining the site (particularly Thr50, Met53 and Tyr138; Fig. 4a) suggest that despite the structural similarity, The mechanism of catalysis by B. anthracis 5-formyl-THF cyclo-ligase. (a) A cutthrough of the 5-formyl-THF cyclo-ligase structure showing the cofactor/substratebinding pocket with bound ADP and phosphate. The enlarged view shows 2F o À F c electron density contoured at 1.5 (green) for these moieties. Weak electron density (not shown) suggests that the substrate/product binds in the right-hand side of this pocket. (b) A ball-and-stick diagram showing the octahedral coordination of the Mg 2+ ion by theand -phosphates of ADP, residues Asp144 and Asp173 and two tightly bound water molecules. Interaction distances are shown in Å . This coordination helps orient the -phosphate of ATP correctly for attack on the 5-formyl-THF. The green lines show 2F o À F c electron density contoured at 2.2. (c) The proposed catalytic mechanism of B. anthracis 5-formyl-THF cyclo-ligase via a phospho-enol intermediate. Our structure represents the final stage of the process, after the product has dissociated from the binding pocket. there is potential for designing inhibitors of 5-formyl-THF cycloligase that are specific to particular pathogen species.