The nucleotide-binding site of Aquifex aeolicus LpxC

The structure of UDP-3-O-acyl-N-acetylglucosamine deacetylase (LpxC) in complex with UDP is reported. The complex allows for a description of how the enzyme recognizes and binds a nucleotide moiety and enables the construction of an LpxC-substrate model.


Introduction
A unique biochemical pathway in Gram-negative bacteria leads to the production of lipopolysaccharide (LPS), a major component of the outer membrane of these organisms. LPS contributes to bacterial resistance to antibiotics (Nikaido, 2003) and hyperstimulation of the host immune system (Cohen, 2002;Lopez-Bojorquez et al., 2004). It has three structural components: core polysaccharide, O-antigen and lipid A, a fatty-acid (FA) substituted phosphorylated glucosamine disaccharide. Lipid A is required for both bacterial viability (Raetz & Whitfield, 2002) and modulation of the host immune system (Galanos et al., 1985).
Several of the enzymes involved in the synthesis of lipid A are essential for the growth and viability of Gram-negative bacteria (Raetz & Whitfield, 2002) and represent targets for chemotherapy. One such enzyme, UDP-3-O-acyl-N-acetylglucosamine deacetylase (LpxC), catalyzes the first committed step in the synthesis of lipid A (Raetz & Whitfield, 2002). Inhibition of LpxC in vivo has led to successful treatment of infection in mice (Kline et al., 2002;Onishi et al., 1996). In both Escherichia coli and Aquifex aeolicus, LpxC deacetylates UDP-(3-O-hydroxymyristoyl)-N-acetylglucosamine (Jackman et al., 2001;Young et al., 1995). LpxC is a metalloenzyme that uses Zn 2+ to activate a nucleophilic attack by water on the amide of the N-acetylglucosamine (GlcNAc) moiety of the substrate ( Fig. 1; Jackman et al., 1999); like other bacterial metalloamidases, a second inhibitory zinc-binding site is present (Jackman et al., 1999;Whittington et al., 2003). LpxC presents an + fold with no known mammalian homologues (Whittington et al., 2003). This uniqueness, its essentiality and the previous success in enzyme inhibition make LpxC an attractive target for structure-based drug design.
The elements required by LpxC to bind and catalyze substrate include a zinc-binding motif, UDP-binding and GlcNAc-binding sites and a hydrophobic cavity to bind FA. Structural and biochemical studies have identified HKX(L,F)D as the zinc-binding motif in LpxC (Jackman et al., 2001;Whittington et al., 2003) and several inhibitors containing hydroxamate groups exploit the presence of the divalent cation. Crystallographic studies of LpxC have elucidated a hydro-phobic FA-binding cavity that stabilizes the structure upon binding FA (Hernick et al., 2005;Whittington et al., 2003). The presence of a longer FA moiety on the GlcNAc results in a higher binding affinity to LpxC and is required for efficient catalysis (Jackman et al., 1999(Jackman et al., , 2000Whittington et al., 2003). The inhibitor 1,5-anhydro-2-C-(carboxymethyl-N-hydroxyamide)-2-deoxy-3-O-myristoyl-d-glucitol (TU-514; Fig. 1) is a substrate-analogue inhibitor that has been used to characterize the GlcNAc-binding site of LpxC (Coggins et al., 2003(Coggins et al., , 2005Gennadios et al., 2006). TU-514 exploits the zinc-coordination and FA-binding and GlcNAc-binding sites of LpxC and the result is an inhibitor with nanomolar affinity for the enzyme (Jackman et al., 2000). Inhibitor design would benefit from including UDP-recognition elements, but presently this binding site remains uncharacterized. Here, we present a crystallographic analysis of a complex of A. aeolicus LpxC with UDP that allows us to describe the molecular recognition of UDP in the nucleotide-binding site of the enzyme.

Sample preparation
A construct identical to that previously described by Whittington et al. (2003) was used. This produces a truncated version of the enzyme which also contains a cysteine-to-alanine mutation that has been found to reduce aggregation. Briefly, lpxC was amplified from genomic DNA of A. aeolicus (strain VF5) using the primers CATATGGGATTAGAAAAGACGGTTAAAGAG and GGATCC-TTATCAACGAGTAAGTTTTTGTTTCTTTG, which contained an N-terminal NdeI and a C-terminal BamHI restriction site, respectively, as indicated in bold. The PCR product, which encodes residues 1-271 of the protein, was ligated into pCR-Blunt II-TOPO using the Zero Blunt TOPO Cloning Kit (Invitrogen). The lpxC Á272-282 insert was subsequently ligated into NdeI-and BamHI-digested pET21a (Novagen). The C181A mutation was inserted using divergent PCR techniques (Higuchi et al., 1988). The integrity of this construct, pET21a-lpxC C181A Á272-282, was verified by DNA sequencing.
The plasmid containing modified lpxC was transformed into Rosetta(DE3) pLysS (Novagen) for expression. A single colony was used to inoculate 10 ml of enriched media (Nair et al., 1991) containing carbenicillin (50 mg l À1 ) and chloramphenicol (12 mg l À1 ) and was grown overnight at 310 K. The overnight culture was diluted (1:100) into 1 l fresh medium and grown at 310 K until the A 600 reached 0.6. Expression was induced at 298 K with 1 mM isopropyl -d-thiogalactopyranoside and the medium was supplemented with 0.1 mM ZnSO 4 at this point. The cells were harvested approximately 16 h later by centrifugation (Beckman JS-4.2, 3480g at 277 K for 60 min) and stored at 253 K.
The cell pellet was resuspended in 20 ml 20 mM NaCl, 20 mM Tris pH 8 and 50 mM EDTA and lysed using a constant cell disrupter (One Shot model, Constant Systems Ltd). The soluble fraction was separated by centrifugation (Beckman JA-25.50, 48 400g at 277 K for 30 min), passed through a 0.2 mm filter and loaded onto a 5 ml HiTrap Q FF column (GE Healthcare). A linear gradient from 20 to 500 mM NaCl was used to elute the protein. The fractions containing LpxC were pooled, treated with 20 mM EDTA pH 8 and dialyzed into 50 mM NaCl, 20 mM Tris pH 7.7 overnight. The sample was applied onto a 5 ml HisTrap HP column (GE Healthcare) charged with ZnSO 4 and the protein was eluted using a combination of step and linear gradients from 15 to 500 mM imidazole. Fractions containing LpxC were pooled and further purified by gel-filtration chromatography (Superdex 75, GE Healthcare) in 150 mM NaCl, 25 mM HEPES pH 7.5 and 0.5 mM ZnSO 4 . 10 mM magnesium acetate was added to the sample, which was subsequently concentrated to 184 mM as determined using a theoretical molar extinction coefficient of 22 900 M À1 cm À1 . Purity was ascertained by SDS-PAGE and matrixassisted laser desorption ionization-time-of-flight mass spectrometry. The experimentally determined weight of the protein was 30 811.6 Da, which agrees with the predicted weight of the protein in the absence of the initial methionine (30 810.3 Da).

Crystallization and data collection
Crystals of LpxC were grown by hanging-drop vapour diffusion at 293 K using 1 ml protein solution (184 mM) and 1 ml of a reservoir  solution containing 8% PEG 6000, 100 mM Tris pH 8.5 and 300 mM NaCl. Crystals were harvested and subsequently soaked for 24 h in a pre-equilibrated vapour-diffusion drop initially composed of 0.5 ml 500 mM UDP-N-acetylglucosamine (UDP-GlcNAc) in gel-filtration buffer and 0.5 ml reservoir solution containing 17% PEG 6000, 100 mM Tris pH 8.5 and 150 mM NaCl. The crystals were flash-cooled to $100 K (Oxford Cryosystems) in a solution consisting of 17% PEG 6000, 100 mM Tris pH 8.5, 150 mM NaCl and 20% glycerol. Data were collected at the European Synchrotron Radiation Facility on beamline ID23-2 using a MAR CCD 225 detector. A total of 80 of data were collected in 1 oscillations at a wavelength of 0.873 Å using an exposure time of 2 s. The data were integrated using MOSFLM (Leslie, 1992) as implemented in CCP4 (Collaborative Computational Project, Number 4, 1994) and then scaled and merged using SCALA (Evans, 1997). Significant radiation damage was evident on this undulator beamline and only the first 60 images were used. Statistics are given in Table 1.

Structural determination and refinement
The crystal is isomorphous with previously reported structures (Gennadios et al., 2006;Whittington et al., 2003) and contains two molecules per asymmetric unit. A single molecule of LpxC (PDB code 1p42) was used in molecular-replacement calculations (MOLREP; Vagin & Teplyakov, 2000) and two initial solutions were identified. These molecules will be referred to as chains A and B. Refinement involved rounds of electron-density and difference density map inspection and model building using the program Coot (Emsley & Cowtan, 2004) interspersed with restrained maximumlikelihood refinement (REFMAC5; Murshudov et al., 1997). The placement of ligand and water molecules completed the refinement. Noncrystallographic symmetry restraints were not imposed during refinement. The N-terminal methionine is absent and three residues at the C-terminus are disordered in both monomers. Statistics for the refined structure are presented in Table 1. An overlay of C positions for residues 2-266 of our model with the starting model gave an r.m.s.d. of 0.19 Å for chain A and 0.24 Å for chain B.

Modelling the substrate UDP-(3-O-hydroxymyristoyl)-N-acetylglucosamine
The crystal structure of LpxC with bound inhibitor TU-514 (PDB code 2go4) was overlaid on the structure of LpxC with bound UDP. The substrate was placed at the active site and manipulated using Coot and O (Jones et al., 1991) such that the uracil and pyranose moieties of the substrate overlaid with those of UDP and TU-514, respectively. The geometry of the model was idealized in REFMAC5 using molecular restraints obtained from PRODRG (Schü ttelkopf & van Aalten, 2004).

Results and discussion
3.1. Overall structure A detailed description of the LpxC structure has been published (Whittington et al., 2003). In brief, LpxC displays an + fold containing two domains, each consisting of a five-stranded -sheet and two -helices. These domains are sandwiched together with both sets of helices between the sheets (Fig. 2). The active site is located between the domains at the -sheet interface. The enzyme has been crystallized in the zinc-inhibited form, with two Zn 2+ ions present in the active site. A third Zn 2 is located between -helix C, which flanks the active site, and the loop joining the small domain to the larger -sheet and is coordinated by residues His58 and His188, a chloride and a water molecule (not shown). A fourth Zn 2+ is coordinated by Gly2 and Glu120 of chain A and by Glu90 and His29 of a symmetry-related chain B (not shown). This is a crystallization artefact rather than an indication of oligomerization, as gel-filtration experiments clearly show the protein to be a monomer (data not shown). Electron density was observed in the FA-binding cavity of both chains and was modelled as myristic acid. In chain A the FA directly binds the inhibitory Zn 2+ , while in chain B the carboxylic acid head extends away from the active site and is disordered over two   (Cruickshank, 1999).

Figure 2
Ribbon diagram of the LpxC-UDP complex. Domains I and II are shown in green and wheat, respectively. Secondary structure is labelled according to previous assignments (Whittington et al., 2003). UDP is depicted as spheres and myristic acid as sticks. positions (not shown). The FA was not added to the crystallization solution and is likely to have been acquired from the expression system. Similar FA binding has been observed in other LpxC structures (Gennadios et al., 2006;Whittington et al., 2003).

Nucleotide binding
Our current study, the crystal structure of the zinc-inhibited form of LpxC in complex with UDP, provides structural details concerning nucleotide binding for the first time. The nucleotide-binding site is located between strand 3 0 and helix A 0 of domain II (Fig. 2). The uracil moiety is well defined in the electron density, with the ribose and phosphates less so (Fig. 3). This clearly correlates with the average B factor observed for each moiety: 58.9, 65.6 and 76.0 Å 2 for uracil, ribose and diphosphate, respectively. We were unable to model the GlcNAc portion of the UDP-GlcNAc. This segment of the added ligand may be disordered or missing owing to hydrolysis. The presence of the inhibitory Zn 2+ at the entrance to the active-site cavity (Fig. 2) may preclude GlcNAc binding, leading to nonspecific association and disorder. Interestingly, the use of identical soaking conditions with UDP rather than UDP-GlcNAc resulted in crystal dissolution.
The uracil-binding pocket is mainly formed by residues contributed from 4 0 , the 8 0 --B 0 turn and one face of -A 0 (Figs. 2, 3 and 4). The uracil is 10.5 Å from the catalytic Zn 2+ (measured from O2) and is placed in a hydrophobic patch where it forms a -bond with Phe155. This is a highly conserved amino acid; in 241 of 248 assigned LpxC sequences found in a range of databases this position is an aromatic residue (Phe/Tyr/Trp). In the remaining seven sequences, derived from Burkholderia species, a leucine occupies this position. Phe155 is part of a hydrophobic patch that, along with Phe180 and Phe182, is recognized as an affinity determinant for binding to LpxC (Coggins et al., 2005). Both uracil N3 and O2 are within hydrogenbonding distance of the side-chain hydroxyl of Tyr151, which forms part of the / interface of the protein. In alignments of LpxC across 248 species, approximately 50% of all LpxC sequences have phenylalanine, 25% have isoleucine, leucine or valine and 15% have tyrosine at this position. This observation emphasizes the importance of a hydrophobic side chain in defining this part of the uracil-binding pocket. It is possible that in the absence of a tyrosine hydroxyl group a solvent could occupy this site and mediate interactions between the base and the enzyme.
Additional hydrogen-bonding interactions hold the nucleotide in the binding site. O4 accepts a hydrogen bond from the main-chain amide of Glu154; this side chain, which is exposed on the surface of the protein, does not appear to be important in organizing the binding site despite a high level of conservation as either aspartate or glutamate. The uracil O2 accepts a hydrogen bond donated from Lys227 NZ. This residue is conserved in LpxC as either lysine or arginine, where a similar hydrogen-bonding interaction is plausible. Mutagenesis studies  show Lys227 to be critical for enzyme activity since a K227A mutant retains only 0.05% activity of the wild-type LpxC. In the structure with the inhibitor TU-514, a water-mediated hydrogen bond links the C-6 hydroxyl of the inhibitor to the side chain of this lysine (Gennadios et al., 2006).
The ribose and phosphates are directed away from the active site in a position that is inconsistent with the substrate orientation required for catalysis. LpxC from A. aeolicus does not deacetylate UDP-GlcNAc (Whittington et al., 2003) and poor penetration of this molecule into the active site of the zinc-inhibited form is likely to be expected. The ribose interacts with the side chains of Arg137 and Glu185. This is a well conserved residue pairing in that a basic lysine is often observed at the corresponding arginine position and the glutamate is retained or replaced by aspartate or glutamine. The -phosphate interacts with the basic side chains of Arg250 and Lys156 (Fig. 3). The arginine is highly conserved, but not the lysine. The current positions of the ribose and phosphates and interactions with the enzyme may represent a configuration adopted by substrate as it binds in advance of catalysis or alternatively by product as it vacates the catalytic centre. UDP-binding site and the omit difference electron-density map. The map has coefficients |F o À F c |, c and is contoured at 3.5. F o represents the observed structure factors, F c the calculated structure factors and c the calculated phases from which ligand contributions were omitted. Protein residues within 4 Å of UDP are labelled and putative hydrogen bonds (within 3.6 Å ) are shown as black dashed lines. Atoms are coloured as described in Fig. 2, with the C atoms of the protein coloured salmon.

Figure 4
A model for substrate binding in the active site. LpxC is depicted as a van der Waals surface. The experimental models of UDP and TU-514, derived from crystal structures, provide the template to model the substrate. The ligands are shown as sticks. The experimentally derived C atoms are coloured black and the modelled substrate C atoms cyan. Conserved residues in the LpxC active site (at a level of >60%) are coloured by type: histidines green, lysine blue, aspartate red, threonine slate-blue, aromatic phenylalanines salmon, glycine and leucine magenta. The catalytic Zn 2+ and a water molecule implicated in the mechanism are shown as grey and marine spheres, respectively, and ligand atoms are depicted as in Figs. 2 and 3.

Implications for substrate binding
We propose that the nucleotide-binding site observed in this study represents the position that uracil adopts during substrate recognition and subsequently catalysis. Binding of the base involves highly conserved residues, which when altered influence both binding and enzyme activity (Coggins et al., 2005;McClerren et al., 2005). There is a notable absence of conserved hydrogen bonds between the hexose of the substrate analogue TU-514 and LpxC (Gennadios et al., 2006). Isooxazoline-based inhibitors carrying carboxylate or phosphonate head groups bind more tightly to E. coli LpxC if they contain an aliphatic substituent (IC 50 values of 4 mM with and >450 mM without; Pirrung et al., 2002), suggesting that hexose-protein interactions do not contribute significantly to substrate binding. Therefore, interactions between LpxC and the UDP and FA moieties of the substrate are likely to contribute the most to ordering of the substrate during catalysis. Using our experimentally derived uracil-binding site and a GlcNAc-binding site based on the position of the pyranose in the crystallographic model of LpxC with TU-514, a substrate model can be constructed by varying the torsion angles between the uracil and the ribose, in particular the glycosidic link between the base and ribose, and the ribose and the phosphates while maintaining binding of the FA within the hydrophobic cavity (Fig. 4). Note that the inhibitory Zn 2+ was removed from the structural template prior to modelling because it prevents access to the catalytic Zn 2+ . The r.m.s.d. of atoms from uracil common to those of the substrate is 0.03 Å and the r.m.s.d. of the inhibitor pyranose atoms to those of the substrate is 0.08 Å . The model places the functional groups in chemically plausible positions for catalysis, supporting the conclusion that the uracil placement we observe is biologically relevant. Our structure now completes the picture of how LpxC recognizes substrate and imparts information to support a structure-based approach to inhibitor development.