Received 20 May 2010
The structure of Haemophilus influenzae prephenate dehydrogenase suggests unique features of bifunctional TyrA enzymes
Hsiu-Ju Chiu,a,b Polat Abdubek,b,c Tamara Astakhova,b,d Herbert L. Axelrod,a,b Dennis Carlton,b,e Thomas Clayton,b,e Debanu Das,a,b Marc C. Deller,b,e Lian Duan,b,d Julie Feuerhelm,b,c Joanna C. Grant,b,c Anna Grzechnik,b,e Gye Won Han,b,e Lukasz Jaroszewski,b,d,f Kevin K. Jin,a,b Heath E. Klock,b,c Mark W. Knuth,b,c Piotr Kozbial,b,f S. Sri Krishna,b,d,f Abhinav Kumar,a,b David Marciano,b,e Daniel McMullan,b,c Mitchell D. Miller,a,b Andrew T. Morse,b,d Edward Nigoghossian,b,c Linda Okach,b,c Ron Reyes,a,b Henry J. Tien,b,e Christine B. Trame,a,b Henry van den Bedem,a,b Dana Weekes,b,f Qingping Xu,a,b Keith O. Hodgson,b,g John Wooley,b,d Marc-André Elsliger,b,e Ashley M. Deacon,a,b Adam Godzik,b,d,f Scott A. Lesleyb,c,e and Ian A. Wilsonb,e*
aStanford Synchrotron Radiation Lightsource, SLAC National Accelerator Laboratory, Menlo Park, CA, USA,bJoint Center for Structural Genomics, http://www.jcsg.org , USA,cProtein Sciences Department, Genomics Institute of the Novartis Research Foundation, San Diego, CA, USA,dCenter for Research in Biological Systems, University of California, San Diego, La Jolla, CA, USA,eDepartment of Molecular Biology, The Scripps Research Institute, La Jolla, CA, USA,fProgram on Bioinformatics and Systems Biology, Sanford-Burnham Medical Research Institute, La Jolla, CA, USA, and gPhoton Science, SLAC National Accelerator Laboratory, Menlo Park, CA, USA
Chorismate mutase/prephenate dehydrogenase from Haemophilus influenzae Rd KW20 is a bifunctional enzyme that catalyzes the rearrangement of chorismate to prephenate and the NAD(P)+-dependent oxidative decarboxylation of prephenate to 4-hydroxyphenylpyruvate in tyrosine biosynthesis. The crystal structure of the prephenate dehydrogenase component (HinfPDH) of the TyrA protein from H. influenzae Rd KW20 in complex with the inhibitor tyrosine and cofactor NAD+ has been determined to 2.0 Å resolution. HinfPDH is a dimeric enzyme, with each monomer consisting of an N-terminal / dinucleotide-binding domain and a C-terminal -helical dimerization domain. The structure reveals key active-site residues at the domain interface, including His200, Arg297 and Ser179 that are involved in catalysis and/or ligand binding and are highly conserved in TyrA proteins from all three kingdoms of life. Tyrosine is bound directly at the catalytic site, suggesting that it is a competitive inhibitor of HinfPDH. Comparisons with its structural homologues reveal important differences around the active site, including the absence of an - motif in HinfPDH that is present in other TyrA proteins, such as Synechocystis sp. arogenate dehydrogenase. Residues from this motif are involved in discrimination between NADP+ and NAD+. The loop between 5 and 6 in the N-terminal domain is much shorter in HinfPDH and an extra helix is present at the C-terminus. Furthermore, HinfPDH adopts a more closed conformation compared with TyrA proteins that do not have tyrosine bound. This conformational change brings the substrate, cofactor and active-site residues into close proximity for catalysis. An ionic network consisting of Arg297 (a key residue for tyrosine binding), a water molecule, Asp206 (from the loop between 5 and 6) and Arg365' (from the additional C-terminal helix of the adjacent monomer) is observed that might be involved in gating the active site.
The TyrA protein family comprises dehydrogenases that are dedicated to L-tyrosine biosynthesis. These dehydrogenases can be classified into three groups according to their substrate specificity. Prephenate dehydrogenases (TyrAp or PDH) only use prephenate as a substrate, arogenate dehydrogenases (TyrAa of ADH) only accept arogenate and cyclohexadienyl dehydrogenases (TyrAc or CDH) use either prephenate or arogenate. The TyrA dehydrogenases convert prephenate to tyrosine through two different routes (Fig. 1). In the prephenate pathway, PDH enzymes catalyze the NAD(P)+-dependent oxidative decarboxylation of prephenate to 4-hydroxyphenylpyruvate (HPP), which is then converted to L-tyrosine by an aminotransferase. In the arogenate route, prephenate is first transaminated to L-arogenate by prephenate aminotransferase. L-Arogenate is then decarboxylated and converted to L-tyrosine by ADH. In addition to diverse substrate specificity, the TyrA family also exhibits diversity with respect to its cofactor specificity. TyrA proteins may be specific for NAD+ or NADP+ or may use both. TyrA proteins exist as either monofunctional or bifunctional proteins. The common fusion partners of TyrA proteins include chorismate mutase (aroQ; Calhoun et al., 2001), 3-phosphoshikimate 1-carboxyvinyltransferase (aroF; Beller et al., 2006) and an ACT (aspartate kinase-chorismate mutase-TyrA) regulatory domain (Chipman & Shaanan, 2001; Grant, 2006).
| || Figure 1 |
Tyrosine biosynthetic pathway. The figure is modified from The Enzyme Database (http://www.enzyme-database.org ).
TyrAp is mostly present in low-GC Gram-positive organisms, such as Bacillus subtilis. TyrAa is abundant in higher plants and in at least three bacterial lineages, cyanobacteria, actinomycetes and Nitrosomonas europaea, whereas TyrAc is found in most bacteria. An analysis of the phylogenetic relationship of TyrA enzymes identified a distinct subgroup within the TyrAc group, denoted here as TyrAc_ (Song et al., 2005). When the primary sequences of these TyrAc_ proteins are aligned with those of other TyrA groups, it is immediately apparent that the TyrAc_ proteins contain a number of deletions within the catalytic core region and possess a second functional domain, which classifies them as bifunctional enzymes. Biochemical studies have shown that this subgroup displays narrower substrate and cofactor specificity compared with the parent TyrAc enzymes. The TyrA enzymes from Escherichia coli and Klebsiella pneumoniae are the two best characterized TyrAc_ enzymes and both prefer prephenate over arogenate by more than one order of magnitude and only use NAD+ as cofactor (Ahmad & Jensen, 1987; Turnbull et al., 1990). The studies further suggested that the TyrAc family, with its broad substrate specificity, represents the ancestral enzymes from which the TyrAc_, TyrAa and TyrAp enzymes have evolved to exhibit a narrower range of substrate specificity (Song et al., 2005).
The regulation of TyrA activity is important as prephenate is a common precursor in the biosynthesis of tyrosine and phenylalanine. TyrA enzymes are regulated by various mechanisms, including feedback inhibition and gene regulation by the Tyr operon (Cobbett & Delbridge, 1987). Kinetic studies of chorismate mutase/prephenate dehydrogenase (CM/PDH) from E. coli have led to the proposal of two different types of mechanism for tyrosine inhibition. Christopherson (1985) concluded that tyrosine acts as a competitive inhibitor in the dehydrogenase reaction, whereas Turnbull, Morrison et al. (1991) suggested that tyrosine binds at a distinct allosteric site. B. subtilis PDH is inhibited competitively by tyrosine and noncompetitively by tryptophan and HPP (Champney & Jensen, 1970). B. subtilis PDH has a C-terminal fusion of an ACT regulatory domain. The ACT domain was first identified in 1995 and is a small-molecule binding domain that is found in enzymes involved in amino-acid metabolism and transcription regulation. Small-molecule binding to the ACT domain is thought to control the enzyme activity through allosteric regulation. Thus, the noncompetitive inhibition by tryptophan and HPP in B. subtilis PDH might be a consequence of the presence of the ACT domain. Crystal structures of PDH enzymes from our study and from Aquifex aeolicus (Sun et al., 2009) both revealed bound tyrosine, but only at the active site, which supports the role of tyrosine as a competitive inhibitor. The source of tyrosine in these two structures originated from protein expression and cocrystallization, respectively. Not all TyrA enzymes are inhibited by tyrosine; Synechocystis ADH, for example, is completely insensitive to competitive inhibition by tyrosine (Legrand et al., 2006). Studies on A. aeolicus PDH (Sun et al., 2009) showed that His217 is critical for the inhibitory effect of tyrosine where a His217Ala mutation completely abolished the inhibitory effect of tyrosine. Studies on E. coli TyrA reported similar results, in which a His257Ala mutation (His257 is equivalent to His217 in A. aeolicus PDH) abolished inhibition by tyrosine (Christendat et al., 1998). Comparison of the crystal structure of Synechocystis ADH with that of A. aeolicus PDH revealed that Val182 is present in this location, which could account for the loss of tyrosine inhibition.
Studies of the enzymatic mechanisms of TyrA enzymes have revealed some variations in the reaction pathway. The kinetic data for Synechocystis ADH suggested a sequential substrate-binding event in which arogenate first binds to the protein, followed by the cofactor (Beller et al., 2006), whereas kinetic studies on E. coli and Arabidopsis thaliana suggested a random addition of NAD+ and prephenate (Sampathkumar & Morrison, 1982; Rippert & Matringe, 2002). Kinetic studies using 13C-labeled substrates suggested a mechanism in which decarboxylation and proton transfer occur in a concerted manner (Hermes et al., 1984). Kinetic and mutagenesis studies of several systems have identified key active-site residues (Christendat et al., 1998; Christendat & Turnbull, 1999). Conserved histidine, arginine and serine residues are critical for enzyme activity. For instance, in E. coli TyrA His197 has been proposed to facilitate hydride transfer from prephenate to NAD+ by polarizing the 4-OH group of prephenate, whereas Arg294 is critical for substrate binding. A His197Ala mutation decreased the dehydrogenase activity significantly and an Arg294Gln mutation greatly increased the Km, but did not affect the kcat. In A. aeolicus PDH, Ser126 is hydrogen bonded to the 4'-OH of tyrosine and to NAD+ and could participate in both catalysis and ligand binding. The Ser126Ala mutation reduced kcat 15-fold and increased Km tenfold.
The TyrA gene from Haemophilus influenza Rd KW20 encodes a bifunctional enzyme, chorismate mutase (EC 126.96.36.199)/prephenate dehydrogenase (EC 188.8.131.52) (CM/PDH), with a molecular weight of 43 kDa (residues 1-377) and a calculated isoelectric point of 5.56. The crystal structure of the prephenate dehydrogenase component (HinfPDH; residues 81-377) of this TyrA enzyme was determined in complex with tyrosine and NAD+ at 2.0 Å resolution and represents the first PDH structure from a bifunctional TyrA enzyme. The structure was determined using the semi-automated high-throughput pipeline of the Joint Center for Structural Genomics (JCSG; Lesley et al., 2002) as part of the National Institute of General Medical Sciences' Protein Structure Initiative (PSI; http://www.nigms.nih.gov/Initiatives/PSI/ ).
Clones were generated using the Polymerase Incomplete Primer Extension (PIPE) cloning method (Klock et al., 2008). The gene encoding HinfPDH (GenBank AAC22939, gi|1574749; Swiss-Prot TYRA) was amplified by polymerase chain reaction (PCR) from H. influenzae Rd KW20 genomic DNA using PfuTurbo DNA polymerase (Stratagene) and I-PIPE (Insert) primers (forward primer, 5'-ctgtacttccagggcATGCGTGAATCCTATGCCAATGAAAACC-3'; reverse primer, 5'-aattaagtcgcgttaGCATAAAACGGCGTAGAACATCTTCAAT-3'; target sequence in upper case) that included sequences for the predicted 5' and 3' ends. The expression vector pSpeedET, which encodes an amino-terminal tobacco etch virus (TEV) protease-cleavable expression and purification tag (MGSDKIHHHHHHENLYFQ/G), was PCR-amplified with V-PIPE (Vector) primers. The V-PIPE and I-PIPE PCR products were mixed to anneal the amplified DNA fragments together. E. coli GeneHogs (Invitrogen) competent cells were transformed with the V-PIPE/I-PIPE mixture and dispensed onto selective LB-agar plates. The cloning junctions were confirmed by DNA sequencing. Using the PIPE method, the part of the gene encoding residues Met1-Phe80 was excluded from the final construct. Expression was performed in selenomethionine-containing medium with suppression of normal methionine synthesis (Van Duyne et al., 1993). At the end of fermentation, lysozyme was added to the culture to a final concentration of 250 µg ml-1 and the cells were harvested and frozen. After one freeze-thaw cycle, the cells were sonicated in lysis buffer [50 mM HEPES pH 8.0, 50 mM NaCl, 10 mM imidazole, 1 mM tris(2-carboxyethyl)phosphine-HCl (TCEP)] and the lysate was clarified by centrifugation at 32 500g for 30 min. The soluble fraction was passed over nickel-chelating resin (GE Healthcare) pre-equilibrated with lysis buffer, the resin was washed with wash buffer [50 mM HEPES pH 8.0, 300 mM NaCl, 40 mM imidazole, 10%(v/v) glycerol, 1 mM TCEP] and the protein was eluted with elution buffer [20 mM HEPES pH 8.0, 300 mM imidazole, 10%(v/v) glycerol, 1 mM TCEP]. The eluate was buffer-exchanged with TEV buffer (20 mM HEPES pH 8.0, 200 mM NaCl, 40 mM imidazole, 1 mM TCEP) using a PD-10 column (GE Healthcare) and incubated with 1 mg TEV protease per 15 mg of eluted protein. The protease-treated eluate was run over nickel-chelating resin (GE Healthcare) pre-equilibrated with HEPES crystallization buffer (20 mM HEPES pH 8.0, 200 mM NaCl, 40 mM imidazole, 1 mM TCEP) and the resin was washed with the same buffer. The flowthrough and wash fractions were combined and concentrated to 19.6 mg ml-1 by centrifugal ultrafiltration (Millipore) for crystallization trials. HinfPDH was crystallized using the nanodroplet vapor-diffusion method (Santarsiero et al., 2002) with standard JCSG crystallization protocols (Lesley et al., 2002). Sitting drops composed of 200 nl protein mixed with 200 nl crystallization solution were equilibrated against a 50 µl reservoir at 293 K for 28 d prior to harvest. The crystallization reagent was composed of 0.04 M potassium dihydrogen phosphate, 20.0%(v/v) glycerol and 16.0%(w/v) PEG 8000. A rod-shaped crystal of approximate dimensions 0.1 × 0.05 × 0.05 mm was harvested for data collection. No additional cryoprotectant was added to the crystal. Initial diffraction screening was carried out using the Stanford Automated Mounting system (SAM; Cohen et al., 2002) at the Stanford Synchrotron Radiation Lightsource (SSRL, Menlo Park, California, USA). The crystal was indexed in the tetragonal space group P41212. The oligomeric state of HinfPDH was determined to be a dimer by gel filtration using a 0.8 × 30 cm Shodex Protein KW-803 column (Thomson Instruments) equilibrated in 20 mM Tris, 200 mM NaCl, 0.5 mM TCEP pH 7.5 and pre-calibrated with gel-filtration standards (Bio-Rad). Protein concentrations were determined using the Coomassie Plus assay (Pierce).
Multiple-wavelength anomalous diffraction (MAD) data were collected on beamline 11-1 at the SSRL using a 0.1 × 0.1 mm X-ray beam at wavelengths corresponding to the inflection (1) and remote (2) wavelengths of a selenium MAD experiment. The data sets were collected at 100 K using a Rayonix MAR Mosaic MX-325 CCD detector. The MAD data were integrated and reduced using XDS and scaled with the program XSCALE (Kabsch, 1993, 2010a,b). An initial substructure solution was obtained with SHELXD (Sheldrick, 2008) and the phases were refined using autoSHARP (Vonrhein et al., 2007), which gave a mean figure of merit of 0.48 with 14 selenium sites. Automated model building was performed with ARP/wARP (Cohen et al., 2004). Model completion and refinement were performed with Coot (Emsley & Cowtan, 2004) and REFMAC 5.2 (Winn et al., 2003) using the high-energy remote (2) data set. The refinement included phase restraints from SHARP and TLS refinement with three TLS groups per chain. CCP4 programs were used for data conversion and other calculations (Collaborative Computational Project, Number 4, 1994).
During structure refinement, additional electron density was found at the active site. The density was well defined and could be unambiguously assigned to a tyrosine molecule and an NAD+ cofactor. As these molecules were not added during the crystallization experiment, they must have been acquired during protein expression and have remained bound during purification and crystallization. Data-collection and refinement statistics are summarized in Table 1.
+Rmeas = (Diederichs & Karplus, 1997).
§Rcryst = , where Fcalc and Fobs are the calculated and observed structure-factor amplitudes, respectively.
##Rfree is the same as Rcryst but for 5.1% of the total reflections that were chosen at random and omitted from refinement.
++This value represents the total B that includes TLS and residual B components.
§§The estimated overall coordinate error (Collaborative Computational Project, Number 4, 1994; Cruickshank, 1999).
The quality of the crystal structure was analyzed using the JCSG Quality Control server (http://smb.slac.stanford.edu/jcsg/QC ). This server verifies the stereochemical quality of the model using AutoDepInputTool (Yang et al., 2004), MolProbity (Chen et al., 2010) and WHATIF v.5.0 (Vriend, 1990), the agreement between the atomic model and the data using SFCHECK v.4.0 (Vaguine et al., 1999) and RESOLVE (Terwilliger, 2004), the protein sequence using ClustalW (Thompson et al., 1994), the atom occupancies using MOLEMAN2 (Kleywegt, 2000) and the consistency of NCS pairs. Protein quaternary-structure analysis was conducted using the PISA server (http://www.ebi.ac.uk/msd-srv/prot_int/pistart.html ; Krissinel & Henrick, 2005). Fig. 2(d) was adapted from PDBsum (Laskowski, 2009); all other figures were prepared with PyMOL (DeLano, 2002). Atomic coordinates and experimental structure factors have been deposited in the PDB under accession code 2pv7 .
| || Figure 2 |
Structure of HinfPDH. (a) Ribbon diagram of a HinfPDH monomer complexed with NAD+ and a tyrosine molecule. Helices 1-12 and strands 1-7 are indicated. The C-terminal domain is colored green and the helices and -strands of the N-terminal domain are colored cyan and red, respectively. Bound tyrosine and NAD+ molecules are shown in ball-and-stick representation; C, O, N and phosphate atoms are colored yellow, red, blue and orange, respectively. (b) Ribbon diagram of the HinfPDH dimer; monomer A is colored green and monomer B is colored orange. Helices are shown as cylinders. (c) Top view compared with (b) of the HinfPDH dimer. (d) Diagram showing the secondary-structure elements of HinfPDH superimposed on its primary sequence. The labeling of secondary-structure elements is in accord with PDBsum (http://www.ebi.ac.uk/pdbsum ), where -helices (H1-H12) and -strands (1-7) are sequentially labeled, -turns and -turns are designated by Greek letters (, ) and -hairpins are indicated by red loops.
The final model consists of a dimer of the PDH domain of TyrA (residues 92-371 for chains A and B), two nicotinamide adenine dinucleotides (NAD+), two tyrosines and 393 water molecules in the asymmetric unit (Figs. 2a, 2b and 2c). No electron density was observed for Gly80-Phe91, Asp312-Glu315 and Asn372-Gly377 in chain A and Gly80-Val91 and Asn372-Gly377 in chain B. The side chains of Lys212 and Lys348 from chain A and of Arg132, Lys212, Lys239, Gln325 and Ala371 from chain B were omitted owing to poor electron density. The Matthews coefficient (VM; Matthews, 1968) was 2.98 Å3 Da-1 and the estimated solvent content was 58.7%. The Ramachandran plot produced by MolProbity showed that 98% of the residues are in favored regions, with no outliers.
Each monomer consists of an N-terminal / dinucleotide-binding domain (residues 92-243) and a C-terminal -helical dimerization domain (residues 244-371) (Fig. 2a). The active site is located at the domain interface. The N-terminal domain adopts a modified Rossmann fold, which consists of a parallel seven-stranded -sheet (strand order 2-1-3-4-5-6-7) with the 1 helix on one face of the -sheet and the 2, 3, 4 and 5 helices on the other. Structural comparisons of HinfPDH and other nucleotide-binding proteins, including other TyrA enzymes, show that HinfPDH lacks the - structural motif between 2 and 2 that is present and is part of the -sheet in other nucleotide-binding proteins. The C-terminal domain consists of seven helices (6-12) that form the dimer interface; the helices from each monomer are intertwined into a tightly packed helical bundle with a buried surface of 11 000 Å2 (Figs. 2b and 2c). The two monomers are structurally similar to each other, with an r.m.s.d. of 0.2 Å for 268 equivalent C atoms.
The active site is located in the cleft between the N- and C-terminal domains (Figs. 2a, 2b and 2c). One tyrosine and one NAD+ are bound in each monomer (Fig. 3a). His200, Ser179 and Arg297 (Fig. 3b) are among the important residues for enzyme catalysis and/or ligand binding and are conserved in TyrA enzymes across all kingdoms of life. The His200 imidazole is hydrogen bonded to the 4'-OH of the bound tyrosine. Ser179 hydrogen bonds to both the ribose O atom of nicotinamide nucleoside and the 4'-OH of the bound tyrosine and is important for orientating prephenate and NAD+ for catalysis. The Arg297 guanadinium forms a pair of electrostatic interactions with the tyrosine carboxyl, which also interacts with Gln301 from the 8 helix. The tyrosine amino group hydrogen bonds to Tyr306 from 9 and Tyr288' from 8 of the adjacent monomer. His260 is located close to the tyrosine amino group and could be involved in regulation of tyrosine inhibition in a similar way to His217 in A. aeolicus.
| || Figure 3 |
Active site of HinfPDDH. (a) Final model of tyrosine and NAD+ molecules fitted in 2Fo - Fc electron-density maps prior to model building as output from the last step of XSOLVE (JCSG, unpublished work). The map is contoured at 1. (b) Active site of HinfPDDH showing key active-site residues and the bound tyrosine and NAD+ molecules in ball-and-stick representation. C atoms are colored gray for the protein and yellow for the ligands; O, N and phosphate atoms are colored red, blue and orange, respectively.
The aromatic ring of the bound tyrosine packs against the nicotinamide ring of NAD+ such that the 4'-OH of tyrosine is approximately 4 Å away from C4 of the nicotinamide ring. Assuming that prephenate adopts the same binding mode, the structure suggests that the hydride is transferred from prephenate to the si face of NAD+, which is consistent with a previous proton NMR study on the E. coli TyrA enzyme using isotope-labeled NAD-4-d (Hermes et al., 1984).
The binding mode of NAD+ is similar to those of arogenate dehydrogenase from Synechocystis sp. and prephenate dehydrogenase from A. aeolicus (Legrand et al., 2006; Sun et al., 2006). The pyrophosphate of NAD+ interacts with the P-loop (Gly108-Gly113) between 1 and 1, forming hydrogen bonds to Lys111 and the main-chain amides of Lys111 and Leu112. The diol of the adenylyl ribose hydrogen bonds to Asp131 and Lys132 from the loop located between 2 and 2. The adenine ring is sandwiched between Val152 and Arg132 from helices 2 and 3, with its N1 hydrogen bonded to Trp135 from 2. The diol of the nicotinamide ribose is hydrogen bonded to Ser179 and Val152. The nicotinamide ring interacts with the protein mainly through hydrophobic interactions.
A search with FATCAT (Ye & Godzik, 2004) using the HinfPDH coordinates identified the closest structural homologues of HinfPDH in the PDB as the prephenate dehydrogenases from A. aeolicus (AaeoPDH; PDB code 2g5c ; Sun et al., 2006) and Streptococcus thermophilus (SthePDH; PDB code 3dzb ; Z. Zhang, S. Eswaramoorthy, S. K. Burley & S. Swaminathan, unpublished work) and the arogenate dehydrogenase from Synechocystis sp. (SyneADH; PDB code 2f1k ; Legrand et al., 2006). HinfPDH is bifunctional, whereas the other three enzymes are monofunctional. The pairwise sequence identities between HinfPDH and AaeoPDH, SthePDH and SyneADH are 20, 27 and 25%, respectively. Despite the low sequence identity, the overall structures of these enzymes are very similar. The structures of AaeoPDH complexed with the ligands NAD+ (PDB code 2g5c ), NAD+ and L-tyrosine (PDB code 3ggg ), NADH and 4-hydroxylphenylpyruvate (HPP; PDB code 3ggo ) and NAD+ and 4-hydroxyphenylpropionate (PDB code 3ggp ) are available (Sun et al., 2009), but only the first two structures were used for comparison because they are sufficient to represent the two unique enzyme states; they are denoted AaeoPDH and AaeoPDH-Tyr-NAD+, respectively. SyneADH has an NADP+ bound at the active site and SthePDH has no ligand bound. The dehydrogenase activity of SyneADH is strictly dependent on arogenate and NADP+ (Legrand et al., 2006). Conversely, NAD+ and prephenate are the preferred cofactor and substrate for AaeoPDH, although a very low level of dehydrogenase activity is detected when NADP+ with prephenate or NAD+ with arogenate are used (Bonvin et al., 2006).
Although the overall fold is similar, structural comparisons revealed important differences around the active site. Compared with other TyrA enzymes, HinfPDH lacks an - structural motif between 2 and 2, the loop between 5 and 6 (L5-6) is much shorter, and an extra helix 12 is found at the C-terminus (Fig. 4). Multiple sequence alignment of many representative TyrA proteins clearly shows that the - motif and L5-6 represent unique structural differences between bifunctional and monofunctional TyrA enzymes (Fig. 5), but the extra C-terminal helix is less obviously discernable from the sequence comparisons, presumably because the exact end point of the prephenate dehydrogenase domain is difficult to determine for cases such as TyrA-aroF or TyrA-ACT fusions in which aroF and ACT are fused at the C-terminal end.
| || Figure 4 |
Structural differences between HinfPDH and its structural homologues. (a) Superimposition of HinfPDH, AaeoPDH, SthePDH and SyneADH reveals unique structural differences in HinfPDH including a missing - motif, a shorter L5-6 loop, and an extra C-terminal helix. HinfPDH is colored green or orange for each monomer. The AaeoPDH, SthePDH and SyneADH structures are colored magenta, yellow and blue, respectively. Bound tyrosine and NAD+ molecules are shown in ball-and-stick representation. (b) Enlarged view of the area around the - motif showing that HinfPDH is missing the - motif compared with AaeoPDH, SthePDH and SyneADH. (c) Enlarged view of the area around L5-6. Lys111 of HinfPDH is shown with a green backbone, Ser155 of AaeoPDH is in magenta and Gln120 of SyneADH is in blue. (d) Enlarged view of the C-terminal area showing that an extra C-terminal helix is present in HinfPDH. The 7 and 8 helices of HinfPDH superimposed onto structurally equivalent helices in AaeoPDH, SthePDH and SyneADH are also indicated.
| || Figure 5 |
Multiple sequence alignment of representative TyrA enzymes. Regions around the - motif and L5-6 are shown. Abbreviations: Hinf, Haemophilus influenzae Rd KW20; Etas, Erwinia tasmaniensis Et1/99; Ecol, Escherichia coli str. K-12 substr. MG1655; Plum, Photorhabdus luminescens subsp. laumondii TTO1; Pmul, Pasteurella multocida subsp. multocida str. Pm70; Vcho, Vibrio cholerae O1 biovar eltor str. N16961; Vpar, Vibrio parahaemolyticus RIMD 2210633; Yent, Yersinia enterocolitica subsp. enterocolitica 8081; Aful, Archaeoglobus fulgidus; Syne, Synechocystis sp. (Legrand et al., 2006); Atha, Arabidopsis thaliana (Rippert & Matringe, 2002); Npun, Nostoc punctiforme PCC 73102 (Song et al., 2005); Zmob, Zymomonas mobilis (Zhao et al., 1993); Paer, Pseudomonas aeruginosa PA7 (Xia & Jensen, 1990); Pstu, Pseudomonas stutzeri (Xie et al., 2000); Aaeo, Aquifex aeolicus VF5 (Bonvin et al., 2006); Bsub, Bacillus subtilis (Champney & Jensen, 1970); Sthe: Streptococcus thermophilus LMG 18311 (Song et al., 2005); Efae, Enterococcus faecalis V583 (Song et al., 2005).
In HinfPDH, L5-6 is eight residues shorter than in AaeoPDH, SthePDH and SyneADH. This loop is well ordered in the HinfPDH, AaeoPDH and SyneADH structures (Fig. 4c) but is disordered in the SthePDH structure owing to the absence of bound cofactor. In AaeoPDH and SyneADH, this loop extends along the cofactor-binding site. Ser155 in AaeoPDH and Gln120 in SyneADH are in structurally equivalent positions (Fig. 4c) and both form hydrogen bonds to the pyrophosphate O atom of the bound cofactor. However, in HinfPDH no residue is structurally equivalent because of the shorter loop and Lys111 from 1 instead provides the equivalent interaction with the pyrophosphate O atom (Fig. 4c). The equivalent residues to Lys111 in AaeoPDH and SyneADH are Phe40 and Leu10, respectively, and neither side chain can form hydrogen bonds to the pyrophosphate. This suggests that although a shorter L5-6 loop has evolved in HinfPDH, the ability to bind cofactor is not affected.
In SyneADH, the phosphate group of the adenosine ribose of NADP+ is recognized by helical residues in the - motif. The phosphate group is stabilized by electrostatic interaction with Arg31 and hydrogen-bonding interactions with Gln32 and Thr35, as well as with the main chain of Arg31 and Gln32. The equivalent residues in AaeoPDH are Ile63, Asn64 and Ser67. A loss of electrostatic interaction caused by the substitution of Arg by Ile may explain why AaeoPDH prefers NAD+ over NADP+ as cofactor. In SthePDH, the equivalent residues are Arg36, Ser37 and Ser40. Thus, the ability to form electrostatic and hydrogen-bonding interactions is similar to that of SyneADH, suggesting that SthePDH is capable of binding NADP+. In HinfPDH, Arg132 is structurally equivalent to SyneADH Arg31, but residues equivalent to Gln32 and Thr35 are absent owing to the lack of the - structural motif. It is possible that HinfPDH also prefers NAD+ over NADP+ in a manner similar to bifunctional TyrA enzymes from E. coli and K. pneumonia.
A smaller local difference among these TyrA structures is the loop joining 7 and 8 and the adjacent residues (Fig. 4d). This loop takes a wider turn in HinfPDH compared with that in AaeoPDH, SthePDH and SyneADH and is positioned next to 12. Ser284-Leu290 in this region are highly conserved in TyrAc_ sequences, suggesting that they play important roles.
Another important difference between HinfPDH and other TyrA structures is the relative orientation of the respective N- and C-terminal domains. Pairwise structural alignment of HinfPDH with AaeoPDH, AaeoPDH-Tyr-NAD+, SthePDH and SyneADH using only the N-terminal domain gives r.m.s.d.s of 1.8, 1.7, 1.9 and 1.5 Å, respectively, for 128-132 superimposed C atoms (0.8-0.9 Å for core -sheet residues). Upon structural superimposition, it is immediately noticeable that differences in the relative orientation of the N- and C-terminal domains are present in these TyrA structures (Figs. 6a and 6b). A hinge region around Glu242-Asn244 connects the N- and C-terminal domains at the domain interface opposite to the substrate-binding site. Superimposition using the C-terminal domain gives similar results, although the results are less obvious owing to internal structural differences within the C-terminal domains of these TyrA structures. Therefore, the discussion below is based on superimpositions using the N-terminal domain.
| || Figure 6 |
Superimposition of TyrA structures showing differences in the relative orientation of the N- and C-terminal domains and a close-up view of the active site of HinfPDH. (a) Superimposition of HinfPDH, AaeoPDH, AaeoPDH-Tyr-NAD+, SthePDH and SyneADH reveals differences in the relative orientation of the N- and C-terminal domains in these TyrA proteins. HinfPDH, AaeoPDH, AaeoPDH (monomer B)-Tyr-NAD+, SthePDH and SyneADH are colored green, magenta, orange, cyan and blue, respectively. (b) Same orientation as (a); for clarity, only HinfPDH and SyneADH are shown. (c) An ionic network in the active site of HinfPDH consists of Arg297, a bridging water molecule, Asp206 and Arg365' from adjacent molecule in a dual conformation. His260, Tyr288, Gln301 and Tyr306 that could be involved in substrate selectivity are also shown. Hydrogen bonds are indicated as dashed lines.
Using the C-terminal domain of HinfPDH as a reference, the C-terminal domains of AaeoPDH, SthePDH and SyneADH are farther away from the substrate-binding site; therefore, HinfPDH represents the most closed form, SyneADH is the most open form and AaeoPDH and SthePDH are in intermediate states (Figs. 6a and 6b). Monomer B of AaeoPDH-Tyr-NAD+ has both NAD+ and tyrosine bound and the conformation is closed, similar to HinfPDH. Monomer A of AaeoPDH-Tyr-NAD+ only has NAD+ bound and the conformation is similar to AaeoPDH. Comparison of the AaeoPDH and AaeoPDH-Tyr-NAD+ structures suggest that tyrosine induces a conformational change upon binding. Hence, it is possible that the binding of tyrosine to HinfPDH also induces a conformational change from an open to a closed form and the closed form is captured in the current HinfPDH structure. A crystal structure of apo HinfPDH could provide direct evidence for this proposal. In the closed conformation, 6, 8, 9 and 8' (8 from monomer B) are close to the bound tyrosine and 8 and 12' (12 from monomer B) are near L5-6, facilitating closure of the active site and the proper alignment of active-site residues for catalysis. Key active-site residues in this region include His260 (6), Arg297, Gln301 (8), Tyr288' (8'), Tyr306 (9), Arg365' (12') and Asp206 (L5-6). If a tyrosine molecule is modeled adjacent to the cofactor in AaeoPDH and SthePDH, the residue equivalent to Arg297 of HinfPHDH is too distant to interact with the tyrosine. The conformational change is independent of cofactor binding as the structures of SyneADH with NADP+ bound, AaeoPDH with NAD+ bound and SthePDH without any cofactor are all in similar open conformations.
In the closed conformation of HinfPDH an ionic network consisting of Arg297, a bridging water molecule, Asp206 from L5-6 and Arg365' from 12' is observed that may be involved in gating the active site (Fig. 6c). The bridging water is present in both monomers. The Arg365' side chain adopts dual conformations, with one conformation participating in the ionic network and the other pointing away from the active site. This dual conformation may be part of the gating mechanism, in which an ionic network forms on closure of the active site after substrate is bound and is broken when the product is released. It is worth noting that Asp206 is absolutely conserved and Arg365 is highly conserved in chorismate mutase/prephenate dehydrogenase sequences but not in monofunctional TyrA enzymes. Thus, the active-site gating mechanism might be different in bifunctional and monofunctional TyrA enzymes.
Previous studies on the pH-dependence of the E. coli TyrA enzyme showed that a catalytic group with a pKa value of about 6.5 is deprotonated for dehydrogenase activity (Turnbull, Cleland et al., 1991). Subsequent site-directed mutagenesis experiments revealed that this critical catalytic residue is His197 (Christendat et al., 1998). The catalytic mechanism of the oxidative decarboxylation of E. coli TyrA was investigated and suggested a concerted mechanism in which hydride transfer and decarboxylation occur in a concerted manner. It was proposed that His197 provides the driving force for the dehydrogenase reaction by polarizing the 4'-hydroxyl group of prephenate (Christendat et al., 1998). It was also postulated that since the end-product of the reaction is aromatic, polarization of the 4'-OH group is sufficient to lower the energy barrier for the reaction, rather than deprotonation of the 4'-OH group to form a -keto acid intermediate in a stepwise mechanism. The crystal structure of HinfPDH shows that the N2 atom of His200 (equivalent to His197 in E. coli TyrA) is hydrogen bonded to the 4'-hydroxyl group of tyrosine at a distance of 2.6 Å. A hydrogen-bonding network between His200, His248 and Asp249 is observed in which His200 N1 hydrogen bonds to His248 N(H)2 and His248 N(H)1 hydrogen bonds to Asp249 O1. Asp249 is located near the protein surface. As for His200, His248 and Asp249 are highly conserved in TyrA sequences. This hydrogen-bonding network can help to maintain His200 N2 in a deprotonated state. The Ser179 hydrogen bond to the 4'-OH of the bound tyrosine can provide an additional driving force for the reaction by polarizing the 4'-OH group since the equivalent Ser126 in AaeoPDH is critical for catalysis. In addition, HinfPDH and E. coli TyrA share 57% sequence identity in their prephenate dehydrogenase domains and all key active-site residues are conserved. Hence, HinfPDH is likely to adopt a concerted mechanism for dehydrogenase reaction as found for E. coli TyrA. In HinfPDH, tyrosine is bound directly at the catalytic site, suggesting that it acts as a competitive inhibitor.
How TyrA enzymes evolved to be specific for prephenate or arogenate is intriguing because prephenate and arogenate have very similar structures. In HinfPDH, Gln301, Tyr306, Tyr288 and His260 are positioned close to the amino and carboxyl groups of the bound tyrosine and could be involved in substrate specificity (Fig. 6c). Gln301, Tyr306 and Tyr288 are highly conserved in bifunctional TyrA sequences. In other TyrA sequences, Gln301 is replaced by Gly, Ser or Thr. Tyr306 is relatively conserved as Trp in TyrAp and TyrAc, but is Trp, Gly or Val in TyrAa. Tyr288 is located at the N-terminus of 8, where some local structural differences are found between HinfPDH and other TyrA structures. Alignment of TyrA sequences shows a gap of approximately four residues around Tyr288 in the TyrAp, TyrAa and TyrAc sequences. His260 is conserved in TyrA, except for some TyrAas, where it is Val or Gln. Given the structural resemblance between prephenate and arogenate, further experiments to elucidate the exact prephenate-binding mode will advance our understanding of substrate specificity in TyrA enzymes.
HinfPDH is the first prephenate dehydrogenase structure to be determined from a bifunctional TyrA enzyme. This structure reveals active-site residues that are important for catalysis and/or ligand binding and are consistent with previously determined structures of other TyrA enzymes. The comparison of HinfPDH with other known TyrA structures indicates important differences that appear to be characteristic features that differentiate the bifunctional and monofunctional TyrA enzymes and suggest that the regulation of enzyme activity is likely to differ between bifunctional and monofunctional TyrA enzymes. These structural differences may be related to the presence of a chorismate mutase domain in the bifunctional TyrA enzymes. A crystal structure of full-length H. influenzae TyrA should provide insight into this question. Additional information about the proteins described in this study is available from TOPSAN (Krishna et al., 2010) at http://www.topsan.org/explore?PDBid=2pv7 .
This work was supported by the NIH, National Institute of General Medical Sciences, Protein Structure Initiative grant U54 GM074898. Portions of this research were carried out at the Stanford Synchrotron Radiation Lightsource (SSRL). The SSRL is a national user facility operated by Stanford University at the SLAC National Accelerator Laboratory on behalf of the US Department of Energy, Office of Basic Energy Sciences. The SSRL Structural Molecular Biology Program is supported by the Department of Energy, Office of Biological and Environmental Research and by the National Institutes of Health (National Center for Research Resources, Biomedical Technology Program and the National Institute of General Medical Sciences. Genomic DNA from H. influenzae Rd KW20 (ATCC No. 5190) was obtained from the American Type Culture Collection (ATCC). The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institute of General Medical Sciences or the National Institutes of Health.
Ahmad, S. & Jensen, R. A. (1987). FEBS Lett. 216, 133-139.
Beller, H. R., Chain, P. S., Letain, T. E., Chakicherla, A., Larimer, F. W., Richardson, P. M., Coleman, M. A., Wood, A. P. & Kelly, D. P. (2006). J. Bacteriol. 188, 1473-1488.
Bonvin, J., Aponte, R. A., Marcantonio, M., Singh, S., Christendat, D. & Turnbull, J. L. (2006). Protein Sci. 15, 1417-1432.
Calhoun, D. H., Bonner, C. A., Gu, W., Xie, G. & Jensen, R. A. (2001). Genome Biol. 2, 1-16.
Champney, W. S. & Jensen, R. A. (1970). J. Biol. Chem. 245, 3763-3770.
Chen, V. B., Arendall, W. B., Headd, J. J., Keedy, D. A., Immormino, R. M., Kapral, G. J., Murray, L. W., Richardson, J. S. & Richardson, D. C. (2010). Acta Cryst. D66, 12-21.
Chipman, D. M. & Shaanan, B. (2001). Curr. Opin. Struct. Biol. 11, 694-700.
Christendat, D., Saridakis, V. C. & Turnbull, J. L. (1998). Biochemistry, 37, 15703-15712.
Christendat, D. & Turnbull, J. L. (1999). Biochemistry, 38, 4782-4793.
Christopherson, R. I. (1985). Arch. Biochem. Biophys. 240, 646-654.
Cobbett, C. S. & Delbridge, M. L. (1987). J. Bacteriol. 169, 2500-2506.
Cohen, A. E., Ellis, P. J., Miller, M. D., Deacon, A. M. & Phizackerley, R. P. (2002). J. Appl. Cryst. 35, 720-726.
Cohen, S. X., Morris, R. J., Fernandez, F. J., Ben Jelloul, M., Kakaris, M., Parthasarathy, V., Lamzin, V. S., Kleywegt, G. J. & Perrakis, A. (2004). Acta Cryst. D60, 2222-2229.
Collaborative Computational Project, Number 4 (1994). Acta Cryst. D50, 760-763.
Cruickshank, D. W. J. (1999). Acta Cryst. D55, 583-601.
Diederichs, K. & Karplus, P. A. (1997). Nature Struct. Biol. 4, 269-275.
DeLano, W. L. (2002). PyMOL Molecular Viewer. DeLano Scientific LLC, Palo Alto, California, USA. http://www.pymol.org .
Emsley, P. & Cowtan, K. (2004). Acta Cryst. D60, 2126-2132.
Grant, G. A. (2006). J. Biol. Chem. 281, 33825-33829.
Hermes, J. D., Tipton, P. A., Fisher, M. A., O'Leary, M. H., Morrison, J. F. & Cleland, W. W. (1984). Biochemistry 23, 6263-6275.
Kabsch, W. (1993). J. Appl. Cryst. 26, 795-800.
Kabsch, W. (2010a). Acta Cryst. D66, 125-132.
Kabsch, W. (2010b). Acta Cryst. D66, 133-144.
Kleywegt, G. J. (2000). Acta Cryst. D56, 249-265.
Klock, H. E., Koesema, E. J., Knuth, M. W. & Lesley, S. A. (2008). Proteins, 71, 982-994.
Krishna, S. S., Weekes, D., Bakolitsa, C., Elsliger, M.-A., Wilson, I. A., Godzik, A. & Wooley, J. (2010). Acta Cryst. F66, 1143-1147.
Krissinel, E. & Henrick, K. (2005). CompLife 2005, edited by M. R. Berthold, R. Glen, K. Diederichs, O. Kohlbacher & I. Fischer, pp. 163-174. Berlin/Heidelberg: Springer-Verlag.
Laskowski, R. A. (2009). Nucleic Acids Res. 37, D355-D359.
Legrand, P., Dumas, R., Seux, M., Rippert, P., Ravelli, R., Ferrer, J.-L. & Matringe, M. (2006). Structure, 14, 767-776.
Lesley, S. A. et al. (2002). Proc. Natl Acad. Sci. USA, 99, 11664-11669.
Matthews, B. W. (1968). J. Mol. Biol. 33, 491-497.
Rippert, P. & Matringe, M. (2002). Eur. J. Biochem. 269, 4753-4761.
Sampathkumar, P. & Morrison, J. F. (1982). Biochim. Biophys. Acta, 702, 212-219.
Santarsiero, B. D., Yegian, D. T., Lee, C. C., Spraggon, G., Gu, J., Scheibe, D., Uber, D. C., Cornell, E. W., Nordmeyer, R. A., Kolbe, W. F., Jin, J., Jones, A. L., Jaklevic, J. M., Schultz, P. G. & Stevens, R. C. (2002). J. Appl. Cryst. 35, 278-281.
Sheldrick, G. M. (2008). Acta Cryst. A64, 112-122.
Song, J., Bonner, C. A., Wolinsky, M. & Jensen, R. A. (2005). BMC Biol. 3, 13.
Sun, W., Shahinas, D., Bonvin, J., Hou, W., Kimber, M. S., Turnbull, J. & Christendat, D. (2009). J. Biol. Chem. 284, 13223-13232.
Sun, W., Singh, S., Zhang, R., Turnbull, J. L. & Christendat, D. (2006). J. Biol. Chem. 281, 12919-12928.
Terwilliger, T. (2004). J. Synchrotron Rad. 11, 49-52.
Thompson, J. D., Higgins, D. G. & Gibson, T. J. (1994). Nucleic Acids Res. 22, 4673-4680.
Turnbull, J., Cleland, W. W. & Morrison, J. F. (1990). Biochemistry, 29, 10245-10254.
Turnbull, J., Cleland, W. W. & Morrison, J. F. (1991). Biochemistry, 30, 7777-7782.
Turnbull, J., Morrison, J. F. & Cleland, W. W. (1991). Biochemistry 30, 7783-7788.
Vaguine, A. A., Richelle, J. & Wodak, S. J. (1999). Acta Cryst. D55, 191-205.
Van Duyne, G. D., Standaert, R. F., Karplus, P. A., Schreiber, S. L. & Clardy, J. (1993). J. Mol. Biol. 229, 105-124.
Vonrhein, C., Blanc, E., Roversi, P. & Bricogne, G. (2007). Methods Mol. Biol. 364, 215-230.
Vriend, G. (1990). J. Mol. Graph. 8, 52-56.
Winn, M. D., Murshudov, G. N. & Papiz, M. Z. (2003). Methods Enzymol. 374, 300-321.
Xia, T. & Jensen, R. A. (1990). J. Biol. Chem. 265, 20033-20036.
Xie, G., Bonner, C. A. & Jensen, R. A. (2000). Comput. Biochem. Physiol. C Toxicol. Pharmacol. 125, 65-83.
Yang, H., Guranovic, V., Dutta, S., Feng, Z., Berman, H. M. & Westbrook, J. D. (2004). Acta Cryst. D60, 1833-1839.
Ye, Y. & Godzik, A. (2004). Nucleic Acids Res. 32, W582-W585.
Zhao, G., Xia, T., Ingram, L. O. & Jensen, R. A. (1993). Eur. J. Biochem. 212, 157-165.