Received 7 August 2013
Structures of Saccharomyces cerevisiae D-arabinose dehydrogenase Ara1 and its complex with NADPH: implications for cofactor-assisted substrate recognition
aCollege of Life and Environment Science, Huangshan University, Huangshan, Anhui 245041, People's Republic of China, and bHefei National Laboratory for Physical Sciences at the Microscale and School of Life Sciences, University of Science and Technology of China, Hefei, Anhui 230027, People's Republic of China
The primary role of yeast Ara1, previously mis-annotated as a D-arabinose dehydrogenase, is to catalyze the reduction of a variety of toxic ,-dicarbonyl compounds using NADPH as a cofactor at physiological pH levels. Here, crystal structures of Ara1 in apo and NADPH-complexed forms are presented at 2.10 and 2.00 Å resolution, respectively. Ara1 exists as a homodimer, each subunit of which adopts an (/)8-barrel structure and has a highly conserved cofactor-binding pocket. Structural comparison revealed that induced fit upon NADPH binding yielded an intact active-site pocket that recognizes the substrate. Moreover, the crystal structures combined with computational simulation defined an open substrate-binding site to accommodate various substrates that possess a dicarbonyl group.
D-Erythroascorbic acid (eAsA) is a major antioxidant that is produced during the metabolic processes in several fungi, such as Saccharomyces cerevisiae (Nick et al., 1986), Neurospora crassa (Dumbrava & Pall, 1987) and Candida (Pall & Robertson, 1988), and corresponds to ascorbic acid (AsA) in animals (Meister, 1994) and plants (Asada, 1999). The biosynthetic pathway of eAsA is composed of two successive reactions that are catalyzed by D-arabinose dehydrogenase (Ara; Kim et al., 1998) and D-arabinono--lactone oxidase (Alo; Huh et al., 1998). Two S. cerevisiae genes, YBR149W and YMR041C, have been annotated as encoding two types of Ara: Ara1 and Ara2, respectively (Kim et al., 1998). However, the Km value of Ara1 towards D-arabinose is about 160 mM, which is 200 times that of Ara2 (0.78 mM). The high Michaelis constant suggests that Ara1 might possess ineffective D-arabinose dehydrogenase activity since the intracellular D-arabinose concentration in yeast is far lower than 100 mM (Amako, Fujita, Iwamoto et al., 2006). Moreover, the results of deletion of the ARA1 or ARA2 gene further confirmed that Ara2, and not Ara1, contributes the majority of the production of AsA from D-arabinose (Amako, Fujita, Shimohata et al., 2006; Amako, Fujita, Iwamoto et al., 2006).
Ara1 was subsquently re-annotated as an ,-dicarbonyl reductase which belongs to the aldo-keto reductase (AKR) family (van Bergen et al., 2006). That is, Ara1 catalyzes the reduction of ,-dicarbonyl compounds such as methylglyoxal, diacetyl and pentanedione, which are known to be toxic metabolic by-products. These reactive compounds react with proteins and nucleic acids, leading to mutagenesis and damage (Kovacic & Cooksy, 2005; Wondrak et al., 2002). In addition, the presence of the metabolite diacetyl in beverages such as beer gives a butterscotch-like aroma and an unpleasant flavour. Thus, it would be useful to engineer a strain of yeast that could enzymatically reduce diacetyl to acetoin (2-hydroxy-3-butanone), a more flavour-neutral compound in beer (van Bergen et al., 2006). In the presence of saturated NADPH, the Km values of Ara1 towards 2,3-pentanedione, diacetyl and methylglyoxal at pH 4-5 are 4.2, 5.0 and 14.3 mM, respectively. The kcat values towards these substrates are in the range 4.4-5.9 s-1, which is fast enough to catalyze degradation of these toxic compounds (van Bergen et al., 2006). Moreover, proteomics results demonstrated a twofold increase of Ara1 expression in response to H2O2 stimuli (Godon et al., 1998). Microarray data showed that environmental stresses, including heat shock and oxidative stress, markedly stimulate up-regulation of ara1 transcription in yeast cells (Gasch et al., 2000). It is suggested that the primary role of Ara1 is to reduce a variety of toxic aldehydes and ketones produced during stress.
Although some homologous proteins to Ara1, such as human Akr1b10 (4-3-ketosteroid 5-reductase; PDB entry 3cav ; Faucher et al., 2008) and murine FR-1 (fibroblast growth factor 1; PDB entry 1frb ; Wilson et al., 1995), which belong to the AKR family have been characterized, the crystal structure of yeast Ara1 is still not available. Here, we determined the first crystal structures of Ara1: in the apo form at 2.10 Å resolution and complexed with the coenzyme NADPH at 2.00 Å resolution. These two structures enabled us to illustrate an induced fit upon NADPH binding and to define an accommodative substrate-binding site which would detoxify a broad spectrum of substrates.
The coding sequence of ARA1/YBR149W was cloned into a pET28a-derived vector. This construct adds a hexahistidine tag to the N-terminus of the recombinant protein, which was overexpressed in E. coli BL21 (DE3) strain (Novagen, Madison, Wisconsin, USA) using 2×YT (yeast extract and tryptone) culture medium. The cells were induced with 0.2 mM isopropyl -D-1-thiogalactopyranoside (IPTG) at 289 K for 20 h when the OD600 nm reached 0.6. The cells were harvested by centrifugation at 8000g for 10 min and resuspended in lysis buffer (20 mM Tris-HCl pH 8.0, 200 mM NaCl). After 5 min of sonication and centrifugation at 12 000g for 25 min, the supernatant containing the soluble target protein was collected and loaded onto an Ni-NTA column (GE Healthcare) equilibrated with binding buffer (20 mM Tris-HCl pH 8.0, 200 mM NaCl). The target protein was eluted with 250 mM imidazole buffer and loaded onto a Superdex 200 column (GE Healthcare) equilibrated with 20 mM Tris-HCl pH 8.0, 50 mM NaCl. Fractions containing the target protein were pooled and concentrated to 20 mg ml-1. The purity of the protein was estimated by SDS-PAGE and the protein sample was stored at 193 K.
Crystals of Ara1 were obtained at 289 K using the hanging-drop vapour-diffusion technique by mixing 1 µl protein solution at 10 mg ml-1 with an equal volume of reservoir solution (25% polyethylene glycol 3350, 0.1 M HEPES pH 7.5).
Crystals of the Ara1-NADPH complex were obtained by co-crystallization with 5 mM NADPH in 25% polyethylene glycol 3350, 0.1 M bis-tris pH 6.5, 0.05 M CaCl2.
The crystals were flash-cooled in liquid nitrogen and data sets were collected at a radiation wavelength of 0.9795 Å on beamline BL17U at Shanghai Synchrotron Radiation Facility (SSRF) at 100 K using an MX-225 CCD detector (MAR Research). Data processing and scaling were performed using the HKL-2000 package (Otwinowski & Minor, 1997). The crystal structure of Ara1 was determined by the molecular-replacement method with MOLREP using the coordinates of human AKR in complex with NADPH and inhibitor (PDB entry 1zua ; Gallego et al., 2007) as the search model. Refinement was carried out using the maximum-likelihood method in REFMAC (Murshudov et al., 2011) and the interactive rebuilding process was performed using Coot (Emsley & Cowtan, 2004). The overall model quality was assessed with MolProbity (Chen et al., 2010). Atomic coordinates and structure factors have been deposited in the Protein Data Bank (PDB; http://www.rcsb.org ) under accession codes 4ijc and 4ijr . The crystallographic parameters of the structure are listed in Table 1. All structural figures were prepared using PyMOL (http://www.pymol.org ).
+R factor = , where Fobs and Fcalc are the observed and calculated structure-factor amplitudes, respectively.
§Rfree was calculated using 5% of the data, which were excluded from the refinement.
##Root-mean-square deviation from ideal values (Engh & Huber, 1991).
++Categories as defined by MolProbity (Chen et al., 2010).
The asymmetric unit contains a dimer of Ara1 with an interface area of 1030 Å2. The two subunits are very similar, with an overall root-mean-square deviation (r.m.s.d.) of 0.13 Å over 296 C atoms (Fig. 1a). Gel-filtration chromatography also indicated the existence of Ara1 as a dimer in solution. The dimeric interface is mainly mediated by strands A, B and two loops (Met15-Tyr24 and Lys91-Leu96) in each subunit and contains eight hydrogen bonds and 111 non-bonded contacts, which include hydrophobic interactions and salt bridges.
| || Figure 1 |
Overall structure. Schematic representation of (a) the Ara1 dimer and (b) the Ara1 monomer. (c) Cartoon representation and (d) molecular surface of the Ara1-NADPH complex. NADPH is shown as green sticks. All figures were drawn using PyMOL.
Each Ara1 subunit adopts an (/)8-barrel topology or TIM-barrel (Banner et al., 1975) motif (Fig. 1b). As in other AKR structures (Wilson et al., 1992), the TIM barrel is mainly composed of eight parallel -strands, and each -strand alternates with an -helix running antiparallel to the strand. The two antiparallel -strands (A and B) at the N-terminus cover the bottom of the barrel. The top of the barrel is partially covered by three large exposed loops, loops A (between 4 and 4; Glu133-Tyr166), B (between 7 and 7; Tyr241-Pro248) and C (at the carboxyl-terminus; Lys316-Leu342), and two -helices, helix A (Pro254-Ile263) between 7 and 7 and helix B (Lys303-Lys315) between helix 8 and loop C (Fig. 1b).
The structure of the Ara1-NADPH complex showed that a molecule of NADPH binds at the carboxyl edge of the -strands of the barrel in an extended conformation (Figs. 1c and 1d). In detail, the adenine ring of NADPH is stabilized by the main chains of Ala249 and Ser250 and the side chains of Ala248, Leu251, Asn268 and Arg291 via van der Waals interactions. The phosphate group of the adenosine ribose is fixed by Ser286 O, Leu287 N and Arg291 N1 and the hydroxyl group of the adenosine ribose is stabilized by Arg285 N1 through hydrogen bonds. The pyrophosphate is threaded through a short tunnel, with one side occupied by Ser241-His246. The other side is lined with Ile283, Pro284 and Arg285. The pyrophosphate group forms two hydrogen bonds to Ser241 N and O. One hydroxyl group of the nicotinamide ribose makes a hydrogen bond to Ala41 N. The nicotinamide moiety accommodates a wider cavity and forms hydrogen bonds to Gln214 O1 and Ser192 O (Fig. 2a). Superposition of apo-form Ara1 on the the Ara1-NADPH complex yields an r.m.s.d. of 0.25 Å over 297 C atoms. The major conformational change results from the induced fit upon NADPH binding. In order to accommodate the cofactor, loop B (Tyr240-Pro249) and a short segment (Ile283-Arg291) near the C-terminal end shift towards each other and lead to a narrower cleft (Fig. 2b). For example, Leu243, Ser245, His246 and Ala248 shift by 1.8, 1.0, 1.3 and 1.0 Å, respectively, whereas the phenolic ring of Tyr240 and the hydroxyl group of Ser241 shift by about 0.7 and 0.9 Å, respectively, leaving space for the NADPH nicotinamide moiety. On the other side, Arg285, Ser286 and Leu287 also shift by about 1.1, 0.7 and 0.3 Å, respectively, to stabilize the adenosine ribose of NADPH (Fig. 2b).
| || Figure 2 |
NADPH-binding site. (a) Interactions between NADPH and Ara1. (b) Induced fit upon NADPH binding. Ara1 is shown in cyan and the Ara1-NADPH complex is shown in orange. NADPH is shown in green lines and the interacting residues are shown as sticks.
The substrate-binding pocket of AKRs has been proposed and is reported to be close to the nicotinamide moiety of the NADPH cofactor (Jez et al., 1997; Di Costanzo et al., 2009; Wilson et al., 1992). To clarify the structural basis of the catalysis driven by Ara1, we attempted to obtain a crystal of the tertiary complex of Ara1 with NADP+ and an ,-dicarbonyl compound by either co-crystallization or crystal soaking, but were not successful. Therefore, we docked two typical ,-dicarbonyl substrates, diacetyl and 2,3-pentanedione, into the structure of Ara1-NADPH using HADDOCK (de Vries et al., 2010). The docking program was driven by interaction restraints between the active-site residues of Ara1 and the ,-dicarbonyl substrate, as defined by WHISCY (Adams et al., 2002) and previously reported by Jez et al. (1997). Docking produced 25 clusters for diacetyl and 32 for 2,3-pentanedione. The results for each substrate were selected as the cluster of lowest energy that satisfied the best interaction restraints. In this mode, the topology of the substrate-binding site resembles an open and accommodative cleft and includes three components: the oxyanion-binding site (Tyr71, His131 and C4 of the nicotinamide ring), residues at the edge of the active site (Ala41, Ala70, Trp102 and Trp132) forming a hydrophobic environment and amino acids from three loops forming the sides of the cleft: loop A contributes to one side (Lys150 and Thr151), with the opposite side being formed by loop B (Tyr240 and His246) and loop C (Ile321, Glu323 and Phe325) (Figs. 3a and 3b). The substrate-binding pocket is predominantly hydrophobic, in accord with the generally hydrophobic nature of the dicarbonyl substrates. In the binding mode, all of the substrate packed perpendicular to the nicotinamide ring, with one carbonyl O atom of the ,-dicarbonyl compound interacting with the side chain of Tyr71, His131 and the nicotinamide ring through hydrogen bonds (about 3.1 Å for diacetyl and 3.4 Å for 2,3-pentanedione; Figs. 3a and 3b). Tyr71 was proposed as the catalytic residue (proton donor) based on the proximity required between the C4 position of the nicotinamide ring and the anticipated position of the substrate carbonyl (Wilson et al., 1992; Jez et al., 1997). The other carbonyl is exposed towards the outside of the substrate-binding pocket (Figs. 3c and 3d). Meanwhile, the conserved hydrophobic residues Ala41, Ala70, Trp102, Trp132, Tyr240, Ile321 and Phe325 form a hydrophobic environment to accommodate the carbon skeleton of the ,-dicarbonyl compound (Figs. 3a and 3b). Sequence analysis reveals that the active-site residues Ala41, Asp66, Ala70, Tyr71, Lys100, Trp102, His131, Trp132, Tyr240, Ile321 and Phe325 are all conserved in the AKR family (Fig. 4) and possess a common substrate-binding pattern. Analysis of the Ara1 structure shows that the three loop regions (A, B and C) exhibit relatively high B factors, and this structural flexibility and plasticity was supposed to be necessary for the recognition of more than one substrate (Jez et al., 1997). Furthermore, multiple sequence alignment also shows that the composition and length of the amino acids in the three loops (A, B and C) varies (Fig. 4), which probably determines the substrate specificities of the different AKRs. In conclusion, the open and accommodative substrate-binding site forms a favourable environment for various ,-dicarbonyl substrates.
| || Figure 3 |
A docking model of Ara1 complexed with diacetyl or 2,3-pentanedione. (a, b) Binding patterns of (a) diacetyl and (b) 2,3-pentanedione. (c, d) Surface potentials of Ara1 complexed with (c) diacetyl and (d) 2,3-pentanedione. Residues are shown as cyan sticks and diacetyl or 2,3-pentanedione as grey sticks. NADPH is shown as thinner sticks. Hydrogen bonds are shown as black dashes.
| || Figure 4 |
Multiple sequence alignment of proteins in the aldo-keto reductase (AKR) family. Proteins are represented by their PDB codes: 4ijc , Saccharomyces cerevisiae Ara1 (NP_009707.3); 1mzr , Escherichia coli Dkga (NP_417485.4; Jeudy et al., 2006); 3h7u , Arabidopsis thaliana NADP-linked oxidoreductase (NP_001031505.1; Simpson et al., 2009); 4f4o , Leishmania braziliensis Ara1 (XP_001685202.1; Andersen et al., 2012); 1qwk , Caenorhabditis elegans Ara1 (NP_509242.1; Southeast Collaboratory for Structural Genomics, unpublished work); 1frb , Mus musculus aldose reductase (NP_032038.1; Wilson et al., 1995); 1zua , Homo sapiens Akr1b10 (NP_064695.3; Gallego et al., 2007). The secondary-structure elements of Ara1 (PDB entry 4ijc ) are shown at the top. Residues involved in substrate binding are labelled with blue triangles and catalytic residues are marked with red stars. The alignment was performed with ClustalW (Larkin et al., 2007) and ESPript (Gouet et al., 1999).
The docking results enable us to propose a plausible catalytic mechanism for Ara1. In the apo form, the cofactor-binding pocket and the active site are relatively open and relaxed. Upon binding of NADPH, loop B (Tyr240-Pro249) and a short segment (Ile283-Arg291) near the C-terminal end move towards each other to narrow the cofactor-binding cleft. Meanwhile, the active-site residues form a cavity favourable for substrate binding (Jez et al., 1997; Wilson et al., 1992). The substrate in the pocket is correctly positioned by the side chains of Ala70, Tyr71, His131, Trp102 and Trp132, as well as the NADPH nicotinamide ring. An electron immediately transfers from C4 of the nicotinamide to the carbonyl group of the substrate. Upon reduction of the carbonyl group of the ,-dicarbonyl substrate, the hydrogen bond between the catalytic residue Tyr71 and the carbonyl group of the substrate disappears. With the change in redox state, NADP+ may undergo a conformational change accompanied by the opening of the cofactor-binding cleft for release of the product.
We are grateful to the developers of the CCP4 suite, ESPript, MolProbity and PyMOL. This work was supported by the 973 Project (2012CB911000) from the Ministry of Science and Technology of China and the National Natural Science Foundation of China (Program 31170695).
Adams, P. D., Grosse-Kunstleve, R. W., Hung, L.-W., Ioerger, T. R., McCoy, A. J., Moriarty, N. W., Read, R. J., Sacchettini, J. C., Sauter, N. K. & Terwilliger, T. C. (2002). Acta Cryst. D58, 1948-1954.
Amako, K., Fujita, K., Iwamoto, C., Sengee, M., Fuchigami, K., Fukumoto, J., Ogishi, Y., Kishimoto, R. & Goda, K. (2006). Biosci. Biotechnol. Biochem. 70, 3004-3012.
Amako, K., Fujita, K., Shimohata, T. A., Hasegawa, E., Kishimoto, R. & Goda, K. (2006). FEBS Lett. 580, 6428-6434.
Andersen, C. B., Torvund-Jensen, M., Nielsen, M. J., de Oliveira, C. L., Hersleth, H. P., Andersen, N. H., Pedersen, J. S., Andersen, G. R. & Moestrup, S. K. (2012). Nature (London), 489, 456-459.
Asada, K. (1999). Annu. Rev. Plant Physiol. Plant Mol. Biol. 50, 601-639.
Banner, D. W., Bloomer, A. C., Petsko, G. A., Phillips, D. C., Pogson, C. I., Wilson, I. A., Corran, P. H., Furth, A. J., Milman, J. D., Offord, R. E., Priddle, J. D. & Waley, S. G. (1975). Nature (London), 255, 609-614.
Bergen, B. van, Strasser, R., Cyr, N., Sheppard, J. D. & Jardim, A. (2006). Biochim. Biophys. Acta, 1760, 1636-1645.
Chen, V. B., Arendall, W. B., Headd, J. J., Keedy, D. A., Immormino, R. M., Kapral, G. J., Murray, L. W., Richardson, J. S. & Richardson, D. C. (2010). Acta Cryst. D66, 12-21.
Di Costanzo, L., Drury, J. E., Christianson, D. W. & Penning, T. M. (2009). Mol. Cell. Endocrinol. 301, 191-198.
Dumbrava, V. A. & Pall, M. L. (1987). Biochim. Biophys. Acta, 926, 331-338.
Emsley, P. & Cowtan, K. (2004). Acta Cryst. D60, 2126-2132.
Engh, R. A. & Huber, R. (1991). Acta Cryst. A47, 392-400.
Faucher, F., Cantin, L., Luu-The, V., Labrie, F. & Breton, R. (2008). Biochemistry, 47, 8261-8270.
Gallego, O., Ruiz, F. X., Ardevol, A., Dominguez, M., Alvarez, R., de Lera, A. R., Rovira, C., Farres, J., Fita, I. & Pares, X. (2007). Proc. Natl Acad. Sci. USA, 104, 20764-20769.
Gasch, A. P., Spellman, P. T., Kao, C. M., Carmel-Harel, O., Eisen, M. B., Storz, G., Botstein, D. & Brown, P. O. (2000). Mol. Biol. Cell, 11, 4241-4257.
Godon, C., Lagniel, G., Lee, J., Buhler, J.-M., Kieffer, S., Perrot, M., Boucherie, H., Toledano, M. B. & Labarre, J. (1998). J. Biol. Chem. 273, 22480-22489.
Gouet, P., Courcelle, E., Stuart, D. I. and Métoz, F. (1999). Bioinformatics, 15, 305-308.
Huh, W.-K., Lee, B.-H., Kim, S.-T., Kim, Y.-R., Rhie, G.-E., Baek, Y.-W., Hwang, C.-S., Lee, J.-S. & Kang, S.-O. (1998). Mol. Microbiol. 30, 895-903.
Jeudy, S., Monchois, V., Maza, C., Claverie, J.-M. & Abergel, C. (2006). Proteins, 62, 302-307.
Jez, J. M., Bennett, M. J., Schlegel, B. P., Lewis, M. & Penning, T. M. (1997). Biochem. J. 326, 625-636.
Kim, S.-T., Huh, W.-K., Lee, B.-H. & Kang, S.-O. (1998). Biochim. Biophys. Acta, 1429, 29-39.
Kovacic, P. & Cooksy, A. L. (2005). Arch. Toxicol. 79, 123-128.
Larkin, M. A., Blackshields, G., Brown, N. P., Chenna, R., McGettigan, P. A., McWilliam, H., Valentin, F., Wallace, I. M., Wilm, A., Lopez, R., Thompson, J. D., Gibson, T. J. & Higgins, D. G. (2007). Bioinformatics, 23, 2947-2948.
Meister, A. (1994). J. Biol. Chem. 269, 9397-9400.
Murshudov, G. N., Skubák, P., Lebedev, A. A., Pannu, N. S., Steiner, R. A., Nicholls, R. A., Winn, M. D., Long, F. & Vagin, A. A. (2011). Acta Cryst. D67, 355-367.
Nick, J. A., Leung, C. T. & Loewus, F. A. (1986). Plant Sci. 46, 181-187.
Otwinowski, Z. & Minor, W. (1997). Methods Enzymol. 276, 307-326.
Pall, M. L. & Robertson, C. K. (1988). Biochem. Biophys. Res. Commun. 150, 365-370.
Simpson, P. J., Tantitadapitak, C., Reed, A. M., Mather, O. C., Bunce, C. M., White, S. A. & Ride, J. P. (2009). J. Mol. Biol. 392, 465-480,
Vries, S. J. de, van Dijk, M. & Bonvin, A. M. (2010). Nature Protoc. 5, 883-897.
Wilson, D. K., Bohren, K. M., Gabbay, K. H. & Quiocho, F. A. (1992). Science, 257, 81-84.
Wilson, D. K., Nakano, T., Petrash, J. M. & Quiocho, F. A. (1995). Biochemistry, 34, 14323-14330.
Wondrak, G. T., Cervantes-Laurean, D., Roberts, M. J., Qasem, J. G., Kim, M., Jacobson, E. L. & Jacobson, M. K. (2002). Biochem. Pharmacol. 63, 361-373.