research papers
The N-acetylglucosamine 2-epimerase from Nostoc sp. KVJ10 reveals the true dimer
of theaThe Norwegian Structural Biology Centre (NorStruct), Department of Chemistry, UiT – The Arctic University of Norway, 9037 Tromsø, Norway
*Correspondence e-mail: inger.l.rader@uit.no
N-Acetylglucosamine 2-epimerases (AGEs) catalyze the interconversion of N-acetylglucosamine and N-acetylmannosamine. They can be used to perform the first step in the synthesis of sialic acid from N-acetylglucosamine, which makes the need for efficient AGEs a priority. This study presents the structure of the AGE from Nostoc sp. KVJ10 collected in northern Norway, referred to as nAGE10. It is the third AGE structure to be published to date, and the first one in P42212. The nAGE10 monomer folds as an (α/α)6 barrel in a similar manner to that of the previously published AGEs, but the crystal did not contain the dimers that have previously been reported. The previously proposed `back-to-back' assembly involved the face of the AGE monomer where the barrel helices are connected by small loops. Instead, a `front-to-front' dimer was found in nAGE10 involving the long loops that connect the barrel helices at this end. This assembly is also present in the other AGE structures, but was attributed to crystal packing, even though the `front' interface areas are larger and are more conserved than the `back' interface areas. In addition, the front-to-front association allows a better explanation of the previously reported observations considering surface cysteines. Together, these results indicate that the `front-to-front' dimer is the most probable biological assembly for AGEs.
Keywords: N-acetylglucosamine 2-epimerase; AGE; sialic acid; crystal packing; ManNAc; GlcNAc; N-acetylmannosamine; Nostoc sp. KVJ10.
PDB reference: N-acetylglucosamine 2-epimerase, 6f04
1. Introduction
N-Acetylglucosamine 2-epimerases (AGEs; EC 5.1.3.8) catalyze the reversible of N-acetylmannosamine (ManNAc) to N-acetylglucosamine (GlcNAc) in both eukaryotic and prokaryotic cells (Ghosh & Roseman, 1965; Lee, Chien et al., 2007). It seems that their role, at least in mammals, is to divert the metabolic away from sialic acid synthesis (Luchansky et al., 2003). The reaction follows a deprotonation/reprotonation mechanism involving two key residues, which had been thought to be histidines, but which have recently been identified as a glutamine and an arginine (Lee, Wu et al., 2007; Takahashi, Takahashi et al., 2001; Wang et al., 2016). AGE activity is greatly enhanced by the presence of in particular ATP, which serve as allosteric activators (Datta, 1970; Ghosh & Roseman, 1965; Tabata et al., 2002; Takahashi, Hori et al., 2001).
From an industrial point of view, AGEs can be used for the synthesis of N-acetylneuraminic acid (Neu5Ac), which is also known as sialic acid (Hu et al., 2010; Kragl et al., 1991; Lee et al., 2004; Lee, Chien et al., 2007; Maru et al., 1996; Tabata et al., 2002; Wang et al., 2009). One current approach is a one-pot, coupled reaction with an N-acetylneuraminic acid lyase (NAL; EC 4.1.3.3), in which the AGE performs the (reverse) GlcNAc-to-ManNAc and the NAL performs the condensation with pyruvate, thus producing Neu5Ac. However, there are several challenges related to the use of AGEs in this step, such as the unfavourable thermodynamic equilibrium, the requirement for ATP and the inhibition by both pyruvate and Neu5Ac (Datta, 1970; Ghosh & Roseman, 1965; Klermund et al., 2013; Kragl et al., 1991).
In order to optimize Neu5Ac production, the search for better AGEs is a current area of focus and enzymes from cyanobacterial sources seem to be the most promising. Indeed, the AGEs from Anabaena sp. CH1 (AnaAGE) and A. variabilis ATCC 29413 (AvaAGE) have a that is almost four times that of the porcine enzyme, despite having similar affinities for GlcNAc (Klermund et al., 2013; Lee, Wu et al., 2007). Other enzymes present interesting properties, such as the AGE from Bacteroides ovatus ATCC 8483 (BoAGE), which shows a much higher affinity for GlcNAc than ManNAc, and that from Synechocystis PCC 6803 (SynAGE), which has the lowest Km values for both GlcNAc and ManNAc of all AGEs that have been characterized to date (Sola-Carvajal et al., 2012; Tabata et al., 2002).
In addition to bioprospecting for enzymes that are more suited for industrial purposes, the identification of the structural elements that govern the activity parameters of AGEs is a priority. Two AGEs had been crystallized prior to this study: the porcine enzyme (pAGE; PDB entry 1fp3) and that from Anabaena sp. CH1 (AnaAGE; PDB entry 2gz6), the latter of which used the former as a template for (Itoh et al., 2000; Lee, Wu et al., 2007). These studies revealed that the overall structure of AGEs is that of a (α/α)6 barrel. Together with mutagenesis studies, this led to the proposal of a involving two critical histidine residues as acid/base catalysts in the protonation/deprotonation of carbon C2 of GlcNAc/ManNAc. The enzyme from Pedobacter heparinus DSM2366 (PhGn2E) was used in a hydrogen/deuterium-exchange experiment, which confirmed this type of mechanism for the albeit with a glutamine/arginine pair as catalysts (Wang et al., 2016).
AGEs were found to be structurally similar to sulfoquinovose isomerases encoded by the yihS gene (EC 5.3.1.31) and to cellobiose 2-epimerases (CE; EC 5.1.3.11), despite a relatively low sequence identity (Denger et al., 2014; Fujiwara et al., 2013, 2014; Itoh et al., 2008; Tyler & Leatherwood, 1967). Together, they form the AGE superfamily (Pfam accession No. PF07221), for which 13 structures are currently available (Bateman et al., 2004; Finn et al., 2014). Several of these structures have been solved as protein–ligand complexes, and structural comparison of the relatively conserved active sites helped to form the current hypothesis for the catalytic mechanism of AGEs (Itoh et al., 2008).
In this study, we report the determination of the Nostoc sp. KVJ10, which we refer to as nAGE10 (Liaimer et al., 2016). We also demonstrate that the most probable biological assembly for the AGE dimer involves the opposite face of the barrel to that published previously. Finally, we reveal the presence of a putatively conserved chloride ion within the active site of AGE.
of the AGE from the local strain2. Materials and methods
2.1. Cloning, expression and purification
The gene coding for nAGE10, containing the coding sequence for a C-terminal hexahistidine tag and optimized for expression in Escherichia coli, was synthesized by GeneArt and subcloned into the pDEST14 expression vector using the Gateway cloning system (ThermoFisher Scientific). The protein was expressed in E. coli BL21 Star (DE3) cells (ThermoFisher Scientific), which were grown at 37°C to an OD600 of 0.6, brought to 20°C and induced with 0.5 mM isopropyl β-D-1-thiogalactopyranoside (IPTG; Sigma–Aldrich). The cells were incubated for approximately 16 h at 20°C and harvested by centrifugation at 6000g for 25 min. Bacterial cell pellets were resuspended in 50 mM Tris–HCl pH 7.5, 250 mM NaCl, 5 mM β-mercaptoethanol, 10 mM imidazole, along with half a tablet of protease-inhibitor cocktail (Roche) and 2 µl DNAse I. The cells were disrupted using a French press and the extract was centrifuged at 20 000g for 2 × 30 min to remove cell debris. The supernatant was filtered and purified by using a 5 ml HisTrap column (GE Healthcare). The binding buffer consisted of 20 mM sodium phosphate pH 7.4, 0.5 M NaCl, 20 mM imidazole. The elution buffer consisted of 20 mM sodium phosphate pH 7.4, 0.5 M NaCl, 500 mM imidazole. The protein was eluted using a gradient from 5% to 80% elution buffer over 60 ml. After purification, fractions containing protein were assessed by SDS–PAGE. Pure fractions were pooled and dialyzed overnight in 50 mM Tris–HCl pH 7.5, 250 mM NaCl. The protein solutions were further dialyzed into storage buffer (20 mM HEPES pH 7.4, 115 mM NaCl). The state of nAGE10 was determined by gel-filtration on a Superdex 75 10/300 GL column (GE Healthcare). The results are presented in the supporting information. The protein concentration was determined by measuring the absorbance at 280 nm using a theoretical extinction coefficient for the protein.
2.2. Differential scanning calorimetry
−1. A solution of nAGE10 at 1.5 mg ml−1 in storage buffer was used in the experiment. The results are presented in the supporting information.
(DSC) was used to determine the melting temperature of nAGE10. Thermal of nAGE10 was followed between 5 and 95°C using a heating/cooling rate of 1°C min2.3. Structure determination
The protein was concentrated to 10 mg ml−1 in storage buffer prior to crystallization. Screening for crystallization conditions was performed using the sitting-drop method (drop size of 200 + 200 nl). The initial screening was performed using in-house and commercially available screens at both room temperature and 4°C. From this, two initial hits were obtained (at both temperatures) with 0.1 M sodium acetate pH 5, 17.6% PEG 6000 or with a combination of 0.07 M sodium acetate pH 5, 0.05 M calcium acetate and 12% PEG 3350. The optimized crystallization condition was 0.1 M sodium acetate pH 5, 0.1 M calcium acetate, 10% PEG 3350, 3% dextran. Crystals were obtained at room temperature after 1–3 days of incubation and were flash-cooled in liquid nitrogen. 20% ethylene glycol was used as a cryoprotectant. Crystallographic data were collected at the BESSY II photon source, Helmholtz-Zentrum Berlin, Germany. The images were integrated using XDSapp (Sparta et al., 2016). The structure was solved by using the AnaAGE structure (PDB entry 2gz6) as a search model (Lee, Wu et al., 2007). was performed using PHENIX and the CCP4 program REFMAC5 (Adams et al., 2010; Murshudov et al., 2011; Winn et al., 2011). The waters were placed by Coot v.0.7.2 (Emsley et al., 2010). The crystallographic data and model statistics are summarized in Table 1.
|
2.4. Analysis of interface areas
The surfaces of nAGE10, AnaAGE and pAGE were analyzed using the PDBePISA server in order to obtain parameters pertaining to the structural and chemical properties of possible assemblies (Krissinel & Henrick, 2007).
2.5. Sequence comparison
The amino-acid sequence of nAGE10 and the previously characterized AGEs were aligned using the MUSCLE multiple sequence alignment tool with default settings (Edgar, 2004a,b). Secondary-structure data for nAGE10 and pAGE were obtained from their respective PDB files. The graphical output was created using the TeXshade package for LaTeX (Beitz, 2000).
2.6. Enzyme-activity measurement
The activity of nAGE10 was assessed by coupling the Aliivibrio salmonicida (AsNAL; EC 4.1.3.3; PDB entry 5afd; M. K. Gurung, B. Altermark, I. L. U. Rader, R. Helland & A. O. Smalas, unpublished work), resulting in the production of Neu5Ac. Its presence was detected using the thiobarbituric acid (TBA) assay (Warren, 1959). Samples consisting of 124 mM HEPES pH 8.0, either 15 mM GlcNAc or 15 mM ManNAc, 10 mM ATP, 15 mM pyruvate, 7 µg NAL and 10 µg AGE10 were incubated at room temperature for 1 h. The reactions were terminated by adding 137 µl sodium periodate (2.5 mg ml−1 sodium periodate in 57 mM H2SO4) followed by incubation at 37°C for 15 min with shaking. Arsenite solution (50 µl, 25 mg ml−1 in 0.5 M HCl) was added, resulting in a brown colour. The tubes were shaken until the brown colour disappeared and 100 µl TBA solution (71 mg ml−1 TBA pH 9.0) was then added. The samples were placed in boiling water for 7.5 min and then on ice for 5 min; they were then brought to room temperature for 5 min. Acidic n-butanol (5% HCl, 1 ml) was added and the samples were shaken for 10 min at room temperature. The tubes were spun down at 16 060g for 7 min and the absorbance of the upper layer was measured at 549 nm on a SpectraMax (Molecular Devices). A molar extinction coefficient of 57 000 M −1 cm−1 was used for concentration calculations (Warren, 1959).
to the condensation reaction catalyzed by the NAL from2.7. Graphical output generation
Molecular representations were generated using PyMOL (v.1.8; Schrödinger) and sequence alignments were rendered with the TeXshade package for LaTeX (Beitz, 2000). Figure editing was performed in Inkscape (https://inkscape.org/).
3. Results and discussion
3.1. Structure of nAGE10
nAGE10 was crystallized using the hanging-drop vapour-diffusion method. Just as for pAGE (PDB entry 1fp3) and AnaAGE (PDB entry 2gz6), nAGE10 crystallized at a pH below 6. Attempts to either crystallize it at a higher pH or to increase the pH of the existing crystals were unsuccessful. Co-crystallization trials with GlcNAc, ManNAc or ATP, as well as a combination of the latter with hexosamine, did not result in crystallized complexes, although crystals of the apoenzyme were obtained from these experiments. Soaking did not affect the crystals. These results are similar to those described for previously published AGE structures, indicating that this behaviour may be inherent to AGEs rather than to the crystallization method. Temperature did not seem to play a significant role in the crystallization process, as similar crystals were obtained at both room temperature and 4°C.
The nAGE10 crystal diffracted to 1.70 Å resolution, and the subsequent model, which was determined by 2gz6; 90% sequence identity), was refined to an R factor of 0.179 (Rfree = 0.209). The r.m.s.d. between the two structures is 0.8 (Holm & Laakso, 2016). The data are summarized in Table 1. In contrast to the pAGE and AnaAGE crystals, which both belonged to P212121, the nAGE10 crystal belonged to P42212.
using the AnaAGE structure (PDB entryThe a). The structure of the monomer was resolved for the amino-acid sequence from Tyr3 to Leu391, with residues 157–164 missing. The fold is that of an (α/α)6 barrel, with the helices connected by small loops at one end (the back) and long loops at the other end (the front). The nAGE10 barrel is composed of two concentric rings of six helices each, in which the sequence orientation is opposite. Helices H1, H3, H5, H7, H9 and H11 form the outer ring of the barrel, with the even-numbered helices as the inner ring (Fig. 1b). The nAGE10 barrel is about 50 Å in diameter and 30 Å in length.
contained one nAGE10 monomer, a chloride ion and three ethylene glycol molecules, as well as 318 waters (Fig. 13.2. Dimer assembly
Fig. 1(c) shows nAGE10 as a dimer, which was generated by twofold symmetry using the operation y − 1, x + 1, −z. The calculated interaction interface represents 9.1% of the accessible surface area, but this value is most likely to be underestimated owing to the disordered loop between residues 156 and 165 (indicated by red arrows in Fig. 1c). The interface is composed of 41 residues for each monomer, 35 of which participate in extensive interactions (Table 2). 11 hydrogen bonds involve the residue pairs Asp48/Arg234, Asp50/Arg167, Pro108/Thr166 (which only occurs once), Gln295/Lys367, Leu297/Gln301 and Trp299/Leu360. In addition, 53 pairs of residues form van der Waals (vdW) interactions. The proposed interactions involving residues Thr166 and Arg167 are probably influenced by the disorder of the aforementioned loop.
‡Monomers. In the cases of pAGE and AnaAGE, the chain is specified. |
The nAGE10 dimer, which involves the front faces of each monomer, differs from the back-to-back association presented for AnaAGE and pAGE (Itoh et al., 2000; Lee, Wu et al., 2007). A front-to-front association for AnaAGE (symmetry operator x − 1, y, z) is mentioned by Lee et al. (2007) and is attributed to crystal packing. However, the nAGE10 dimer can be superimposed onto that of AnaAGE assuming that the front-to-front packing in PDB entry 2gz6 is the biological dimer (instead of the back-to-back packing). pAGE, which has the same crystal packing as AnaAGE, also has a front-to-front dimer (Fig. 2). A comparison of the of nAGE10 with those of pAGE and AnaAGE reveals not only that the back-to-back assembly does not exist in the nAGE10 crystal, but also that either the front-to-front or the back-to-back assemblies can be used as the for the pAGE and AnaAGE structures, as presented in Fig. 2.
A comparison of the assembly parameters calculated by the PISA method indicates that the front-to-front organization is more favourable than the back-to-back organization in terms of interface area, number of interactions and solvation energy (Krissinel & Henrick, 2007). For AnaAGE, the interface area is only 346.8 Å2 for the back-to-back assembly, while it is 1386.3 Å2 for the front-to-front complex (again, the missing residues may be part of the dimer interface, so the interface area for the front-to-front assembly is probably larger). The two interfaces of pAGE have similar areas (1227.2 Å2 for the front and 1029 Å2 for the back), but the solvation energy is much more favourable for the front-to-front assembly at −22.3 kcal mol−1, compared with −1.2 kcal mol−1 for the back-to-back assembly. In terms of interactions, Itoh et al. (2000) reported that nine hydrogen bonds and 23 vdW contacts (<4.5 Å) occur between monomers in the back-to-back assembly. Using the same criteria, 13 hydrogen bonds and 52 vdW contacts can be found for the front-to-front dimer (Table 2). In the case of AnaAGE the front-to-front assembly is mentioned (as crystal packing), but only the interactions involving eight residues are described. Using the same criteria as the authors for identifying interactions and considering only hydrogen bonds leads to the involvement of 14 residues (Table 2). For example, the residues involved in the polar interactions that take place between Asp49 and Arg166, between Pro107 and Thr165 and between Trp298 and Leu359 below 3.5 Å are described, but not those between Asp47 and Arg233, between Leu296 and Gln300 and between Arg355 and Glu357 that also occur within the same distance (data not shown).
A look at the distribution of the interface residues, which is presented in Fig. 3, shows that the back interface areas of AnaAGE and pAGE are quite different (bottom surfaces in Figs. 3a and 3c). This is explained by the fact that the residues involved in this interface are not conserved between AnaAGE and pAGE (Fig. 3e). Indeed, the main interaction patch of pAGE, which involves the loops between H4 and H5 (residues 135–140) and between H6 and H7 (residues 194–198), is missing in AnaAGE. The shape of these loops is thus different and the monomers are further apart at this location in AnaAGE than they are in pAGE (data not shown). On the contrary, the residues involved in the front interface area are mostly conserved, with the exception of the loop between H5 and H6 (residues 170–173; Fig. 3d). This loop is also only resolved for pAGE, and it is likely that the interface parameters for the cyanobacterial AGEs, in particular the solvation energy, may be lower than calculated.
In addition to this, it was shown that cysteines are involved in the dimerization of pAGE (Takahashi et al., 1988). The front interface contains three cysteines (residues 41, 239 and 302 of pAGE), which is the exact number of cysteines that were alkylated by the treatment with N-ethylmaleimide performed in that study. Those cysteines lose at least 55% of their solvent-accessible area upon dimer formation, with Cys239 and Cys302 involved in hydrogen bonds to the neighbouring monomer (Table 2). Their positions are identical to those in human AGE when mapping the hAGE sequence onto the pAGE structure (data not shown); mutation studies revealed the importance of Cys41 for the stability of hAGE, while Cys239 and Cys302 do not seem to play critical roles (Takahashi, Takahashi et al., 2001; Takahashi, Takahashi, Kaneko, Ogasawara, Shindo, Saito et al., 1999).
3.3. Dimer organization and ATP-binding site
One of the principal consequences of the front-to-front dimer is for the ATP-binding site. To date, two hypotheses have been put forward regarding the location of this site. One involves the H5/H6 loop and the other involves a glycine-rich fragment (residues 363–369 of AnaAGE) in the C-terminus (Lee, Chien et al., 2007; Liao et al., 2012; Sola-Carvajal et al., 2012; Takahashi et al., 2002, 2005). While the latter, which is based on sequence similarity to the motif of NTPases, is conserved across species, the evidence supporting the nonconserved H5/H6 loop is built on experiments using chimeric constructs, mutagenesis and ATP footprinting. A nonconserved binding region would also better explain the differences in nucleotide affinities that are observed across both species and nucleotide type (Lee, Chien et al., 2007; Sola-Carvajal et al., 2012; Takahashi, Hori et al., 2001). This would mean that the H5/H6 loop is involved in both dimer interaction and ATP binding. However, the second putative site corresponds to part of the H11/H12 loop, which is also on the front face of the monomer in a position opposite to that of the H5/H6 loop. The loop points towards the centre of the barrel, and if the H5/H6 loop does the same, which is the case for the corresponding loop in pAGE, they might be in proximity to each other. This opens the possibility of both proposed sites being involved. Independently of which site is used, the localization of the ATP-binding site at the dimer interface is the determinant of its role in AGE activity, which may involve the oligomeric state of the enzyme. The inability to obtain crystals of the AGE–ATP complex either by co-crystallization or soaking could indicate the dissociation of the dimeric form upon ATP binding, as has been seen in different complexes (Nayar & Bhattacharyya, 1997; Ahmed et al., 2015; Du et al., 2014).
3.4. Active site and comparison with the AGE superfamily
The active site of nAGE10, which is presented in Fig. 4(a), contains an ethylene glycol molecule and a buried chloride ion. No ligand could be identified, although the enzyme was crystallized in the presence of ManNAc. The ethylene glycol interacts with the side chains of His373 and Arg58, as well as the main chain of Trp368 and a water molecule. The chloride ion is coordinated by the amine groups of His373 and the Gly370 main chain, which are within its first coordination sphere (3.4 Å), along with a water molecule (Carugo, 2014). However, the geometry of this three-atom coordination seems to be incorrect, as the chloride is completely outside the plane formed by these atoms. By investigating nearby atoms, the best geometry was obtained for a five-atom coordination, with the ethylene glycol (3.7 Å) and Arg58 (3.5 Å) as the two remaining partners (Fig. 4b). Other potential partners are the amine group of Phe372, the Gly370 carbonyl and the δ position of His373. The latter would mean that His373 is flipped, which has consequences for the interactions taking place within the active site.
The structures of pAGE and AnaAGE contain only waters within their active sites, and the active-site residues are conserved compared with those of nAGE10 (not shown). However, it is worth noting that for each of them there is a water molecule at the position where the chloride ion is found in the nAGE10 structure (water 389 in AnaAGE and water 521 in pAGE). The closest neighbour to both molecules is at 3.1 Å, and their B factors are below 3 Å2, while those of the neighbouring water molecules range from 11.9 to 29.3 Å2.
While none of the AGEs could be co-crystallized with ligands, several protein–ligand complex structures are available for other members of the AGE superfamily. A superimposition of the active sites of the cellobiose epimerase from Rhodothermus marinus (RmCE; PDB entry 3wki) and the isomerase YihS from Salmonella enterica (SeYihS; PDB entry 2zbl) is presented in Fig. 5 (Fujiwara et al., 2014; Itoh et al., 2008). It shows that the ethylene glycol molecule in the active site of nAGE10 is in proximity to the O5, C5, C6 and O6 positions of mannose, as well as the corresponding positions of cellobiitol. This placement is consistent with the suggested role of Arg63 of the epimerase PhGn2E as the agent that is responsible for the protonation and deprotonation of ManNAc (Wang et al., 2016). Indeed, the corresponding position in AGE10 (Ar68) is only 2.9 Å from the oxygen of ethylene glycol that mimicks O5 of mannose (Fig. 5b). The other catalytic residue, Glu314 in PhGn2E (Glu309 in nAGE10), is surprisingly not in the vicinity of the ligands, with those in SeYihS (Gln320) and RmCE (Gln326) being more than 7 Å away from the C2 atom of their respective `mannoses' (data not shown). Glu251 in SeYihS (Glu243 in nAGE10) is much closer: it is only 4.4 Å from the C2 atom.
Another interesting feature is the stacking interaction between Trp385 of RmCE and the glucose ring of cellobiitol (the same interaction is present with epilactose and glucosylmannose in the other RmCE structures; PDB entries 3wkg and 3wkh). This residue, which corresponds to Trp368 in nAGE10, is strictly conserved in AGEs but its function has not yet been investigated. Considering the position of the glucose relative to the mannose ring, it may be involved in the substrate-specificity differences observed by Wang et al. (2016) when using derivatives of glucosamine substituted at C4 (just like the glucose ring of cellobiitol is linked to C4 of the open mannose ring).
A structural comparison of the active sites within the AGE superfamily also reveals a putative binding site for the N-acetyl group of GlcNAc/ManNAc in AGEs. For RmCE (and SeYihS), the C2 position of the `mannose' and the OH that it carries interact with the side chains of Tyr124 (Tyr111), His200 (His176) and Asn196 (Asn172). Those positions are not conserved in nAGE10, where they are replaced by Phe116, Ile177 and Ala173, respectively (Fig. 5c). This leaves a cavity that would be large enough to accommodate an N-acetyl group and would also explain why PhGn2E is not active on glucosamine, as it would not be retained in the active site. All three residues are either almost or strictly conserved in AGEs.
RmCE also possesses a chloride ion within its active site, at the same location as that in nAGE10, and the coordination by ethylene glycol seen in nAGE10 may not reflect what takes place in the presence of the substrate. The structure of another CE from Ruminococcus albus (RaCE; PDB entry 3vw5; Fujiwara et al., 2013) contains a water (water 451) at the `chloride location', with a similar B factor to those of the neighbouring waters. Investigating the residues corresponding to coordinating residues in RmCE reveals that Gly387 is replaced by a cysteine (Cys371) in RaCE. In yihS-encoded proteins the spatial organization of waters differs from those in CEs and AGEs, despite their shared fold (PDB entries 2zbl, 2afa and 2rgk; Itoh et al., 2008; SGX Research Center for Structural Genomics, unpublished work). The coordinating glycine is replaced in this case by an aspartate (Asp380 in SeYihS) that is conserved within the structures. This strengthens the hypothesis of a conserved chloride within the AGE active site and opens the possibility of studying its role by mutating the glycine position of AGEs.
3.5. Sequence similarity to characterized AGEs
The amino-acid sequence of nAGE10 was compared with those of other characterized AGEs from cyanobacteria, as well as those from human, pig, rat, B. ovatus and P. heparinus (Klermund et al., 2013; Lee, Chien et al., 2007; Maru et al., 1996; Sola-Carvajal et al., 2012; Tabata et al., 2002; Takahashi, Takahashi, Kaneko, Ogasawara, Shindo, Saito & Kobayashi, 1999). Their sequences are presented in Fig. 6 as a multiple sequence alignment, which shows that the nAGE10 sequence is closest to that of N. punctiforme PCC 73102, with 96.14% sequence identity (Edgar, 2004a,b). AGEs are quite conserved amongst related species, with greater than 80% identity within mammalian sequences and greater than 90% in the Nostocaceae family (Anabaena and Nostoc genera). For the latter, sequences from Anabaena are grouped together along with that of Nostoc sp. PCC 7120, which is also known as Anabaena sp. PCC 7120 (Kaneko et al., 2001). This is consistent with the currently accepted phylogenic distribution of Nostoc and Anabaena strains (Svenning et al., 2005). The sequences for P. heparinus, B. ovatus and Synechocystis sp. stand out, which is expected considering that they do not belong to either of the two aforementioned groups.
3.6. Expression and purification
The AGE from Nostoc sp. KVJ10 was successfully expressed as a soluble, His-tagged protein in E. coli BL21 Star (DE3) cells. It could be purified in one step by giving a yield of purified protein of up to 40 mg per litre of culture when grown in LB medium or 87 mg per litre of culture when using TB medium (data not shown). Gel filtration showed that nAGE10 was present as a dimer, and a melting temperature of 72.3°C was determined by DSC. The SDS–PAGE gel and the chromatograms from the gel-filtration and DSC experiments are presented in the supporting information. Together with BoAGE and SynAGE, nAGE10 is the third report of a high-yielding AGE that does not form inclusion bodies when expressed without chaperones (Datta, 1970; Klermund et al., 2013, 2015; Lee, Chien et al., 2007; Maru et al., 1996; Sola-Carvajal et al., 2012; Tabata et al., 2002).
3.7. Verification of nAGE10 activity
In order to verify that the purified nAGE10 was active, a one-pot, coupled reaction with AsNAL was performed and the production of Neu5Ac was detected by the TBA assay (Fig. 7). The reaction was performed at 25°C, according to the optimal working conditions for the NAL. The yield from the condensation reaction using ManNAc as a substrate (`No AGE'; grey bar) was defined as 100% for the purposes of comparison with the coupled reaction. The results show that the synthesis of Neu5Ac from GlcNAc is possible and that 40% of the maximum yield could be achieved under the assay conditions. It is worth noting that different conditions can give a much higher yield (data not shown). For the condensation reaction, the presence of active AGE (i.e. with ATP) reduces the yield starting from ManNAc by 28%, showing that nAGE10 is active in both directions. AGE will initially speed up the formation of GlcNAc; however, when the NAL uses more of the ManNAc then the production of ManNAc by the AGE will increase. The incubation times used will determine how severe this effect is. In the absence of ATP, only a fraction of Neu5Ac could be synthetized from GlcNAc, while the condensation reaction was not affected.
4. Conclusion
The yihS-encoded isomerases, suggests that the chloride ion may be a conserved element of the active site and that ethylene glycol mimics the way that substrates of AGE may bind. In addition to this, it was possible to formulate a hypothesis for the binding of the N-acetyl group of hexosamines. nAGE10 can be used for the one-pot synthesis of sialic acid at 25°C when coupled with the NAL from the psychrophilic bacterium A. salmonicida. The insight gained from this structure offers new residues to consider for mutation studies. Together, these observations open new paths of investigation with new residues to consider, which may lead to a better understanding of AGEs.
of nAGE10 presents a different dimer organization to that previously published for AGE structures, while the monomeric unit is very similar. The dimerization interface involves the front faces of each AGE monomer, and analysis revealed that this assembly is the most probable dimer association for AGEs. The front-to-front dimer organization leaves open previous hypotheses regarding the location of the ATP-binding site and raises the question of whether ATP binding affects the dimer interactions at the interface. This opens new perspectives to analyze and understand the role of ATP in the regulation of AGE activity. Another difference from the previously published AGE structures was the presence of an ethylene glycol molecule and a chloride ion within the active site of nAGE10. Comparison with the active sites of other members of the AGE superfamily, cellobiose 2-epimerases andSupporting information
PDB reference: N-acetylglucosamine 2-epimerase, 6f04
Supplementary Figure S1. DOI: https://doi.org/10.1107/S2059798318017047/jc5019sup1.pdf
Acknowledgements
The authors would like to thank Seila Pandur for the cloning of the nAGE10 construct into the cloning vector, Adele Williamson for help with the DSC experiments, and Man Kumari Gurung and Tor Olav Berg for their help with the TBA assay.
Funding information
This work was funded by the Norwegian Research Council as part of the NorzymeD project (project ID 221568) and the University of Tromsø – The Arctic University of Norway.
References
Adams, P. D., Afonine, P. V., Bunkóczi, G., Chen, V. B., Davis, I. W., Echols, N., Headd, J. J., Hung, L.-W., Kapral, G. J., Grosse-Kunstleve, R. W., McCoy, A. J., Moriarty, N. W., Oeffner, R., Read, R. J., Richardson, D. C., Richardson, J. S., Terwilliger, T. C. & Zwart, P. H. (2010). Acta Cryst. D66, 213–221. Web of Science CrossRef CAS IUCr Journals Google Scholar
Ahmed, Z., Timsah, Z., Suen, K. M., Cook, N. P., Lee, G. R., Lin, C.-C., Gagea, M., Marti, A. A. & Ladbury, J. E. (2015). Nature Commun. 6, 7354. CrossRef Google Scholar
Bateman, A., Coin, L., Durbin, R., Finn, R. D., Hollich, V., Griffiths-Jones, S., Khanna, A., Marshall, M., Moxon, S., Sonnhammer, E. L., Studholme, D. J., Yeats, C. & Eddy, S. R. (2004). Nucleic Acids Res. 32, D138–D141. Web of Science CrossRef PubMed CAS Google Scholar
Beitz, E. (2000). Bioinformatics, 16, 135–139. Web of Science CrossRef PubMed CAS Google Scholar
Carugo, O. (2014). BMC Struct. Biol. 14, 19. Google Scholar
Datta, A. (1970). Biochemistry, 9, 3363–3370. CrossRef CAS Google Scholar
Denger, K., Weiss, M., Felux, A. K., Schneider, A., Mayer, C., Spiteller, D., Huhn, T., Cook, A. M. & Schleheck, D. (2014). Nature (London), 507, 114–117. CrossRef CAS Google Scholar
Du, F., Zhang, M., Li, X., Yang, C., Meng, H., Wang, D., Chang, S., Xu, Y., Price, B. & Sun, Y. (2014). Biochem. Biophys. Res. Commun. 452, 1034–1039. CrossRef CAS Google Scholar
Edgar, R. C. (2004a). BMC Bioinformatics, 5, 113. Google Scholar
Edgar, R. C. (2004b). Nucleic Acids Res. 32, 1792–1797. CrossRef CAS Google Scholar
Emsley, P., Lohkamp, B., Scott, W. G. & Cowtan, K. (2010). Acta Cryst. D66, 486–501. Web of Science CrossRef CAS IUCr Journals Google Scholar
Finn, R. D., Bateman, A., Clements, J., Coggill, P., Eberhardt, R. Y., Eddy, S. R., Heger, A., Hetherington, K., Holm, L., Mistry, J., Sonnhammer, E. L., Tate, J. & Punta, M. (2014). Nucleic Acids Res. 42, D222–D230. CrossRef CAS PubMed Google Scholar
Fujiwara, T., Saburi, W., Inoue, S., Mori, H., Matsui, H., Tanaka, I. & Yao, M. (2013). FEBS Lett. 587, 840–846. CrossRef CAS Google Scholar
Fujiwara, T., Saburi, W., Matsui, H., Mori, H. & Yao, M. (2014). J. Biol. Chem. 289, 3405–3415. CrossRef CAS Google Scholar
Ghosh, S. & Roseman, S. (1965). J. Biol. Chem. 240, 1531–1536. CAS Google Scholar
Holm, L. & Laakso, L. M. (2016). Nucleic Acids Res. 44, W351–W355. Web of Science CrossRef CAS PubMed Google Scholar
Hu, S., Chen, J., Yang, Z., Shao, L., Bai, H., Luo, J., Jiang, W. & Yang, Y. (2010). Appl. Microbiol. Biotechnol. 85, 1383–1391. CrossRef CAS Google Scholar
Itoh, T., Mikami, B., Hashimoto, W. & Murata, K. (2008). J. Mol. Biol. 377, 1443–1459. CrossRef CAS Google Scholar
Itoh, T., Mikami, B., Maru, I., Ohta, Y., Hashimoto, W. & Murata, K. (2000). J. Mol. Biol. 303, 733–744. CrossRef CAS Google Scholar
Kaneko, T., Nakamura, Y., Wolk, C. P., Kuritz, T., Sasamoto, S., Watanabe, A., Iriguchi, M., Ishikawa, A., Kawashima, K., Kimura, T., Kishida, Y., Kohara, M., Matsumoto, M., Matsuno, A., Muraki, A., Nakazaki, N., Shimpo, S., Sugimoto, M., Takazawa, M., Yamada, M., Yasuda, M. & Tabata, S. (2001). DNA Res. 8, 205–213. CrossRef CAS Google Scholar
Klermund, L., Groher, A. & Castiglione, K. (2013). J. Biotechnol. 168, 256–263. CrossRef CAS Google Scholar
Klermund, L., Riederer, A., Groher, A. & Castiglione, K. (2015). Protein Expr. Purif. 111, 36–41. CrossRef CAS Google Scholar
Kragl, U., Gygax, D., Ghisalba, O. & Wandrey, C. (1991). Angew. Chem. Int. Ed. Engl. 30, 827–828. CrossRef Google Scholar
Krissinel, E. & Henrick, K. (2007). J. Mol. Biol. 372, 774–797. Web of Science CrossRef PubMed CAS Google Scholar
Lee, J.-O., Yi, J.-K., Lee, S.-G., Takahashi, S. & Kim, B.-G. (2004). Enzyme Microb. Technol. 35, 121–125. CrossRef CAS Google Scholar
Lee, Y.-C., Chien, H.-C. R. & Hsu, W.-H. (2007). J. Biotechnol. 129, 453–460. CrossRef CAS Google Scholar
Lee, Y.-C., Wu, H.-M., Chang, Y.-N., Wang, W.-C. & Hsu, W.-H. (2007). J. Mol. Biol. 367, 895–908. Web of Science CrossRef PubMed CAS Google Scholar
Liaimer, A., Jensen, J. B. & Dittmann, E. (2016). Front. Microbiol. 7, 1693. CrossRef Google Scholar
Liao, H.-F., Kao, C.-H., Lin, W.-D., Hsiao, N.-W., Hsu, W.-H. & Lee, Y.-C. (2012). Process Biochem. 47, 948–952. CrossRef CAS Google Scholar
Luchansky, S. J., Yarema, K. J., Takahashi, S. & Bertozzi, C. R. (2003). J. Biol. Chem. 278, 8035–8042. CrossRef CAS Google Scholar
Maru, I., Ohta, Y., Murata, K. & Tsukada, Y. (1996). J. Biol. Chem. 271, 16294–16299. CrossRef CAS Google Scholar
Murshudov, G. N., Skubák, P., Lebedev, A. A., Pannu, N. S., Steiner, R. A., Nicholls, R. A., Winn, M. D., Long, F. & Vagin, A. A. (2011). Acta Cryst. D67, 355–367. Web of Science CrossRef CAS IUCr Journals Google Scholar
Nayar, S. & Bhattacharyya, D. (1997). FEBS Lett. 409, 449–451. CrossRef CAS Google Scholar
Sola-Carvajal, A., Sánchez-Carrón, G., García-García, M. I., García-Carmona, F. & Sánchez-Ferrer, Á. (2012). Biochimie, 94, 222–230. CAS Google Scholar
Sparta, K. M., Krug, M., Heinemann, U., Mueller, U. & Weiss, M. S. (2016). J. Appl. Cryst. 49, 1085–1092. Web of Science CrossRef CAS IUCr Journals Google Scholar
Svenning, M. M., Eriksson, T. & Rasmussen, U. (2005). Arch. Microbiol. 183, 19–26. CrossRef CAS Google Scholar
Tabata, K., Koizumi, S., Endo, T. & Ozaki, A. (2002). Enzyme Microb. Technol. 30, 327–333. CrossRef CAS Google Scholar
Takahashi, S., Hori, K., Takahashi, K., Ogasawara, H., Tomatsu, M. & Saito, K. (2001). J. Biochem. 130, 815–821. CrossRef CAS Google Scholar
Takahashi, S., Miura, R. & Miyake, Y. (1988). Biochem. Int. 16, 1053–1060. CAS Google Scholar
Takahashi, S., Ogasawara, H., Hiwatashi, K., Hata, K., Hori, K., Koizumi, Y. & Sugiyama, T. (2005). Biomed. Res. 26, 117–121. CrossRef CAS Google Scholar
Takahashi, S., Ogasawara, H., Takahashi, K., Hori, K., Saito, K. & Mori, K. (2002). J. Biochem. 131, 605–610. CrossRef CAS Google Scholar
Takahashi, S., Takahashi, K., Kaneko, T., Ogasawara, H., Shindo, S. & Kobayashi, M. (1999). J. Biochem. 125, 348–353. CrossRef CAS Google Scholar
Takahashi, S., Takahashi, K., Kaneko, T., Ogasawara, H., Shindo, S., Saito, K. & Kawamura, Y. (2001). J. Biochem. 129, 529–535. CrossRef CAS Google Scholar
Takahashi, S., Takahashi, K., Kaneko, T., Ogasawara, H., Shindo, S., Saito, K. & Kobayashi, M. (1999). J. Biochem. 126, 639–642. CrossRef CAS Google Scholar
Tyler, T. R. & Leatherwood, J. M. (1967). Arch. Biochem. Biophys. 119, 363–367. CrossRef CAS Google Scholar
Wang, S.-Y., Laborda, P., Lu, A.-M., Duan, X.-C., Ma, H.-Y., Liu, L. & Voglmeir, J. (2016). Catalysts, 6, 212. CrossRef Google Scholar
Wang, T.-H., Chen, Y.-Y., Pan, H.-H., Wang, F.-P., Cheng, C.-H. & Lee, W.-C. (2009). BMC Biotechnol. 9, 63. Google Scholar
Warren, L. (1959). J. Biol. Chem. 234, 1971–1975. CAS Google Scholar
Winn, M. D., Ballard, C. C., Cowtan, K. D., Dodson, E. J., Emsley, P., Evans, P. R., Keegan, R. M., Krissinel, E. B., Leslie, A. G. W., McCoy, A., McNicholas, S. J., Murshudov, G. N., Pannu, N. S., Potterton, E. A., Powell, H. R., Read, R. J., Vagin, A. & Wilson, K. S. (2011). Acta Cryst. D67, 235–242. Web of Science CrossRef CAS IUCr Journals Google Scholar
This is an open-access article distributed under the terms of the Creative Commons Attribution (CC-BY) Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original authors and source are cited.