research papers
The structure of Phocaeicola vulgatus sialic acid acetylesterase
aDepartment of Chemistry, University of York, Heslington, York YO10 5DD, United Kingdom, and bDepartment of Bioorganic Synthesis, Leiden University, Einsteinweg 55, 2333 CC Leiden, The Netherlands
*Correspondence e-mail: z.w.b.armstrong@lic.leidenuniv.nl
Sialic acids terminate many N- and O-glycans and are widely distributed on cell surfaces. There are a diverse range of enzymes which interact with these sugars throughout the tree of life. They can act as receptors for influenza and specific betacoronaviruses in viral binding and their cleavage is important in virion release. Sialic acids are also exploited by both commensal and pathogenic bacteria for nutrient acquisition. A common modification of sialic acid is 9-O-acetylation, which can limit the action of sialidases. Some bacteria, including human endosymbionts, employ esterases to overcome this modification. However, few bacterial sialic acid 9-O-acetylesterases (9-O-SAEs) have been structurally characterized. Here, the of a 9-O-SAE from Phocaeicola vulgatus (PvSAE) is reported. The structure of PvSAE was determined to resolutions of 1.44 and 2.06 Å using crystals from two different crystallization conditions. Structural characterization revealed PvSAE to be a dimer with an SGNH fold, named after the conserved sequence motif of this family, and a Ser–His–Asp catalytic triad. These structures also reveal flexibility in the most N-terminal α-helix, which provides a barrier to active-site accessibility. Biochemical assays also show that PvSAE deacetylates both mucin and the acetylated chromophore para-nitrophenyl acetate. This structural and biochemical characterization of PvSAE furthers the understanding of 9-O-SAEs and may aid in the discovery of small molecules targeting this class of enzyme.
Keywords: acetylesterases; SGNH fold; serine hydrolases; sialic acid acetylesterases; gut bacteria; carbohydrate metabolism; Phocaeicola vulgatus.
PDB references: Phocaeicola vulgatus sialic acid acetylesterase, 7pgh; 7pzg
1. Introduction
Sialic acids are abundant et al., 2020). As a result, they play key roles in a range of functions within humans, bacteria and viruses (Tortorici et al., 2019). Sialic acids are also highly diverse, and are frequently 9-O-acetylated. In humans, 9-O-acetylation is regulated by CASD1 and sialic acid acetylesterase (SIAE), where CASD1 acetylates and (SIAE) deacetylates sialic acid (Baumann et al., 2015; Orizio et al., 2015; Fig. 1a). A key target molecule for SIAE is the 9-O-acylated GD3 (Neu5,9Ac2-α2,8-Neu5Ac-α2,3-Gal-β1,4-Glc-β1-ceramide); this ganglioside has an impact on a variety of biological functions. For example, elevated levels of 9-O-acetylsialic acid have been linked to apoptotic resistance by tumour cells, such as in medulloblastoma (a form of brain cancer; Mather et al., 2019).
that terminate and on cell surfaces (BarnardViruses such as influenza C and specific betacoronaviruses (human OC43 and HKU1 coronaviruses) use 9-O-acetylated sialylglycans as receptors. To promote viral release after infection, these viruses use 9-O-acetylesterases (9-O-SAEs) as receptor-destroying enzymes (RDEs). The RDE for influenza C is the hemagglutinin esterase fusion protein (HEF) and that for betacoronaviruses is hemagglutinin esterase (HE). These enzymes also bind to 9-O-acetylsialic acid during viral entry. To date, HEFs and HEs from four betacoronaviruses have been structurally characterized and their 9-O-acetylesterase domains have SGNH hydrolase-like folds (Rosenthal et al., 1998; Hurdiss et al., 2020; Zeng et al., 2008; Langereis et al., 2012).
In bacteria, esterases play an important role in glycan foraging and catabolism of sialic acid in the gut (Robinson et al., 2017). There is a high level of sialic acid diacetylation in humans, such as in the intestine distal column and rectum (Robinson et al., 2017). However, sialidases encoded by bacteria cannot access diacetylated sialic acid. Therefore, bacteria employ esterases to make acetylated sialic acid more accessible for cleavage by sialidases. Carbohydrate-active esterases have been classified into several different families based on sequence homology (Drula et al., 2022; Lenfant et al., 2013) and have domain architectures such as α/β-hydrolase, (β/α)7-barrel and SNGH folds. As with HE and HEF, the structurally characterized 9-O-acetylsialic acid esterase NanS, encoded by enterohemorrhagic Escherichia coli O157:H7, also has an SGNH hydrolase fold (Rangarajan et al., 2011).
The SGNH hydrolase superfamily was identified by Upton and Buckley through sequence comparisons of various motifs and was classified from members of the GDSL family (Upton & Buckley, 1995). GDSL esterases are named after the distinct GDSL motif near the N-terminus of the sequence that contains the catalytic (Upton & Buckley, 1995). SGNH have been defined by the structure of rhamnogalacturonan acetylesterase and classified with regard to the presence of Ser-Gly-Asn-His (SGNH) residues within four blocks of conserved sequences (I, II, III and V; Mølgaard et al., 2000). These residues play important roles in the of the enzymes. The nucleophilic serine is found in block I of the sequences. The glycine and asparagine residues, within blocks II and III, respectively, stabilize the oxyanion in the oxyanion hole. The catalytic histidine within block V acts as a base by deprotonating the nucleophilic serine (Mølgaard et al., 2000). Block V also contains an aspartic acid residue forming the third member of the Ser–His–Asp catalytic triad.
Phocaeicola vulgatus (previously known as Bacteroides vulgatus) is a commensal gut bacterium that has a cooperative role in the gut microbiome (Huang et al., 2015). Sialidases encoded by several Bacteriodetes species cleave sialic acids from the intestinal tissue, which in turn has been shown to increase E. coli growth, resulting in `enterobacterial blooms' which are apparent during inflammation (Huang et al., 2015). A close homolog of PvSAE from B. fragilis (EstA; 81% identity) has been shown to have a role in the complex network of interactions involving in the gut. Robinson and coworkers developed further understanding of the role of B. fragilis in this network and highlighted the community scavenging of sialic acid in the gut (Robinson et al., 2017). E. coli growth significantly increased in the presence of B. fragilis esterase with either B. fragilis or B. thetaiotaomicron sialidase, demonstrating how the microorganisms cooperate. Furthermore, PvSAE is found in a genomic locus that contains multiple glycoside including a sialidase (Fig. 1b), that has synteny with loci from several Bacteroidetes including P. massiliensis, B. fragilis and B. thetaiotaomicron (Briliūtė et al., 2019). The syntenic loci from B. thetaiotaomicron has recently been characterized as part of the sialylated N-glycan-degrading machinery of this host-associated bacterium (Briliūtė et al., 2019).
The structure of 9-O-SAEs, including PvSAE, should provide insight into the function of sialic acid esterases and add to our understanding of host-glycan degradation. Furthermore, as this class of enzymes is also preserved in pathogens such as Tanerella forsythia (Albers et al., 2021), investigating the structure of these enzymes should aid in the discovery of therapeutics for both bacterial and viral enzymes. Here, we show through SEC-MALLS that the 9-O-SAE from P. vulgatus (PvSAE) exists as a homodimer in solution. We also show through biochemical assays that PvSAE is a genuine SAE with activity on both chromogenic and natural mucins. The structure of PvSAE has previously been solved and deposited in the PDB (PDB entry 6njc). Here, we have solved the structure of PvSAE in two new crystal forms at 1.44 and 2.06 Å resolution, which reveal a previously unidentified conformational flexibility of this enzyme. We also show through that this enzyme may bind sialic acids.
2. Materials and methods
2.1. Enzyme production and purification
The gene encoding PvSAE (GenBank accession No. ABR41743.1) was predicted to contain a signal peptide with a cleavage site between amino acids 19 and 20 by SignalP5.0 (Almagro Armenteros et al., 2019). A codon- optimized version of this gene with a His6 tag (MGSSHHHHHHGTAENLYFQG) in place of the signal peptide was synthesized and cloned into a pET-28 vector with NcoI and BamHI restriction sites by GenScript (Leiden, Netherlands). The plasmid was transformed into chemically competent E. coli BL21(DE3) Gold cells, which were then plated onto LB agar containing 50 µg ml−1 kanamycin. A single colony was used to inoculate 1 l LBE-5052 autoinduction medium (Studier, 2005) containing 50 µg ml−1 kanamycin, which was incubated overnight with shaking at 300 rev min−1 and 37°C. Expression cultures were harvested by centrifugation (5000g, 30 min, 4°C). The pellets were resuspended in 25 ml resuspension buffer (20 mM HEPES, 200 mM NaCl pH adjusted to 7.4 with NaOH) and stored at −70°C until use.
Frozen pellets were thawed and 1.8 ml buffer B (20 mM HEPES, 500 mM imidazole, 200 mM NaCl, pH adjusted to 7.4 with HCl) was added with lysozyme (4 mg) and DNase I (1 mg). The volume was increased to 50 ml with buffer A (20 mM HEPES, 20 mM imidazole, 200 mM NaCl, 1 mM DTT, pH adjusted to 7.4 with HCl). After warming to room temperature, the cell suspension was vortexed and sonicated on ice for 1 min 40 s (1 s sonication, 2 s rest) in a sonicator (Qsonica) set to 40% amplitude. The lysed cells were centrifuged (18 000g, 30 min, 4°C) and the supernatant was directly loaded onto a 5 ml HisTrap Excel column (GE Healthcare). The bound protein was washed with ten columns of buffer A and then subsequently eluted using a 0–100% linear gradient of buffer B over 20 column volumes. Fractions containing PvSAE were determined by SDS–PAGE and the peak fractions were pooled. Some precipitate was present, so the pooled fractions were centrifuged and filtered (0.22 µm filter). The filtrate was concentrated to a final volume of 2 ml using a Vivaspin 10 kDa concentrator (GE Healthcare) and further purified by gel filtration (HiLoad 16/600 Superdex 75 pg) in buffer C (20 mM HEPES, 200 mM NaCl, 1 mM DTT, pH adjusted to 7.4 with NaOH) at a flow rate of 1 ml min−1. Fractions containing PvSAE were determined using SDS–PAGE and the peak fractions were pooled and concentrated using a Vivaspin 10 kDa concentrator (GE Healthcare). The concentrated protein was washed with buffer C and diluted to 50 mg ml−1. Aliquots (40 µl) were flash-frozen in liquid nitrogen and stored at −70°C until use. Protein concentrations were determined spectrophotometrically using a calculated A280 extinction coefficient of 32 890 M−1 cm−1. This purification yielded 45.6 mg protein per litre of culture. Macromolecule-production information is summarized in Table 1.
|
2.2. SEC-MALLS
Experiments were conducted on a system comprising a Wyatt HELEOS-II multi-angle light-scattering detector and a Wyatt rEX refractive-index detector linked to a Shimadzu HPLC system (SPD-20A UV detector, LC20-AD isocratic pump system, DGU-20A3 degasser and SIL-20A autosampler). Work was conducted at room temperature (20 ± 2°C). The sample-injection volume was at 100 µl at a protein concentration of 2 mg ml−1. The sample was separated on a Superdex 200 Increase 10/300 GL column (GE Healthcare) using 20 mM HEPES, 200 mM NaCl, pH adjusted to 7.4 with NaOH (0.2 µm filtered) as buffer. A further 0.1 µm filter was present in the flow path. Shimadzu LabSolutions software was used to control the HPLC and the ASTRA 7 software was used for the HELEOS-II and rEX detectors. The data were analysed using the ASTRA 7 software. Molecular masses were estimated using the Zimm fit method with degree 1 (Zimm, 1945). A value of 0.182 was used for the protein refractive-index increment (dn/dc).
2.3. Crystallization
Initial crystallization conditions were identified using a range of commercial crystallization screens in 96-well format plates, including PEG/Ion (Hampton Research), JCSG+ (Molecular Dimensions), The AmSO4 Suite (Qiagen), PACT (Molecular Dimensions), Index (Hampton Research) and Morpheus (Molecular Dimensions). The condition 0.2 M sodium malonate/malonic acid pH 5.0, 20%(w/v) polyethylene glycol 3350 with His6-PvSAE at 30 mg ml−1 in a 150 nl:150 nl ratio yielded thin rod-like crystals and this condition was further optimized. The optimized crystals were grown in MRC Maxi 48-well plates using the sitting-drop vapour-diffusion method at 20°C with 0.2 M sodium malonate/malonic acid pH 4.5, 18%(w/v) polyethylene glycol 3350 and His6-PvSAE at 30 mg ml−1 in buffer C with a 400 nl:400 nl protein:well solution ratio; this condition will be referred to as Apo I. Crystals were soaked in well solution consisting of 0.2 M sodium malonate/malonic acid pH 4.5, 18%(w/v) polyethylene glycol 3350, 15%(w/v) ethylene glycol before flash-cooling in liquid nitrogen.
Crystals were also observed in the Morpheus screen and were collected without cryoprotectant from a condition consisting of 50% precipitant mix 4 [25%(w/v) PEG 1000, 25%(w/v) PEG 3350, 25%(v/v) MPD], 0.03 M sodium nitrate, 0.03 M sodium phosphate, 0.03 M ammonium sulfate, 0.1 M MOPS/HEPES–Na pH 7.5 (prepared according to Gorrec, 2009), with His6-PvSAE at 30 mg ml−1 in buffer C. This condition will be referred to as Apo II. The crystals were harvested directly from the crystallization drop before being flash-cooled in liquid nitrogen. Crystallization information is summarized in Table 2.
|
2.4. Data collection, processing, structure solution and refinement
All data sets were collected on the I03 beamline at Diamond Light Source (DLS) and were integrated using the DIALS pipeline in xia2. All other calculations were carried out using the CCP4 suite (Winn et al., 2011). Data reduction was performed using AIMLESS (Evans & Murshudov, 2013) and the data for Apo I and Apo I crystals were processed to resolutions of 1.44 and 2.06 Å, respectively. The structures were solved by expert-mode with Phaser (McCoy et al., 2007) using the coordinates of BVU4141 from P. vulgatus as a search model (PDB entry 6njc; Midwest Center for Structural Genomics, unpublished work). The models were corrected and completed manually with multiple rounds of model building using Coot (Emsley et al., 2010) and refined using REFMAC5 (Murshudov et al., 2011). There was suspected in the structure obtained from the Apo II condition and this was confirmed by an L-test score of 0.42. To take into account, twin was used for the Apo II structure in REFMAC5 once the Rfree fell below 40%.
Water molecules for both models were added using REFMAC5 (Murshudov et al., 2011) and were manually inspected after using Coot (Emsley et al., 2010). The final can be found in Table 3. Data-collection and processing statistics are summarized in Table 3 and are summarized in Table 4.
|
|
2.5. Michaelis–Menten kinetics
Hydrolysis of chromogenic para-nitrophenyl acetate (pNP-Ac) was carried out in a 96-well plate at 25°C. Reactions were carried out in a total volume of 100 µl. Each reaction contained 10 nM PvSAE, 100 mM phosphate–citrate buffer pH 7.0 and 1 mM to 50 µM pNP-Ac in 1% DMSO. Reactions were initiated by the addition of 10 µl 100 nM enzyme to a reaction containing 90 µl water, buffer and pNP-Ac. All kinetic assays were completed in triplicate.
Reactions were measured using a Clariostar microplate reader (BMG Labtech; λex = 405 nm). The absorbance of released pNP was measured continuously over 15 min. The concentration of pNP released was calculated using a of pNP standards (100 µM to 1 µM) present on the same plate. Rates of substrate hydrolysis were determined by a linear fit of initial data points. Plots of the initial rate (vo) versus the substrate concentration were fitted with nonlinear regression using OriginPro 2021 (OriginLab) to the hyperbolic equation vo = kcat[E][S]/(Km + [S]) to determine Km and kcat (Table 5). The error values given for Km and kcat are fitting errors determined in OriginPro when fitting all data. The error values given for the specificity constant (kcat/Km) were propagated from the error determined for Km and kcat using the formula
|
2.6. Mucin activity assays
The hydrolysis of acetylated sialic acids present in mucin was assessed using bovine submaxillary mucin (Millipore, Lot No. 3716979). Purified PvSAE was added to a final concentration of 1 µM or 100 nM to a 100 µl reaction consisting of 6.25 mg ml−1 bovine submaxillary mucin, 20 mM HEPES, 100 mM NaCl pH 7.0 (adjusted with NaOH). The samples were incubated at 37°C for 1 h with shaking at 800 rev min−1. The concentration of acetate in each sample was quantified using a manual acetic acid assay kit (Megazyme, K-ACET) according to the manufacturer's instructions. HEPES buffer was used in place of phosphate buffer for this assay as phosphate interferes with coupled in the acetic acid assay kit. An acetate standard curve (0–800 µM acetate) in the presence of 6.25 mg ml−1 bovine submaxillary mucin was used to calibrate the acetate concentration. All measurements were performed in at least quadruplicate.
2.7. Molecular docking
Molecular docking of 9-O-acetylsialic acid into the of PvSAE was performed with AutoDock Vina (Trott & Olson, 2010). The ligand coordinates were made using AceDRG (Long et al., 2017) and molecular charges were computed with AutoDock (Morris et al., 2009). The ligand was docked into a grid box (30 × 20 × 20 Å) centred on the catalytic A search was performed on each of the active sites for the Apo I and Apo II structures. An exhaustiveness of 30 was used to compute 20 docking positions for each active site. The docked position with the best calculated energy and with the acetyl group positioned in the active site was used for further analysis.
3. Results and discussion
3.1. Purification and biochemical characterization
P. vulgatus 9-O-acetylsialic acid esterase (PvSAE) was produced in E. coli BL21(DE3) Gold cells. The protein contained an N-terminal His6 tag and was purified using immobilized nickel-affinity followed by The protein eluted earlier than expected for a monomer from a calibrated size-exclusion column. To determine whether the protein was a multimer in solution, it was subjected to SEC-MALLS (size-exclusion chromatography–multi-angle laser light scattering). A single peak was observed in the with a calculated molar mass of 51.6 kDa, indicating that PvSAE forms a dimer in solution.
To gain further insight into the action of PvSAE, we investigated the activity of PvSAE on both a synthetic substrate and a natural mucin. We first tested the activity of PvSAE on the activated, chromogenic substrate para-nitrophenyl acetate (pNP-Ac). This substrate has previously been used to detect 9-O-SAE activity for E. coli NanS, B. fragilis EstA and Tannerella forsythia NanS (Rangarajan et al., 2011; Nakayama-Imaohji et al., 2012; Albers et al., 2021). pNP-Ac hydrolysis was monitored at pH 7.0 to limit nonspecific acid- or base-catalyzed hydrolysis. The catalytic constants determined from a Michaelis–Menten plot of initial rates (Fig. 2a), revealed a high specificity (kcat/Km = 930 ± 70 mM−1 s−1) for this non-natural substrate, with rates slightly higher than that determined for the bifunctional NanS from T. forsythia (kcat/Km = 434 mM−1 s−1; Albers et al., 2021).
To examine the activity of PvSAE on a more natural substrate, we assessed its activity on bovine submaxillary mucin (BSM). BSM is purified directly from the submaxillary gland, allowing access to the natural substrate before it has been acted on by commensal microbes. As such, it contains several percent sialic acid by weight and is highly acetylated. We incubated either 100 nM or 1 µM PvSAE with BSM for 1 h at 37°C. At both enzyme concentrations PvSAE showed a significant release of acetate when compared with a buffer-only control (Fig. 2b). This indicates that PvSAE is indeed a genuine sialic acid esterase that is capable of action on natural substrates.
3.2. Protein crystallization and structure solution
Purified PvSAE crystallized in two different forms (Apo I and Apo II; see Table 2 for crystallization conditions). A crystal from the Apo I condition diffracted to a resolution of 1.44 Å and had P1211 symmetry, while a crystal from the Apo II condition diffracted to 2.06 Å resolution and had P61 symmetry. Both structures were solved by molecular replacement using a structure of PvSAE at 1.9 Å resolution deposited in the PDB (PDB entry 6njc; Midwest Center for Structural Genomics, unpublished work). The unit cells of these crystals contained four and eight protomers for the Apo I and Apo II forms, respectively. These protomers were arranged as dimers in both crystal forms, recapitulating the dimeric form observed in solution.
The PvSAE Apo I and Apo II structures displayed a near-complete overlap when aligned [root-mean-square deviation (r.m.s.d.) of 0.18 Å], with the secondary structures in the same conformation. The only major difference between these structures was a small region of the most N-terminal α-helix, which is discussed below. The individual protomers have a fold with a central five-stranded parallel β-sheet surrounded by α-helices (Fig. 3a), which is typical of an SNGH type I domain. This domain architecture is more similar to that of betacoronavirus HEs (Bakkers et al., 2017) than to the domain organization observed for E. coli NanS (Rangarajan et al., 2011; Fig. 3c). Betacoronavirus HEs also contain a central five-stranded parallel β-sheet surrounded by α-helices. However, unlike PvSAE these viral enzymes contain a hemagglutinin domain inserted between the third and fourth β-strands of the β-sheet. The structure of E. coli NanS, in contrast, contains a seven-stranded β-sheet with six parallel β-strands and one antiparallel β-strand (Rangarajan et al., 2011), as is typical of SNGH type II domains. The suspected serine of PvSAE, Ser52, is located at the beginning of the second α-helix, following the first β-strand from the N-terminus, a position that is preserved in both E. coli NanS and betacoronavirus HEs (Fig. 3c). The suspected catalytic histidine and aspartic acid, His202 and Asp199, are located on an Ω loop which is followed by the most C-terminal α-helix.
As stated above, each of the protomers is present in a dimer in both of the crystal structures determined here and in the previously determined structure of PvSAE. These dimers have C2 rotational symmetry, and interactions between the two interfaces were examined using PISA (Krissinel & Henrick, 2007). The dimer interface is mainly composed of hydrophilic residues that form salt bridges and hydrogen bonds (Fig. 3b). A majority of these interactions are formed between residues on the most N-terminal α-helix (residues 27–34) and residues on the Ω loop containing the catalytic histidine and aspartic acid (residues 198–205) (Fig. 3b). This interaction also involves the aspartic acid Asp199 (a constituent of the Asp–His–Ser catalytic triad), which forms a hydrogen bond to Gln31 in the neighbouring protomer. We expect that the dimerization of PvSAE observed here maintains the correct positioning of the catalytic residues. Dimerization results in a loss of 1300 Å2 of solvent-accessible surface for each monomer and leads to the formation of a 15 Å deep cleft. This cleft contains both active sites, which face each other and are separated by 15 Å, determined as the distance between the proposed serine nucleophiles.
The most significant difference between the Apo I and Apo II structures was the positioning of the most N-terminal residues (residues 24–27) of the first α-helix. In the Apo II structure and the previously solved structure of PvSAE (PDB entry 6njc) this α-helix extends above the active-site cleft, restricting the entrance to the active-site cleft to approximately 8 Å (Supplementary Fig. S1). In the Apo I structure residues 24–27 of one of the protomers in the homodimer are no longer part of this α-helix structure, but instead form a loop projecting out of the active site. This conformational change is supported by a hydrogen bond between the hydroxyl of Tyr25 and the carboxylic acid of Asp83 from the neighbouring protomer. The result of this change in conformation is that one active site is more accessible and has an `open' conformation, while the other is still `blocked' by the N-terminal α-helix (Supplementary Fig. S1). Although the observed conformational change may be an artefact of the crystallization conditions, this change indicates that the residues present in the N-terminus have inherent plasticity. We suspect that this conformational change may be important for the accommodation of large, complex that contain acetylated sialic acid.
3.3. Structural comparisons and active site
To investigate proteins with structural homology to PvSAE, a search was performed using the DALI server (Holm, 2020). This further supported PvSAE as a member of the SGNH family, as the search revealed that PvSAE shares low amino-acid sequence identity, but high structural similarity, with members of the SGNH family (Supplementary Fig. S2). Further comparison of the sequences of the top hits showed conservation of Ser52, Gly75, Asn103, His202 and Asp199. It was confirmed that the catalytic triad corresponds to Ser52, His202 and Asp199, with Gly75 and Asn103 responsible for forming the oxyanion hole. The residues within the conserved blocks of sequences common to the SGNH superfamily are also present. However, the block I sequence varied, with a GNS motif (Gly50, Asn51, Ser52) compared with the classic GDS motif (Supplementary Fig. S2). The presence of asparagine in the place of aspartate in PvSAE, however, does not change the positioning of this amino acid when compared with the structure of the top DALI result with a GDS motif (putative platelet-activating factor from Streptococcus pneumoniae; PDB entry 2hsj; Supplementary Fig. S2). The remaining three sequence blocks (II, III and V) were conserved across the DALI hits. These blocks contain the conserved glycine (block II), the conserved asparagine in a GXN motif (block III) and the catalytic serine and histidine in a DXXH motif (block V).
Both the Apo I and Apo II PvSAE structures had an Mg2+ metal ion in close proximity to the active site. This ion is also observed in the previously reported structure of PvSAE (PDB entry 6njc). The electron density surrounding Mg2+, the bond lengths and the octahedral coordination to six electron donors support this annotation. The metal was likely to be incorporated into the structure during the production of the enzyme. Mg2+ interacts with the carbonyl groups of Pro194 and Thr197, the side-chain hydroxyl group of Tyr146 and three waters. The Thr197 residue, in turn, is positioned to form a hydrogen bond to the catalytic aspartate (Asp199) (Fig. 4c).
One of the top DALI hits for the structure of PvSAE was PAF-AH (brain platelet-activating factor acetylhydrolase). A structure of PAF-AH (PDB entry 3dt9) contains the covalent oganophosphorus inhibitor soman bound to the nucleophilic active-site serine (Epstein et al., 2009). Overlay of PvSAE and pAF-AH displayed good overlap between the active-site residues (r.m.s.d. of 0.37 Å for residues within 8 Å of the nucleophilic serine), with the residues in similar conformations (Fig. 4a). This suggests that organophosphorus inhibitors may also be active against PvSAE.
All of the top five DALI hits are homodimeric structures, as seen for PvSAE. As in PvSAE, the dimer interface in these five structures involves the N-terminal α-helix. However, the N-terminal α-helix does not interact with the Ω loop that contains the catalytic histidine in all of these structures (Supplementary Fig. S3). In three of these structures (PDB entries 3p94, 4ppy and 4iyj) the N-terminal α-helix interacts with residues present in the third and fourth α-helices, which in turn interact with the third and fourth α-helices on the neighbouring protomer. This results in more exposed active sites that are not part of the same active-site cleft. The dimeric structure of S. pneumonia putative platelet-activating factor (PDB entry 2hsj) is more similar to PvSAE, with the dimer interface between the N-terminal α-helix and the C-terminal α-helix. However, in this structure the N-terminal α-helix does not involve the Ω loop. The structure of pAF-AH (PDB entry 3dt9) also forms a dimer, with an interface composed of salt bridges formed by hydrophilic residues between the loop containing the catalytic histidine and aspartate and the N-terminal α-helix (Epstein et al., 2009). The angle of this interaction interface, however, differs from that seen for PvSAE, resulting in a deeper active-site cleft than is observed for PvSAE.
The 9-O-SAE NanS from E. coli has a SGNH type II fold with catalytically active residues Ser19 and His301. This enzyme, however, lacks the catalytic aspartate residue from the catalytic triad. This interaction may be compensated for by a more remote carboxylate, the action of which is mediated through an ordered water molecule (Rangarajan et al., 2011). Overlay of the active site of NanS and PvSAE (r.m.s.d. of 0.32 Å for an 8 Å sphere surrounding the catalytic serine) reveals that the catalytic histidine and serine residues occupy a similar position, despite the absence of a catalytic acid in NanS (Fig. 4b). The oxyanion hole of NanS contains a glycine, as seen in PvSAE. However, in NanS the other constituent of the oxyanion hole is Gln18, as opposed to Asn103 in PvSAE (Fig. 4b). Unlike PvSAE, E. coli NanS forms a monomer in solution, indicating that homodimerization is not conserved for 9-O-SAEs. The monomeric nature of E. coli NanS is likely to be due to the absence of the most N-terminal α-helix, which forms a large part of the dimer interface in PvSAE (Fig. 3c).
EstA from S. pneumoniae has also been identified as a 9-O-SAE (Kahya et al., 2017). This enzyme displays a canonical α/β-hydrolase topology with a Ser–His–Asp catalytic triad (Kim et al., 2008). S. pneumoniae EstA, however, is not an SGNH hydrolase as the catalytic triad residues (Ser120–His231–Asp202) do not align with the conserved blocks typical of an SGNH hydrolase. The active site of EstA also differs from that of an SGNH hydrolase, with the backbone N atom from two methionine residues (Met45 and Met121) forming the oxyanion hole (Kim et al., 2008), whereas in PvSAE the backbone N atom of a glycine (Gly75) and the amide side chain of an asparagine (Asn103) form the oxyanion hole (Supplementary Fig. S4).
To further investigate the phylogenetic conservation of amino acids, conservation scores were determined using ConSurf (Ashkenazy et al., 2016) and mapped onto the PvSAE structure. This revealed the absolute conservation of residues in the active site (Fig. 5). There was also absolute conservation of the interface amino-acid residues involved in forming salt bridges between the two dimers, namely Arg32, Arg71, Glu58 and Glu61. The central five-stranded β-sheet was relatively well conserved (ConSurf scores of 6–9), with only Lys33 displaying low conservation (ConSurf score of 2). The level of conservation between the α-helices varied; helices distal to the active site and protein interface, Ala111–Glu128 and Asn150–Glu173, displayed increased variability. Loops at the protein surface were also poorly conserved, in contrast to those near the active site. The level of conservation for the Mg2+-binding site varied. Thr197, which may interact with the active site, was relatively well conserved (ConSurf score of 7). Conversely, Tyr146 and Pro194 displayed poor conservation (ConSurf scores of 1 and 2, respectively).
The high resolution of the Apo I structure allowed the identification of a malonate molecule in the active site of PvSAE. Malonate is positioned such that a carboxylate a). This mirrors the positioning of acetate in the previously determined structure of PvSAE, where the acetate interacts directly with the oxyanion hole (Fig. 6b). Automated docking of 9-O-acetylsialic acid into each active site showed that this substrate could fit into the active sites of both the Apo I and Apo II structures, with the 9-acetyl group correctly positioned for catalysis (Fig. 6 and Supplementary Fig. S6). As the most N-terminal α-helix of the Apo I structure has different conformations, the two active sites accommodate the modelled 9-O-acetylsialic acid in a different conformation. In the `open' active site, the 9-O-acetylsialic acid is positioned such that the 2-position projects towards the bulk solvent, indicating that this site could accommodate larger There also appears to be space for both acetylation at the 7-position and an N-glycolyl modification at the 5-position, both of which are modifications that occur in animal mucins (Park, 2019). The carboxylate on the modelled ligand is within hydrogen-bonding distance of the backbone N atom of Phe28, which only becomes accessible once the N-terminal α-helix has adopted the `open' conformation. This interaction with the carboxylate contrasts with the interaction seen in viral sialic acid esterases, which employ an active-site arginine as an interaction partner with the carboxylate (Rosenthal et al., 1998). Docking into the `blocked' site of the Apo I structure resulted in a ligand conformation with the 2-position projecting towards the N-terminal α-helix, suggesting that this active site could only accommodate monomeric substrates. A similar conformation to the blocked active site was observed in the Apo II structure (Supplementary Fig. S6).
is within hydrogen-bonding distance of the oxyanion-hole residues Asn103 and Gly75 (Fig. 64. Conclusion
9-O-Acetyl-N-acetylsialic acid esterases are found throughout the tree of life from viruses to bacteria and animals. Examination of the 9-O-SAE from the human commensal bacterium P. vulgatus (PvSAE) adds to our growing understanding of the structure and mechanisms underpinning this class of enzymes. Three-dimensional structures of PvSAE determined by X-ray crystallography reveal the homodimerization of PvSAE and the molecular architecture of the SGNH active site. This revealed a canonical Ser–His–Asp catalytic triad encompassed in a binding cleft formed through dimerization. Additionally, the conformational flexibility of the N-terminal α-helix appears to mediate the accessibility of the active cleft to large oligosaccharide substrates. The structural analysis of PvSAE also contributes to our understanding of other predicted SGNH that are yet to be solved, such as human SIAE. Looking forwards, the structural analysis of 9-O-SAE enzymes such as PvSAE promises to enable the development of specific inhibitors targeting this class of enzymes. This could be extended to human 9-O-SAE, bacterial pathogen 9-O-SAEs and viral 9-O-SAEs, such as those encoded by specific betacoronaviruses or influenza C.
Supporting information
PDB references: Phocaeicola vulgatus sialic acid acetylesterase, 7pgh; 7pzg
Supplementary Table and Figures. DOI: https://doi.org/10.1107/S2059798322003357/ud5031sup1.pdf
Acknowledgements
We thank Dr Andrew Leech, University of York Bioscience Technology Facility for assistance with SEC-MALLS. We thank Diamond Light Source for access to beamline I03 (proposal No. mx-18598) that contributed to the results presented here and Dr Johan Turkenburg and Sam Hart for coordinating data collection. GJD thanks the Royal Society for the Ken Murray Research Professorship.
Funding information
We acknowledge generous support from the BBSRC (BB/R001162/1 to GJD).
References
Albers, M., Schröter, L., Belousov, S., Hartmann, M., Grove, M., Abeln, M. & Mühlenhoff, M. (2021). Glycobiology, 31, 1176–1191. PubMed Google Scholar
Almagro Armenteros, J. J., Tsirigos, K. D., Sønderby, C. K., Petersen, T. N., Winther, O., Brunak, S., von Heijne, G. & Nielsen, H. (2019). Nat. Biotechnol. 37, 420–423. Web of Science CrossRef CAS PubMed Google Scholar
Altschul, S. F., Gish, W., Miller, W., Myers, E. W. & Lipman, D. J. (1990). J. Mol. Biol. 215, 403–410. CrossRef CAS PubMed Web of Science Google Scholar
Ashkenazy, H., Abadi, S., Martz, E., Chay, O., Mayrose, I., Pupko, T. & Ben-Tal, N. (2016). Nucleic Acids Res. 44, W344–W350. Web of Science CrossRef CAS PubMed Google Scholar
Bakkers, M. J. G., Lang, Y., Feitsma, L. J., Hulswit, R. J. G., de Poot, S. A. H., van Vliet, A. L. W., Margine, I., de Groot-Mijnes, J. D. F., van Kuppeveld, F. J. M., Langereis, M. A., Huizinga, E. G. & de Groot, R. J. (2017). Cell Host Microbe, 21, 356–366. Web of Science CrossRef CAS PubMed Google Scholar
Barnard, K. N., Alford-Lawrence, B. K., Buchholz, D. W., Wasik, B. R., LaClair, J. R., Yu, H., Honce, R., Ruhl, S., Pajic, P., Daugherity, E. K., Chen, X., Schultz-Cherry, S. L., Aguilar, H. C., Varki, A. & Parrish, C. R. (2020). J. Virol. 94, e01567-19. CrossRef CAS PubMed Google Scholar
Baumann, A.-M. T., Bakkers, M. J. G., Buettner, F. F. R., Hartmann, M., Grove, M., Langereis, M. A., de Groot, R. J. & Mühlenhoff, M. (2015). Nat. Commun. 6, 7673. CrossRef PubMed Google Scholar
Briliūtė, J., Urbanowicz, P. A., Luis, A. S., Baslé, A., Paterson, N., Rebello, O., Hendel, J., Ndeh, D. A., Lowe, E. C., Martens, E. C., Spencer, D. I. R., Bolam, D. N. & Crouch, L. I. (2019). Nat. Microbiol. 4, 1571–1581. PubMed Google Scholar
Drula, E., Garron, M.-L., Dogan, S., Lombard, V., Henrissat, B. & Terrapon, N. (2022). Nucleic Acids Res. 50, D571–D577. CrossRef CAS PubMed Google Scholar
Emsley, P., Lohkamp, B., Scott, W. G. & Cowtan, K. (2010). Acta Cryst. D66, 486–501. Web of Science CrossRef CAS IUCr Journals Google Scholar
Epstein, T. M., Samanta, U., Kirby, S. D., Cerasoli, D. M. & Bahnson, B. J. (2009). Biochemistry, 48, 3425–3435. CrossRef PubMed CAS Google Scholar
Evans, P. R. & Murshudov, G. N. (2013). Acta Cryst. D69, 1204–1214. Web of Science CrossRef CAS IUCr Journals Google Scholar
Gorrec, F. (2009). J. Appl. Cryst. 42, 1035–1042. Web of Science CrossRef CAS IUCr Journals Google Scholar
Holm, L. (2020). Protein Sci. 29, 128–140. Web of Science CrossRef CAS PubMed Google Scholar
Huang, Y.-L., Chassard, C., Hausmann, M., von Itzstein, M. & Hennet, T. (2015). Nat. Commun. 6, 8141. CrossRef PubMed Google Scholar
Hurdiss, D. L., Drulyte, I., Lang, Y., Shamorkina, T. M., Pronker, M. F., van Kuppeveld, F. J. M., Snijder, J. & de Groot, R. J. (2020). Nat. Commun. 11, 4646. CrossRef PubMed Google Scholar
Kahya, H. F., Andrew, P. W. & Yesilkaya, H. (2017). PLoS Pathog. 13, e1006263. CrossRef PubMed Google Scholar
Kim, M. H., Kang, B. S., Kim, S., Kim, K.-J., Lee, C. H., Oh, B.-C., Park, S.-C. & Oh, T.-K. (2008). Proteins, 70, 578–583. CrossRef PubMed CAS Google Scholar
Krissinel, E. & Henrick, K. (2007). J. Mol. Biol. 372, 774–797. Web of Science CrossRef PubMed CAS Google Scholar
Langereis, M. A., Zeng, Q., Heesters, B. A., Huizinga, E. G. & de Groot, R. J. (2012). PLoS Pathog. 8, e1002492. CrossRef PubMed Google Scholar
Lenfant, N., Hotelier, T., Velluet, E., Bourne, Y., Marchot, P. & Chatonnet, A. (2013). Nucleic Acids Res. 41, D423–D429. Web of Science CrossRef CAS PubMed Google Scholar
Long, F., Nicholls, R. A., Emsley, P., Gražulis, S., Merkys, A., Vaitkus, A. & Murshudov, G. N. (2017). Acta Cryst. D73, 112–122. Web of Science CrossRef IUCr Journals Google Scholar
Mather, R. L., Loveson, K. F. & Fillmore, H. L. (2019). Sci. Rep. 9, 8609. CrossRef PubMed Google Scholar
McCoy, A. J., Grosse-Kunstleve, R. W., Adams, P. D., Winn, M. D., Storoni, L. C. & Read, R. J. (2007). J. Appl. Cryst. 40, 658–674. Web of Science CrossRef CAS IUCr Journals Google Scholar
Mølgaard, A., Kauppinen, S. & Larsen, S. (2000). Structure, 8, 373–383. Web of Science CrossRef PubMed CAS Google Scholar
Morris, G. M., Huey, R., Lindstrom, W., Sanner, M. F., Belew, R. K., Goodsell, D. S. & Olson, A. J. (2009). J. Comput. Chem. 30, 2785–2791. Web of Science CrossRef PubMed CAS Google Scholar
Murshudov, G. N., Skubák, P., Lebedev, A. A., Pannu, N. S., Steiner, R. A., Nicholls, R. A., Winn, M. D., Long, F. & Vagin, A. A. (2011). Acta Cryst. D67, 355–367. Web of Science CrossRef CAS IUCr Journals Google Scholar
Nakayama-Imaohji, H., Ichimura, M., Iwasa, T., Okada, N., Ohnishi, Y. & Kuwahara, T. (2012). J. Med. Invest. 59, 79–94. PubMed Google Scholar
Orizio, F., Damiati, E., Giacopuzzi, E., Benaglia, G., Pianta, S., Schauer, R., Schwartz-Albiez, R., Borsani, G., Bresciani, R. & Monti, E. (2015). Glycobiology, 25, 992–1006. CrossRef CAS PubMed Google Scholar
Park, S. S. (2019). Vaccines, 7, 171. CrossRef Google Scholar
Rangarajan, E. S., Ruane, K. M., Proteau, A., Schrag, J. D., Valladares, R., Gonzalez, C. F., Gilbert, M., Yakunin, A. F. & Cygler, M. (2011). Protein Sci. 20, 1208–1219. CrossRef CAS PubMed Google Scholar
Robinson, L. S., Lewis, W. G. & Lewis, A. L. (2017). J. Biol. Chem. 292, 11861–11872. CrossRef CAS PubMed Google Scholar
Rosenthal, P. B., Zhang, X., Formanowski, F., Fitz, W., Wong, C. H., Meier-Ewert, H., Skehel, J. J. & Wiley, D. C. (1998). Nature, 396, 92–96. CrossRef CAS PubMed Google Scholar
Studier, F. W. (2005). Protein Expr. Purif. 41, 207–234. Web of Science CrossRef PubMed CAS Google Scholar
Tortorici, M. A., Walls, A. C., Lang, Y., Wang, C., Li, Z., Koerhuis, D., Boons, G.-J., Bosch, B.-J., Rey, F. A., de Groot, R. J. & Veesler, D. (2019). Nat. Struct. Mol. Biol. 26, 481–489. CrossRef PubMed Google Scholar
Trott, O. & Olson, A. J. (2010). J. Comput. Chem. 31, 455–461. Web of Science PubMed CAS Google Scholar
Upton, C. & Buckley, J. T. (1995). Trends Biochem. Sci. 20, 178–179. CrossRef CAS PubMed Web of Science Google Scholar
Winn, M. D., Ballard, C. C., Cowtan, K. D., Dodson, E. J., Emsley, P., Evans, P. R., Keegan, R. M., Krissinel, E. B., Leslie, A. G. W., McCoy, A., McNicholas, S. J., Murshudov, G. N., Pannu, N. S., Potterton, E. A., Powell, H. R., Read, R. J., Vagin, A. & Wilson, K. S. (2011). Acta Cryst. D67, 235–242. Web of Science CrossRef CAS IUCr Journals Google Scholar
Zeng, Q., Langereis, M. A., van Vliet, A. L. W., Huizinga, E. G. & de Groot, R. J. (2008). Proc. Natl Acad. Sci. USA, 105, 9065–9069. CrossRef PubMed CAS Google Scholar
Zimm, B. H. (1945). J. Chem. Phys. 13, 141–145. CrossRef CAS Web of Science Google Scholar
This is an open-access article distributed under the terms of the Creative Commons Attribution (CC-BY) Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original authors and source are cited.