topical reviews
Many locks to one key: N-acetylneuraminic acid binding to proteins
aDepartment of Biological Sciences, Purdue University, West Lafayette, IN 47907, USA, and bWeldon School of Biomedical Engineering, Purdue University, West Lafayette, IN 47907, USA
*Correspondence e-mail: subram68@purdue.edu
This article is part of a collection of articles from the IUCr 2023 Congress in Melbourne, Australia, and commemorates the 75th anniversary of the IUCr.
Sialic acids play crucial roles in cell surface β-anomeric form of N-acetylneuraminic acid (Neu5Ac), particularly its binding affinity towards various proteins, as elucidated by solved protein structures. Specifically, we delve into the binding mechanisms of Neu5Ac to proteins involved in sequestering and transporting Neu5Ac in Gram-negative bacteria, with implications for drug design targeting these proteins as antimicrobial agents. Unlike the initial assumptions, structural analyses revealed significant variability in the Neu5Ac binding pockets among proteins, indicating diverse evolutionary origins and binding modes. By comparing these findings with existing structures from other systems, we can effectively highlight the intricate relationship between protein structure and Neu5Ac recognition, emphasizing the need for tailored drug design strategies to inhibit Neu5Ac-binding proteins across bacterial species.
of both eukaryotic and prokaryotic organisms, mediating various biological processes, including cell–cell interactions, development, oncogenesis and host–pathogen interactions. This review focuses on theKeywords: sialic acids; N-acetyl neuraminic acid; Neu5Ac; Gram-negative bacteria; binding sites; drug discovery; protein structures; molecular recognition.
1. Sialic acids
Sialic acids are a group of over 50 different types of nine-carbon sugars (Varki, 1992). One of the most widely studied sialic acids is 2-keto-3-deoxy-5-acetamido D-glycero-D-galacto-nonulosonic acid, also known as N-acetyl neuraminic acid (Neu5Ac). The diversity of sialic acids is brought about by the generation of amine- or hydroxyl-linked modifications. The most common modification is the O-acetylation of carbons 4, 7, 8 and 9 (Angata & Varki, 2002). The other major form is N-glycolyl neuraminic acid (Neu5Gc), which is generated by hydroxylation of the 5-acetamido position in Neu5Ac (Fig. 1). Neu5Gc is absent in humans due to a frameshift mutation in the cytidine-5′-monophospho-N-acetylneuraminic acid (CMP-Neu5Ac) hydroxylase gene, but is present in our closest primate relatives (Angata et al., 2001).
Sialic acids are typically found at the terminal position in both eukaryotic and prokaryotic cell surface α2–3)- or (α2–6)-glycosidically linked to galactosides or (α2–6)-linked to N-acetylgalactosamine (Boons & Demchenko, 2000). Polysialic acids of Neu5Ac and Neu5Gc, characterized by (α2–8), (α2–9) or alternating (α2–8)/(α2–9) glycosidic linkages, have been found in the of neural cell adhesion molecules (NCAMs) in fish eggs and in the capsules of certain bacteria, such as Neisseria meningitidis group B (Sato & Kitajima, 2013).
They are (In humans, sialic acid (Neu5Ac) is the most abundant sugar in the central nervous system, hence the name neuraminic acid (Haines-Menges et al., 2015). In the brain, polysialic acids coat neurons, glia and ganglia. Another major reservoir of sialic acids is the gastrointestinal tract, followed by other mucin-rich regions such as the lungs and vagina (Schnaar et al., 2014). Neu5Ac often occurs at the terminal positions of glycoconjugates; is involved in cell–cell interactions; and mediates processes such as development, oncogenesis and host–pathogen interactions (Varki, 2017; Jennings et al., 2022).
Notably, both the α-anomer and β-anomer forms of Neu5Ac exist in biological systems. In this review, we focus on the β-anomer form of Neu5Ac to identify the similarities and the significance of the protein environment to the binding affinity towards the β-anomer of Neu5Ac in solved protein structures. Over a decade ago, the laboratory started working on the sequestering, transport and catabolism of Neu5Ac in bacteria that persist in different niches of the human body. Many of these proteins bind to Neu5Ac, and analysing the nature of these binding pockets will provide crucial insights for designing drugs to inhibit these proteins, often cited as possible new antimicrobial targets. Our naïve hypothesis was that, given that the sequestering proteins, transporters and the first enzymes that utilize Neu5Ac all bind Neu5Ac, we should be able to design inhibitors that will bind to all these proteins. As the structures began to be determined, it was apparent that each of these proteins probably evolved from a different protein and bound to Neu5Ac differently. Here, we review how Neu5Ac binds to different proteins in this class of Gram-negative bacteria and compare them with other published structures of β-Neu5Ac binding in other systems. The collective data reveal substantial diversity among the locks binding to the identical key (Neu5Ac).
2. Bacterial sialometabolism
The first evidence that bacteria possess sialic acids was obtained from E. coli isolates in 1958 (Barry, 1959). Sialidases are enzymes that cleave sialic acids from their glycoconjugates in bacteria, suggesting that they utilize sialic acids. In bacteria, sialometabolism involves the following: (1) biosynthesis and scavenging of sialic acids for sialylation of their cell surface; and (2) sialic acid catabolism, utilizing it as a carbon and nitrogen source.
2.1. Utilization of sialic acid by bacteria to evade host immune response
Studies have shown that H. influenzae, E. coli, C. jejuni, P. multocida, H. ducreyi, N. meningitidis and N. gonorrhoeae can coat the capsular polysaccharide or lipooligosaccharide layer with Neu5Ac to evade the host immune system via molecular mimicry (Vimr et al., 2000). To do so, they evolved at least four mechanisms to acquire sialic acids, as outlined below.
2.1.1. De novo synthesis
E. coli and N. meningitidis utilize this mechanism to obtain sialic acids for cell surface sialylation. Neu5Ac is synthesized from uridine diphosphate, N-acetyl glucosamine (UDP–GlcNAc) by hydrolyzing UDP–GlcNAc-2. The ManNAc produced is condensed with phosphoenol pyruvate to yield Neu5Ac (Vimr & Lichtensteiger, 2002). Neu5Ac is then activated to form CMP–Neu5Ac through the action of CMP–Neu5Ac synthetase. Specific sialyltransferases add sialic acids to the capsular polysaccharide or lipooligosaccharide layers (Vimr et al., 2004).
2.1.2. Precursor scavenging
Neu5Ac is scavenged from the environment through this mechanism. This requires sialic acid transporters to transport sialic acid from the environment to the cytoplasm. Once the scavenged sialic acid is transported into the cell, CMP–Neu5Ac is synthesized and transferred onto the LPS/LOS by sialyltransferases (Vimr & Lichtensteiger, 2002; Apicella, 2012). H. influenza is an example of a pathogen that utilizes this mechanism. This review primarily focuses on the pathways for Neu5Ac sequestration, import and utilization by Gram-negative bacteria.
2.2. Utilization of sialic acid as a carbon and nitrogen source
The other utilization of sialic acids is as a source of carbon and nitrogen. Bacteria colonizing mucin-rich environments have a low supply of glucose, especially in the gastrointestinal tract (Jeong et al., 2009). Bacteria that have evolved the sialic acid catabolism pathway have considerable advantages in colonizing sialic acid-rich environments (Jeong et al., 2009; Almagro-Moreno & Boyd, 2010; Olson et al., 2013). Sialic acid is degraded to fructose-6-phosphate, which subsequently undergoes glycolysis. The intermediate GlcNAc-6-P can also be a precursor of cell wall biosynthetic pathways. The enzymes involved in sialic acid catabolism are generally found in the tightly regulated nan–nag operon (Almagro-Moreno & Boyd, 2010).
3. Transport of sialic acids
For bacteria to utilize sialic acids obtained from host scavenging, sialic acid must be transported across the cell membrane (Fig. 2). Transport from the outer membrane to the periplasmic space occurs through porins (NanC). Transport into bacterial cells from the periplasmic space occurs via two types of transporters: primary active transporters [ATP synthase binding cassette transporters (ABC transporters)] and secondary active transporters. Three distinct secondary active transporters of Neu5Ac have been identified: (1) a single-component sodium solute symporter (SSS), (2) a tripartite ATP-independent periplasmic transporter (TRAP) and (3) a single-component sugar proton symporter (NanT) (Thomas, 2016).
3.1. Primary active transporters: ABC transporters
ABC transporters, known as primary active transporters, use energy driven by ATP hydrolysis to import various substrates. In bacteria, ABC transporters are generally assembled into four subunits: two transmembrane domains (TMDs) and two nucleotide-binding domains (NBDs). In Gram-negative bacteria, importers contain a substrate-binding protein in the periplasm that traps the substrate and delivers it to the TMD. These proteins transport a diverse range of substrates and thus lack sequence similarity in their TMDs (Dawson & Locher, 2006).
Haemophilus ducreyi is a Gram-negative coccobacillus that belongs to the Pasteurellaceae family and causes sexually transmitted chancroid disease. It has been shown that the LOS of H. ducreyi also acts as a putative virulence factor and is heavily sialylated. It has been shown that H. ducreyi uses a novel ABC-transport system to transport sialic acid encoded by the gene SatABCD (Post et al., 2005). SatA encodes a periplasmic binding protein that binds sialic acid, and SatBCD encodes an integral membrane protein with a nucleotide-binding domain in this operon. No structure off the ABC-type Neu5Ac transporter (SatBCD) has been reported. The periplasmic binding protein SatA has been well studied (Setty et al., 2018). Substrate-bound Hd-SatA acts as an anchor for the SatBCD membrane transporter, facilitating the transport of Neu5Ac/Neu5Gc. Hd-SatA is an α/β protein comprising three domains, with the N- and C-terminal domains connected by two β-strands serving as a hinge region. Interestingly, multiple main-chain atoms interact with the polar groups of Neu5Ac. The only residue that could neutralize the charge on the carboxylate group is histidine 333 (Fig. 3 insert). On ligand binding, domains I and II converge to trap the ligand, which is reminiscent of the Venus flytrap mechanism (Setty et al., 2018).
3.2. Secondary active transporters
Secondary + or Na+ across the membrane. The three main families in this category are the major facilitator superfamily (MFS), the SSS and TRAP transporters.
is a mechanism in which solutes are transported across a membrane owing to the difference in the electrochemical gradient established by the transport of ions such as H3.2.1. SSS transporter
In the SSS transport system, an Na+ ion electrochemical gradient is used to uptake sialic acids. The Na+/galactose/glucose symporter from Vibrio parahaemolyticus (vSGLT) was the first representative of the SSS transport family (Faham et al., 2008). Uropathogenic Proteus mirabilis transports and catabolizes host-derived sialic acids using the SSS transport system (SiaT). Functional and studies of SiaT have shown that the uptake and transport of sialic acid are also controlled by two Na+ (Na2 and Na3) ions (Wahlgren et al., 2018). The high-resolution structure of the outward-facing and substrate-bound form of the SiaT sialic acid transporter from Proteus mirabilis (Pm–SiaT), a member of the SSS family, has been elucidated previously (Wahlgren et al., 2018). Pm–SiaT adopts the LeuT-fold, with Neu5Ac bound centrally and two Na+ ions facilitating its transport. Comprising 13 transmembrane helices, Pm–SiaT positions its N- and C-termini towards the periplasmic and cytoplasmic regions, respectively (Fig. 4). Two inverted repeats of five transmembrane helices form a core structural fold. Structural studies show that Na2 occupies a conserved site and Na3 occupies a unique position, which is present at ∼14 Å from the substrate-binding site and 6.5 Å away from Na2. Microscale thermophoresis experiments revealed that Staphylococcus aureus SiaT exhibits a significantly higher affinity for Neu5Gc sialic acid (Kd = 39 ± 5 µM) than for Neu5Ac (Kd = 113 ± 6 µM) (North, Wahlgren et al., 2018). Interestingly, Pm–SiaT displays a reversed preference, exhibiting a slightly higher binding affinity for Neu5Ac (Kd = 58 ± 1 µM) than for Neu5Gc (Kd = 85 ± 2 µM). Neu5Ac is stabilized at its binding pocket by a number of polar interactions. There are no water-mediated polar interactions on the C1 side of the Neu5Ac, suggesting strong electrostatic interactions. The C9 and C5 extensions have a number of water-mediated interactions that decrease the contribution of electrostatics towards binding affinity in these regions.
3.2.2. MFS transporters
MFS is one of the major transport systems found in bacteria, archaea and eukaryotes. Unlike ABC transporters, which transport both small molecules and macromolecules, MFS transporters only transport small solutes in response to the electrochemical ion gradient. Based on the direction of solute/ion transport, MFS transporters are classified as uniporters, symporters or antiporters. They transport essential nutrients and export toxic substances or end products of metabolism. NanT, which transports sialic acid, is part of the MFS transport system. NanT was first identified in E. coli and its mutation leads to a loss of function. Unlike traditional MFS family members, this protein contains 14 predicted transmembrane helices and an α-helix that might play an important role in the structure and function of the protein (Martinez et al., 1995).
3.2.3. TRAP transporters
TRAP transporters are extracytoplasmic solute receptors (ESRs) that belong to the category of secondary transporters and are present only in bacteria and archaea. Similar to the MFS transport system, the electrochemical ion gradient provides a driving force for solute transport across the membrane. Unlike the conventional MFS and SSS transport systems, the TRAP transport system contains an extra substrate binding protein (SBP) to trap the substrate in the periplasm and confers unidirectionality to substrate transport (Mulligan et al., 2009). The TRAP transport system is a three-component system consisting of an extracellular solute receptor (SiaP, periplasmic binding protein) and two unequal-sized distinct integral membrane proteins (Q and M domains). The periplasmic binding protein SiaP delivers Neu5Ac to the TRAP transporter and binds to Neu5Ac with nanomolar affinity. The sequence similarity between SatA and other SiaPs that bind to sialic acids is approximately 20–25%. In Gram-negative bacteria, solute-binding proteins are present in the periplasm, whereas in Gram-positive bacteria, they are anchored to the inner membrane of the bacteria. Several periplasmic sialic acid-binding proteins have been shown to bind to Neu5Ac (North, Horne et al., 2018; Gangi Setty et al., 2014). The SiaP protein of H. influenzae, referred to as Hi–SiaP, is a prominent example of a sialic acid-binding protein within the TRAP transport system. Initially, Müller et al. (2006) elucidated the structure of Hi–SiaP bound to the sialic acid analog 2,3-didehydro-2-deoxy-N-acetylneuraminic acid (Neu5Ac2en), revealing a two-domain protein configuration, followed by the demonstration that Neu5Ac binds enthalpically and with nanomolar affinity to SiaPs (Fig. 5) (Johnston et al., 2008). The enthalpic component of binding is not surprising given the large number of electrostatic charge interactions in the binding pocket. Mutations of R149 and R129 have been shown to decrease the binding affinity (Gangi Setty et al., 2014).
Recently, membrane protein components of two TRAP transporter protein structures have been reported, one from Haemophilus influenzae and the other from Photobacterium profundum, using cryo-EM (Davies et al., 2023; Peter et al., 2022; Currie et al., 2023). The structure revealed that the protein was in an inward-open conformation with two sodium sites. We recently determined the TRAP structure from Fusobacterium nucleatum at 2.8 Å resolution. The Fn–SiaQM structure was in an inward-open conformation with three sodium sites.
4. Sialic acid catabolism pathway
Once β-Neu5Ac is present in the cytoplasm, it can be used as a carbon or nitrogen source (Fig. 6). The first evidence that bacteria utilize sialic acids as the sole carbon source came from studies on Clostridium perfringens (Nees et al., 1976). Further studies have shown that only a limited number of commensals and pathogenic bacteria that are closely associated with the host have genes for sialic acid catabolism. H. influenzae, F. nucleatum, P. multocida, S. aureus, V. cholera, V. vulnificus and E. coli have been shown to possess a nan cluster of genes (Olson et al., 2013; Almagro-Moreno & Boyd, 2009). Alternatively, cytidine-5′-monophosphate (CMP) is utilized as a substrate by sialic acid synthetase enzymes, and catalyzes the conversion of CMP and siliac acid to cytidine-5′-monophosphate–N-acetylneuraminic acid (CMP–Neu5Ac), which is a key intermediate in the sialic acid biosynthetic pathway. Neu5Ac (from CMP-Neu5Ac) is incorporated into lipopolysaccharide (LPS) molecules through the action of sialyltransferase enzymes. Sialyltransferases are glycosyltransferases that catalyze the transfer of sialic acid residues from CMP–Neu5Ac to terminal sugar residues on glycoconjugates such as LPS. During this process, the sialyltransferase enzyme binds to both the CMP–Neu5Ac substrate and the acceptor molecule, which typically contains a terminal galactose or N-acetylgalactosamine residue (Mizanur & Pohl, 2008). The enzyme then facilitates the transfer of the sialic acid residue from CMP–Neu5Ac to the terminal sugar residue on the acceptor molecule, forming a glycosidic bond and thereby incorporating the sialic acid moiety into the LPS molecule. LPS sialylation can have significant effects on bacterial physiology, including host–pathogen interactions, immune evasion and bacterial virulence.
4.1. Regulation of sialic acid catabolic genes
NanR is a transcriptional regulator (repressor) involved in the regulation of genes related to sialic acid metabolism in bacteria such as E. coli. The mechanism underlying NanR gene repression and allosteric induction of bacterial sialic acid metabolism involves intricate molecular interactions between NanR, DNA operator sites and sialic acid molecules, such as Neu5Ac (Fig. 7). The dissociation constant (KD) for Neu5Ac binding to NanR determined using isothermal titration yielded a consistent Kd of 16 µM, with one Neu5Ac molecule binding per NanR dimer (Horne et al., 2021). The presence of Zn2+ in the binding pocket influences the impact of the charged residues. There is a salt bridge formed between C1-carboxylate and R128, and another electrostatic interaction with R203. However, the Kd value does not reflect nanomolar affinity, as Zn2+ can affect the charge on the C1-carboxylate.
4.2. Conversion of Neu5Ac into fructose-6-phosphate
Five enzymes govern catabolism, with N-acetylneuraminate lyase (NanA) being the first committed enzyme. NanA cleaves Neu5Ac into N-acetyl-D-mannosamine (ManNAc) and pyruvate, initiating this pathway. Enzymes including N-acetylmannosamine kinase (NanK), N-acetylmannosamine-6-phosphate epimerase (NanE), N-acetylglucosamine-6-phosphate deacetylase (NagA) and glucosamine-6-phosphate deaminase (NagB) further metabolize the resulting products. NanA, also known as sialic acid aldolase, is present in both pathogenic and non-pathogenic bacteria and in various mammalian tissues and plays a vital role in the metabolism of sialic acids. NanA, which belongs to the class I aldolase family, exhibits a triosephosphate isomerase (TIM)-barrel fold and employs aldol condensation by forming a Schiff base intermediate. The high-resolution structures of N-acetylneuraminate lyase from F. nucleatum, H. influenzae, E. coli, S. aureus and P. multicoda revealed that NanA adopts a classical TIM (triosephosphate isomerase) barrel fold (Campeotto et al., 2009; North et al., 2016; Kumar et al., 2018) with the addition of three extra helices (I, J and K) at the C-terminus (Fig. 8). Sialic acid binds in the extended conformation (against the pyranose form in the other cases), and there are no basic residues that are close to the C1-carboxylate group. The structures and functions of the other enzymes have also been well characterized. As they do not bind to sialic acid, their structures and properties are not discussed in this review.
4.3. Incorporation of Neu5Ac as the outermost sugar of LPS
Most pathogenic bacteria incorporate Neu5Ac onto their surfaces to evade the host immune system. The primary enzymes responsible for constructing and synthesizing Neu5Ac-containing glycoconjugates are sialyltransferases, which utilize CMP–Neu5Ac as an activated sugar nucleotide donor substrate (Bose et al., 2019). CMP-sialic acid synthetases (CSSs) facilitate the activation of the β-anomer of free Neu5Ac, forming a β-linked sialyl monophosphate diester bond between the C2 of Neu5Ac and the α-phosphate of cytidine-5′-triphosphate (CTP), subsequently releasing pyrophosphate (Fig. 9). Fig. 9 also suggests that the specificity of binding probably originates from the nucleotide as there are more direct interactions with the side chain of the protein. Neu5Ac interactions occur through the solvent molecules. The next enzyme binds to CMP–Neu5Ac and transfers Neu5Ac to the outermost sugar of LPS.
5. Discussion and conclusions
As we suggested in the introduction, we hoped that of the two periplasmic binding proteins that sequester Neu5Ac, the three types of transport proteins, the transcription factor that binds to sialic acid and the two enzymes that bind to Neu5Ac, a few would have a common binding motif. This would allow us to pursue a drug discovery program targeting multiple proteins in the same pathogen, thereby reducing the ability to acquire resistant mutations. However, as described above, these proteins bind differently to Neu5Ac.
We searched for other proteins in the Protein Data Bank (PDB) with bound β-Neu5Ac and classified them based on their structural homology and binding pocket topology. Here, we discuss examples of unique binding pockets. Two major surface antigenic proteins are present in influenza viruses: hemagglutinin (HA) and sialidase (neuraminidase). Their roles in the influenza virus infection cycle are well understood. Neuraminidases of viral origin have been well studied, and inhibitors of influenza virus neuraminidase are currently available as antiviral agents (Gubareva & Mohan, 2022). The structure and mode of binding of neuraminidase (Fig. 10) show a number of polar interactions (Vavricka et al., 2013), with some elements common to that of the SiaP of the TRAP transport system.
Pathogenic bacteria use molecular mimicry to infiltrate and establish themselves within their hosts, often by mimicking host structures, notably those present in capsular E. coli K1, a neuroinvasive bacterium responsible for neonatal sepsis and meningitis, employs this strategy prominently. Its capsule comprises poly α2,8-sialic acid (polySia). This sugar polymer acts as a critical modulator of neuronal plasticity in adult human hosts and evades the mammalian immune system, making the host tolerant to polySia-shielded bacteria. Several coliphages that infect polySia-encapsulated E. coli K1 have been identified, each harbouring endosialidase tail spikes which are crucial for infection. Cloned from various K1-specific phages, these endo-sialidases belong to the GH-58 family of glycosylhydrolases and exhibit a modular composition with a highly conserved catalytically active central region and distinctive N- and C-terminal domains. The tertiary structures of endo-sialidases from phages differ greatly (Stummeyer et al., 2005) (Fig. 11). However, they interact with fewer polar interactions, and the orientation of the different interacting residues is different from that of the neuraminidases.
Botulinum neurotoxins (BoNTs) produced by Clostridium botulinum, are potent inhibitors of synaptic transmission in the peripheral cholinergic nervous system (Montal, 2010). Neurotoxins bind to Neu5Ac, and their binding sites have been characterized (Fig. 12). Here, Neu5Ac was bound to the surface (Emsley et al., 2000). Arg 1226 is the residue anchoring the orientation, with direct hydrogen bonds to the carboxylate atom and the pyranose oxygen.
The last example is that of the adenovirus fibre knob protein that binds to Neu5Ac as well (Baker et al., 2021). Here, the protein forms a trimer (a cartoon for only one subunit is shown), and three molecules of Neu5Ac bind in between two subunits (Fig. 13). As the binding occurs at the interface, Neu5Ac binds to residues on two different subunits. The authors modelled the Lys349 in multiple conformations, even though the terminal amino group hydrogen bonds to the C1-carboxylate oxygen in both conformations.
Figs. 3–5 and 7–13 illustrate the different modes of β-Neu5Ac binding to different proteins. The different tertiary structures of the binding motifs clearly show that they evolved independently. This hypothesis was also based on sequence comparisons (Severi et al., 2021). However, one could hypothesize convergent evolution of different proteins to organize similarities in the binding motif of Neu5Ac. However, this is not the case. If we organize all the PDB files of the ten different motifs that have been reported and superpose Neu5Ac, it is clear that some interactions occur more often than others (Fig. 14). The C1-carboxylate, in several cases, interacts with positively charged Arg or Lys. This is not surprising; however, it is surprising that in several cases, there seems to be no obvious charge neutralization. We used all of the structures except lyase and the CMP–sialic acid synthase. PyMOL was used to superimpose sialic acid coordinates (version 1.8; Schrödinger LLC, 2015). The figure shows that Neu5Ac superposes well and that there are minor rotations around the free single bonds. Neu5Ac is shown as a stick for only one structure (Fig. 14). The surrounding residues are shown in lines. There are over 40 structures in the PDB that contain bound β-Neu5Ac. We used only one representative of each binding mode to illustrate that the different binding modes are distinct. All superposed structures were in the pyranose form. The enzyme NanA, which is a lyase, binds to sugar in the open-chain form (Fig. 8) and is not included in the figure and analysis. Although the figure looks like all the residues are randomly oriented, there is a cluster of basic residues around the C1-carboxylate (blue lines on the right side of the figure). Although there are exceptions, this may be necessary to neutralize the negative charge at the physiological pH of the carboxylate. Similarly, a less populated cluster of hydrophobic residues is on the opposite side (around the N-acetyl part of the sugar).
Bhagavat & Chandra (2014) published an analysis of Neu5Ac binding sites and classified them into six groups. Numerous additional structures have been reported in subsequent studies. In this review, we have demonstrated the existence of at least ten distinct classes of binding sites. In summary, we see the evolutionary origins of various proteins that engage with Neu5Ac lead to diverse mechanisms through which nature achieves specific binding to this ligand. Although the lock-and-key concept is fascinating for ligand binding, nature has evolved multiple locks for a single key. Attempts to synthesize tight binding inhibitors towards one target that binds to sialic acid do not automatically result in binding to other sialic acid-binding proteins (Bozzola et al., 2022). This prompts questions about the insights that can be gained for designing a molecule capable of targeting multiple active sites simultaneously.
Acknowledgements
We thank the IUCr for providing the opportunity to write this review. The work that led to this review was motivated by Professor Michael A. Apicella when RS was at the University of Iowa, and the investigations continued for 15 years. This work would not have been possible but for the collaborating laboratories of Professor Ren Dobson in New Zealand, Rosmarie Friemann in Sweden and Ceasare Indiveri in Italy.
Funding information
The following funding is acknowledged: Department of Biotechnology, Ministry of Science and Technology, India (grant Nos. BT/PR5081/INF/156/2012; BT/IN/SWEDEN/06/SR/2017-18; BT/PR12422/MED/31/287/214 awarded to RS). We would also like to acknowledge startup support from Purdue University awarded to RS.
References
Almagro-Moreno, S. & Boyd, E. F. (2009). BMC Evol. Biol. 9, 118. Google Scholar
Almagro-Moreno, S. & Boyd, E. F. (2010). Gut Microbes, 1, 45–50. Web of Science PubMed Google Scholar
Angata, T. & Varki, A. (2002). Chem. Rev. 102, 439–470. Web of Science CrossRef PubMed CAS Google Scholar
Angata, T., Varki, N. M. & Varki, A. (2001). J. Biol. Chem. 276, 40282–40287. Web of Science CrossRef PubMed CAS Google Scholar
Apicella, M. (2012). Front. Cell. Inf. Microbio. 2, 19. Web of Science CrossRef Google Scholar
Baker, A. T., Davies, J. A., Bates, E. A., Moses, E., Mundy, R. M., Marlow, G., Cole, D. K., Bliss, C. M., Rizkallah, P. J. & Parker, A. L. (2021). J. Virol. 95, e01849–20. Web of Science CrossRef CAS PubMed Google Scholar
Baker, A. T., Mundy, R. M., Davies, J. A., Rizkallah, P. J. & Parker, A. L. (2019). Sci. Adv. 5, eaax3567. Web of Science CrossRef PubMed Google Scholar
Barry, G. T. (1959). Nature, 183, 117–118. CrossRef PubMed CAS Web of Science Google Scholar
Bhagavat, R. & Chandra, N. (2014). Glycobiology, 24, 5–16. Web of Science CrossRef CAS PubMed Google Scholar
Boons, G.-J. & Demchenko, A. V. (2000). Chem. Rev. 100, 4539–4566. Web of Science CrossRef PubMed CAS Google Scholar
Bose, S., Purkait, D., Joseph, D., Nayak, V. & Subramanian, R. (2019). Acta Cryst. D75, 564–577. Web of Science CrossRef IUCr Journals Google Scholar
Bozzola, T., Johnsson, R. E., Nilsson, U. J. & Ellervik, U. (2022). ChemMedChem, 17, e202200351. Web of Science CrossRef PubMed Google Scholar
Campeotto, I., Carr, S. B., Trinh, C. H., Nelson, A. S., Berry, A., Phillips, S. E. V. & Pearson, A. R. (2009). Acta Cryst. F65, 1088–1090. Web of Science CrossRef IUCr Journals Google Scholar
Currie, M. J., Davies, J. S., Scalise, M., Gulati, A., Wright, J. D., Newton-Vesty, M. C., Abeysekera, G. S., Subramanian, R., Wahlgren, W. Y., Friemann, R., Allison, J. R., Mace, P. D., Griffin, M. D. W., Demeler, B., Wakatsuki, S., Drew, D., Indiveri, C., Dobson, R. C. J. & North, R. A. (2023). eLife, 12, RP92307. Google Scholar
Davies, J. S., Currie, M. J., North, R. A., Scalise, M., Wright, J. D., Copping, J. M., Remus, D. M., Gulati, A., Morado, D. R., Jamieson, S. A., Newton-Vesty, M. C., Abeysekera, G. S., Ramaswamy, S., Friemann, R., Wakatsuki, S., Allison, J. R., Indiveri, C., Drew, D., Mace, P. D. & Dobson, R. C. J. (2023). Nat. Commun. 14, 1120. Web of Science CrossRef PubMed Google Scholar
Dawson, R. J. & Locher, K. P. (2006). Nature, 443, 180–185. Web of Science CrossRef PubMed CAS Google Scholar
Emsley, P., Fotinou, C., Black, I., Fairweather, N. F., Charles, I. G., Watts, C., Hewitt, E. & Isaacs, N. W. (2000). J. Biol. Chem. 275, 8889–8894. Web of Science CrossRef PubMed CAS Google Scholar
Faham, S., Watanabe, A., Besserer, G. M., Cascio, D., Specht, A., Hirayama, B. A., Wright, E. M. & Abramson, J. (2008). Science, 321, 810–814. Web of Science CrossRef PubMed CAS Google Scholar
Gangi Setty, T., Cho, C., Govindappa, S., Apicella, M. A. & Ramaswamy, S. (2014). Acta Cryst. D70, 1801–1811. Web of Science CrossRef IUCr Journals Google Scholar
Gangi Setty, T., Mowers, J. C., Hobbs, A. G., Maiya, S. P., Syed, S., Munson, R. S., Apicella, M. A. & Subramanian, R. (2018). J. Biol. Chem. 293, 20073–20084. Web of Science CrossRef PubMed Google Scholar
Gubareva, L. & Mohan, T. (2022). Cold Spring Harb. Perspect. Med. 12, a038455. Web of Science CrossRef PubMed Google Scholar
Haines-Menges, B. L., Whitaker, W. B., Lubin, J. B. & Boyd, E. F. (2015). Microbiol. Spectr. 3, 3.4.07. Google Scholar
Horne, C. R., Venugopal, H., Panjikar, S., Wood, D. M., Henrickson, A., Brookes, E., North, R. A., Murphy, J. M., Friemann, R., Griffin, M. D. W., Ramm, G., Demeler, B. & Dobson, R. C. J. (2021). Nat. Commun. 12, 1988. Web of Science CrossRef PubMed Google Scholar
Huynh, N., Aye, A., Li, Y., Yu, H., Cao, H., Tiwari, V. K., Shin, D. W., Chen, X. & Fisher, A. J. (2013). Biochemistry, 52, 8570–8579. Web of Science CrossRef CAS PubMed Google Scholar
Jennings, M. P., Day, C. J. & Atack, J. M. (2022). Microbiology, 168, 001157. Web of Science CrossRef PubMed Google Scholar
Jeong, H. G., Oh, M. H., Kim, B. S., Lee, M. Y., Han, H. J. & Choi, S. H. (2009). Infect. Immun. 77, 3209–3217. Web of Science CrossRef PubMed CAS Google Scholar
Johnston, J. W., Coussens, N. P., Allen, S., Houtman, J. C. D., Turner, K. H., Zaleski, A., Ramaswamy, S., Gibson, B. W. & Apicella, M. A. (2008). J. Biol. Chem. 283, 855–865. Web of Science CrossRef PubMed CAS Google Scholar
Kalivoda, K. A., Steenbergen, S. M. & Vimr, E. R. (2013). J. Bacteriol. 195, 4689–4701. Web of Science CrossRef CAS PubMed Google Scholar
Kumar, J. P., Rao, H., Nayak, V. & Ramaswamy, S. (2018). Acta Cryst. F74, 725–732. Web of Science CrossRef IUCr Journals Google Scholar
Martinez, J., Steenbergen, S. & Vimr, E. (1995). J. Bacteriol. 177, 6005–6010. CrossRef CAS PubMed Web of Science Google Scholar
Matthews, M. M., McArthur, J. B., Li, Y., Yu, H., Chen, X. & Fisher, A. J. (2020). Biochemistry, 59, 3157–3168. Web of Science CrossRef CAS PubMed Google Scholar
Mizanur, R. M. & Pohl, N. L. (2008). Appl. Microbiol. Biotechnol. 80, 757–765. Web of Science CrossRef PubMed CAS Google Scholar
Montal, M. (2010). Annu. Rev. Biochem. 79, 591–617. Web of Science CrossRef CAS PubMed Google Scholar
Müller, A., Severi, E., Mulligan, C., Watts, A. G., Kelly, D. J., Wilson, K. S., Wilkinson, A. J. & Thomas, G. H. (2006). J. Biol. Chem. 281, 22212–22222. Web of Science PubMed Google Scholar
Mulligan, C., Geertsma, E. R., Severi, E., Kelly, D. J., Poolman, B. & Thomas, G. H. (2009). Proc. Natl Acad. Sci. USA, 106, 1778–1783. Web of Science CrossRef PubMed CAS Google Scholar
Nees, S., Schauer, R., Mayer, F. & Ehrlich, K. (1976). Hoppe-Seyler's Z. Physiol. Chem. 357, 839–854. CrossRef CAS Web of Science Google Scholar
North, R. A., Horne, C. R., Davies, J. S., Remus, D. M., Muscroft-Taylor, A. C., Goyal, P., Wahlgren, W. Y., Ramaswamy, S., Friemann, R. & Dobson, R. C. J. (2018). Biophys. Rev. 10, 219–227. CrossRef CAS PubMed Google Scholar
North, R. A., Wahlgren, W. Y., Remus, D. M., Scalise, M., Kessans, S. A., Dunevall, E., Claesson, E., Soares da Costa, T. P., Perugini, M. A., Ramaswamy, S., Allison, J. R., Indiveri, C., Friemann, R. & Dobson, R. C. J. (2018). Front. Chem. 6, 233. Web of Science CrossRef PubMed Google Scholar
North, R. A., Watson, A. J., Pearce, F. G., Muscroft–Taylor, A. C., Friemann, R., Fairbanks, A. J. & Dobson, R. C. (2016). FEBS Lett. 590, 4414–4428. Web of Science CrossRef CAS PubMed Google Scholar
Olson, M. E., King, J. M., Yahr, T. L. & Horswill, A. R. (2013). J. Bacteriol. 195, 1779–1788. Web of Science CrossRef CAS PubMed Google Scholar
Peter, M. F., Ruland, J. A., Depping, P., Schneberger, N., Severi, E., Moecking, J., Gatterdam, K., Tindall, S., Durand, A., Heinz, V., Siebrasse, J. P., Koenig, P.-A., Geyer, M., Ziegler, C., Kubitscheck, U., Thomas, G. H. & Hagelueken, G. (2022). Nat. Commun. 13, 4471. Web of Science CrossRef PubMed Google Scholar
Post, D. M., Mungur, R., Gibson, B. W. & Munson, R. S. Jr (2005). Infect. Immun. 73, 6727–6735. Web of Science CrossRef PubMed CAS Google Scholar
Sato, C. & Kitajima, K. (2013). J. Biochem. 154, 115–136. Web of Science CrossRef CAS PubMed Google Scholar
Schnaar, R. L., Gerardy-Schahn, R. & Hildebrandt, H. (2014). Physiol. Rev. 94, 461–518. Web of Science CrossRef PubMed Google Scholar
Severi, E., Rudden, M., Bell, A., Palmer, T., Juge, N. & Thomas, G. H. (2021). Microb. Genom, 7, 000614. Web of Science PubMed Google Scholar
Stummeyer, K., Dickmanns, A., Mühlenhoff, M., Gerardy-Schahn, R. & Ficner, R. (2005). Nat. Struct. Mol. Biol. 12, 90–96. Web of Science CrossRef PubMed CAS Google Scholar
Thomas, G. H. (2016). Biochem. Soc. Trans. 44, 760–765. Web of Science CrossRef CAS PubMed Google Scholar
Varki, A. (1992). Glycobiology, 2, 25–40. CrossRef PubMed CAS Web of Science Google Scholar
Varki, A. (2017). Glycobiology, 27, 3–49. Web of Science CrossRef CAS PubMed Google Scholar
Vavricka, C. J., Liu, Y., Kiyota, H., Sriwilaijaroen, N., Qi, J., Tanaka, K., Wu, Y., Li, Q., Li, Y., Yan, J., Suzuki, Y. & Gao, G. F. (2013). Nat. Commun. 4, 1491. Web of Science CrossRef PubMed Google Scholar
Vimr, E. & Lichtensteiger, C. (2002). Trends Microbiol. 10, 254–257. Web of Science CrossRef PubMed CAS Google Scholar
Vimr, E., Lichtensteiger, C. & Steenbergen, S. (2000). Mol. Microbiol. 36, 1113–1123. Web of Science CrossRef PubMed CAS Google Scholar
Vimr, E. R., Kalivoda, K. A., Deszo, E. L. & Steenbergen, S. M. (2004). Microbiol. Mol. Biol. Rev. 68, 132–153. Web of Science CrossRef PubMed CAS Google Scholar
Wahlgren, W. Y., Dunevall, E., North, R. A., Paz, A., Scalise, M., Bisignano, P., Bengtsson-Palme, J., Goyal, P., Claesson, E., Caing-Carlsson, R., Andersson, R., Beis, K., Nilsson, U. J., Farewell, A., Pochini, L., Indiveri, C., Grabe, M., Dobson, R. C. J., Abramson, J., Ramaswamy, S. & Friemann, R. (2018). Nat. Commun. 9, 1753. Web of Science CrossRef PubMed Google Scholar
This is an open-access article distributed under the terms of the Creative Commons Attribution (CC-BY) Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original authors and source are cited.