research papers
An intermolecular binding mechanism involving multiple LysM domains mediates carbohydrate recognition by an endopeptidase
aCentre for Carbohydrate Recognition and Signalling, Department of Molecular Biology and Genetics, Aarhus University, Gustav Wieds Vej 10C, 8000 Aarhus, Denmark, bNiels Bohr Institute, Faculty of Science, University of Copenhagen, Universitetsparken 5, 2100 Copenhagen, Denmark, and cCentre for Carbohydrate Recognition and Signalling, Department of Chemistry, Faculty of Science, University of Copenhagen, Thorvaldsensvej 40, 1871 Frederiksberg C, Denmark
*Correspondence e-mail: mickael.blaise@cpbs.cnrs.fr
LysM domains, which are frequently present as repetitive entities in both bacterial and plant proteins, are known to interact with N-acetylglucosamine (GlcNAc) moieties, such as chitin and In bacteria, the functional significance of the involvement of multiple LysM domains in substrate binding has so far lacked support from high-resolution structures of ligand-bound complexes. Here, a structural study of the Thermus thermophilus NlpC/P60 endopeptidase containing two LysM domains is presented. The and small-angle X-ray scattering solution studies of this endopeptidase revealed the presence of a homodimer. The structure of the two LysM domains co-crystallized with N-acetyl-chitohexaose revealed a new intermolecular binding mode that may explain the differential interaction between LysM domains and short or long chitin oligomers. By combining the structural information with the three-dimensional model of a model suggesting how protein dimerization enhances the recognition of is proposed.
containingKeywords: D,L-endopeptidase; LysM domains.
3D view: 4xcm,4uz2,4uz3
PDB references: P60_tth, 4xcm; P60_2LysM, 4uz2; P60_2LysM bound to chitohexaose, 4uz3
1. Introduction
Most bacteria are protected from their environment by a rigid cell wall containing N-acetylglucosamine (GlcNAc) and N-acetylmuramic acid (MurNAc) (Dworkin, 2014). A three-dimensional PGN network is formed owing to cross-linking of the peptide stems attached to MurNAc (Meroueh et al., 2006; Kim et al., 2015). The composition of the peptide stems differs among species, but is usually made up of three to five amino acids that include noncanonical D-amino acids. The greatest variation lies in the third amino acid, which is often α-L,∊-D-diaminopimelic acid (meso-DAP) or L-lysine; L-ornithine has also been reported (Vollmer, Blanot et al., 2008; Quintela et al., 1995). Although PGN is very rigid, it has also been shown to be sufficiently dynamic to allow bacterial to elongate and separate during cell division (Typas et al., 2012). During these dynamic phases, PGN is remodelled, and the balance between PGN synthesis and hydrolysis has to be tightly controlled to ensure bacterial survival (Egan & Vollmer, 2013).
(PGN), a disaccharide polymer ofNumerous enzymes termed autolysins are involved in PGN remodelling. Glycosidases, such as muraminidases and glucosaminidases, hydrolyze glycosidic bonds between carbohydrate units, while peptidases, such as amidases, L,D-endopeptidases, D,L-endopeptidases, L,D-carboxypeptidases and D,D-carboxypeptidases, hydrolyze amide bonds of the peptide stem at specific positions (Vollmer, Joris et al., 2008). D,L-Endopeptidases belonging to the papain-like peptidase superfamily possess an NlpC/P60 domain which is responsible for their (Anantharaman & Aravind, 2003). This domain is commonly associated with PGN-binding domains such as the SH3b domain, choline-binding domain or lysin motif (LysM) (Anantharaman & Aravind, 2003; Xu et al., 2009). These domains are assumed to assist in anchoring the protein to the cell wall. However, many of the crystallographic or NMR structures of NlpC/P60 endopeptidases deposited in the Protein Data Bank (PDB) contain only the catalytic domains. To date, only the structures of the NlpC/P60 proteins from the cyanobacteria Anabaena variabilis, Nostoc punctiforme and Bacillus cereus have been solved with their N-terminal SH3b domains (Xu et al., 2009, 2010). In addition, the structure of the NlpC/P60-related amidase of AmiA from Bacteroides uniformis has recently been solved in complex with GlcNAc and GlcNAc-1,6-anhydro-MurNAc, providing insights into the substrate recognition and specificity of the enzyme (Xu et al., 2014). However, no NlpC/P60 structures associated with the choline-binding domain or LysM domains have been solved, and there remains a lack of NlpC/P60 structures that have been solved in complex with PGN fragments containing carbohydrate units. As such, there is a limited understanding of how these enzymes anchor onto PGN and how the substrates are delivered to the catalytic domain.
We and others have shown that the LysM domain does indeed mediate recognition of PGN (Visweswaran et al., 2013, 2014; Wong et al., 2014; Maolanon et al., 2014; Frankel & Schneewind, 2012; Mesnage et al., 2014; Schanda et al., 2014). Our study of the multiple LysM-containing protein CwlS from B. subtilis also demonstrated that the NlpC/P60 endopeptidase displays an affinity towards PGN in the micromolar range. This modest affinity suggests that the multiple LysM modules present in the N-terminus of NlpC/P60 proteins may be crucial for anchoring the proteins to PGN and consequently for their hydrolytic function (Wong et al., 2014). Recent biochemical approaches have suggested that multiple LysM domains cooperate to enhance binding to GlcNAc polymers. However, none of these studies were able to conclude whether this affinity enhancement was owing to the fact that each LysM domain can bind a carbohydrate molecule or to the fact that several LysM domains can bind to the same carbohydrate molecule (Wong et al., 2014; Mesnage et al., 2014), or a combination of both.
The et al., 2013). Dimerization of plant AtCERK1 receptors on long chitin polymers has also been demonstrated and has been suggested to be important for immune signalling (Liu et al., 2012). Recently, a `sandwich-type' dimerization mode has also been proposed for the recognition of chitin by the CEBiP–OsCERK1 receptor complex that is involved in plant immunity (Hayafune et al., 2014). However, no structural information has supported this sandwich model of intermolecular dimerization.
of the fungal Ecp6 chitin-scavenger protein containing three LysM domains has shown that chitin is recognized at the interface of two intrachain LysM domains (Sánchez-ValletIn this study, we unravel the crystallographic and solution structure of TTHA0266 (renamed P60_tth), an NlpC/P60 D,L-endopeptidase from Thermus thermophilus that possesses an N-terminal PGN-anchoring domain made up of two LysM domains (Fig. 1a). We also report a structure of P60_2LysM (P60_tth with no catalytic domain) bound to N-acetyl-chitohexaose (henceforth referred to as chitohexaose), which sheds light on how LysM domains cooperate to bind long chitin/PGN polymers. Based on these high-resolution structural investigations, we propose a model describing how LysM domains may help to anchor the catalytic domains of the D,L-endopeptidase onto PGN.
2. Materials and methods
2.1. Gene cloning, protein expression and purification
The TTHA0266 gene was cloned and the P60_tth protein was expressed and purified as described previously (Wong & Blaise, 2013). The gene was cloned in frame with a Trx tag, a His tag and an S-tag into pET-32 Ek/LIC expression vector, which served as a template for generating the truncation mutants P60_2LysM (no catalytic domain), P60_1LysM (no N-terminal LysM domain) and P60_cata (catalytic domain alone) using the following primers: for P60_2LysM, the reverse primer GAGGAGAAGCCCGGTTACGCCTCGCCCTCTTCGGGAAGCCTCAGGACCTGCCCCACCTTG; for P60_1LysM, the forward primer GACGACGACAAGATGGAGAATCTGTACTTCCAGGGATCGAGGGAAAGGACCCACGTGGTGGCCCCGGGGGACACC; and for P60_cata, the forward primer GACGACGACAAGATGGAGAATCTGTACTTCCAGGGAGAAAGCCCCCTCCTCCGGGCCGTCCTCCGCTACCTGGGG. The sequence in bold encodes the Tobacco etch virus (TEV) protease cleavage site that was introduced to facilitate the removal of the affinity tags during the protein purification process. The genes for the aforementioned truncation mutants were all cloned into the pET-32 Ek/LIC vector (Novagen). The P60_tth_LysM1_mut and P60_tth_LysM2_mut binding mutants in the pET-44 and pET-32 Ek/LIC vectors (Novagen), respectively, were generated using the QuikChange Lightning Site-Directed Mutagenesis Kit (Agilent Technologies) according to the manufacturer's instructions. The pET-44 Ek/LIC vector encodes a His tag, a Nus tag, a His tag and an S-tag at the N-terminus. All mutants were produced and purified using the same procedures as used for the wild-type protein. Briefly, the recombinant proteins were produced in Escherichia coli BL21 Rosetta 2 (DE3) competent cells (Novagen), which were lyzed by sonication. The purification steps included an initial round of nickel-affinity (IMAC), TEV protease cleavage, a second round of IMAC and using a Superdex 75 10/300 GL column (GE Healthcare). For P60_cata, thrombin cleavage was performed after the first round of IMAC. For the P60_tth_LysM1_mut construct, an additional anion-exchange step was introduced after the second round of IMAC to separate the cleaved tags from the protein; this was performed using a 1 ml HiTrap DEAE FF column (GE Healthcare). All purification steps were performed at 4°C and all proteins were at least 95% pure after the final step of purification.
2.2. Crystallization and structure determination
The full-length protein structure was solved using selenomethionine-derivative crystals as described previously (Wong & Blaise, 2013). Briefly, the selenomethionine-derivative protein was crystallized at 19°C in hanging drops composed of 1 µl protein solution at 24 mg ml−1 and 1 µl reservoir solution consisting of 0.1 M sodium citrate pH 5.5, 16%(w/v) PEG 4000, 15%(v/v) 2-propanol equilibrated against 500 µl reservoir solution. Crystals were soaked briefly in cryoprotectant solution consisting of 0.1 M sodium citrate pH 5.5, 16%(w/v) PEG 4000 and 20% ethylene glycol prior to being cryocooled in liquid nitrogen. Data collection was performed at a wavelength of 0.978 Å on the I911-3 beamline at the MAX-lab synchrotron, Lund, Sweden (Ursby et al., 2013) as described previously (Wong & Blaise, 2013). The structure was solved by single-wavelength (SAD) phasing (Hendrickson & Teeter, 1981) as described in Wong & Blaise (2013).
P60_2LysM was crystallized at 19°C in conditions consisting of 28%(w/v) PEG MME 2000 and 0.1 M potassium thiocyanate. Sitting drops set up by adding 1 µl reservoir solution to 1 µl 50 mg ml−1 protein solution were equilibrated against 500 µl reservoir solution. The crystal was soaked briefly in mother liquor containing 34%(w/v) PEG MME 2000 prior to cryocooling in liquid nitrogen. Data were collected on the I911-3 beamline at MAX-lab. The data set consisted of 200 frames collected with 1° oscillation range, 5 s exposure time, a wavelength of 0.98 Å and a crystal-to-detector distance of 204.8 mm.
The structure of P60_2LysM bound to chitohexaose was obtained by co-crystallizing the two LysM domains with chitohexaose (Megazyme) at a protein:sugar molar ratio of 1:2 by dissolving the carbohydrate powder directly in the protein solution and incubating it overnight on ice. The complex was crystallized at 19°C in conditions consisting of 1.6 M ammonium sulfate, 0.1 M MES pH 6.5 and 5%(v/v) 1,4-dioxane. Hanging drops set up by adding 0.5 µl reservoir solution to 0.5 µl protein solution at 32 mg ml−1 were equilibrated against 500 µl reservoir solution. The crystal was soaked briefly in a solution consisting of 1.6 M ammonium sulfate, 0.1 M MES pH 6.5, 20%(v/v) 1,4-dioxane and 5%(v/v) glycerol prior to cryocooling in liquid nitrogen. Data were collected on the I911-2 beamline at MAX-lab (Mammen et al., 2002). The data set consisted of 200 frames collected with 1° oscillation range, 5.2 s exposure time, a wavelength of 1.04 Å and a crystal-to-detector distance of 100 mm.
All three structures were refined with the PHENIX package (Adams et al., 2011) and model building was performed with Coot (Emsley et al., 2010). The quality of the three structures was checked with MolProbity (Chen et al., 2010), giving the following core/allowed statistics for the Ramachandran plot: 95.3/4.7% for the full-length structure, 97.6/2.4% for the P60_2LysM–chitohexaose structure and 97.4/2.6% for the P60_2LysM structure.
2.3. Size-exclusion chromatography
A Kav against the logarithm of the molecular weight of the standard proteins. Proteins were loaded onto a Superdex 75 10/300 GL column (GE Healthcare) and eluted with buffer consisting of 50 mM Tris–HCl pH 8, 200 mM NaCl and 5 mM β-mercaptoethanol at a flow rate of 0.5 ml min−1.
was obtained using the Gel Filtration Markers Kit for Protein Molecular Weights 6 500–66 000 Da (Sigma–Aldrich) by plotting the2.4. Small-angle X-ray scattering (SAXS) experiments
SAXS data were obtained at the European Synchrotron Radiation Facility (ESRF), Grenoble, France. The data were recorded on beamline BM-29 and absolute-scale calibration was performed with bovine serum albumin (BSA) and water as references. The obtained data were azimuthally averaged, normalized and background-subtracted using the BsxCuBE software suite available at the beamline (Pernot et al., 2010). This yielded the scattering intensity I(q), where the scattering vector q is defined by q = 4πsin(θ)/λ, where θ is half of the scattering angle and λ is the wavelength of the incoming beam. All modelling was performed with CORAL (Petoukhov et al., 2012) and the scattering from all structures was evaluated with CRYSOL (Svergun et al., 1995). CORAL runs were made without any imposed symmetry and CRYSOL was run using the default settings. Both software packages are from the ATSAS suite v.2.4 (Petoukhov et al., 2012).
2.5. Microscale thermophoresis binding studies
Protein interactions with chitohexaose were assessed using microscale thermophoresis (MST; Seidel et al., 2013). Proteins were labelled using the Monolith NT.115 Protein Labeling Kit BLUE (NanoTemper Technologies), and a labelling efficiency of approximately 2:1 molar ratio of labelled protein to dye was achieved. A twofold titration series was prepared in which the concentration of the labelled proteins was kept constant at 200 nM and the concentration of the titrant, chitohexaose, was varied from 152 nM to 5 mM in thermophoresis buffer consisting of 50 mM phosphate pH 7.5 and 0.1% Tween 20. After incubation for 1 h at 60°C in the dark, MST measurements were performed at room temperature on a Monolith NT.115 instrument (NanoTemper Technologies). Standard capillaries were used and the LED power was adjusted to 50%. Negative controls for each protein were performed using 200 nM labelled protein in thermophoresis buffer in all 16 capillaries under the same conditions as mentioned above. For each measurement the laser was switched on for 30 s and off for 5 s. Binding curves were obtained from the thermophoresis phase with an infrared laser power of 20%. For each protein, three sets of titration series were prepared and the sigmoidal dose-response curves were fitted with GraphPad Prism 6 to yield an average Kd value.
3. Results and discussion
3.1. of P60_tth
The P60_tth protein was expressed and crystallized as described previously (Wong & Blaise, 2013). The P60_tth structure was solved by the single-wavelength (SAD) method (Hendrickson & Teeter, 1981) using selenomethionine-derivative protein as described previously (Wong & Blaise, 2013).
The structure was refined to 2.65 Å resolution and the . The final model contained two molecules in the (chains A and B). The molecules are not equivalent in terms of model-building completion. In the two catalytic domains, residues 117–246 could be built for chain A and residues 122–245 for chain B. Only three of the four LysM domains could be traced; the two N-terminal LysM domains (LysM1) could be modelled but no electron density was observed for one of the second LysM domains: LysM2 from chain A. In addition, the linker between LysM1 and LysM2 of chain B could be traced (Figs. 1b and 1c).
are shown in Table 1
|
Analysis of the crystal packing with the PISA server (Krissinel & Henrick, 2007) indicates that a stable homodimer is formed within the crystal. Dimer formation is mediated by interactions between the two catalytic domains and between the LysM1 domain of chain B and the of chain A and vice versa (Figs. 1b and 2). The catalytic domains dimerize via a surface area of about 980 Å2. This dimerization interface involves 17 residues mainly belonging to strand 8 (S8) and helix 5 (H5) of each monomer (Fig. 2). Two salt bridges are established between the side chains of Arg223 and Glu230 from each Additionally, 11 hydrogen bonds and van der Waals interactions stabilize the dimer interface (Fig. 2, upper panel). The interaction surface between LysM1 and the is 593 Å2. This interface involves residues in the vicinity of H6 from the and residues from H1 and H2 of LysM1. 21 residues of the contact 14 residues of the LysM1 domain. Six of these interactions are mediated by hydrogen bonds (Fig. 2, lower panel). We analyzed all of the NlpC/P60 structures deposited in the PDB and observed that three NlpC/P60 proteins with the PDB codes 4hpe (Joint Center for Structural Genomics, unpublished work), 3pvq (Joint Center for Structural Genomics, unpublished work) and 2evr (Xu et al., 2009) seem to be able to form stable homodimers. Nevertheless, none of them have the same dimer interface as observed in the P60_tth structure (not shown).
The structure can be divided into two parts: the anchoring domain at the N-terminus and the a). The anchoring domain is made up of two LysM domains that are connected to the by a polyproline linker that could not be traced (Fig. 1c).
at the C-terminus (Fig. 1The β-sheet composed of five antiparallel β-strands that are surrounded by four α-helices (Figs. 1c and 3a). A search for structurally related proteins using the DALI server (Holm & Rosenström, 2010) shows that the matches structures from the NlpC/P60 protein family. The most similar structures are the putative cell-wall hydrolase from Clostridium difficile (PDB entry 4hpe ; Joint Center for Structural Genomics, unpublished work), with a Z-score of 16.3 and a root-mean-square deviation (r.m.s.d.) of 2.7 Å over the Cα atoms of 116 residues, and the D,L-endopeptidase YkfC from B. cereus (PDB entry 3h41 ; Xu et al., 2010), with a Z-score of 16.1 and an r.m.s.d. of 1.9 Å over the Cα atoms of 111 residues. These two catalytic domains share 32% sequence identity with the of P60_tth. The comparison to the YkfC structure is of interest since it was solved with a bound ligand: the L-Ala-D-Glu peptide (Xu et al., 2010). As such, we can use the YkfC model to identify the putative catalytic residues of P60_tth and to propose its probable function.
is made up of a centralSuperposition of the P60_tth and YkfC structures indicates that the two catalytic domains are indeed highly similar (Figs. 3b and 3c). The Cys, His and His catalytic triad in the active site of YkfC is conserved in P60_tth (Fig. 3c). Moreover, all residues whose side chains are involved in the binding of the L-Ala-D-Glu product are either semi-conserved or fully conserved (Fig. 3c). We notice, however, that H6 of the is one turn longer in P60_tth compared with YkfC. Consequently, a steric clash is observed between H6 of P60_tth and the L-Ala-D-Glu product in YkfC when superposing the two structures (Fig. 3c). This indicates that the substrate/product of P60_tth may differ from that of YkfC and/or that the catalytic site requires some structural rearrangement before it can bind its substrate/product. Overall, comparisons to known NlpC/P60 structures strongly suggest that P60_tth also functions as a D,L-endopeptidase involved in PGN hydrolysis.
Despite numerous efforts, we have not been able to identify any hydrolytic activity of the P60_tth protein on E. coli, B. subtilis or T. thermophilus cells or purified cell walls. We have also tried unsuccessfully to assess the in vitro activity of P60_tth on commercial PGN fragments and chemically synthesized cross-linked PGN from T. thermophilus (Supporting Information). This absence of activity is puzzling, but similar difficulties in establishing NlpC/P60 enzymatic assays have also been reported recently (Gomez et al., 2014). Furthermore, it cannot be excluded that we did not identify the optimal conditions for P60_tth activity and/or that the enzyme needs to undergo proteolytic activation, as demonstrated for the M. tuberculosis NlpC/P60 protein RipA (Ruggiero et al., 2010; Chao et al., 2013).
The anchoring domain is composed of two LysM domains (Fig. 1a). Each LysM domain adopts a βααβ fold (Figs. 1c and Supplementary Fig. S1). The primary sequences of LysM1 and LysM2 are very similar since they share 72% sequence identity. Superposition of the LysM domains yields an r.m.s.d. of 0.55 Å over the main chain of 42 residues.
The two LysM domains are very similar to the LysM structures deposited in the PDB, notably to the LysM domain of the B. subtilis YkuD protein (Bielnicki et al., 2006; Lecoq et al., 2012), the LysM2 domain of the fungal Ecp6 protein (Sánchez-Vallet et al., 2013) and the LysM2 domain of the plant CERK1 receptor (AtCERK1; Liu et al., 2012) (Supplementary Fig. S1). The main difference is the existence of an extra helix turn between H2 and S2 in both the AtCERK1 and the YkuD LysM structures; only a loop is present in the corresponding region of P60_tth LysM1 (Supplementary Fig. S1).
3.2. P60_tth is a homodimer in solution
To investigate whether the homodimer exists in solution, we first estimated the oligomeric state of P60_tth using a). This corresponds to a dimer since the theoretical molecular weight of the monomer is 26.5 kDa. From the crystal-packing analysis, it seems that the strongest interactions are established between the two catalytic domains. To verify this, we expressed and purified truncated versions of the protein in which one (P60_1LysM) or two (P60_cata) LysM domains were deleted. These two proteins with predicted molecular weights of 21.4 and 18.3 kDa, respectively, eluted with apparent molecular weights of 43.9 and 40.4 kDa, respectively, which corresponds to dimers (Fig. 4a). In contrast, a construct possessing only the two LysM domains (P60_2LysM), i.e. without a with a predicted molecular weight of 10.9 kDa, elutes with an apparent mass of 11.2 kDa, reflecting the presence of a monomer (Fig. 4a). In summary, the SEC experiments indicate that P60_tth is a homodimer in solution and that the catalytic domains mediate the dimerization.
(SEC). The indicates that the full-length protein (P60_tth) has an apparent molecular weight of 58.5 kDa (Fig. 4Additionally, small-angle X-ray scattering (SAXS) data were collected for the full-length and LysM-truncated forms of the protein (Fig. 4b). The estimated molecular weight of the full-length protein is 56.7 kDa, supporting our observation that P60_tth forms stable dimers in solution. The P60_1LysM and P60_cata truncation mutants also behave as dimers in solution. Hence, the SAXS experiments confirm the existence of stable dimers in solution.
Furthermore, we combined our SAXS and crystallographic data to model the complete P60_tth dimer. To do so, we superposed the most complete molecule from the i.e. chain B was superposed on chain A. This dimeric model (dimer), a monomeric model (monomer) and a dimeric model including dummy residues representing the amino acids not seen in the (dimer + linker + C-t) were evaluated against the SAXS data (Fig. 4c). CORAL (Petoukhov et al., 2012) was used to model the missing residues and the evaluation of the fit of the models to the SAXS data was performed with CRYSOL (Svergun et al., 1995). The resulting fits (Fig. 4c) showed a clear improvement from the monomer (χ value of 97.34) to the dimer (χ value of 10.97). The fit was further improved when the dimeric model containing dummy residues (χ value of 7.29) was used in the evaluation (Fig. 4c). Further of the model did not improve the fit significantly, thereby confirming that the solution structure is highly similar to the crystal structure.
with the less complete molecule,3.3. LysM domains cooperate to bind long carbohydrates
Since we aimed to understand how LysM domains anchor the a). As PGN fragments with long MurNAc-GlcNAc chains are very difficult to obtain, we tried to co-crystallize P60_2LysM with GlcNAc polymers. This approach is relevant since we have shown previously that bacterial LysM domains bind MurNAc-GlcNAc and GlcNAc polymers with similar affinities (Wong et al., 2014). Using this strategy, we successfully crystallized and solved the of P60_2LysM bound to chitohexaose (Figs. 5a and 5b).
onto PGN, we tried to obtain a structure of full-length P60_tth bound to ligands, unfortunately without any success. Attempts to soak chitin and PGN carbohydrate polymers into P60_tth crystals were also futile. Alternatively, we tried to co-crystallize the construct containing only two LysM domains, P60_2LysM (Fig. 1The structure was solved to 1.75 Å resolution (Table 1). Three molecules of P60_2LysM (monomers 1–3) are present in the (Fig. 5b). Monomers 1 and 2 are identical, while no electron density was observed for the LysM2 domain of monomer 3. All LysM domains that could be traced, however, bind a chitohexaose molecule (Fig. 5b).
The LysM binding cleft is similar to those described for the crystal structures of the plant CERK1 receptor (Liu et al., 2012) and the fungal Ecp6 protein (Sánchez-Vallet et al., 2013) and the NMR solution structures of the bacterial AtlA autolysin (Mesnage et al., 2014) and the fungal CVNH-LysM lectin (Koharudin et al., 2011). A similar LysM binding cleft has also been characterized biochemically by NMR for plant chitinase A (Ohnuma et al., 2008). The binding pocket is delimited by the loop between S1 and H1 and the loop between H2 and S2. Monomers 1 and 2 bind chitohexaose in the same manner, but differently from monomer 3. For monomers 1 and 2, the carbohydrate induces intermolecular dimerization with symmetry-related molecules (Fig. 5c).
LysM1 of monomer 1A (Fig. 5c) mainly contacts GlcNAc 6 to GlcNAc 3. The side chain of Gln53 contacts O3 of GlcNAc 6, while the main chains of Gly24 and Leu52 contact its N-acetyl group. Additionally, the side chain of Val21 mediates a with the N-acetyl group of GlcNAc 6. The main chain of Phe50 recognizes O6 of GlcNAc 5. Phe50 also mediates a hydrogen bond to GlcNAc 4 via a water molecule that is stabilized by the main chain of Leu27. Furthermore, the Phe50 side chain mediates a with the N-acetyl group of GlcNAc 4, which is also recognized by the main chain of Tyr28. Finally, Thr26 mediates a hydrogen bond via O4 of GlcNAc 3.
The recognition of GlcNAc 3, GlcNAc 2 and GlcNAc 1 is achieved by LysM1 and LysM2 of monomer 1A and LysM2 of the symmetry-related monomer 1B. The side chain of Val69 mediates a with the N-acetyl group of GlcNAc 3, while the main chains of Ile100 and the carboxylic group of Glu99 establish hydrogen bonds to O6 of GlcNAc 2. The main chains of Pro98 and Leu75 stabilize a water molecule which mediates a hydrogen bond to O3 of GlcNAc 1, while the main chains of Leu75 and Phe76 recognize O7 of the N-acetyl group. Thr74 binds the O1 group of GlcNAc 1. Finally, Arg32 and Arg80 from LysM1 and LysM2 of monomer 1A, respectively, recognize GlcNAc 2 via two water-mediated hydrogen bonds.
In summary, we observed that LysM1 of monomer 1A mainly contacts the last four GlcNAc residues (GlcNAc 6 to GlcNAc 3), while LysM2 of monomer 1B mainly contacts the first three GlcNAc residues (GlcNAc 3 to GlcNAc 1); this LysM2 domain could also potentially interact with a fourth GlcNAc residue if a longer chitin polymer was present. There is a strong difference between this intermolecular dimerization mode and the intramolecular dimerization mode observed in the of fungal Ecp6 bound to chitin (Fig. 6; Sánchez-Vallet et al., 2013). In the Ecp6 structure, the four GlcNAc residues are sandwiched between the two intrachain LysM domains. This sandwich mode of binding has also recently been proposed to occur for chitin recognition by the CERK1–OsCEBiP complex involved in rice immune responses (Hayafune et al., 2014). Our structure offers an alternative binding mode that could explain how LysM receptors dimerize and signal upon recognition of long chitin oligomers.
Very interestingly, the fact that chitohexaose is recognized by two LysM domains from different monomers supports several biochemical observations made on LysM proteins from different phyla (Wong et al., 2014; Hayafune et al., 2014; Liu et al., 2012). We and others have recently proposed that LysM domains in multiple LysM-containing proteins act cooperatively to enhance the binding of these proteins to long (Wong et al., 2014; Mesnage et al., 2014). However, we could not explain whether this was owing to the fact that each LysM domain can bind a chitin molecule or because several LysM domains can bind to the same chitin molecule. With our we now claim that both events occur, since each LysM domain in the binds a chitohexaose molecule which could also be bound by a LysM domain from a different monomer.
Dimerization of LysM domains through et al., 2014; Liu et al., 2012; Madsen et al., 2011), and models of dimerization have been proposed. Our now provides the structural basis for these observations and therefore aids in the design of receptor-dimerization models that are of great importance in this field of research.
has also been demonstrated to be very important for LysM receptors involved in plant defence and symbiotic mechanisms (Hayafune3.4. Comparison of the chitin-binding site in LysM from different phyla
Although a chitohexaose molecule occupies the same binding pocket in the LysM1 domain of monomer 3, a second type of binding is observed (Fig. 7a). This LysM1 domain recognizes GlcNAc 5 to GlcNAc 1, while GlcNAc 6 is not contacted. The Gln53 side chain binds to both GlcNAc 5 and GlcNAc 4, and the main chains of Gly24 and Leu52 bind to the N-acetyl group of GlcNAc 4. The main chains of Phe50 and Leu52 mediate interactions with the O6 group of GlcNAc 3, while the main chains of Leu27 and Phe50 bind to O3 of GlcNAc 2. Finally, the main chain of Tyr28 contacts the N-acetyl group of GlcNAc 2, while the side chain stacks with GlcNAc 1.
By comparing the two different positions of the chitohexaose molecules observed in the LysM1 domains (monomers 1A and 3) of P60_tth with the position of the chitopentaose molecule observed in the LysM2 domain of the plant AtCERK1 receptor (Liu et al., 2012), we see that the GlcNAc 6 position in the LysM binding site of monomer 1A (Fig. 6) corresponds to the GlcNAc 4 position in monomer 3 and the GlcNAc 3 position in AtCERK1 LysM2 (Figs. 5c and 7a). It is therefore tempting to propose that the LysM domains might be able to `slide' along carbohydrates.
3.5. Mutations in the LysM binding site affect chitohexaose dissociation constants
To further validate that the binding site observed in the et al., 2013) was used to measure binding affinities towards chitohexaose in solution, as it has previously been shown to be suitable for measuring such interactions (Wong et al., 2014; Maolanon et al., 2014; Broghammer et al., 2012).
is biologically relevant, we used an alanine-scanning approach to mutate residues in the binding site. Subsequently, microscale thermophoresis (MST; SeidelWe compared the binding capacity of full-length P60_tth to two mutant proteins, P60_tth_LysM1_mut (Y28A, R32A, F50A, Q53A) and P60_tth_LysM2_mut (F76A, R80A, E99A), in which the residues involved in carbohydrate binding via side-chain interactions were mutated to Ala (Supplementary Fig. S2). The full-length P60_tth protein has an apparent Kd of 90 ± 19.8 µM for chitohexaose (Supplementary Fig. S2). P60_tth_LysM1_mut and P60_tth_LysM2_mut have similar Kd values of 320.7 ± 112.3 and 292.1 ± 86.5 µM, respectively, which are approximately three times lower than that of the wild-type protein (Supplementary Fig. S2). It appears that numerous interactions between the carbohydrate and the protein main chain may be sufficient to preserve binding. Although the mutations did not abolish protein–carbohydrate interactions, the reduced binding affinities help to validate the biological relevance of the chitohexaose binding site observed in the Considering the similarity of the binding sites determined in this bacterial endopeptidase LysM domains to the binding sites observed in plant and fungal LysM domains (Sánchez-Vallet et al., 2013; Liu et al., 2012; Ohnuma et al., 2008; Koharudin et al., 2011), we conclude that LysM–carbohydrate binding sites are conserved among phyla.
3.6. Multiple LysM domains are flexible
We have also successfully solved the ). By comparing the LysM domains in this structure with the LysM domains in the full-length P60_tth and the P60_2LysM–chitohexaose structures, we observe that binding of the carbohydrate triggers only minor structural rearrangements in the binding pocket. Only the side chains of Gln53 and Arg32 reorient upon carbohydrate binding (not shown).
of P60_2LysM without any ligand (Table 1However, the relative positions of the two LysM domains differ significantly in the bound state compared with the unbound states (Fig. 8). The linker seems to allow some flexibility between LysM domains, but we cannot claim for certain that the movement is triggered by carbohydrate binding because such movements may arise owing to crystal packing. Nonetheless, we recently showed through SAXS experiments that the four LysM domains of the B. subtilis CwlS protein are flexible in solution (Wong et al., 2014). Our structural data reinforce this observation and clearly indicate that the two LysM domains in P60_tth are flexible despite being separated by a short linker of only four amino acids.
3.7. A model of P60_tth interacting with peptidoglycan
Although we could not obtain a
of P60_tth bound to PGN fragments, the structures of the full-length P60_tth and chitohexaose-bound P60_2LysM enable us to propose a model suggesting how NlpC/P60 proteins possessing multiple LysM domains might recognize PGN.First of all, we superposed a MurNAc peptide (Hoyland et al., 2014) onto GlcNAc 6, GlcNAc 4 and GlcNAc 2 of the chitohexaose from our P60_2LysM–chitohexaose structure. With minimal additional modelling (rotating only the bond between L-Ala and MurNAc), we could fit the peptide stem without inducing any with the residues from the LysM binding site (Fig. 9a). This suggests that a MurNAc-GlcNAc oligosaccharide might interact in a similar way to that observed with a GlcNAc oligosaccharide and that the peptide portion of PGN might not be recognized at all by the residues in the LysM domains. However, we do not exclude the possibility that the peptide portion of PGN might trigger upon binding in the LysM groove. This hypothesis was demonstrated in a recent study by Mesnage and coworkers, who proposed that the peptide portion of PGN reduces the affinity of the Enterococcus faecalis AtlA single LysM domain for PGN (Mesnage et al., 2014).
The distance between the binding sites of the two LysM domains is about 35 Å, which interestingly is the same as the distance between MurNAc-GlcNAc strands that are cross-linked by the PGN peptide stem in the three-dimensional model of PGN proposed by Meroueh et al. (2006). Moreover, the distance between the two putative catalytic cysteines of the is about 27 Å, while the length of the peptide stem is about 25 Å in the PGN model (Fig. 9b). These simple distance observations led us to propose a model of interaction in which individual LysM domains bind opposite carbohydrate strands of PGN, enabling favourable positioning of the catalytic domains to cleave the peptide stems (Fig. 9c). With the knowledge that P60_tth behaves as a homodimer, and assuming that the protein cleaves the peptide arm between the second and third amino acids, we postulate that the two catalytic domains might cleave two bonds simultaneously. This could confer an advantage since PGN fragments released during PGN remodelling are recycled (Reith & Mayer, 2011; Boudreau et al., 2012; Johnson et al., 2013). The P60_tth homodimer could release two GlcNac-MurNac in each hydrolysis step instead of one, therefore enhancing the PGN recycling efficiency.
4. Concluding remarks
In this study, we present a novel structure of an NlpC/P60 protein containing multiple LysM domains. Additionally, the structure of P60_2LysM bound to chitohexaose provides the first structural evidence for intermolecular dimerization of LysM-containing proteins on a GlcNAc polymer. Based on investigations of the crystal structures of P60_tth and P60_2LysM, we have proposed models describing how bacterial LysM domains recognize PGN and how the dimerization of the LysM and catalytic domains may be features that enhance the recognition of PGN and efficiency of PGN hydrolysis by P60_tth.
5. Related literature
The following references are cited in the Supporting Information for this article: Agnihotri et al. (2011), Kok et al. (2009) and Kumar et al. (2013).
Supporting information
3D view: 4xcm,4uz2,4uz3
PDB references: P60_tth, 4xcm; P60_2LysM, 4uz2; P60_2LysM bound to chitohexaose, 4uz3
Supporting Information. DOI: 10.1107/S139900471402793X/rr5089sup1.pdf
Footnotes
‡Present address: Université de Montpellier, Centre d'études d'agents Pathogènes et Biotechnologies pour la Santé (CPBS), CNRS, UM5236, F-3689 Montpellier, France.
Acknowledgements
JEMMW and MB performed the purification, crystallization, biochemical experiments, structures
and analyses. SRM performed the SAXS experiments and data analyses. KG refined the structure of P60_2LysM alone. MBT and KKS performed the chemical synthesis of PGN and hydrolysis assays. KJJ, JS and ST supervised the research. MB designed the project and wrote the paper together with JEMMW. All of the authors contributed to the proofreading of the manuscript. The research leading to these results has received funding from the European Community's Seventh Framework Programme (FP7/2007–2013) under BioStruct-X (grant agreement No. 283570), the Danish National Research Foundation grant No. DNRF79 and Danscatt. We thank Professor Mobashery from the University of Notre Dame for the gift of the structure coordinates and also the staff at the MAX-lab synchrotron for their help during data collection. We thank G. Bajic and J. Bauer for collecting data sets. We thank R. Fitchett for proofreading the manuscript.References
Adams, P. D. et al. (2011). Methods, 55, 94–106. Web of Science CrossRef CAS PubMed Google Scholar
Agnihotri, G., Ukani, R., Malladi, S. S., Warshakoon, H. J., Balakrishna, R., Wang, X. & David, S. A. (2011). J. Med. Chem. 54, 1490–1510. CrossRef CAS PubMed Google Scholar
Anantharaman, V. & Aravind, L. (2003). Genome Biol. 4, r11. CrossRef PubMed Google Scholar
Berman, H. M., Westbrook, J., Feng, Z., Gilliland, G., Bhat, T. N., Weissig, H., Shindyalov, I. N. & Bourne, P. E. (2000). Nucleic Acids Res. 28, 235–242. Web of Science CrossRef PubMed CAS Google Scholar
Bielnicki, J., Devedjiev, Y., Derewenda, U., Dauter, Z., Joachimiak, A. & Derewenda, Z. S. (2006). Proteins, 62, 144–151. Web of Science CrossRef PubMed CAS Google Scholar
Boudreau, M. A., Fisher, J. F. & Mobashery, S. (2012). Biochemistry, 51, 2974–2990. CrossRef CAS PubMed Google Scholar
Broghammer, A., Krusell, L., Blaise, M., Sauer, J., Sullivan, J. T., Maolanon, N., Vinther, M., Lorentzen, A., Madsen, E. B., Jensen, K. J., Roepstorff, P., Thirup, S., Ronson, C. W., Thygesen, M. B. & Stougaard, J. (2012). Proc. Natl Acad. Sci. USA, 109, 13859–13864. CrossRef CAS PubMed Google Scholar
Chao, M. C., Kieser, K. J., Minami, S., Mavrici, D., Aldridge, B. B., Fortune, S. M., Alber, T. & Rubin, E. J. (2013). PLoS Pathog. 9, e1003197. Web of Science CrossRef PubMed Google Scholar
Chen, V. B., Arendall, W. B., Headd, J. J., Keedy, D. A., Immormino, R. M., Kapral, G. J., Murray, L. W., Richardson, J. S. & Richardson, D. C. (2010). Acta Cryst. D66, 12–21. Web of Science CrossRef CAS IUCr Journals Google Scholar
Dworkin, J. (2014). Annu. Rev. Microbiol. 68, 137–154. CrossRef CAS PubMed Google Scholar
Egan, A. J. F. & Vollmer, W. (2013). Ann. N. Y. Acad. Sci. 1277, 8–28. CrossRef CAS PubMed Google Scholar
Emsley, P., Lohkamp, B., Scott, W. G. & Cowtan, K. (2010). Acta Cryst. D66, 486–501. Web of Science CrossRef CAS IUCr Journals Google Scholar
Frankel, M. B. & Schneewind, O. (2012). J. Biol. Chem. 287, 10460–10471. CrossRef CAS PubMed Google Scholar
Hayafune, M., Berisio, R., Marchetti, R., Silipo, A., Kayama, M., Desaki, Y., Arima, S., Squeglia, F., Ruggiero, A., Tokuyasu, K., Molinaro, A., Kaku, H. & Shibuya, N. (2014). Proc. Natl Acad. Sci. USA, 111, E404–E413. CrossRef CAS PubMed Google Scholar
Hendrickson, W. A. & Teeter, M. M. (1981). Nature (London), 290, 107–113. CrossRef CAS Web of Science Google Scholar
Holm, L. & Rosenström, P. (2010). Nucleic Acids Res. 38, W545–W549. Web of Science CrossRef CAS PubMed Google Scholar
Hoyland, C. N., Aldridge, C., Cleverley, R. M., Duchêne, M.-C., Minasov, G., Onopriyenko, O., Sidiq, K., Stogios, P. J., Anderson, W. F., Daniel, R. A., Savchenko, A., Vollmer, W. & Lewis, R. J. (2014). Structure, 22, 949–960. Web of Science CrossRef CAS PubMed Google Scholar
Johnson, J. W., Fisher, J. F. & Mobashery, S. (2013). Ann. N. Y. Acad. Sci. 1277, 54–75. CrossRef CAS PubMed Google Scholar
Kim, S. J., Chang, J. & Singh, M. (2015). Biochim. Biophys. Acta, 1848, 350–362. CrossRef CAS PubMed Google Scholar
Koharudin, L. M. I., Viscomi, A. R., Montanini, B., Kershaw, M. J., Talbot, N. J., Ottonello, S. & Gronenborn, A. M. (2011). Structure, 19, 662–674. CrossRef CAS PubMed Google Scholar
Kok, W. M., Scanlon, D. B., Karas, J. A., Miles, L. A., Tew, D. J., Parker, M. W., Barnham, K. J. & Hutton, C. A. (2009). Chem. Commun. 41, 6228–6230. CrossRef Google Scholar
Krissinel, E. & Henrick, K. (2007). J. Mol. Biol. 372, 774–797. Web of Science CrossRef PubMed CAS Google Scholar
Kumar, A., Kumar, S., Kumar, D., Mishra, A., Dewangan, R. P., Shrivastava, P., Ramachandran, S. & Taneja, B. (2013). Acta Cryst. D69, 2543–2554. CrossRef IUCr Journals Google Scholar
Laverde Gomez, J. A., Bhatty, M. & Christie, P. J. (2014). J. Bacteriol. 196, 527–539. Web of Science CrossRef PubMed Google Scholar
Lecoq, L., Bougault, C., Hugonnet, J.-E., Veckerlé, C., Pessey, O., Arthur, M. & Simorre, J.-P. (2012). Structure, 20, 850–861. Web of Science CrossRef CAS PubMed Google Scholar
Liu, T., Liu, Z., Song, C., Hu, Y., Han, Z., She, J., Fan, F., Wang, J., Jin, C., Chang, J., Zhou, J.-M. & Chai, J. (2012). Science, 336, 1160–1164. Web of Science CrossRef CAS PubMed Google Scholar
Madsen, E. B., Antolín-Llovera, M., Grossmann, C., Ye, J., Vieweg, S., Broghammer, A., Krusell, L., Radutoiu, S., Jensen, O. N., Stougaard, J. & Parniske, M. (2011). Plant. J. 65, 404–417. CrossRef CAS PubMed Google Scholar
Mammen, C. B., Ursby, T., Cerenius, Y., Thunnissen, M., Als-Nielsen, J., Larsen, S. & Liljas, A. (2002). Acta Phys. Pol. A, 101, 595–602. CAS Google Scholar
Maolanon, N. N., Blaise, M., Sørensen, K. K., Thygesen, M. B., Cló, E., Sullivan, J. T., Ronson, C. W., Stougaard, J., Blixt, O. & Jensen, K. J. (2014). Chembiochem, 15, 425–434. CrossRef CAS PubMed Google Scholar
Meroueh, S. O., Bencze, K. Z., Hesek, D., Lee, M., Fisher, J. F., Stemmler, T. L. & Mobashery, S. (2006). Proc. Natl Acad. Sci. USA, 103, 4404–4409. Web of Science CrossRef PubMed CAS Google Scholar
Mesnage, S., Dellarole, M., Baxter, N. J., Rouget, J.-B., Dimitrov, J. D., Wang, N., Fujimoto, Y., Hounslow, A. M., Lacroix-Desmazes, S., Fukase, K., Foster, S. J. & Williamson, M. P. (2014). Nature Commun. 5, 4269. CrossRef Google Scholar
Ohnuma, T., Onaga, S., Murata, K., Taira, T. & Katoh, E. (2008). J. Biol. Chem. 283, 5178–5187. CrossRef PubMed CAS Google Scholar
Pernot, P., Theveneau, P., Giraud, T., Fernandes, R. N., Nurizzo, D., Spruce, D., Surr, J., McSweeney, S., Round, A., Felisaz, F., Foedinger, L., Gobbo, A., Huet, J., Villard, C. & Cipriani, F. (2010). J. Phys. Conf. Ser. 247, 012009. CrossRef Google Scholar
Petoukhov, M. V., Franke, D., Shkumatov, A. V., Tria, G., Kikhney, A. G., Gajda, M., Gorba, C., Mertens, H. D. T., Konarev, P. V. & Svergun, D. I. (2012). J. Appl. Cryst. 45, 342–350. Web of Science CrossRef CAS IUCr Journals Google Scholar
Quintela, J. C., Pittenauer, E., Allmaier, G., Arán, V. & de Pedro, M. A. (1995). J. Bacteriol. 177, 4947–4962. CAS PubMed Web of Science Google Scholar
Reith, J. & Mayer, C. (2011). Appl. Microbiol. Biotechnol. 92, 1–11. CrossRef CAS PubMed Google Scholar
Ruggiero, A., Marasco, D., Squeglia, F., Soldini, S., Pedone, E., Pedone, C. & Berisio, R. (2010). Structure, 18, 1184–1190. Web of Science CrossRef CAS PubMed Google Scholar
Sánchez-Vallet, A., Saleem-Batcha, R., Kombrink, A., Hansen, G., Valkenburg, D.-J., Thomma, B. P. & Mesters, J. R. (2013). eLife, 2, e00790. PubMed Google Scholar
Schanda, P., Triboulet, S., Laguri, C., Bougault, C. M., Ayala, I., Callon, M., Arthur, M. & Simorre, J.-P. (2014). J. Am. Chem. Soc. 36, 17852–17860. CrossRef Google Scholar
Seidel, S. A. I. et al. (2013). Methods, 59, 301–315. CrossRef CAS PubMed Google Scholar
Svergun, D., Barberato, C. & Koch, M. H. J. (1995). J. Appl. Cryst. 28, 768–773. CrossRef CAS Web of Science IUCr Journals Google Scholar
Typas, A., Banzhaf, M., Gross, C. A. & Vollmer, W. (2012). Nature Rev. Microbiol. 10, 123–136. CAS Google Scholar
Ursby, T., Unge, J., Appio, R., Logan, D. T., Fredslund, F., Svensson, C., Larsson, K., Labrador, A. & Thunnissen, M. M. G. M. (2013). J. Synchrotron Rad. 20, 648–653. Web of Science CrossRef CAS IUCr Journals Google Scholar
Visweswaran, G. R. R., Leenhouts, K., van Roosmalen, M., Kok, J. & Buist, G. (2014). Appl. Microbiol. Biotechnol. 98, 4331–4345. CAS PubMed Google Scholar
Visweswaran, G. R. R., Steen, A., Leenhouts, K., Szeliga, M., Ruban, B., Hesseling-Meinders, A., Dijkstra, B. W., Kuipers, O. P., Kok, J. & Buist, G. (2013). PLoS One, 8, e72167. CrossRef PubMed Google Scholar
Vollmer, W., Blanot, D. & De Pedro, M. A. (2008). FEMS Microbiol. Rev. 32, 149–167. CrossRef PubMed CAS Google Scholar
Vollmer, W., Joris, B., Charlier, P. & Foster, S. (2008). FEMS Microbiol. Rev. 32, 259–286. Web of Science CrossRef PubMed CAS Google Scholar
Wong, J. E. M. M., Alsarraf, H. M. A. B., Kaspersen, J. D., Pedersen, J. S., Stougaard, J., Thirup, S. & Blaise, M. (2014). FEBS J. 281, 1196–1208. CrossRef CAS PubMed Google Scholar
Wong, J. E. M. M. & Blaise, M. (2013). Acta Cryst. F69, 1291–1294. CrossRef IUCr Journals Google Scholar
Xu, Q. et al. (2010). Acta Cryst. F66, 1354–1364. Web of Science CrossRef CAS IUCr Journals Google Scholar
Xu, Q. et al. (2009). Structure, 17, 303–313. Web of Science CrossRef PubMed CAS Google Scholar
Xu, Q., Mengin-Lecreulx, D., Patin, D., Grant, J. C., Chiu, H.-J., Jaroszewski, L., Knuth, M. W., Godzik, A., Lesley, S. A., Elsliger, M.-A., Deacon, A. M. & Wilson, I. A. (2014). Structure, 22, 1799–1809. CrossRef CAS PubMed Google Scholar
This is an open-access article distributed under the terms of the Creative Commons Attribution (CC-BY) Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original authors and source are cited.