structural communications
Structure of the C-terminal domain of the surface antigen SpaP from the caries pathogen Streptococcus mutans
aDepartment of Odontology, Umeå University, SE-901 87 Umeå, Sweden
*Correspondence e-mail: karina.persson@odont.umu.se
SpaP is a 1500-residue adhesin expressed on the surface of the caries-implicated bacterium Streptococcus mutans. SpaP is a member of the antigen I/II (AgI/II) family of proteins expressed by oral streptococci. These surface proteins are crucial for the incorporation of streptococci into dental plaque. The structure of the C-terminal domain of SpaP (residues 1136–1489) was solved and refined to 2.2 Å resolution with six molecules in the Similar to a related AgI/II structure, SpaP is stabilized by isopeptide bonds between lysine and asparagine side chains.
Keywords: antigen I/II; isopeptide bonds; dental plaque.
3D view: 3opu
PDB reference: C-terminal domain of SpaP, 3opu
1. Introduction
Streptococcus mutans is the causative agent of caries (Loesche, 1986) and its surface adhesin SpaP is important for its incorporation into the oral biofilm (dental plaque). SpaP is a member of the antigen I/II (AgI/II) family of proteins that are expressed on the surfaces of oral streptococci and are key molecules for networking with other microorganisms and host cells in the oral biofilm. AgI/II proteins are approximately 1500 amino acids in length and consist of seven regions: a leader peptide (∼38 amino acids), an N-terminal region (∼80 amino acids), an alanine-rich region (∼320 amino acids), a variable domain (∼360 amino acids), a proline-rich region (∼180 amino acids), a C-terminal domain (∼500 amino acids) and a cell-wall-anchoring segment (Jenkinson & Demuth, 1997). The structure of AgI/II proteins is unique in the sense that the proline-rich region and the alanine-rich region wrap around each other, forming an elongated fibrillar structure that presents the variable domain at the tip (Larson et al., 2010). The structure further consists of a C-terminal domain which contains an LPXTG motif that anchors the protein to the cell wall. A of the major part of the C-terminal domain of the AgI/II protein SspB from the commensal bacterium S. gordonii (Forsgren et al., 2010) showed that the protein consists of two structurally similar β-sandwich domains. Sequence analysis indicates that the C-terminal region also contains a third β-sandwich domain. It has been shown that the C-terminal domain of S. gordonii SspB, but not S. mutans SpaP, functions as an adherence site for the oral pathogen Porphyromonas gingivalis, which is associated with adult periodontitis, a condition that may result in tooth loss. In order to compare the structure of S. mutans SpaP with the previously determined structure of SspB from S. gordonii, we determined the of SpaP-C1136–1489, which is reported here.
2. Materials and methods
2.1. Cloning
The preparation of chromosomal DNA from S. mutans SK773 was performed using BacReady from GenScript. Constructs were designed based on the spaP gene from S. mutans strain NG5 (accession code X17390) and the domain borders that were used to solve the analogous structure SspB from S. gordonii (Forsgren et al., 2010). The fragment was PCR-amplified using the forward primer 5′-AAAAACCATGGAAGCTAAATTAGCAAAA-3′ and the reverse primer 5′-TTTTTGGTACCTTATGGAGTTTTTACCGGTGGAGT-3′ (restriction sites are shown in bold). The PCR product was digested with NcoI and Acc65I and ligated into the equivalent sites of the pET-M11 expression vector (kindly provided by Gunter Stier, EMBL, Germany) containing a His tag at the N-terminus. The final construct encodes His6-PMSDYDIPTTENLYFQGAM-SpaP1136–1489 with a molecular weight of 44.1 kDa. The plasmid was transformed into Escherichia coli DH5α and colonies were subsequently selected on kanamycin plates. The positive clones were verified by DNA sequencing.
2.2. Overexpression and purification
The SpaP-C1136–1489 construct was overexpressed in E. coli BL21 pLysS cells at 310 K in Luria broth supplemented with 50 µg ml−1 kanamycin and 25 µg ml−1 chloramphenicol. When the culture reached an OD600 of 0.6, the temperature was lowered to 298 K, protein expression was induced with 0.5 mM IPTG and the culture was allowed to grow for 4 h. The cells were harvested by centrifugation at 5300g for 20 min and the cell pellets were stored at 193 K. The cell pellets were resuspended in 20 mM Tris pH 7.5, 150 mM NaCl and 10 mM imidazole pH 8.0 supplemented with EDTA-free protease-inhibitor cocktail (Roche). The cells were lysed on ice by sonication and cellular debris was removed by centrifugation at 39 000g for 50 min. The supernatant was loaded onto an Ni–NTA agarose column (Qiagen). The SpaP-C1136–1489 protein was washed with 20 mM Tris pH 7.5, 150 mM NaCl and 20 mM imidazole pH 8.0 and eluted in 20 mM Tris pH 7.5, 150 mM NaCl and 300 mM imidazole pH 8.0. The buffer was exchanged to 20 mM Tris pH 7.5, 150 mM NaCl and 0.5 mM EDTA and the protein was further purified by on a Superdex 200 16/60 PG column (Amersham Biosciences). The protein purity was analyzed by SDS–PAGE (data not shown).
2.3. Crystallization
The SpaP-C1136–1489 protein was concentrated to 15 mg ml−1 in 0.2 mM Tris pH 7.5 using an Amicon Ultra centrifugal filter device (Millipore). Initial crystallization trials were performed with a Mosquito pipetting robot using sitting-drop vapour diffusion and standard crystal screening kits (Hampton Research and Molecular Dimensions) in 96-well plate format. The initial hits were optimized using the hanging-drop method by mixing 2 µl protein solution at a concentration of 15 mg ml−1 and 1 µl reservoir solution, resulting in crystals that were suitable for data collection. The final crystallization condition was 0.1 M sodium acetate pH 5.0 and 15%(w/v) polyethylene glycol 6000.
2.4. Data collection and processing
Prior to data collection, the crystals were soaked in crystallization solution supplemented with 20% polyethylene glycol 400, mounted on loops and vitrified in liquid nitrogen. Diffraction data were collected to 2.2 Å resolution at 100 K using a MAR CCD 165 detector on beamline I911-5, MAX-lab, Lund, Sweden. The diffraction data were processed with XDS (Kabsch, 2010) and scaled with SCALA from the CCP4 program suite (Collaborative Computational Project, Number 4, 1994).
2.5. Structure solution and refinement
The initial molecular-replacement search model was generated from S. gordonii SspB-C (PDB code 2woy ; Forsgren et al., 2010) and the S. mutans SpaP protein sequence using the program CHAINSAW (Stein, 2008) The sequence identity between the two proteins is 67% in this region. The structure was solved using the program Phaser (McCoy et al., 2007) and was refined with phenix.refine (Afonine et al., 2005). The first steps included rigid-body and simulated annealing starting at 5000 K; translation–liberation–screw (TLS) was used in the last rounds of treating each chain as an individual TLS group (Winn et al., 2001). Manual inspection, rebuilding and addition of water molecules were performed with Coot (Emsley et al., 2010). The quality of the model was analyzed with WHAT_CHECK (Hooft et al., 1996). The model was subjected to sixfold NCS restraints throughout are given in Table 1. Figures were drawn with CCP4MG (Potterton et al., 2004)
‡Rwork = , where Fobs and Fcalc are the observed and calculated structure-factor amplitudes, respectively. Rfree is Rwork calculated using 5% of the data that were omitted from |
Structure factors and coordinates have been deposited in the Protein Data Bank (PDB; https://www.rcsb.org/pdb ) under accession code 3opu .
3. Results and discussion
3.1. Crystallization, data collection and structure determination
Crystals of the SpaP-C1136–1489 protein were obtained by hanging-drop vapour diffusion. The crystals belonged to P21212, with six molecules in the which corresponds to a Matthews coefficient of 2.68 Å3 Da−1 (54% solvent content; Matthews, 1968). The unit-cell parameters were a = 135.8, b = 238.4, c = 78.5 Å. The structure was solved by using a model based on the analogous S. gordonii structure as a search model. and manual fitting of the model resulted in a final Rwork of 21.0% and an Rfree of 24.8%.
3.2. The overall structure of SpaP-C1136–1489
The final model of SpaP-C1136–1489 from S. mutans consists of six independent protein chains, each binding to two Ca2+ ions. 545 water molecules were included in the model. The refined monomers consist of residues 1152–1489. The 16 N-terminal residues and residues from the linker could not be seen in the electron density and were not included. The crystallized protein consists of two domains, C2 and C3, each of which adopts a β-sandwich fold (Fig. 1). The C2 domain consists of two β-sheets with five and six strands. The domain also contains two short α-helices located on top of the β-sandwich, as well as one helix positioned perpendicular to the β-sandwich. The C3 domain consists of two sheets built up of five strands each. On the side facing the C2 domain the β-sandwich strands are connected by long loops, which also include a short helix and a short β-hairpin. On the other side the strands are connected by short turns, with one exception in which a long coil (16 amino acids) connects two strands. Each domain is stabilized by a metal ion, modelled as Ca2+. In each β-sandwich the two sheets are linked by a covalent formed between the side chains of a lysine and an asparagine.
The six monomers in the α atoms.
are generally very similar, with a root-mean-square deviation (r.m.s.d.) ranging from 0.3 to 0.7 calculated on all C3.3. Isopeptide bonds and Ca2+ sites
In the C2 β-sandwich an is formed between Lys1157 of one sheet and Asn1307 of the other. Similarly, an isopeptide bond is formed between Lys1334 and Asn1469 of the C3 β-sandwich. Both isopeptide bonds are formed between the NZ atom of the lysine and the CG atom of the asparagine, thereby releasing ammonium. A conserved aspartic acid forms hydrogen bonds to the peptide C=O and NH groups. The isopeptides are further surrounded by hydrophobic and aromatic residues. The hydrophobic environment is more prominent in the C3 domain. The isopeptide bonds and their surrounding residues are very similar to what was reported previously for the analogous protein SspB from S. gordonii (Forsgren et al., 2010). Isopeptides are being described in an increasing number of Gram-positive surface proteins, for example in the pili from S. pyogenes and Corynebacterium diphtheriae (Kang et al., 2007, 2009). It is hypothesized that these covalent bonds are used as a means of stabilizing these surface proteins against mechanical force.
The Ca2+ ions are likely to play important structural roles since they are located in close proximity to the in each domain. In both domains the aspartic acid that forms hydrogen bonds to the isopeptide is followed by an aspartic acid that coordinates a Ca2+ ion. The Ca2+ ion in the C2 domain is coordinated by carboxylate groups from two residues, Asp1208 and Glu1211, the carbonyl O atoms of Lys1261, Tyr1209 and Ala1263 and one water molecule. In C3 the Ca2+ ion is coordinated by side-chain O atoms from Glu1387 and Asp1384, the main-chain O atoms of Lys1430, Gly1431 and Tyr1385 and one water molecule.
3.4. Comparison with the S. gordonii SspB structure
The overall structure of S. mutans SpaP-C is generally very similar to the structure of S. gordonii SspB-C, with the isopeptide bonds and Ca2+ atoms in identical positions. The approximate r.m.s.d. between the two structures is 1.0 Å. The largest deviation in sequence is found in the helix located perpendicular to the β-sandwich of the C2 domain and the following extended region. This region is described as the BAR region (SspB adherence region) in S. gordonii SspB and is used as a recognition handle by the periodontal pathogen P. gingivalis (Brooks et al., 1997). This region of SpaP is not recognized by P. gingivalis despite its very similar structure (Fig. 2a). Most of the charged residues in the helix facing the solvent are different in the two proteins (QEIRDVLSK in SpaP and KKVQDLLKK in SspB), creating different conditions for recognition. The following extended regions are also dissimilar in sequence (GIRPK in SpaP and NITVK in SspB). Previous studies have shown that two single mutations, N1182G and V1185P, in the extended region of SspB that make the region more like that of the S. mutans SpaP protein completely abolish P. gingivalis adherence (Demuth et al., 2001). The of SpaP shows that the natural occurrence of these amino acids in the protein does not alter the secondary structure of the region and gives limited clarification of the importance of these two single sites with regard to P. gingivalis recognition and adherence. However, the electrostatic surfaces are considerably different (Figs. 2b and 2c) and create very different recognition properties for any interaction partner. Comparison of the structures of SpaP and SspB opens the possibility of developing substances to eliminate the adherence of P. gingivalis to the oral biofilm.
Acknowledgements
We are grateful for access to beamline I911-5 at MAX-lab, Lund and for the support of the beamline scientists. We thank Gunter Stier, EMBL, Germany for cloning vectors and Ulla Öhman and Nicklas Strömberg at Umeå University for the S. mutans strain. This work was supported by the Swedish Research Council and the National Research School in Odontology.
References
Afonine, P. V., Grosse-Kunstleve, R. W. & Adams, P. D. (2005). CCP4 Newsl. 42, contribution 8. Google Scholar
Brooks, W., Demuth, D. R., Gil, S. & Lamont, R. J. (1997). Infect. Immun. 65, 3753–3758. CAS PubMed Web of Science Google Scholar
Collaborative Computational Project, Number 4 (1994). Acta Cryst. D50, 760–763. CrossRef IUCr Journals Google Scholar
Demuth, D. R., Irvine, D. C., Costerton, J. W., Cook, G. S. & Lamont, R. J. (2001). Infect. Immun. 69, 5736–5741. Web of Science CrossRef PubMed CAS Google Scholar
Emsley, P., Lohkamp, B., Scott, W. G. & Cowtan, K. (2010). Acta Cryst. D66, 486–501. Web of Science CrossRef CAS IUCr Journals Google Scholar
Forsgren, N., Lamont, R. J. & Persson, K. (2010). J. Mol. Biol. 397, 740–751. Web of Science CrossRef CAS PubMed Google Scholar
Hooft, R. W., Vriend, G., Sander, C. & Abola, E. E. (1996). Nature (London), 381, 272. CrossRef PubMed Web of Science Google Scholar
Jenkinson, H. F. & Demuth, D. R. (1997). Mol. Microbiol. 23, 183–190. CrossRef CAS PubMed Web of Science Google Scholar
Kabsch, W. (2010). Acta Cryst. D66, 125–132. Web of Science CrossRef CAS IUCr Journals Google Scholar
Kang, H. J., Coulibaly, F., Clow, F., Proft, T. & Baker, E. N. (2007). Science, 318, 1625–1628. Web of Science CrossRef PubMed CAS Google Scholar
Kang, H. J., Paterson, N. G., Gaspar, A. H., Ton-That, H. & Baker, E. N. (2009). Proc. Natl Acad. Sci. USA, 106, 16967–16971. CrossRef PubMed CAS Google Scholar
Larson, M. R., Rajashankar, K. R., Patel, M. H., Robinette, R. A., Crowley, P. J., Michalek, S., Brady, L. J. & Deivanayagam, C. (2010). Proc. Natl Acad. Sci. USA, 107, 5983–5988. Web of Science CrossRef CAS PubMed Google Scholar
Loesche, W. J. (1986). Microbiol. Rev. 50, 353–380. CAS PubMed Web of Science Google Scholar
Matthews, B. W. (1968). J. Mol. Biol. 33, 491–497. CrossRef CAS PubMed Web of Science Google Scholar
McCoy, A. J., Grosse-Kunstleve, R. W., Adams, P. D., Winn, M. D., Storoni, L. C. & Read, R. J. (2007). J. Appl. Cryst. 40, 658–674. Web of Science CrossRef CAS IUCr Journals Google Scholar
Potterton, L., McNicholas, S., Krissinel, E., Gruber, J., Cowtan, K., Emsley, P., Murshudov, G. N., Cohen, S., Perrakis, A. & Noble, M. (2004). Acta Cryst. D60, 2288–2294. Web of Science CrossRef CAS IUCr Journals Google Scholar
Stein, N. (2008). J. Appl. Cryst. 41, 641–643. Web of Science CrossRef CAS IUCr Journals Google Scholar
Winn, M. D., Isupov, M. N. & Murshudov, G. N. (2001). Acta Cryst. D57, 122–133. Web of Science CrossRef CAS IUCr Journals Google Scholar
This is an open-access article distributed under the terms of the Creative Commons Attribution (CC-BY) Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original authors and source are cited.