research communications
Structural characterization of the Streptococcus pneumoniae carbohydrate substrate-binding protein SP0092
aDiamond Light Source, Harwell Science and Innovation Campus, Didcot OX11 0DE, England, and bResearch Complex at Harwell, Harwell Science and Innovation Campus, Didcot OX11 0FA, England
*Correspondence e-mail: simone.culurgioni@diamond.ac.uk
Streptococcus pneumoniae is an opportunistic respiratory pathogen that remains a major cause of morbidity and mortality globally, with infants and the elderly at the highest risk. S. pneumoniae relies entirely on as a source of carbon and dedicates a third of all uptake systems to carbohydrate import. The structure of the carbohydrate-free substrate-binding protein SP0092 at 1.61 Å resolution reveals it to belong to the newly proposed subclass G of substrate-binding proteins, with a ligand-binding pocket that is large enough to accommodate complex SP0092 is a dimer in solution and the reveals a domain-swapped dimer with the monomer subunits in a closed conformation but in the absence of carbohydrate ligand. This closed conformation may be induced by dimer formation and could be used as a mechanism to regulate carbohydrate uptake.
Keywords: Streptococcus pneumoniae; substrate-binding protein SP0092; ABC transporters; carbohydrate uptake.
PDB reference: SP0092, 5mlt
1. Introduction
Streptococcus pneumoniae (the pneumococcus) resides asymptomatically in the upper airway tract but can migrate to normally sterile locations to cause diseases such as otitis, pneumonia, sepsis, septicaemia and meningitis (Weiser, 2010; Bogaert et al., 2004). S. pneumoniae relies solely on as a source of carbon and, as these are limited in the nasopharynx, it dedicates over 30% of its transport systems to the uptake of which are scavenged from host complex (Burnaugh et al., 2008; King, 2010; King et al., 2006; Buckwalter & King, 2012; Bidossi et al., 2012). These transport systems include phosphotransferase systems, ATP-binding cassette (ABC) transporters and porins, which provide the potential to convey up to 32 different carbohydrates (Bidossi et al., 2012). In ABC transporters the ligand is translocated through the membrane by transmembrane permease domains activated by a pair of conserved cytoplasmic nucleotide-binding domains. In the case of type I and II ABC importers a substrate-binding protein (SBP) presents the bound substrate to the outward-facing side of the transporter, which selectively binds the ligand and transfers it to the transmembrane domains (Hopfner, 2016; Locher, 2016). SBPs are formed by two α/β domains connected by a hinge region, which are interdependent in the apo form (Tang et al., 2007). Upon ligand binding at the interface between the two domains, the protein closes around the ligand in a more rigid conformation; ligand binding in this way has been termed the `Venus fly trap' mechanism (Mao et al., 1982). As the number of SBP structures determined has increased, the level of structural diversity has concomitantly grown. Six distinct structural groups have been proposed based on structural similarity, size and the presence of notable structural features (Berntsson et al., 2010). This has recently been extended to a seventh structural class (G) following the structural characterization of FusA, a frucotoligosaccharide SBP from S. pneumoniae (Culurgioni et al., 2016).
Here, we describe the
of the SBP SP0092 in an atypically closed and ligand-free conformation. SP0092 oligomerizes in solution in a concentration-dependent manner and we propose that dimerization could induce a closed conformation in which ligand binding is modulated. SP0092 belongs to the newly identified `cluster G' structural class of SBPs, possessing an extended fold and large ligand-binding cavity that typifies this cluster.2. Materials and methods
2.1. Macromolecule production
SP009239–491 was cloned into the pOPINF vector (OPPF-UK), truncating the first 39 residues coding for the periplasmic localization signal. The native His-tag fusion protein was expressed in Escherichia coli BL21 Rosetta cells by autoinduction using Overnight Express medium (Millipore) supplemented with 1%(v/v) glycerol, while selenomethionine-labelled protein was expressed using SelenoMethionine Medium Complete (Molecular Dimensions) supplemented with 0.5 mM IPTG for induction. Cells were lysed in 0.1 M HEPES pH 7.5, 0.5 M NaCl, 0.02 M imidazole, 10%(v/v) glycerol supplemented with EDTA-free protease inhibitors (Roche) and cleared for 1 h at 100 000g. Cleared lysates were loaded onto an affinity HisTrap HP column (GE Healthcare). The fusion protein was eluted with lysis buffer supplemented with 0.2 M imidazole and, after dilution, was treated with HRV 3C protease overnight at 4°C. The mixture was loaded onto a HisTrap HP column and the cleaved protein was immediately eluted. The resulting sample was loaded onto a Superdex 200 column equilibrated with 0.02 M MES pH 6.5, 0.2 M NaCl, 2.5%(v/v) glycerol, 0.5 mM TCEP. Fractions of the two peaks observed from gel filtration were collected separately and concentrated to 170 and 154 mg ml−1 for the oligomeric and monomeric states, respectively. Macromolecule-production information is summarized in Table 1.
|
2.2. and multiangle light scattering
SP009239–491 samples at different protein concentrations were loaded onto a Superdex 200 5/150 GL column equilibrated with running buffer [0.02 M HEPES pH 7.5, 0.2 M NaCl, 2.5%(v/v) glycerol, 0.5 mM TCEP]. Relevant collected fractions were loaded onto an SDS–PAGE gel. Static light-scattering experiments were performed at room temperature using a Superdex 200 Increase 10/300 GL column (GE Healthcare) in-line with a DAWN HELEOS II light-scattering detector (Wyatt). The column was equilibrated with running buffer. Samples of 100 µl protein solution at 5 mg ml−1 were analysed. Data acquisition and analysis were carried out using the ASTRA software.
2.3. Crystallization
Initial crystals of SP009239–491 were obtained by sitting-drop vapour diffusion at 20°C. These initial crystals were obtained by mixing equal volumes of protein (at a concentration of 50 mg ml−1) and a reservoir solution consisting of 20%(w/v) PEG 6000, 0.1 M Tris–HCl pH 8.0, 0.02 M zinc chloride. Optimization of the crystallization conditions resulted in single crystals of about 200 µm in size using the conditions detailed in Table 2. Selenomethionine-labelled SP009239–491 yielded similar crystals in the same crystallization conditions.
|
2.4. Data collection and processing
For data collection, crystals were first transferred to a cryoprotectant solution [reservoir buffer supplemented with 25%(v/v) glycerol] and then flash-cooled in liquid nitrogen. Crystal screening and initial crystal characterization were carried out on the I03 and I04 beamlines at Diamond Light Source. Diffraction data for selenomethionine-derivatized SP009239–491 crystals were collected at the Se K edge. All data were processed with xia2 and resolution limits were defined using a half-data-set (CC1/2) limit of 0.5, although the crystals diffracted to 1.48 Å resolution in the detector corners (Winter et al., 2013). Data-collection and processing statistics are summarized in Table 3.
|
2.5. Structure solution and refinement
The SHELX suite was used to determine the selenium (Sheldrick, 2010). Analysis of the data with SHELXC showed a strong anomalous signal to high resolution, with a CC1/2 of 0.28 at 2.35 Å between observed and calculated E values (Schneider & Sheldrick, 2002). Data to 2.5 Å resolution (anomalous CC1/2 of 0.35) were used for the search, which located all seven Se atoms. The atomic model was completed automatically with ARP/wARP with starting phases generated by SHELXE. The autotraced model was then completed through iterative cycles of manual model building and using REFMAC5 in the CCP4 suite (Murshudov et al., 2011; Langer et al., 2008; Winn et al., 2011) and Coot (Emsley et al., 2010), respectively. The final are reported in Table 4. The final electron density was of high quality for the complete polypeptide chain except for the loop region formed by residues 90–96 (PDB entry 5mlt). The structure was visualized with PyMOL (https://www.schrodinger.com/pymol).
|
3. Results and discussion
3.1. SP0092 state
Although the majority of SBPs are monomeric in solution, a few cases of higher order et al., 1994, 2004; Friedman et al., 1995; Ramseier et al., 1993). Following the observation of multiple elution peaks from we measured the absolute molar mass of purified SP009239–491 samples by multiangle (MALS). At least four different states were detected with good agreement to the theoretical molecular weights of SP009239–491 monomer, dimer, trimer and tetramer species of 49.4, 97.0, 140.8 and 187.2 kDa, respectively (Fig. 1a). To investigate whether the is dependent on protein concentration, we analysed the gel-filtration elution profile of the monomeric and oligomeric samples at different dilutions. From this analysis, although the main species remained the same at different concentrations, we observed an increase in of the monomeric sample at higher concentration (increasing from 10 to 13%); inversely, the monomeric state in the oligomeric sample increased from 14 to 30% of the total amount when diluted (Fig. 1b). This points towards a dynamic equilibrium between the different species that is dependent on protein concentration (Figs. 1c and 1d).
states have been detailed (Schumacher3.2. of SP0092
Both the monomeric and oligomeric species of SP009239–491 isolated after were subjected to extensive crystallization trials, but only the latter yielded crystals and enabled the structure of oligomeric SP009239–491 to be determined to 1.61 Å resolution (PDB entry 5mlt).
SP009239–491 folds similarly to other substrate-binding proteins, presenting two globular α/β domains linked by a hinge region formed by three loops. The first domain (residues 39–154 and 321–396) is composed of one central β-sheet of four strands surrounded by seven α-helices, two 310-helices and an additional three-stranded β-sheet. The second domain (residues 155–320 and 394–491) consists of a three-stranded β-sheet enclosed by eight α-helices, two 310-helices and an extra three-stranded β-sheet.
The most striking feature of the oligomeric SP009239–491 structure is the presentation of a domain-swapped dimer structure. Crystals were only obtained from pooled samples of oligomeric SP009239–491 and in the crystal the `open' monomer subunit of the swapped domain dimer extends its C-terminal domain (residues 367–491), positioning its β16 and β17 strands, its η4–η6 310-helices and the last α13–α16 helices onto the neighbouring chain which is generated by the crystal symmetry (Figs. 2a and 2b). This domain swap generates an extended interface of 7550 Å2. The hinge loop connecting the swapped and main domains is located at residues Gly366 and Lys367, which are positioned between the β15 and β16 strands. The hinge loop is modelled in well defined electron density (Fig. 2c). Apart from this hinge loop, the overall architecture of the two functional monomeric units is identical. A domain-swapped dimer structure has also been observed in the α-keto acid substrate-binding protein TakP (Gonin et al., 2007). However, as of yet, there is no evidence that a domain-swapped dimer is a functional state of these SBPs.
3.3. Structural classification of SP0092
The recent S. pneumoniae allowed a new subclass of SBPs to be defined. This structural subclass, annotated as subclass G, allowed the grouping of four SBP structures, including that of FusA. The members of subclass G are characterized by their larger molecular weight, additional structural elements, an enlarged ligand-binding cavity and a regulatory EF-hand-like calcium-binding site (Culurgioni et al., 2016). SP009239–491 possesses all of the features characterizing this subfamily apart from the calcium-binding site and shows approximately 24% sequence identity to the other subclass G members (Fig. 3). Independent structural superpositions of domains I and II, which make up the functional SP009239–491 monomer, onto the equivalent domains of the other members of subclass G resulted in a maximum root-mean-square deviation of 2.92 Å for both domains of the monomers. The only prominent difference that is observed in the SP009239–491 structure, when compared with the other subclass G members, is in the hinge region between the two α/β domains. In the case of SP009239–491 the loop spanning residues 315–319 is reorganized to form an additional helix, α10. This helix is positioned in the central part of the ligand-binding cavity and may play a role in substrate interaction or recognition. Thus, in summary, we propose SP0092 to be a fifth, albeit atypical, member of the structural subclass G of SBPs.
of the fructooligosaccharide substrate-binding protein FusA from3.4. Carbohydrate-binding cavity
Comparison of the SP0092 functional monomeric unit with the other members of subclass G reveals the subunit to be in a closed conformation even though no ligand is bound (Figs. 4a–4d). This may be a consequence of the domain-swapped dimer structure. Thus, variation in protein concentration may modulate ligand binding through the formation of a domain-swapped dimer, which presents a closed SBP monomer conformation.
Despite predictions for the binding of N-acetylmannosamine (ManNAc) by SP0092, the nature of the carbohydrate ligand still remains unknown (Bidossi et al., 2012). The ligand-binding cavity of SP009239–491 extends in volume to 2692 Å3, which is comparable to the closed ligand cavity of FusA (∼2218 Å3; Fig. 4e). Thus, the structure of SP0092 shows that the SBP has the ability to bind complex which extend by at least three sugar moieties.
ranging from galactose, mannose and4. Closing remarks
The pneumococcus relies solely on S. pneumoniae SBP SP0092, which delineates a large substrate-binding cavity and an overall structure which shows that it belongs to the newly described structural subclass G of the SBP family. Further structural analyses of the full complement of carbohydrate substrate-binding proteins could aid the investigation of these proteins as potential vaccine candidates and their potential suitability as novel drug-delivery systems (Saxena et al., 2015; Garmory & Titball, 2004; Ahuja et al., 2015).
as a carbon source, with at least seven ABC transporters encoded in the reference genome strain TIGR4 annotated as carbohydrate importers. Here, we have determined the high-resolution of theNote added in proof. During the review of this paper, three entries were released by the PDB describing the SP0092 structure in a monomeric configuration with and without oligosaccharide bound (PDB entries 5swb, 5swa and 5suo).
Acknowledgements
We thank OPPF-UK for technical support in cloning and protein production, Gemma Harris (SEC-MALLS) and the scientists of beamlines I03 and I04 at Diamond Light Source for their support and provision of excellent facilities.
References
Ahuja, S., Rougé, L., Swem, D. L., Sudhamsu, J., Wu, P., Russell, S. J., Alexander, M. K., Tam, C., Nishiyama, M., Starovasnik, M. A. & Koth, C. M. (2015). Structure, 23, 713–723. CrossRef Google Scholar
Berntsson, R. P.-A., Smits, S. H. J., Schmitt, L., Slotboom, D.-J. & Poolman, B. (2010). FEBS Lett. 584, 2606–2617. Web of Science CrossRef CAS PubMed Google Scholar
Bidossi, A., Mulas, L., Decorosi, F., Colomba, L., Ricci, S., Pozzi, G., Deutscher, J., Viti, C. & Oggioni, M. R. (2012). PLoS One, 7, e33320. CrossRef Google Scholar
Bogaert, D., de Groot, R. & Hermans, P. W. (2004). Lancet Infect. Dis. 4, 144–154. CrossRef Google Scholar
Buckwalter, C. M. & King, S. J. (2012). Trends Microbiol. 20, 517–522. CrossRef Google Scholar
Burnaugh, A. M., Frantz, L. J. & King, S. J. (2008). J. Bacteriol. 190, 221–230. Web of Science CrossRef PubMed CAS Google Scholar
Culurgioni, S., Harris, G., Singh, A. K., King, S. J. & Walsh, M. A. (2016). Structure, https://doi.org/10.1016/j.str.2016.11.008. Google Scholar
Emsley, P., Lohkamp, B., Scott, W. G. & Cowtan, K. (2010). Acta Cryst. D66, 486–501. Web of Science CrossRef CAS IUCr Journals Google Scholar
Friedman, A. M., Fischmann, T. O. & Steitz, T. A. (1995). Science, 268, 1721–1727. CrossRef CAS PubMed Web of Science Google Scholar
Garmory, H. S. & Titball, R. W. (2004). Infect. Immun. 72, 6757–6763. Web of Science CrossRef PubMed CAS Google Scholar
Gonin, S., Arnoux, P., Pierru, B., Lavergne, J., Alonso, B., Sabaty, M. & Pignol, D. (2007). BMC Struct. Biol. 7, 11. Google Scholar
Hopfner, K. P. (2016). Biopolymers, 105, 492–504. CrossRef Google Scholar
King, S. J. (2010). Mol. Oral Microbiol. 25, 15–24. CrossRef Google Scholar
King, S. J., Hippe, K. R. & Weiser, J. N. (2006). Mol. Microbiol. 59, 961–974. CrossRef Google Scholar
Langer, G., Cohen, S. X., Lamzin, V. S. & Perrakis, A. (2008). Nature Protoc. 3, 1171–1179. Web of Science CrossRef CAS Google Scholar
Locher, K. P. (2016). Nature Struct. Mol. Biol. 23, 487–493. CrossRef Google Scholar
Mao, B., Pear, M. R., McCammon, J. A. & Quiocho, F. A. (1982). J. Biol. Chem. 257, 1131–1133. CAS PubMed Web of Science Google Scholar
Mishima, Y., Momma, K., Hashimoto, W., Mikami, B. & Murata, K. (2003). J. Biol. Chem. 278, 6552–6559. Web of Science CrossRef PubMed CAS Google Scholar
Momma, K., Mishima, Y., Hashimoto, W., Mikami, B. & Murata, K. (2005). Biochemistry, 44, 5053–5064. Web of Science CrossRef PubMed CAS Google Scholar
Murshudov, G. N., Skubák, P., Lebedev, A. A., Pannu, N. S., Steiner, R. A., Nicholls, R. A., Winn, M. D., Long, F. & Vagin, A. A. (2011). Acta Cryst. D67, 355–367. Web of Science CrossRef CAS IUCr Journals Google Scholar
Ramseier, T. M., Nègre, D., Cortay, J. C., Scarabel, M., Cozzone, A. J. & Saier, M. H. Jr (1993). J. Mol. Biol. 234, 28–44. CrossRef Google Scholar
Robert, X. & Gouet, P. (2014). Nucleic Acids Res. 42, W320–W324. Web of Science CrossRef CAS PubMed Google Scholar
Saxena, S., Khan, N., Dehinwal, R., Kumar, A. & Sehgal, D. (2015). PLoS One, 10, e0118154. CrossRef Google Scholar
Schneider, T. R. & Sheldrick, G. M. (2002). Acta Cryst. D58, 1772–1779. Web of Science CrossRef CAS IUCr Journals Google Scholar
Schumacher, M. A., Allen, G. S., Diel, M., Seidel, G., Hillen, W. & Brennan, R. G. (2004). Cell, 118, 731–741. Web of Science CrossRef PubMed CAS Google Scholar
Schumacher, M. A., Choi, K. Y., Zalkin, H. & Brennan, R. G. (1994). Science, 266, 763–770. CrossRef CAS PubMed Web of Science Google Scholar
Sheldrick, G. M. (2010). Acta Cryst. D66, 479–485. Web of Science CrossRef CAS IUCr Journals Google Scholar
Tang, C., Schwieters, C. D. & Clore, G. M. (2007). Nature (London), 449, 1078–1082. Web of Science CrossRef PubMed CAS Google Scholar
Weiser, J. N. (2010). J. Mol. Med. 88, 97–102. CrossRef Google Scholar
Winn, M. D. et al. (2011). Acta Cryst. D67, 235–242. Web of Science CrossRef CAS IUCr Journals Google Scholar
Winter, G., Lobley, C. M. C. & Prince, S. M. (2013). Acta Cryst. D69, 1260–1273. Web of Science CrossRef CAS IUCr Journals Google Scholar
Zhang, Z., Li, Y., Lin, B., Schroeder, M. & Huang, B. (2011). Bioinformatics, 27, 2083–2088. CrossRef Google Scholar
This is an open-access article distributed under the terms of the Creative Commons Attribution (CC-BY) Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original authors and source are cited.