research communications
Structure and stability of the Human respiratory syncytial virus M2–1 RNA-binding core domain reveals a compact and cooperative folding unit
aProtein Structure–Function and Engineering Laboratory, Fundación Instituto Leloir and IIBBA–CONICET, Avenida Patricias Argentinas 435, C1405BWE Buenos Aires, Argentina, bThe Hamburg Centre for Ultrafast Imaging and Department of Chemistry, Institute for Biochemistry and Molecular Biology, University of Hamburg, Martin-Luther-King-Platz 6, 20146 Hamburg, Germany, and cEuropean Molecular Biology Laboratory (EMBL), Hamburg Unit, Notkestrasse 85, 22607 Hamburg, Germany
*Correspondence e-mail: gpg@leloir.org.ar, tidow@chemie.uni-hamburg.de
Human syncytial respiratory virus is a nonsegmented negative-strand RNA virus with serious implications for respiratory disease in infants, and has recently been reclassified into a new family, Pneumoviridae. One of the main reasons for this classification is the unique presence of a transcriptional antiterminator, called M2–1. The puzzling mechanism of action of M2–1, which is a rarity among antiterminators in viruses and is part of the RNA polymerase complex, relies on dissecting the structure and function of this multidomain tetramer. The RNA-binding activity is located in a monomeric globular `core' domain, a high-resolution of which is now presented. The structure reveals a compact domain which is superimposable on the full-length M2–1 tetramer, with additional electron density for the C-terminal tail that was not observed in the previous models. Moreover, its folding stability was determined through chemical which shows that the secondary and unfold concomitantly, which is indicative of a two-state equilibrium. These results constitute a further step in the understanding of this unique RNA-binding domain, for which there is no sequence or structural counterpart outside this virus family, in addition to its implications in transcription regulation and its likeliness as an antiviral target.
Keywords: viral proteins; crystal structure; small-angle X-ray scattering; protein folding; Human syncytial respiratory virus.
PDB references: HRSV M2–1 core, 5nkx; 5noh
1. Introduction
Human respiratory syncytial virus (HRSV) is a nonsegmented negative-strand RNA virus of the recently reclassified Pneumoviridae family (Afonso et al., 2016) and is a major cause of lower respiratory tract infections in children, the elderly and the immunocompromised, including acute bronchiolitis and pneumonia (Reed et al., 1997; Lay et al., 2013; Amarasinghe et al., 2017). HRSV is an enveloped virus; its genome consists of a 15.2 kb single-stranded nonsegmented negative-sense RNA that contains ten genes which encode 11 proteins. Four of them are part of the nucleocapsid: the nucleocapsid protein N, the polymerase cofactor phosphoprotein P, the transcription antitermination factor M2–1 and the viral polymerase L. The virus also encodes two nonstructural proteins (NS1 and NS2) and three transmembrane proteins: the fusion F protein, the small hydrophobic SH protein and the G glycoprotein.
The viral polymerase complex is responsible for carrying out genome replication, transcription and post-transcriptional modifications of mRNAs and consists of genomic RNA tightly bound to the nucleocapsid N protein and the P, L and M2–1 proteins (Collins et al., 2001). The M2–1 transcription antiterminator factor is a basic protein of 194 amino acids that exists as a stable tetramer in solution (Tran et al., 2009; Tanner et al., 2014; Esperante et al., 2011) and is essential for the complete sequential transcription of mRNAs in HRSV (Collins et al., 1995, 1996; Yu et al., 1995; Hardy & Wertz, 1998). It harbours four distinct regions: an N-terminal zinc-binding domain (ZBD; residues 7–25), an α-helical tetramerization domain (TD; residues 32–49), an RNA-binding core domain (RBD; residues 69–172) and a C-terminal unstructured region (residues 173–194). A of the tetrameric M2–1 assembly (Tanner et al., 2014) and an NMR structure of the monomeric M2–1 RBD (residues 58–177; Blondot et al., 2012) have been determined. M2–1 is primarily driven by a central tetramerization α-helix and is also stabilized by contacts between the zinc-binding domain and the core of an adjacent monomer (Tanner et al., 2014; Fig. 1). The removal of zinc leads to tetramer dissociation into a monomeric apo M2–1 species in solution. Thus, the integrity of the ZBD seems to be critical for the tetrameric assembly and for protein function (Esperante et al., 2013).
The crystallographic structure of the closely related Human metapneumovirus (HMPV) M2–1 protein revealed that the ZBD interacts with the tetramerization helix through a hydrophobic interface. The core domain interacts with the ZBD, and the tetramerization and core domains of adjacent protomers, mostly through polar contacts (Leyrat et al., 2014). Interestingly, each protomer exists in an equilibrium between an open and a closed conformation. RNA binding induces the closed state, while the addition of subdenaturing concentrations of guanidinium chloride (GdmCl) induces an open state, consistent with the polar nature of the interactions between the core domain and the rest of the molecule (Leyrat et al., 2014).
Given the relevance and uniqueness of this viral transcription antiterminator, we set out to investigate its RNA-binding domain (residues 73–194) using X-ray crystallography and folding thermodynamics. The structure of the monomeric domain is superimposable with that of the tetramer, but uncovers additional structured residues at the C-terminus that have not been observed previously. The folding studies reveal that although the RBD shows inter-monomer contacts and contacts the ZBD from neighbouring monomers in the tetramer, these do not influence its structure or stability.
2. Materials and methods
2.1. Protein expression and purification
The HRSV M2–1 protein from strain A was recombinantly expressed in bacteria and purified as described previously (Esperante et al., 2013). The highly purified full-length M2–1 tetramer was then subjected to limited proteolysis using chymotrypsin from bovine pancreas (Sigma–Aldrich) at a 1:60(w:w) protein:chymotrypsin ratio in 50 mM Tris–HCl pH 8.0 for 90 min at 28°C. The reaction was stopped by adding 1 mM phenylmethylsulfonyl fluoride (PMSF) and subsequently subjected to on a Superdex 75 column in 20 mM Tris–HCl pH 7.0, 300 mM NaCl. The resulting RBD protein after proteolysis comprises amino acids Ala73–Tyr194. Its molecular weight of 13.6 kDa was confirmed by MALDI–TOF (Bruker, Daltonics, Billerica, Massachusetts, USA) and the protein elutes as a monomeric peak from (Supplementary Fig. S2). The protein concentration was determined spectrophotometrically using an extinction coefficient (∊280) of 4470 M−1 cm−1.
2.2. Folding and stability
Equilibrium M protein sample with increasing concentrations of guanidinium chloride (GdmCl) in 20 mM Tris–HCl pH 7.0. The samples were incubated for 16 h at 20°C prior to measurements. CD spectra were recorded on a Jasco 815 spectropolarimeter. The CD data were fitted to a two-state model as described previously (Pretel et al., 2013). Fluorescence spectroscopy was carried out using a Horiba Fluoromax-4 spectrofluorometer with excitation at 275 nm. Intensity at 310 nm, corresponding to tyrosine fluorescence, was plotted as a function of the denaturant concentration. All measurements were performed at 20°C.
experiments were performed by incubating a 5 µ2.3. Crystallization and data collection
Initial M2–1 RBD crystallization conditions were identified using the vapour-diffusion technique in sitting drops in a high-throughput crystallization screen. The most promising hits were then optimized by mixing 1 µl M2–1 RBD solution (11 mg ml−1) with 1 µl reservoir solution and equilibrating against reservoir solution consisting of either condition 1 (2.1 M DL-malic acid pH 7) or condition 2 (2.4 M sodium malonate pH 7) (both from the JCSG plus screen) at 293 K. The crystals belonged to space groups P2 and P3221, respectively, and both diffracted to better than 2 Å resolution. Crystals appeared after 1 d and grew to maximum dimensions of 0.6 × 0.8 × 1.0 mm within one week (Supplementary Fig. S1). Crystals were mounted and flash-cooled in liquid nitrogen with cryoprotection (by the addition of 20% glycerol to the crystallization solution). Diffraction data were collected to a resolution limit of 1.8–2.0 Å. Full data sets with an interval of 0.1° were collected for both crystal forms on the P14 beamline at EMBL/DESY, Hamburg, Germany. All data sets were processed with XDS (Kabsch, 2010) and merged with AIMLESS (Evans, 2006). A summary of the data statistics is given in Table 1.
|
2.4. and analysis
The structures were solved by Phaser (McCoy et al., 2007) using PDB entry 4c3b (Tanner et al., 2014) as the search model. Subsequent rounds of manual building using Coot (Emsley & Cowtan, 2004) and using phenix.refine (Afonine et al., 2012) allowed the model building of a short C-terminal extension of the M2–1 RBD (residues 73–194) in P3221. The final models yielded crystallographic R factors of 17 and 19% and free R factors of 21 and 23% for the crystals in space groups P2 and P3221, respectively (see Table 1 for details). The models were validated using MolProbity (Chen et al., 2010). Evaluation of the Ramachandran plot showed all residues to be in allowed regions (97–99% in favoured regions). All figures were prepared using PyMOL (DeLano, 2002). The data have been deposited in the Protein Data Bank with PDB codes 5noh (P2) and 5nkx (P3221).
with2.5. Small-angle X-ray scattering (SAXS)
SAXS data were collected on the P12 beamline at EMBL/DESY, Hamburg following standard procedures. Repetitive data collection from the same sample was performed and no radiation damage was detected. Samples of M2–1 RBD (residues Ala73–Tyr194) solution were prepared in the concentration range 4.5–9.0 mg ml−1 in a buffer consisting of 20 mM Tris–HCl pH 7, 300 mM NaCl. All SAXS data were analyzed using the ATSAS package (Franke et al., 2017). Raw data were processed using PRIMUS (Konarev et al., 2003). The (Rg) was evaluated using the Guinier approximation [I(s) = I(0)exp(−s2Rg2/3) for sRg < 1.3] and also from the entire scattering curve with GNOM (Svergun, 1992); the latter also provided the distance distribution function p(r) and the maximum dimension Dmax. The masses of the solutes were evaluated by comparison of the intensity with that from a BSA reference solution (66 kDa). Low-resolution SAXS models were obtained using the ab initio simulated-annealing (SA) program DAMMIF (Franke & Svergun, 2009), which generates models consisting of dummy atoms to fit the experimental data.
3. Results and discussion
The HRSV M2–1 RBD domain crystallized in two different space groups (Table 1) and the structure could be determined to 2.0 Å resolution. The structure consists of six α-helices connected by short loops (Fig. 2a). The structures from both crystal forms are virtually identical in the core region (residues 58–177), with a root-mean-square deviation (r.m.s.d. on Cα atoms) of 0.3 Å. They are also very similar to the core domains observed in the of full-length tetrameric HRSV M2–1 (Tanner et al., 2014; r.m.s.d. of 0.7 Å) and in the NMR structure (Blondot et al., 2012; r.m.s.d. of 1.39 Å) (Fig. 2b). A notable exception is the C-terminal region (residues 178–185), which could be clearly modelled in one monomer in the P3221 crystal form. In the tetrameric HRSV M2–1 this region was disordered and thus not visible (Tanner et al., 2014). In our P3221 core-domain structure it is located in a cleft between helices H3 and H6 and occupies a position that would interfere with the zinc-binding domain (ZBD) of another monomer in the tetrameric assembly (Figs. 2c and 3). We have shown that removing the zinc leads to a monomeric `apo' form, which otherwise contains the same secondary structure as the tetrameric species, indicating that the ZBD modulates M2–1 In fact, this tetramer–monomer transition is increased at pH 5.5, strongly suggesting that the ZBD acts as a pH sensor that modulates (Esperante et al., 2013).
The HRSV M2–1 RBD crystallized with two molecules in the The elution profile from indicates that the HRSV M2–1 RBD exists as a monomer in solution (Supplementary Fig. S2). However, in order to confirm the oligomeric state of the HRSV M2–1 RBD in solution at the high concentration used in the crystallization conditions, we performed small-angle X-ray scattering (SAXS) experiments. Model-free analysis of the SAXS data yielded an Rg of 20.3 Å from the (Fig. 4a) and a Porod volume of 26.4 nm3, indicating a molecular weight of 15 kDa. The pair distance distribution function indicates a globular particle with a tail, with maximum dimensions (Dmax) of 8 nm (Fig. 4c). The ab initio model of RBD obtained using DAMMIF is comparable to that of the monomer (Fig. 4d). The C-terminal tail observed in our P3221 model could be extended in solution and would therefore explain the lack of globularity in our SAXS model. Electron density for this region has not been observed in previous crystal structures.
The RNA-binding core domain appears to be a compact and cooperative folding unit in the et al., 2014). However, there are several intermonomer contacts in the tetramer, and other contacts connecting with other regions of the protein, which raises the questions of whether the domain is stabilized by such interactions and to what extent the domain is compact and stable in solution. We carried out equilibrium unfolding experiments using guanidine chloride, and followed the secondary-structure transition by monitoring changes in (CD) in the far-UV region. The mostly α-helical spectrum with the characteristic minima at 208 and 222 nm is gradually lost upon an increase in denaturant, indicating complete unfolding of the protein (M2–1 RBD; Fig. 5a). The molar ellipticity change at 222 nm shows a highly suggestive of a two-state transition (Fig. 5b). Since there are no tryptophan residues, we used tyrosine intrinsic fluorescence as a probe for which produces a transition superimposable with the secondary structure, which is a strong indication of a two-state unfolding process (Fig. 5b). For a quantitative analysis of the thermodynamic stability, we fitted the ellipticity data to a two-state model (Pretel et al., 2013) and obtained an unfolding free-energy change () of 5.21 ± 0.92 kcal mol−1 and an m value of 2.28 ± 0.38 kcal mol−1 M−1; the latter matches that expected for a globular two-state folder of this size (Myers et al., 1995).
reported here and the previous NMR structure of a similar fragment (TannerWe have previously shown that zinc removal dissociates the tetramer, and the `apo' M2–1 monomer obtained showed a secondary structure surprisingly identical to that of the tetramer (Esperante et al., 2013). The unfolding transition of the M2–1 RBD is superimposable on that of the apo M2–1 monomer (Fig. 5b, inset), with a of 5.89 ± 0.33 kcal mol−1 and an m value of 2.62 ± 0.15 kcal mol−1 M−1, which is in excellent agreement with the stability and m value of the monomeric RBD, confirming that the isolated monomeric RNA-binding domain is a folding unit that is independent of the rest of the molecule either as a tetramer or an apo monomer. These results indicate that the rest of the polypeptide (residues 1–72) is disordered when the zinc is removed and the tetramer dissociates, or at least there is no cooperative compact structure.
The fact that the M2–1 RBD is a stable and compact domain showing two-state cooperative unfolding, with identical thermodynamic stability to the apo M2–1 monomer, shows that the region that is missing in the isolated domain (residues 1–72) does not partake in any persistent structure, is not connected to the M2–1 RBD and forms the tetrameric interface only when the zinc-binding motif is intact (Esperante et al., 2013; Fig. 1). Although only one highly symmetric structure was observed in the of the HRSV M2–1 tetramer (Tanner et al., 2014), the homologue from Human metapneumovirus (HMPV) showed open and closed conformations in which different RBDs move away from the tetrameric arrangement, and this conformational change was proposed to be the result of RNA binding (Leyrat et al., 2014). The structure of the HMPV RBD does not change in the open or closed conformations, suggesting that there must be a flexible regulatory region which moves the RBD back and forth. In this picture, contacts of the RBD with other RBDs or other regions (i.e. the zinc-binding motif) must be regulatory, with no effect on RBD conformation or stability. In the HRSV tetramer, three of the chains show a lack of electron density between the tetramerization domain and the RBD (Tanner et al., 2014), coincident with what could constitute a flexible hinge region that may modulate the open–closed transition. In the HMPV tetramer structure the same region also lacks electron density in the linker region between the TD and RBD, indicating high flexibility (as reflected by very high B factors for adjacent residues). On the other hand, the crystal structures presented here strongly suggest that the carboxy-terminal region of the protein would be unstructured in the closed conformation present in the RNA-free tetramer and folds upon the helical cleft corresponding to the ZBD-binding region in the RBD. In any case, the structures of both complexes with RNA plus mechanistic studies of the interaction with RNA are required for a complete picture of this puzzling transcription antiterminator activity that governs the relative viral protein levels, as determined by the action of the M2–1 tetramer unique to these two viruses within the pneumovirus family.
Supporting information
PDB references: HRSV M2–1 core, 5nkx; 5noh
Supplementary Figures S1 and S2. DOI: https://doi.org/10.1107/S2053230X17017381/ow5002sup1.pdf
Acknowledgements
We thank members of the Tidow and Prat-Gay laboratories for valuable discussions and Katharina Veith for technical assistance. We are very grateful to the staff of beamlines P12, P13 and P14 at EMBL/DESY, Hamburg. The Sample Preparation and Characterization (SPC) Facility of EMBL Hamburg is acknowledged for support in crystallization screening. Author contributions are as follows: conceptualization, GP-G and HT; investigation, IGM, IJ, YAH and SE; analysis and interpretation, IGM, IJ, YAH, SE, MS, MGA, GP-G and HT; manuscript writing, IGM, IJ, GP-G and HT; funding acquisition and supervision, GP-G and HT. We declare no competing financial interests.
Funding information
Support from the European Community Research Infrastructure Action under the FP7 is acknowledged for access to EMBL/DESY, Hamburg. IJ, YAH and HT are supported by the excellence cluster `The Hamburg Center for Ultrafast Imaging – Structure, Dynamics and Control of Matter at the Atomic Scale' of the German Research Foundation (DFG). In addition, HT is grateful for support by an Emmy Noether Fellowship from the German Research Foundation (DFG). IGM is grateful for the financial support provided by Boehringer Ingelheim Fonds to travel to Hamburg, Germany to perform parts of these experiments.
References
Afonine, P. V., Grosse-Kunstleve, R. W., Echols, N., Headd, J. J., Moriarty, N. W., Mustyakimov, M., Terwilliger, T. C., Urzhumtsev, A., Zwart, P. H. & Adams, P. D. (2012). Acta Cryst. D68, 352–367. Web of Science CrossRef CAS IUCr Journals Google Scholar
Afonso, C. L. et al. (2016). Arch. Virol. 161, 2351–2360. CrossRef CAS PubMed Google Scholar
Amarasinghe, G. K. et al. (2017). Arch Virol. 162, 2493–2504. CrossRef CAS PubMed Google Scholar
Blondot, M.-L., Dubosclard, V., Fix, J., Lassoued, S., Aumont-Nicaise, M., Bontems, F., Eléouët, J.-F. & Sizun, C. (2012). PLoS Pathog. 8, e1002734. CrossRef PubMed Google Scholar
Chen, V. B., Arendall, W. B., Headd, J. J., Keedy, D. A., Immormino, R. M., Kapral, G. J., Murray, L. W., Richardson, J. S. & Richardson, D. C. (2010). Acta Cryst. D66, 12–21. Web of Science CrossRef CAS IUCr Journals Google Scholar
Collins, P. L., Chanock, R. M. & Murphy, B. R. (2001). Fields Virology, 4th ed., edited by D. M. Knipe & P. M. Howley, pp. 1443–1486. Philadelphia: Lippincott, Williams & Wilkins. Google Scholar
Collins, P. L., Hill, M. G., Camargo, E., Grosfeld, H., Chanock, R. M. & Murphy, B. R. (1995). Proc. Natl Acad. Sci. USA, 92, 11563–11567. CrossRef CAS PubMed Google Scholar
Collins, P. L., Hill, M. G., Cristina, J. & Grosfeld, H. (1996). Proc. Natl Acad. Sci. USA, 93, 81–85. CrossRef CAS PubMed Google Scholar
DeLano, W. L. (2002). PyMOL. https://www.pymol.org. Google Scholar
Emsley, P. & Cowtan, K. (2004). Acta Cryst. D60, 2126–2132. Web of Science CrossRef CAS IUCr Journals Google Scholar
Esperante, S. A., Chemes, L. B., Sánchez, I. E. & de Prat-Gay, G. (2011). Biochemistry, 50, 8529–8539. CrossRef CAS PubMed Google Scholar
Esperante, S. A., Noval, M. G., Altieri, T. A., de Oliveira, G. A., Silva, J. L. & de Prat-Gay, G. (2013). Biochemistry, 52, 6779–6789. CrossRef CAS PubMed Google Scholar
Evans, P. (2006). Acta Cryst. D62, 72–82. Web of Science CrossRef CAS IUCr Journals Google Scholar
Franke, D., Petoukhov, M. V., Konarev, P. V., Panjkovich, A., Tuukkanen, A., Mertens, H. D. T., Kikhney, A. G., Hajizadeh, N. R., Franklin, J. M., Jeffries, C. M. & Svergun, D. I. (2017). J. Appl. Cryst. 50, 1212–1225. Web of Science CrossRef CAS IUCr Journals Google Scholar
Franke, D. & Svergun, D. I. (2009). J. Appl. Cryst. 42, 342–346. Web of Science CrossRef CAS IUCr Journals Google Scholar
Hardy, R. W. & Wertz, G. W. (1998). J. Virol. 72, 520–526. CAS PubMed Google Scholar
Kabsch, W. (2010). Acta Cryst. D66, 125–132. Web of Science CrossRef CAS IUCr Journals Google Scholar
Konarev, P. V., Volkov, V. V., Sokolova, A. V., Koch, M. H. J. & Svergun, D. I. (2003). J. Appl. Cryst. 36, 1277–1282. Web of Science CrossRef CAS IUCr Journals Google Scholar
Lay, M. K., González, P. A., León, M. A., Céspedes, P. F., Bueno, S. M., Riedel, C. A. & Kalergis, A. M. (2013). Microbes Infect. 15, 230–242. CrossRef CAS PubMed Google Scholar
Leyrat, C., Renner, M., Harlos, K., Huiskonen, J. T. & Grimes, J. M. (2014). Elife, 3, e02674. CrossRef PubMed Google Scholar
McCoy, A. J., Grosse-Kunstleve, R. W., Adams, P. D., Winn, M. D., Storoni, L. C. & Read, R. J. (2007). J. Appl. Cryst. 40, 658–674. Web of Science CrossRef CAS IUCr Journals Google Scholar
Myers, J. K., Pace, C. N. & Scholtz, J. M. (1995). Protein Sci. 4, 2138–2148. CrossRef CAS PubMed Google Scholar
Pretel, E., Camporeale, G. & de Prat-Gay, G. (2013). PLoS One, 8, e74338. CrossRef PubMed Google Scholar
Reed, G., Jewett, P. H., Thompson, J., Tollefson, S. & Wright, P. F. (1997). J. Infect. Dis. 175, 807–813. CrossRef CAS PubMed Google Scholar
Svergun, D. I. (1992). J. Appl. Cryst. 25, 495–503. CrossRef CAS Web of Science IUCr Journals Google Scholar
Tanner, S. J., Ariza, A., Richard, C.-A., Kyle, H. F., Dods, R. L., Blondot, M.-L., Wu, W., Trincão, J., Trinh, C. H., Hiscox, J. A., Carroll, M. W., Silman, N. J., Eléouët, J.-F., Edwards, T. A. & Barr, J. N. (2014). Proc. Natl Acad. Sci. USA, 111, 1580–1585. CrossRef CAS PubMed Google Scholar
Tran, T.-L., Castagné, N., Dubosclard, V., Noinville, S., Koch, E., Moudjou, M., Henry, C., Bernard, J., Yeo, R. P. & Eléouët, J.-F. (2009). J. Virol. 83, 6363–6374. CrossRef PubMed CAS Google Scholar
Yu, Q., Hardy, R. W. & Wertz, G. W. (1995). J. Virol. 69, 2412–2419. Google Scholar
This is an open-access article distributed under the terms of the Creative Commons Attribution (CC-BY) Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original authors and source are cited.