research papers
Structural analysis of a replication protein encoded by a plasmid isolated from a multiple sclerosis patient
aSchaller Research Group at the University of Heidelberg, Heidelberg, Germany, bDepartment of Infectious Diseases, Virology, University of Heidelberg, Heidelberg, Germany, cStructural Biology Group, European Synchrotron Radiation Facility (ESRF), Grenoble, France, dGerman Cancer Research Center (DKFZ), Heidelberg, Germany, and eBiosciences Faculty, University of Heidelberg, Heidelberg, Germany
*Correspondence e-mail: t.bund@dkfz-heidelberg.de, g.hansman@dkfz.de
Bovine meat and milk factors (BMMFs) are circular, single-stranded episomal DNAs that have been detected in bovine meat and milk products. BMMFs are thought to have roles in human malignant and degenerative diseases. BMMFs encode a replication initiator protein (Rep) that is actively transcribed and translated in human cells. In this study, a Rep WH1 domain encoded on a BMMF (MSBI1.176) isolated from a multiple sclerosis human brain sample was determined to 1.53 Å resolution using X-ray crystallography. The overall structure of the MSBI1.176 WH1 domain was remarkably similar to other Rep structures, despite having a low (28%) amino-acid sequence identity. The MSBI1.176 WH1 domain contained elements common to other Reps, including five α-helices, five β-strands and a hydrophobic pocket. These new findings suggest that the MSBI1.176 Rep might have comparable roles and functions to other known Reps of different origins.
Keywords: bovine meat and milk factors; episomal circular DNA; trans-acting replication initiator protein; multiple sclerosis; X-ray crystal structure.
PDB reference: MSBI1.176 WH1 domain, 6h24
1. Introduction
The consumption of bovine meat and milk is considered to be a risk factor for the development of colon and breast cancers (Chan et al., 2011; Corpet, 2011; Huxley et al., 2009). Indeed, epidemiologic data suggest that there is a correlation of these cancers with the consumption of bovine products from cattle derived from Eurasian aurochs (zur Hausen & de Villiers, 2015; zur Hausen et al., 2017; zur Hausen, 2012). Recently, it was suggested that bovine meat and milk factors (BMMFs), which are circular, single-stranded episomal DNAs (<3 kb) that are found in bovine meat and milk products, might represent a possible etiological agent of such diseases (Funk et al., 2014; Falida et al., 2017; zur Hausen et al., 2019). More recently, BMMFs were isolated from patients with multiple sclerosis and studies suggested these to be a possible infectious agent of this disease (Whitley et al., 2014; Lamberto et al., 2014; Gunst et al., 2014; zur Hausen et al., 2019).
Typically, BMMFs encode an autonomous plasmid trans-acting replication initiator protein (Rep). Rep binds at an origin of replication on the DNA (termed ori) and in most cases comprises a set of repetitive DNA elements (termed iterons), which are present within most BMMFs (zur Hausen et al., 2017). Replication of various plasmids, including circular Rep-encoding single-stranded (CRESS) DNA viruses, also requires the binding of the Rep to a specific DNA sequence (Kornberg & Baker, 1992). Within prokaryotes, Rep plays a central role in maintaining the plasmid copy number, as reported for the F plasmid in Escherichia coli (Kline, 1985). This regulation is also critical for the replication of plasmid-derived, bacteriophage-like or virus-like DNA genomes (Ruiz-Masó et al., 2015). Reps are essential for the replication of multidrug-resistant bacteria in humans (Schumacher et al., 2014) and studies have suggested that Reps have a role in transmissible amyloid proteinopathy (Molina-García et al., 2018; Giraldo et al., 2016, 2011).
Recently, an episomal circular DNA (isolate MSBI1.176, accession LK931491.1) was isolated from a brain sample of a patient with multiple sclerosis (Whitley et al., 2014). The MSBI1.176-encoded Rep exhibits 98% amino-acid sequence identity to a Rep encoded on the Sphinx-1.76 genome (GenBank ADR65123.1 and HQ444404.1), which was isolated from culture and brain preparations of transmissible encephalopathy-related agents (Manuelidis, 2011). Moreover, there were indications of the detection of Sphinx-1.76-encoded Reps in neural cells (GT1 cell line) and brain samples of mouse CNS, hamster CNS and human glioblastoma based on Sphinx-1.76-correlated antibodies (Yeh et al., 2017). Serology based on the MSBI1.176 Rep antigen showed positive immune responses for healthy human blood donors and indicated a possible pre-exposure towards these agents (Eilebrecht et al., 2018). Therefore, deciphering the functions of BMMFs in human malignant and degenerative disease is becoming increasingly important.
The X-ray crystal structures of Reps have been well documented and the structural basis for autonomous replication has been described (Giraldo et al., 2003; Komori et al., 1999; Nakamura et al., 2007; Swan et al., 2006). Reps are composed of two winged-helix domains (termed WH1 and WH2) that are essentially a fused N- and C-terminal protein. Reps transform between monomeric and dimeric forms depending on their specific function and binding to DNA (see Forest & Filutowicz, 2003). Large structural changes involving both domains complement these oligomeric forms. The structural transformation requires certain α-helices and β-strands on the Rep to be refolded and/or shifted (Nakamura et al., 2007). In the dimeric form, the Rep functions as a repressor, where WH2 binds to each operator DNA repeat and WH1 functions to form the dimerization interface. In the monomeric form, the Rep functions as a replication initiator, where WH1 undergoes a large structural movement, i.e. dimer dissociation, thereby allowing WH1 to bind to the iteron end, while WH2 binds to the opposite iteron end.
In this study, we determined the X-ray
of MSBI1.176 WH1 in the dimeric form to 1.53 Å resolution. Overall, the structures of MSBI1.176 WH1 and other Reps were remarkably similar, despite having low amino-acid sequence identities. Although structural differences were also observed, our findings suggested that the MSBI1.176 Rep might have similar roles and functions to other Reps. Moreover, this new structural information could be important for defining vulnerable regions on the Rep and possibly aid in future inhibitor design.2. Materials and methods
2.1. Protein expression and purification
The MSBI1.176 DNA (LK931491.1) was isolated from a brain sample of a patient with multiple sclerosis (Whitley et al., 2014). The MSBI1.176 WH1 domain (residues 1–135) was expressed in E. coli and purified as previously described for human norovirus protruding domains (Hansman et al., 2011). Briefly, the codon-optimized WH1 was cloned in a modified pMal-c2X expression vector (GeneArt) and transformed into E. coli BL21 cells for protein expression. Transformed cells were grown in LB medium supplemented with 100 µg ml−1 ampicillin for 4 h at 37°C. Expression was induced with 0.75 mM IPTG at an OD600 of 0.7 for 18 h at 22°C. The cells were harvested by centrifugation at 6000 rev min−1 for 15 min and were disrupted by sonication on ice. His-tagged MBSI1.176 WH protein was initially purified from an Ni column (Qiagen), dialyzed in gel-filtration buffer (GFB; 25 mM Tris–HCl pH 7.6, 300 mM NaCl) with 10 mM imidazole and digested with HRV-3C protease (Novagen) overnight at 4°C. The cleaved MSBI1.176 WHI domain was then applied onto the Ni column again to separate and collect the cleaved protein, and dialyzed in GFB overnight at 4°C. The MSBI1.176 WH1 protein was further purified by concentrated to 5 mg ml−1 and stored in GFB at 4°C.
2.2. Crystallization
Crystals of MSBI1.176 WH1 grew using the hanging-drop vapor-diffusion method at 18°C in ∼6–10 days in a 1:1 mixture of protein sample and mother liquor (0.2 M magnesium acetate, 20% PEG 3350). Prior to data collection, MSBI1.176 WH1 crystals were transferred to a cryoprotectant containing the mother liquor with 40% PEG 3350, followed by flash-cooling in liquid nitrogen.
2.3. Data collection and processing, and refinement
X-ray diffraction data for the MSBI1.176 WH1 domain were collected on beamlines ID23-1 and ID30B at the European Synchrotron Radiation Facility (ESRF). For the single-wavelength anomalous diffraction using native sulfur (S-SAD) experiments, diffraction data were collected from seven crystals at λ = 1.850 Å on beamline ID23-1 equipped with a Dectris PILATUS 6M pixel-array detector. The X-ray beam size at the sample position was 50 µm and the dimensions of the crystals were approximately 70 × 70 × 200 µm. To decrease the radiation-damage effects, the helical data-collection strategy was applied. One native data set was collected on ID23-1 at λ = 0.972 Å for initial phase extension and a second native data set was collected on ID30B at λ = 0.979 Å for structure Optimal experimental parameters for data collection were designed using BEST (Bourenkov & Popov, 2010) incorporated into the MXCuBE software (Gabadinho et al., 2010) at the ESRF.
The single native data set was processed with XDS, while the multiple data sets for S-SAD were processed with XDS and then merged using XSCALE (Kabsch, 2010). Our initial attempts to solve the structure of MSBI1.176 WH1 by using prokaryotic RepA proteins as search models failed. Therefore, several data sets were collected for further processing using S-SAD (Liu et al., 2012). The S-SAD phasing protocol was carried out using the SHELXC/D/E pipeline as implemented in HKL2MAP (Sheldrick, 2010). 1000 trials were carried out for determination in SHELXC. Using a resolution of 2.3 Å and an anomalous signal truncated to 3.1 Å, where the self-correlation coefficient for the anomalous signal decreased to 25%, SHELXD correctly identified all 24 sulfur sites. 415 residues were built automatically by SHELXE, which resulted in an interpretable map for further processing. Finally, ARP/wARP was used for automated model building based on the first native data set collected (Langer et al., 2008). The structure was refined using the second high-resolution data set in multiple rounds of manual model building in Coot (Emsley et al., 2010) and PHENIX (Adams et al., 2010). The structure was validated using MolProbity and PROCHECK. Interactions were analyzed using Accelrys Discovery Studio (v.4.1), with hydrogen-bond distances of between 2.4 and 3.5 Å. Figures and protein contact potentials were generated using PyMOL. Atomic coordinates and structure factors have been deposited in the Protein Data Bank (PDB) with accession code 6h24.
3. Results
3.1. X-ray of MSBI1.176 WH1
The structure of MSBI1.176 WH1 (residues 2–133) was solved to 1.53 Å resolution (data statistics are given in Table 1). The consisted of one WH1 dimer, i.e. two protomers (termed A and B). The electron density was well resolved for most of the protein (average B factor of 29.98 Å2). However, residues 36–39 could not be fitted into the B protomer owing to a lack of discernible electron density, although the electron density was distinct in the other protomer. The WH1 structure comprised five α-helices (α1–α5) and five β-strands (β1–β5) in each protomer (Fig. 1). The A and B protomers were closely related (r.m.s.d. of 0.37 Å); however, a minor structural shift was observed at the β2–β3 hairpin, suggesting some flexibility of this region. Importantly, with the improved resolution over those of previous structures (Giraldo et al., 2003; Komori et al., 1999; Nakamura et al., 2007; Swan et al., 2006), water molecules were effectively added to this Rep structure.
|
3.2. Structural comparison with other replication proteins (Reps)
A database search for closely related structures and sequences revealed that MSBI1.176 WH1 has 28% and 17% amino-acid identity to Pseudomonas syringae RepA WH1 (RepA; PDB entry 1hkq; Giraldo et al., 2003) and E. coli RepE (PDB entry 1rep; Komori et al., 1999), respectively. Similar to that from RepA, the MSBI1.176 WH1 was also folded as the replication-inert dimer, while RepE (a WH1–2 construct) was crystallized in the monomeric initiator form. Superposition of MSBI1.176 WH1 and RepA WH1 showed that these two domains were structurally similar (r.m.s.d. of 1.20 Å), with both having the typical five α-helices and five β-strands (Fig. 2). A number of structural similarities and differences were observed between these two Reps.
The dimeric interface of MSBI1.176 and RepA, which involves β3–β4, was held with a similar number of main-chain binding interactions, although not at identical residues (Fig. 3a). This result suggested that the dimeric interface feature was likely to be related to function among the diverse Rep isolates. We also observed that water molecules bound at this dimeric interface (data not shown). However, how these water molecules stabilize the dimeric interface and/or are displaced after binding DNA and changing conformation is not yet known.
We also observed that the MSBI1.176 WH1 region that comprised α1–α2–α5 was similar in orientation to that in RepA, having the typical α1–α2 bend and thereby making a V-shaped structure (Fig. 3b). This region, which forms the linker to WH2, also contains the hydrophobic heptad pocket, which typically contains a number of leucine residues (for example Leu12, Leu19 and Leu26 in RepA, and Leu24, Leu31 and Leu39 in RepE). The MSBI1.176 WH1 hydrophobic pocket also contained three leucine residues, i.e. Leu11, Leu18 and Ile25, which were similarly positioned as in RepA. Not surprisingly, water molecules were absent in the hydrophobic pocket of MSBI1.176 WH1.
In general, many of the structural features of MSBI1.176 WH1 are conserved in other known dimeric Rep structures (Giraldo et al., 2003; Komori et al., 1999; Nakamura et al., 2007; Swan et al., 2006). However, loop movements and different α-helices and β-strands have been observed among the different structures. In the case of MSBI1.176, the β2–β3 hairpin shifted approximately 23 Å when compared with the RepA β2–β3 hairpin (Fig. 4). In the case of RepE, the equivalent β2–β3 hairpin (residues 97–110) was not added to the structure since electron density was lacking (Giraldo et al., 2003, 2011). It was suggested that the RepE β2–β3 hairpin was flexible and this flexibility might function by destabilizing the antiparallel β2–β3 hairpin and blocking dimerization (Giraldo et al., 2003, 2011). However, the MSBI1.176 WH1 β2–β3 hairpin was clearly held by direct main-chain interactions, not unlike the RepA structure (Fig. 3a). Moreover, we perceived that water-mediated interactions at this dimeric interface might also add further stability to this hairpin (Fig. 3b).
3.3. Modeling of DNA binding
Previous modeling analysis of the RepA domain indicated that six basic residues on α2, β2, β3 and adjacent loops (Lys74, Arg81, Arg91, Arg93, Lys62 and Arg78; RepA numbering) might follow the minor groove of a DNA backbone (Giraldo et al., 2003). In the MSBI1.176 WH1 structure, six basic residues were also found in this region, i.e. Lys69, Lys73 (both located on α4), Lys85 (β2), Arg90 (β3), Arg78 and Arg96 (both on adjacent loops). Although the electron density for the Lys73 (chains A and B), Lys85 (chains A and B) and Arg90 (chain A) side chains in MSBI1.176 WH1 was weak, two of these residues (Arg78 and Arg96) were at equivalent positions in RepA and were suggested to interact with a DNA molecule (Giraldo et al., 2003; Fig. 5). The function of the MSBI1.176 WH1 β2–β3 sheet orientation is not obvious, although MSBI1.176 WH1 has a three-amino-acid insertion in the β2 strand that extended the sheet. Presumably, this insertion elegantly shifted Lys85 (β2) and Arg90 (β3) on the β2–β3 sheet in MSBI1.176 WH1 when compared with the equivalent RepA residues Arg81 and Arg91. Seen in another way, the MSBI1.176 β2–β3 sheet was rather flattened, whereas the RepA β2–β3 hairpin was hooked in the opposite direction (Fig. 2).
4. Conclusions
Reps are important for the replication of plasmids or autarkic episomal et al., 2014). Rep-specific serum antibodies have been found in a set of healthy human blood-bank donors, indicating general human exposure to such agents (Eilebrecht et al., 2018). The discovery that this MSBI1.176-encoded Rep WH1 protein was closely similar to a prokaryote Rep structure might have important consequences and point towards a possible disease-correlated adaptation of these agents towards humans. This new structural information might aid in the development and design of therapeutic/preventive drugs that can inhibit these Reps of diverse origin.
in different hosts. It is speculated that such proteins and Rep-encoding DNAs might be linked to disease. Thus, careful structural and functional characterization of Reps is needed. The Rep described in this study is encoded by the human bioactive bovine meat and milk factor MSBI1.176, which was isolated from a patient with multiple sclerosis (WhitleyAcknowledgements
This work was supported by the CHS Foundation, Heidelberg. We thank the protein crystallization platform facility (CellNetworks of the University of Heidelberg) for initial crystal screening. We acknowledge the European Synchrotron Radiation Facility (ESRF) for the provision of synchrotron-radiation facilities. We thank Ethel-Michele de Villiers for the helpful discussions and for defining the importance of the MSBI1 Rep.
References
Adams, P. D., Afonine, P. V., Bunkóczi, G., Chen, V. B., Davis, I. W., Echols, N., Headd, J. J., Hung, L.-W., Kapral, G. J., Grosse-Kunstleve, R. W., McCoy, A. J., Moriarty, N. W., Oeffner, R., Read, R. J., Richardson, D. C., Richardson, J. S., Terwilliger, T. C. & Zwart, P. H. (2010). Acta Cryst. D66, 213–221. Web of Science CrossRef CAS IUCr Journals Google Scholar
Bourenkov, G. P. & Popov, A. N. (2010). Acta Cryst. D66, 409–419. Web of Science CrossRef CAS IUCr Journals Google Scholar
Chan, D. S., Lau, R., Aune, D., Vieira, R., Greenwood, D. C., Kampman, E. & Norat, T. (2011). PLoS One, 6, e20456. CrossRef PubMed Google Scholar
Corpet, D. E. (2011). Meat Sci. 89, 310–316. CrossRef CAS PubMed Google Scholar
Eilebrecht, S., Hotz-Wagenblatt, A., Sarachaga, V., Burk, A., Falida, K., Chakraborty, D., Nikitina, E., Tessmer, C., Whitley, C., Sauerland, C., Gunst, K., Grewe, I. & Bund, T. (2018). Sci. Rep. 8, 2851. CrossRef PubMed Google Scholar
Emsley, P., Lohkamp, B., Scott, W. G. & Cowtan, K. (2010). Acta Cryst. D66, 486–501. Web of Science CrossRef CAS IUCr Journals Google Scholar
Falida, K., Eilebrecht, S., Gunst, K., zur Hausen, H. & de Villiers, E.-M. (2017). Genome Announc. 5, e00266-17. CrossRef PubMed Google Scholar
Forest, K. T. & Filutowicz, M. S. (2003). Nature Struct. Biol. 10, 496–498. CrossRef PubMed CAS Google Scholar
Funk, M., Gunst, K., Lucansky, V., Muller, H., zur Hausen, H. & de Villiers, E.-M. (2014). Genome Announc. 2, e00846-14. CrossRef PubMed Google Scholar
Gabadinho, J., Beteva, A., Guijarro, M., Rey-Bakaikoa, V., Spruce, D., Bowler, M. W., Brockhauser, S., Flot, D., Gordon, E. J., Hall, D. R., Lavault, B., McCarthy, A. A., McCarthy, J., Mitchell, E., Monaco, S., Mueller-Dieckmann, C., Nurizzo, D., Ravelli, R. B. G., Thibault, X., Walsh, M. A., Leonard, G. A. & McSweeney, S. M. (2010). J. Synchrotron Rad. 17, 700–707. Web of Science CrossRef CAS IUCr Journals Google Scholar
Giraldo, R., Fernandez, C., Moreno-del Alamo, M., Molina-Garcia, L., Revilla-Garcia, A., Sanchez-Martinez, M. C., Gimenez-Abian, J. F. & Moreno-Diaz de la Espina, S. (2016). Prion, 10, 41–49. CrossRef CAS PubMed Google Scholar
Giraldo, R., Fernández-Tornero, C., Evans, P. R., Díaz-Orejas, R. & Romero, A. (2003). Nature Struct. Biol. 10, 565–571. CrossRef PubMed CAS Google Scholar
Giraldo, R., Moreno-Díaz de la Espina, S., Fernández-Tresguerres, M. E. & Gasset-Rosa, F. (2011). Prion, 5, 60–64. CrossRef CAS PubMed Google Scholar
Gunst, K., zur Hausen, H. & de Villiers, E.-M. (2014). Genome Announc. 2, e00847-14. CrossRef PubMed Google Scholar
Hansman, G. S., Biertümpfel, C., Georgiev, I., McLellan, J. S., Chen, L., Zhou, T., Katayama, K. & Kwong, P. D. (2011). J. Virol. 85, 6687–6701. CrossRef CAS PubMed Google Scholar
Hausen, H. zur (2012). Int. J. Cancer, 130, 2475–2483. PubMed Google Scholar
Hausen, H. zur, Bund, T. & de Villiers, E.-M. (2019). Int. J. Cancer, 144, 1574–1583. PubMed Google Scholar
Hausen, H. zur, Bund, T. & de Villiers, E.-M. (2017). Curr. Top. Microbiol. Immunol. 407, 83–116. PubMed Google Scholar
Hausen, H. zur & de Villiers, E.-M. (2015). Int. J. Cancer, 137, 959–967. PubMed Google Scholar
Huxley, R. R., Ansary-Moghaddam, A., Clifton, P., Czernichow, S., Parr, C. L. & Woodward, M. (2009). Int. J. Cancer, 125, 171–180. CrossRef PubMed CAS Google Scholar
Kabsch, W. (2010). Acta Cryst. D66, 125–132. Web of Science CrossRef CAS IUCr Journals Google Scholar
Kline, B. C. (1985). Plasmid, 14, 1–16. CrossRef CAS PubMed Web of Science Google Scholar
Komori, H., Matsunaga, F., Higuchi, Y., Ishiai, M., Wada, C. & Miki, K. (1999). EMBO J. 18, 4597–4607. Web of Science CrossRef PubMed CAS Google Scholar
Kornberg, A. & Baker, T. A. (1992). DNA Replication, 2nd ed. New York: W. H. Freeman & Co. Google Scholar
Lamberto, I., Gunst, K., Müller, H., zur Hausen, H. & de Villiers, E.-M. (2014). Genome Announc. 2, e00848-14. CrossRef PubMed Google Scholar
Langer, G., Cohen, S. X., Lamzin, V. S. & Perrakis, A. (2008). Nature Protoc. 3, 1171–1179. Web of Science CrossRef CAS Google Scholar
Liu, Q., Dahmane, T., Zhang, Z., Assur, Z., Brasch, J., Shapiro, L., Mancia, F. & Hendrickson, W. A. (2012). Science, 336, 1033–1037. Web of Science CrossRef CAS PubMed Google Scholar
Manuelidis, L. (2011). J. Neurovirol. 17, 131–145. CrossRef CAS PubMed Google Scholar
Molina-García, L., Gasset-Rosa, F., Álamo, M. M., de la Espina, S. M. & Giraldo, R. (2018). Methods Mol. Biol. 1779, 289–312. PubMed Google Scholar
Nakamura, A., Wada, C. & Miki, K. (2007). Proc. Natl Acad. Sci. USA, 104, 18484–18489. CrossRef PubMed CAS Google Scholar
Robert, X. & Gouet, P. (2014). Nucleic Acids Res. 42, W320–W324. Web of Science CrossRef CAS PubMed Google Scholar
Ruiz-Masó, J. A., Machón. C., Bordanaba-Ruiseco, L., Espinosa, M., Coll, M. & Del Solar, G. (2015). Microbiol. Spectr. 3, PLAS-0035-2014. PubMed Google Scholar
Schumacher, M. A., Tonthat, N. K., Kwong, S. M., Chinnam, N. B., Liu, M. A., Skurray, R. A. & Firth, N. (2014). Proc. Natl Acad. Sci. USA, 111, 9121–9126. CrossRef CAS PubMed Google Scholar
Sheldrick, G. M. (2010). Acta Cryst. D66, 479–485. Web of Science CrossRef CAS IUCr Journals Google Scholar
Swan, M. K., Bastia, D. & Davies, C. (2006). Proc. Natl Acad. Sci. USA, 103, 18481–18486. CrossRef PubMed CAS Google Scholar
Whitley, C., Gunst, K., Muller, H., Funk, M., zur Hausen, H. & de Villiers, E.-M. (2014). Genome Announc. 2, e00849-14. CrossRef PubMed Google Scholar
Yeh, Y.-H., Gunasekharan, V. & Manuelidis, L. (2017). Proc. Natl Acad. Sci. USA, 114, 7118–7123. CrossRef CAS PubMed Google Scholar
This is an open-access article distributed under the terms of the Creative Commons Attribution (CC-BY) Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original authors and source are cited.