12-Fold symmetry of the putative portal protein from the Thermus thermophilus bacteriophage G20C determined by X-ray analysis
aYork Structural Biology Laboratory, Department of Chemistry, University of York, York YO10 5DD, England, bDepartment of Molecular Biology and Biochemistry, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA, cWaksman Institute for Microbiology, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA, and dSt Petersburg State Polytechnical University, St Petersburg 195251, Russian Federation
*Correspondence e-mail: email@example.com
In tailed bacteriophages and several animal viruses, the portal protein forms the gateway through which viral DNA is translocated into the head structure during viral particle assembly. In the mature virion the portal protein exists as a dodecamer, while recombinant portal proteins from several phages, including SPP1 and CNPH82, have been shown to form 13-subunit assemblies. A putative portal protein from the thermostable bacteriophage G20C has been cloned, overexpressed and purified. Crystals of the protein diffracted to 2.1 Å resolution and belonged to space group P4212, with unit-cell parameters a = b = 155.3, c = 115.4 Å. The unit-cell content and self-rotation function calculations indicate that the protein forms a circular 12-subunit assembly.
During the assembly of tailed dsDNA bacteriophages, a copy of the viral genome is packaged into a preformed protein shell known as a procapsid. The portal protein, a circular oligomer, is embedded into one of the vertices of the icosahedral procapsid (Rao & Feiss, 2008; Casjens, 2011). The portal protein primarily functions to connect other viral proteins to the procapsid and as the gateway through which DNA is translocated into the procapsid and out of the mature capsid. Typically, following replication of the viral genome, a complex comprising the small and large terminase proteins and the viral DNA binds to the portal protein to form an ATPase-driven DNA-translocating motor. The motor drives the viral DNA through the portal protein and into the procapsid, where the DNA is packaged to near-crystalline density. Following DNA packaging and dissociation of the terminase complex, the portal protein binds components of the tail structure to complete the viral assembly process. On infection of a host cell, DNA leaves the mature capsid through the portal protein and tail structure.
In functional mature viral particles and following isolation in complex with tail proteins, the portal proteins from several bacteriophages, such as SPP1 and T3, have consistently been identified as dodecameric rings with 12-fold rotational symmetry (Lurz et al., 2001; Donate et al., 1988; Rao & Feiss, 2008). These results strongly suggest that the biologically relevant oligomeric state of these portal proteins is a dodecamer. Following heterologous expression, however, viral portal proteins have been found to display 11-fold, 12-fold, 13-fold or 14-fold symmetry (Trus et al., 2004; Rao & Feiss, 2008). For example, the SPP1 and CNPH82 portal proteins exhibit 13-fold symmetry following heterologous expression in Escherichia coli (Lebedev et al., 2007; Lurz et al., 2001; Luan et al., 2012). This suggests that dodecamers may be selected for, or their assembly may be promoted, during the native oligomerization process and that the dodecameric arrangement of the portal protein is important for its function (Rao & Feiss, 2008; Lurz et al., 2001).
Despite the portal proteins from different tailed bacteriophages varying significantly in both amino-acid sequence and molecular mass, they all assemble into circular homo-oligomers that have a turbine-like shape and contain a central channel for DNA translocation (Luan et al., 2012; Orlova et al., 1999). The X-ray structures of several bacteriophage portal proteins subsequently revealed additional common structural features and shed light on the mode of action of the portal proteins (Lebedev et al., 2007; Olia et al., 2011; Simpson et al., 2001). Common features of bacteriophage portal proteins include the presence of negatively charged residues lining the central channel, which would favour translocation of negatively charged DNA through the channel, and several conserved structural motifs. One such motif is formed by three α-helices comprising two tunnel helices, a perpendicular long helix and the tunnel loop. Another prominent conserved feature is a `clip' structure at the base of the portal protein (Lebedev et al., 2007; Rao & Feiss, 2008).
In this paper, we report the expression and purification of a putative portal protein from the Thermus thermophilus bacteriophage G20C, a close relative of bacteriophages P23-45 and P74-26 (Minakhin et al., 2008). Initial trials with the wild-type protein comprising 448 residues resulted in the production of an insoluble protein, but this was remedied by the use of N- and C-terminal truncations. A soluble and stable protein construct was crystallized and its symmetry was deduced from the crystal data. This provides a route to characterize the structure and mechanism of action of this protein.
In bacteriophage genomes, the gene encoding the portal protein is usually positioned directly after the gene encoding the large terminase. The gene encoding the G20C large terminase containing the classical Walker motifs was annotated by sequence homology to the large terminase from bacteriophage P23-45. Based on the genomic context and size of the gene, and the predicted secondary structure of the gene product, the gene directly following the large terminase is likely to encode the portal protein of G20C. The gene corresponds to the ORF P23p86 (UniProtKB/TrEMBL A7XXB9) in the closely related phage P23-45 (Minakhin et al., 2008).
Forward and reverse primers containing the NdeI and BamHI restriction-site sequences, respectively, were designed to incorporate a hexahistidine tag at the N-terminus of the DNA sequence encoding the truncated putative portal protein (Ser21–Asp438). The amplified segment was cloned into the pET-22a vector (Novagen). Sequencing and alignment were performed to confirm the sequence of the insert.
The truncated putative G20C portal protein bearing an N-terminal hexahistidine tag was overexpressed in E. coli strain B834. Cells were grown in Luria–Bertani medium with 100 µg ml−1 ampicillin at 310 K to mid-log phase (optical density of approximately 0.6 at 600 nm). Expression of the portal protein was induced by the addition of 0.1 mM IPTG followed by incubation at 289 K for 20 h. The cell pellet was lysed by sonication at 277 K in lysis buffer consisting of 50 mM HEPES pH 7.5, 1 M NaCl, 5 mM imidazole and one cOmplete EDTA-free Protease-Inhibitor Cocktail tablet per 25 ml of solution (Roche). Nickel-affinity chromatography was performed on a 5 ml HisTrap Chelating HP column (GE Healthcare). The binding and elution buffers consisted of 50 mM HEPES, 1 M NaCl pH 7.5 with 5 and 500 mM imidazole, respectively. The protein was concentrated to approximately 10 mg ml−1 using a 30 kDa Ultra centrifugal filter (Amicon). The protein sample was purified further on a Superose 6 size-exclusion column (GE Life Sciences) in buffer consisting of 10 mM HEPES, 1 M NaCl pH 7.5. Purity was assigned by denaturing PAGE. The molecular mass of the purified sample was confirmed by matrix-assisted laser desorption/ionization mass spectrometry (MALDI–MS).
The protein was concentrated to approximately 10 mg ml−1 using a 30 kDa Ultra centrifugal filter (Amicon) in 10 mM HEPES, 1 M NaCl pH 7.5. Crystallization conditions were evaluated using standard commercial screens [Index and MPD (Hampton Research) and PACT (Molecular Dimensions)]. Drops composed of 150 nl purified protein solution and 150 nl reservoir solution were dispensed by a Mosquito nanolitre pipetting robot (TTP LabTech) and equilibrated against 60 µl reservoir solution. The best crystal was obtained from the MPD screen with a reservoir consisting of 0.2 M magnesium chloride, 40%(v/v) MPD.
X-ray data were collected from a single cryocooled crystal on the I02 beamline at the Diamond Light Source, UK equipped with a Dectris Pilatus detector. Data were collected at a wavelength of 0.9795 Å with a crystal-to-detector distance of 321.2 mm, a 0.2° crystal rotation per image and a total crystal rotation range of 180°. The data were processed with XDS using the xia2 program (Kabsch, 2010; Winter et al., 2013). The self-rotation function was calculated using MOLREP (Vagin & Teplyakov, 2010) with a resolution range of 51–2.89 Å and a radius of integration of 52.5 Å.
The truncated construct comprising an N-terminal methionine-hexahistidine tag and the Ser21–Asp438 protein segment contains 425 amino acids with a theoretical molecular mass of 47.2 kDa. This protein construct was cloned and overexpressed in E. coli B834 cells. Homogeneous protein was obtained after Ni-affinity and size-exclusion chromatography. The molecular weight of the purified protein measured by MALDI–MS was 47.022 kDa, which is in good agreement with the theoretical value of 47.027 kDa for the protein construct lacking the initial methionine residue.
The best crystal was obtained using ∼10 mg ml−1 protein solution in 10 mM HEPES, 1 M NaCl pH 7.5 and a reservoir consisting of 40% MPD, 0.2 M magnesium chloride. The crystal belonged to the tetragonal space group P4212, with unit-cell parameters a = b = 155.3, c = 115.4 Å. A complete X-ray data set to a resolution of 2.1 Å was collected on the I02 beamline at the Diamond Light Source (Table 1).
‡Rmeas is the redundancy-independent R factor (Diederichs & Karplus, 1997).
§CC1/2 is the percentage correlation between intensities from random half data sets (Karplus & Diederichs, 2012).
The self-rotation function (Crowther, 1972) was calculated to deduce the internal symmetry of the portal protein. Peaks appearing in the κ = 180° section are related by a 30° rotation around the axis coinciding with the crystallographic fourfold axis (Fig. 1a). Consistent with the presence of the 12-fold rotation symmetry, there is a peak in the κ = 30° section (Fig. 1b) which is approximately 40% higher than the peaks in the κ = 32.7° (i.e. 360°/11) and κ = 27.7° (i.e. 360°/13) sections. Three subunits in the asymmetric unit correspond to a specific volume of 2.46 Å3 Da−1 and a solvent content of 50% (Matthews, 1968). The crystallographic fourfold symmetry generates a 12-subunit oligomer.
Following heterologous expression, the putative portal protein from bacteriophage G20C has been purified and crystallized. Analysis of the X-ray data collected to 2.1 Å resolution indicates that the protein forms a 12-subunit circular assembly. The genomic context, the size and the oligomeric state of the protein are consistent with it being a portal protein. Determination of the structure of this putative portal protein by molecular replacement is not possible owing to a complete lack of sequence similarity to portal proteins for which the three-dimensional structure is available. The next stage of this project will focus on experimental phasing.
We would like to thank Johan Turkenburg and Sam Hart for collecting the X-ray data. This project was supported by the Wellcome Trust (fellowship 081916 and equipment grant No. 077371 to AAA). Work in the laboratories of KS is supported by NIH grant R01 59295 and The Ministry of Education and Science of the Russian Federation, project No. 14.B25.31.0004.
Casjens, S. R. (2011). Nature Rev. Microbiol. 9, 647–657. Web of Science CrossRef CAS
Crowther, R. A. (1972). The Molecular Replacement Method, edited by M. G. Rossmann, pp. 173–178. New York: Gordon & Breach.
Diederichs, K. & Karplus, P. A. (1997). Nature Struct. Biol. 4, 269–275. CrossRef CAS PubMed Web of Science
Donate, L. E., Herranz, L., Secilla, J. P., Carazo, J. M., Fujisawa, H. & Carrascosa, J. L. (1988). J. Mol. Biol. 201, 91–100. CrossRef CAS PubMed Web of Science
Kabsch, W. (2010). Acta Cryst. D66, 125–132. Web of Science CrossRef CAS IUCr Journals
Karplus, P. A. & Diederichs, K. (2012). Science, 336, 1030–1033. Web of Science CrossRef CAS PubMed
Lebedev, A. A., Krause, M. H., Isidro, A. L., Vagin, A. A., Orlova, E. V., Turner, J., Dodson, E. J., Tavares, P. & Antson, A. A. (2007). EMBO J. 26, 1984–1994. Web of Science CrossRef PubMed CAS
Luan, W., Fesseler, J., Chechik, M., Buttner, C. R., Antson, A. A. & Smits, C. (2012). Acta Cryst. F68, 1267–1270. Web of Science CrossRef CAS IUCr Journals
Lurz, R., Orlova, E. V., Günther, D., Dube, P., Dröge, A., Weise, F., van Heel, M. & Tavares, P. (2001). J. Mol. Biol. 310, 1027–1037. Web of Science CrossRef PubMed CAS
Matthews, B. W. (1968). J. Mol. Biol. 33, 491–497. CrossRef CAS PubMed Web of Science
Minakhin, L., Goel, M., Berdygulova, Z., Ramanculov, E., Florens, L., Glazko, G., Karamychev, V. N., Slesarev, A. I., Kozyavkin, S. A., Khromov, I., Ackermann, H. W., Washburn, M., Mushegian, A. & Severinov, K. (2008). J. Mol. Biol. 378, 468–480. Web of Science CrossRef PubMed CAS
Olia, A. S., Prevelige, P. E., Johnson, J. E. & Cingolani, G. (2011). Nature Struct. Mol. Biol. 18, 597–603. Web of Science CrossRef CAS
Orlova, E. V., Dube, P., Beckmann, E., Zemlin, F., Lurz, R., Trautner, T. A., Tavares, P. & van Heel, M. (1999). Nature Struct. Biol. 6, 842–846. Web of Science PubMed CAS
Rao, V. B. & Feiss, M. (2008). Annu. Rev. Genet. 42, 647–681. Web of Science CrossRef PubMed CAS
Simpson, A. A., Leiman, P. G., Tao, Y., He, Y., Badasso, M. O., Jardine, P. J., Anderson, D. L. & Rossmann, M. G. (2001). Acta Cryst. D57, 1260–1269. Web of Science CrossRef CAS IUCr Journals
Trus, B. L., Cheng, N., Newcomb, W. W., Homa, F. L., Brown, J. C. & Steven, A. C. (2004). J. Virol. 78, 12668–12671. Web of Science CrossRef PubMed CAS
Vagin, A. & Teplyakov, A. (2010). Acta Cryst. D66, 22–25. Web of Science CrossRef CAS IUCr Journals
Winter, G., Lobley, C. M. C. & Prince, S. M. (2013). Acta Cryst. D69, 1260–1273. Web of Science CrossRef CAS IUCr Journals
This is an open-access article distributed under the terms of the Creative Commons Attribution (CC-BY) Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original authors and source are cited.