research papers
Structure of the cyanobactin oxidase ThcOx from Cyanothece sp. PCC 7425, the first structure to be solved at Diamond Light Source beamline I23 by means of S-SAD
aBSRC, University of St Andrews, North Haugh, St Andrews, Fife KY16 9ST, Scotland, bMarine Biodiscovery Center, Department of Chemistry, University of Aberdeen, Meston Walk, Aberdeen AB24 3UE, Scotland, cPharmacognosy Department, Faculty of Pharmacy, Mansoura University, Mansoura 35516, Egypt, dDiamond Light Source Ltd, Diamond House, Harwell Science and Innovation Campus, Didcot OX11 0DE, England, and eState Key Laboratory of Biotherapy and Collaborative Innovation Center for Biotherapy, West China Hospital, Sichuan University, Chengdu, People's Republic of China
*Correspondence e-mail: armin.wagner@diamond.ac.uk, naismith@st-andrews.ac.uk
Determination of protein crystal structures requires that the phases are derived independently of the observed measurement of diffraction intensities. Many techniques have been developed to obtain phases, including heavy-atom substitution,
and substitution during protein expression of the amino acid methionine with selenomethionine. Although the use of selenium-containing methionine has transformed the experimental determination of phases it is not always possible, either because the variant protein cannot be produced or does not crystallize. Phasing of structures by measuring the anomalous diffraction from S atoms could in theory be almost universal since almost all proteins contain methionine or cysteine. Indeed, many structures have been solved by the so-called native sulfur single-wavelength anomalous diffraction (S-SAD) phasing method. However, the anomalous effect is weak at the wavelengths where data are normally recorded (between 1 and 2 Å) and this limits the potential of this method to well diffracting crystals. Longer wavelengths increase the strength of the anomalous signal but at the cost of increasing air absorption and scatter, which degrade the precision of the anomalous measurement, consequently hindering phase determination. A new instrument, the long-wavelength beamline I23 at Diamond Light Source, was designed to work at significantly longer wavelengths compared with standard synchrotron beamlines in order to open up the native S-SAD method to projects of increasing complexity. Here, the first novel structure, that of the oxidase domain involved in the production of the natural product patellamide, solved on this beamline is reported using data collected to a resolution of 3.15 Å at a wavelength of 3.1 Å. The oxidase is an example of a protein that does not crystallize as the selenium variant and for which no suitable homology model for was available. Initial attempts collecting anomalous diffraction data for native sulfur phasing on a standard macromolecular crystallography beamline using a wavelength of 1.77 Å did not yield a structure. The new beamline thus has the potential to facilitate by native S-SAD phasing for what would previously have been regarded as very challenging cases with modestly diffracting crystals and low sulfur content.Keywords: cyanobactins; azoline oxidase; S-SAD; RIPPs; phasing; structure; sulfur; long wavelength.
PDB reference: ThcOx, 5lq4
1. Introduction
The majority of structures from macromolecular crystals can nowadays be solved by
However, in the absence of suitable homology models, experimental phasing is the method of choice to overcome the crystallographic This method requires measurements of the small anomalous differences arising when tuning the wavelength of the X-rays towards the absorption edges of atoms bound to the structures. The most successful label used is selenium by substituting the amino acid methionine by selenomethionine. However, this technique is not universal as it is not compatible with all expression systems and, even if the labelled protein can be produced, crystallization fails in some cases.The native sulfur single-wavelength anomalous diffraction (S-SAD) method can overcome this by exploiting the intrinsic anomalous signal from sulfur. Over the past years, several studies have demonstrated the applicability of this method to problems of increasing complexity (Liu et al., 2012; Weinert et al., 2015; El Omari et al., 2014; Rose et al., 2015). On standard experimental setups, typically optimized for wavelengths around the selenium K edge (λ = 0.97 Å) and operated in air, S-SAD experiments are performed in the wavelength range 1.7–2.3 Å. The wavelength choice is limited mainly by the increased background noise from air scattering and the reduced signal from air absorption, as well as the size of the detector, given that the diffraction angles needed to measure data at constant resolution increase with the wavelength. To measure the small anomalous signals, high-redundancy data from well diffracting crystals or merging data from multiple are needed. The sulfur K edge is at λ = 4.96 Å and the anomalous signal increases approximately with the cube of the wavelength as it moves towards the edge. The new long-wavelength MX beamline I23 at Diamond Light Source has been designed to provide an optimized environment for native SAD experiments, enabling experiments at longer wavelengths to increase the anomalous signal with minimal noise (Wagner et al., 2016).
Cyanobactins are a family of ribosomally synthesized and post-translationally modified et al., 2013) from cyanobacteria. Each product is tailored using a selected set of enzymes to produce linear or macrocyclic (Sivonen et al., 2010). The patellamide pathway (Schmidt et al., 2005) is the best characterized pathway. The first step in patellamide biosynthesis is the ATP-dependent dehydration of cysteine, threonine and serine by the heterocyclase PatD to form thiazoline, methyl oxazoline and oxazoline, respectively (Fig. 1). Following this cyclodehydration, the N-terminus is cleaved off by the PatA protease domain and subsequently macrocyclized by the macrocyclase domain of PatG. Structures have been determined for the heterocyclase (cyclic dehydratase; Koehnke et al., 2013, 2015), protease (Houssen et al., 2012; Agarwal et al., 2012), prenyltransferase (Bent et al., 2013), a conserved domain of unknown function (Mann et al., 2014) and macrocyclase (Koehnke et al., 2012; Agarwal et al., 2012) proteins from various cyanobactin pathways. The only enzyme function remaining to be solved in the patellamide cluster is that of the FMN-dependent cyanobactin azoline oxidase. The oxidase converts thiazolines to thiazoles, with some enzymes also capable of the oxidation of oxazolines to oxazoles (Fig. 1, Table 1). The oxidase domain is conserved in patellamide-like pathways either as a fused domain to the G protein or as a standalone protein (Martins & Vasconcelos, 2015); the exception to this is the trunkamide pathway, in which the final compounds are not oxidized.
(RiPPs; Arnison
|
Here, we present ThcOx, the first novel structure to be solved by in-vacuum long-wavelength macromolecular crystallography on beamline I23 at Diamond Light Source (Wagner et al., 2016), and its subsequent high-resolution structure solved by molecular replacement.
2. Materials and methods
2.1. Cloning, expression and purification of ThcOx
The ThcOx gene was amplified from gDNA of cultured Cyanothece sp. PCC 7425 obtained from the Pasteur Culture Collection of Cyanobacteria (Paris) and subcloned with an N-terminal Tobacco etch virus (TEV) protease-cleavable His6 tag followed by a four-glycine linker into a pJexpress 401 vector from DNA2.0 (Table 2). It was expressed in Escherichia coli BL21 (DE3) cells grown on autoinduction medium supplemented with 50 µM riboflavin for 48 h at 20°C. Cell pellets were resuspended in lysis buffer plus EDTA-free protease-inhibitor tablets (Roche) and DNase at 0.4 mg per gram of wet cell pellet. The resuspension was lysed by passage through a cell disruptor at 207 MPa (Constant Systems). The lysate was cleared by centrifugation (40 000g, 4°C, 20 min) and then loaded onto an Ni Sepharose 6 Fast Flow column (GE Healthcare) equilibrated with lysis buffer [150 mM NaCl, 20 mM Tris–HCl pH 8.0, 20 mM imidazole pH 8.0, 50 µM flavin mononucleotide (FMN), 3 mM β-mercaptoethanol (BME)]. The column was washed with lysis buffer and ThcOx was then eluted with elution buffer (150 mM NaCl, 20 mM Tris–HCl pH 8.0, 250 mM imidazole pH 8.0, 50 µM FMN, 3 mM BME). The protein was then passed over a desalting column (16/10 Desalting, GE Healthcare) back into the lysis buffer. TEV protease was added at a mass ratio of 1:10 and the protein was digested for 2 h at 20°C to remove the His6 tag. The sample was then loaded onto a second nickel column in lysis buffer and directly onto an ion-exchange column (HiTrap Q Sepharose FF, GE Healthcare) and eluted with a 0.15–1 M NaCl gradient. The peak fraction was then concentrated to 7.5 ml (Vivaspin concentrators, 30 kDa molecular-weight cutoff) and applied onto a Superdex 200 gel-filtration column (GE Healthcare) equilibrated with gel-filtration buffer (150 mM NaCl, 10 mM HEPES pH 7.4, 1 mM TCEP). The lack of FMN in the gel-filtration buffer combined with the elution fraction remaining yellow in colour confirmed that FMN was indeed bound to the protein. The protein was concentrated to 4 mg ml−1 for crystallography. The purity of the protein was confirmed by SDS–PAGE analysis and its identity was confirmed by (MS).
|
2.2. Crystallization
ThcOx protein was mixed in a 1:1.1 ratio with the peptide NILPQQGQPVIR plus an additional 1 mM FMN cofactor and incubated overnight at 273 K prior to setting up crystallization trials. Initial crystallization screening was performed by sitting-drop vapour diffusion in a 96-well Intelli-Plate (300 nl protein mixture plus 150 nl reservoir solution) using a Gryphon robot (Art Robbins; https://www.artrobbins.com/). The only crystals produced from a wide range of screening trials were found in condition No. 16 from Wizard Classic Screen 2 (1.0 M sodium citrate, 0.1 M CHES pH 9.5) at 293 K. These crystals were subsequently optimized in Linbro hanging-drop plates using a drop volume of 2.5 µl + 1.5 µl over a well of 0.5 ml, with the best crystals found in a condition consisting of 1.0 M sodium citrate, 0.1 M CHES pH 7.75 at 293 K (Table 3, Fig. 2a).
|
2.3. Data collection, processing and phasing
The best crystals of ThcOx were cryoprotected in paraffin oil and either flash-cooled directly at the synchrotron or cooled and shipped to the synchrotron. The crystals showed a high level of non-isomorphism and diffracted to between 4.0 and 2.65 Å resolution (Figs. 2b and 2c). In order to phase the protein structure by several failed attempts were made using such proteins as putative nitroreductases from Anabaena variabilis (PDB entry 3eo7) and Ralstonia eutropha (PDB entry 3hj9), both of which were deposited by the Joint Center for Structural Genomics (JCSG). We were consistently unable to crystallize a selenomethionine variant of the protein under either the native or alternative crystallization conditions. A range of heavy-atom soaks were prepared including sodium bromide, lead(II) acetate, potassium tetrabromoplatinate(II), sodium iodide and potassium iodide, but none of these were successful in obtaining phases. The ThcOx protein contains a total of ten S atoms after removal of the His6 tag (Table 2). With a total of 477 amino-acid residues, the Bijvoet ratio at a wavelength of 2.0 Å is 0.98%. Our attempts to solve the structure by S-SAD phasing with in-house data collected at a wavelength of 1.54 Å or on beamline I02 at Diamond Light Source using data collected at a wavelength of 1.77 Å both failed owing to the low anomalous signal at those wavelengths.
We therefore used the new long-wavelength in-vacuum beamline I23 with its custom detector to investigate the use of longer wavelengths for phasing. A wavelength of 3.1 Å appeared to be a good compromise between the increase in the anomalous signal and the decrease in data quality owing to the greater sample absorption at longer wavelengths. At this wavelength the Bijvoet ratio is 2.1%, which is more than 2.5 times the value at a wavelength of 1.77 Å. Owing to the large crystal-to-crystal variability, 48 crystals had to be transferred into the beamline vacuum end station before a crystal of suitable quality could be found. The best crystal diffracted to a resolution limit of 3.15 Å. Data were collected with the large in-vacuum PILATUS 12M detector (Dectris, Switzerland) at a wavelength of 3.1 Å with 0.1° oscillations and 0.1 s exposure time. A total of 4000 images were measured in inverse-beam mode with wedge sizes of 20° and were subsequently processed and scaled with XDS/XSCALE (Kabsch, 2010). No additional absorption correction was performed. The was determined with SHELXD (Sheldrick, 2010) with a success rate of one hit in 1000 trials using a resolution cutoff of 4.2 Å when searching for 18 sites. Using this AutoSol (Terwilliger et al., 2009) found noncrystallographic symmetry, and after density modification automatic model building placed about 20% of the amino-acid residues in an electron-density map of sufficient quality to allow manual model building. The accuracy of the anomalous measurements can be seen in a phased anomalous difference map, which shows strong peaks for the S atoms (Fig. 3).
This initial model was refined and then used as a search model for xia2 (Winter, 2010) using XDS (Kabsch, 2010) and SCALA (Evans, 2006). The structure was determined using Phaser (McCoy et al., 2007) as part of the CCP4 suite (Winn et al., 2011). All data-collection statistics can be found in Table 4.
to solve a previous data set of higher resolution collected on beamline I02 at Diamond Light Source. The high-resolution data were processed and scaled in
|
2.4. Refinement
The high-resolution model was refined by iterative cycles of manual rebuilding using Coot (Emsley et al., 2010) and using REFMAC5 (Murshudov et al., 2011) in the CCP4 suite (Winn et al., 2011). TLS restraints were calculated using the TLSMD server (Painter & Merritt, 2006) and were used in (Winn et al., 2001). Final can be found in Table 5. The structure was validated using MolProbity (Chen et al., 2010) and the coordinates were deposited in the Protein Data Bank (PDB entry 5lq4). The anomalous data have also been deposited.
|
3. Results
The protein crystals belonged to P412121, with two molecules found in the Simple phase extension from the long-wavelength data used for phasing was not possible owing to non-isomorphism between the two crystals. The c axis is more than 10% larger in the long-wavelength data, and the unit-cell volume is about 15% larger compared with the data obtained from the crystal which diffracted to the highest resolution. Hence, was needed to phase the high-resolution data. This crystal-to-crystal variability was typical. The structure is composed of a novel domain and an FMN nitroreductase domain. Each molecule in the consists of 16 α-helices and 17 β-sheets (Fig. 4; structural figures were created using PyMOL v.1.5.0.4 from Schrödinger). The final structure contains residues 3–222 and 233–469 in chain A and 1–222 and 231–473 in chain B. In both monomers, the missing residues are located between α-helices 7 and 8 (the linker between the two domains) and at the N- and C-termini. These residues are presumed to be disordered in the structure. There are four cysteine residues per molecule, none of which form disulfide bonds.
PISA (Krissinel & Henrick, 2007; Krissinel, 2010) analysis infers that the dimer is biologically relevant, with each molecule containing a single FMN. The N-terminal domain contains a portion (residues 7–86) which, by visual analysis, supported by Phyre2 (Kelley & Sternberg, 2009), possesses the same fold as the leader binding domain of TruD (Koehnke et al., 2013), the so-called `peptide-clamp domain'. These domains are common in the RiPP family (Melby et al., 2014), and its orientation here suggests a possible role in function. The remaining residues in domain 1 (87–194) form a second peptide-clamp domain, but its binding site is buried by an interaction with the other clamp domain and thus it is unlikely to be active.
The C-terminal domain (residues 323–469) contains the FMN molecule and has a high degree of homology (r.m.s.d. = 1.45 Å) to the putative nitroreductase from A. variabilis (PDB entry 3eo7; residues 97–245; Fig. 5). Crystals were only observed when co-crystallized in the presence of the peptide NILPQQGQPVIR. The peptide was included as there was evidence that it may bind to the oxidase; however, there was no evidence of any density for the peptide. We have attempted to achieve a complex structure with substrate bound at the catalytic site but have so far been unsuccessful.
4. Discussion
We have reported the first structure of an oxidase protein from a cyanobactin pathway. The structure was determined by native S-SAD phasing on the new I23 beamline at Diamond Light Source. A wavelength of 3.10 Å was used to measure a sufficiently large anomalous signal to experimentally phase the structure of ThcOx from a crystal which diffracted to 3.15 Å resolution with useful anomalous data extending to only 4.2 Å resolution. These resolution limits would traditionally be unfavourable for phase determination by native S-SAD, particularly given the large size of the
(a dimer of 954 residues in total) with relatively few S atoms. Furthermore, the inability to extend the phases to a higher resolution data set, caused by non-isomorphism, required that the initial phases at 3.15 Å resolution had to be of sufficient quality to allow model building. Thus, ThcOx represented a particularly challenging test for the I23 beamline and the successful validates its performance.This
demonstrates the great potential of the novel in-vacuum long-wavelength MX beamline I23. The combination of the in-vacuum sample environment with the large semi-cylindrical PILATUS 12M pixel hybrid detector allows very accurate measurements with minimal background, giving very high signal-to-noise ratios in a wavelength range typically not accessible at standard MX beamlines. This new beamline should significantly extend the range of protein crystals that are amenable to native S-SAD structure determination.Footnotes
‡Joint first authors.
Acknowledgements
We thank Diamond Light Source for providing access to beamline I23. We also thank Thomas Sorensen and his staff for access to and support on beamline I02. This research was supported by grants from the UK Biotechnology and Biological Research Council (No. BB/K015508/1; JHN and MJ) and the European Research Council (No. 339367; JHN and MJ). Mass-spectrometric analysis was carried out by the Biomedical Sciences Research Complex
and Proteomics Facility, University of St Andrews and was funded by the Wellcome Trust (grant Nos. 094476/Z/10/Z and WT079272AIA). JHN is a Royal Society Wolfson Merit Award Holder and 1000 Talent scholar at Sichuan University.References
Agarwal, V., Pierce, E., McIntosh, J., Schmidt, E. W. & Nair, S. K. (2012). Chem. Biol. 19, 1411–1422. Web of Science CrossRef CAS PubMed Google Scholar
Arnison, P. A. et al. (2013). Nat. Prod. Rep. 30, 108–160. CrossRef CAS PubMed Google Scholar
Bent, A. F., Koehnke, J., Houssen, W. E., Smith, M. C. M., Jaspars, M. & Naismith, J. H. (2013). Acta Cryst. F69, 618–623. Web of Science CrossRef IUCr Journals Google Scholar
Chen, V. B., Arendall, W. B., Headd, J. J., Keedy, D. A., Immormino, R. M., Kapral, G. J., Murray, L. W., Richardson, J. S. & Richardson, D. C. (2010). Acta Cryst. D66, 12–21. Web of Science CrossRef CAS IUCr Journals Google Scholar
El Omari, K., Iourin, O., Kadlec, J., Fearn, R., Hall, D. R., Harlos, K., Grimes, J. M. & Stuart, D. I. (2014). Acta Cryst. D70, 2197–2203. Web of Science CrossRef IUCr Journals Google Scholar
Emsley, P., Lohkamp, B., Scott, W. G. & Cowtan, K. (2010). Acta Cryst. D66, 486–501. Web of Science CrossRef CAS IUCr Journals Google Scholar
Evans, P. (2006). Acta Cryst. D62, 72–82. Web of Science CrossRef CAS IUCr Journals Google Scholar
Houssen, W. E., Koehnke, J., Zollman, D., Vendome, J., Raab, A., Smith, M. C. M., Naismith, J. H. & Jaspars, M. (2012). Chembiochem, 13, 2683–2689. Web of Science CrossRef CAS PubMed Google Scholar
Kabsch, W. (2010). Acta Cryst. D66, 125–132. Web of Science CrossRef CAS IUCr Journals Google Scholar
Kelley, L. A. & Sternberg, M. J. (2009). Nature Protoc. 4, 363–371. Web of Science CrossRef CAS Google Scholar
Koehnke, J., Bent, A., Houssen, W. E., Zollman, D., Morawitz, F., Shirran, S., Vendome, J., Nneoyiegbe, A. F., Trembleau, L., Botting, C. H., Smith, M. C. M., Jaspars, M. & Naismith, J. H. (2012). Nature Struct. Mol. Biol. 19, 767–772. Web of Science CrossRef CAS Google Scholar
Koehnke, J., Bent, A. F., Zollman, D., Smith, K., Houssen, W. E., Zhu, X., Mann, G., Lebl, T., Scharff, R., Shirran, S., Botting, C. H., Jaspars, M., Schwarz-Linek, U. & Naismith, J. H. (2013). Angew. Chem. Int. Ed. 52, 13991–13996. CrossRef CAS Google Scholar
Koehnke, J., Mann, G., Bent, A. F., Ludewig, H., Shirran, S., Botting, C., Lebl, T., Houssen, W. E., Jaspars, M. & Naismith, J. H. (2015). Nature Chem. Biol. 11, 558–563. CrossRef CAS Google Scholar
Krissinel, E. (2010). J. Comput. Chem. 31, 133–143. Web of Science CrossRef PubMed CAS Google Scholar
Krissinel, E. & Henrick, K. (2007). J. Mol. Biol. 372, 774–797. Web of Science CrossRef PubMed CAS Google Scholar
Liu, Q., Dahmane, T., Zhang, Z., Assur, Z., Brasch, J., Shapiro, L., Mancia, F. & Hendrickson, W. A. (2012). Science, 336, 1033–1037. Web of Science CrossRef CAS PubMed Google Scholar
Mann, G., Koehnke, J., Bent, A. F., Graham, R., Houssen, W., Jaspars, M., Schwarz-Linek, U. & Naismith, J. H. (2014). Acta Cryst. F70, 1597–1603. CrossRef IUCr Journals Google Scholar
Martins, J. & Vasconcelos, V. (2015). Mar. Drugs, 13, 6910–6946. CrossRef CAS PubMed Google Scholar
McCoy, A. J., Grosse-Kunstleve, R. W., Adams, P. D., Winn, M. D., Storoni, L. C. & Read, R. J. (2007). J. Appl. Cryst. 40, 658–674. Web of Science CrossRef CAS IUCr Journals Google Scholar
Melby, J. O., Li, X. & Mitchell, D. A. (2014). Biochemistry, 53, 413–422. CrossRef CAS PubMed Google Scholar
Murshudov, G. N., Skubák, P., Lebedev, A. A., Pannu, N. S., Steiner, R. A., Nicholls, R. A., Winn, M. D., Long, F. & Vagin, A. A. (2011). Acta Cryst. D67, 355–367. Web of Science CrossRef CAS IUCr Journals Google Scholar
Painter, J. & Merritt, E. A. (2006). J. Appl. Cryst. 39, 109–111. Web of Science CrossRef CAS IUCr Journals Google Scholar
Rose, J. P., Wang, B.-C. & Weiss, M. S. (2015). IUCrJ, 2, 431–440. Web of Science CrossRef CAS PubMed IUCr Journals Google Scholar
Schmidt, E. W., Nelson, J. T., Rasko, D. A., Sudek, S., Eisen, J. A., Haygood, M. G. & Ravel, J. (2005). Proc. Natl Acad. Sci. USA, 102, 7315–7320. Web of Science CrossRef PubMed CAS Google Scholar
Sheldrick, G. M. (2010). Acta Cryst. D66, 479–485. Web of Science CrossRef CAS IUCr Journals Google Scholar
Sivonen, K., Leikoski, N., Fewer, D. P. & Jokela, J. (2010). Appl. Microbiol. Biotechnol. 86, 1213–1225. Web of Science CrossRef CAS PubMed Google Scholar
Terwilliger, T. C., Adams, P. D., Read, R. J., McCoy, A. J., Moriarty, N. W., Grosse-Kunstleve, R. W., Afonine, P. V., Zwart, P. H. & Hung, L.-W. (2009). Acta Cryst. D65, 582–601. Web of Science CrossRef CAS IUCr Journals Google Scholar
Wagner, A., Duman, R., Henderson, K. & Mykhaylyk, V. (2016). Acta Cryst. D72, 430–439. Web of Science CrossRef IUCr Journals Google Scholar
Weinert, T. et al. (2015). Nature Methods, 12, 131–133. Web of Science CrossRef CAS PubMed Google Scholar
Winn, M. D. et al. (2011). Acta Cryst. D67, 235–242. Web of Science CrossRef CAS IUCr Journals Google Scholar
Winn, M. D., Isupov, M. N. & Murshudov, G. N. (2001). Acta Cryst. D57, 122–133. Web of Science CrossRef CAS IUCr Journals Google Scholar
Winter, G. (2010). J. Appl. Cryst. 43, 186–190. Web of Science CrossRef CAS IUCr Journals Google Scholar
This is an open-access article distributed under the terms of the Creative Commons Attribution (CC-BY) Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original authors and source are cited.