Crystallization and preliminary structural analysis of the giant haemoglobin from Glossoscolex paulistus at 3.2 Å
aCentro de Biotecnologia Molecular Estrutural, Instituto de Fisica de São Carlos, Universidade de São Paulo, São Carlos – SP, CEP 13566-590, Brazil, and bInstituto de Química de São Carlos, Universidade de São Paulo, São Carlos – SP, CEP 13566-590, Brazil
*Correspondence e-mail: firstname.lastname@example.org, email@example.com
Glossoscolex paulistus is a free-living earthworm encountered in south-east Brazil. Its oxygen transport requirements are undertaken by a giant extracellular haemoglobin, or erythrocruorin (HbGp), which has an approximate molecular mass of 3.6 MDa and, by analogy with its homologue from Lumbricus terrestris (HbLt), is believed to be composed of a total of 180 polypeptide chains. In the present work the full 3.6 MDa particle in its cyanomet state was purified and crystallized using sodium citrate or PEG8000 as precipitant. The crystals contain one-quarter of the full particle in the asymmetric unit of the I222 cell and have parameters of a = 270.8 Å, b = 320.3 Å and c = 332.4 Å. Diffraction data were collected to 3.15 Å using synchrotron radiation on beamline X29A at the Brookhaven National Laboratory and represent the highest resolution data described to date for similar erythrocruorins. The structure was solved by molecular replacement using a search model corresponding to one-twelfth of its homologue from HbLt. This revealed that HbGp belongs to the type I class of erythrocruorins and provided an interpretable initial electron density map in which many features including the haem groups and disulfide bonds could be identified.
The annelid erythrocruorins are giant hexagonal bilayer haemoglobins with molecular masses within the megadalton range. They are multimeric assemblies built from the association of both globin and non-globin chains which typically present a highly cooperative behaviour (Weber & Vinogradov, 2001). The crystal structures of two types of such erythrocruorins have thus far been described, those from Lumbricus terrestris (HbLt) (Royer et al., 2006) and Arenicola marina (HbAc) (Royer et al., 2007). In both cases the architecture of the full particle of approximately 3.6 MDa is based on two hexagonal discs in which the most prominent substructure corresponds to one-twelfth of the particle (a protomer) and is composed of a dodecamer of globins together with three non-globin chains called linkers. Both structures possess 622 (or D6) symmetry but present slightly different relative orientations of the two discs. In type I structures, such as that from HbLt, the hexagonal layers are staggered with respect to one another by approximately 16°, while in type II structures, such as that seen in HbAc, the two layers are eclipsed. These arrangements as seen in the crystal structures are consistent with those observed in three-dimensional reconstructions based on cryo-electron microscopy images (Jouan et al., 2001) and would therefore appear to be intrinsic structural differences inherent to the two classes of molecule rather than alternative quarternary states accessible to a single species. Besides earthworms, cryo-electron microscopy has also established that the type I architecture is also observed in both leeches (de Haas, Biosset et al., 1996) and a hydrothermal vent tube worm (de Haas, Zal et al., 1996), whereas the type II class is only well established for polychaetes (de Haas, Taveau et al., 1996).
A full particle of HbLt is composed of 144 globin chains and 36 linkers. Four different types of globin chain (a, b, c and d) unite to form abcd tetramers in which the a subunit is covalently linked to b and c by disulphide bridges. Three such tetramers form the dodecameric head of the one-twelfth protomers which are further stabilized by the presence of one copy of each of the three linker chains L1, L2 and L3. The latter protrude from the head into the centre of the particle giving the protomer a mushroom-like appearance and are primarily responsible for stabilizing the structure of the full particle. It is also worth noting that only quite recently the primary structures of the HbLt linker chains became available and a fourth linker, L4, was also reported (Kao et al., 2006).
From a physico-chemical viewpoint, besides the HbLt, one of the erythrocruorins which has been most extensively studied is HbGp, an oligochaete earthworm readily encountered in the south-east of Brazil and whose molecular mass was originally reported to be of the order of 3.1 MDa (Costa et al., 1988). More recently this value has been questioned after a careful re-examination using analytical ultracentrifugation consistently yielded values between 3.6 and 3.7 MDa (Carvalho et al., 2009). This value is compatible with those described above for HbLt and with the fact that at alkaline pH HbGp dissociates into monomeric and disulphide-linked trimeric globin chains as well as non-globin linkers (Imasato et al., 1995). More specifically, the monomeric HbGp globin chain (d) has been fully sequenced and presents 57% amino acid sequence identity with its homologue HbLt, indicating a common evolutionary origin (Bosch Cabral et al., 2002). All these data suggest that HbGp is expected to present an overall architecture very similar to that seen for HbLt. This is borne out by results from SAXS studies (Gelamo et al., 2004; Krebs et al., 2004) as well as from mass spectrometric measurements (Oliveira et al., 2007; Martin et al., 1996) which have shown a great deal of similarity between the two haemoglobins. Furthermore, the stability of the HbGp particle as a function of pH, temperature and the presence of different detergents, as well as the close interdependence of subunit dissociation and auto-oxidation of the haem groups, have all been either well characterized or work is in progress to achieve this goal, making it, together with HbLt, one of the best studied members of the family described to date (Santiago et al., 2007, 2008; Oliveira et al., 2008). The relevance of these physico-chemical studies is also due to the fact that, since HbGp is an extracellular haemoglobin freely circulating in the worm haemolymph, besides its main oxygenation function it could possibly perform several other unknown biological roles as a carrier of small-molecular-weight biomolecules.
In spite of the significant advances made in understanding the architecture of these giant molecular assemblies, many outstanding questions remain concerning the details of the molecular mechanisms involved in their complex physiology. Some of the current limitations are the consequence of the relatively low resolution of the structures thus far reported for the full particles, 3.5 Å and 6.2 Å for L. terrestris (Royer et al., 2006) and A. marina (Royer et al., 2007), respectively. However, these limitations have been partially overcome by higher resolution studies of the (abcd)3 dodecamer in isolation which is believed to form the allosteric core of the molecule (Strand et al., 2004). Nevertheless, the interest in obtaining closer to atomic resolution structures for the full particle in different ligand-bound states would appear to strongly justify continued effort in the search for high-quality crystals of erythrocruorins from other species. The many previously described studies on HbGp, together with the research currently under way regarding its oligomeric stability, make it an attractive system on which to work and in this paper we describe progress made towards its structure determination.
The whole HbGp complex was purified directly from adult earthworms as previously described (Agustinho et al., 1996, 1998). Preparation of cyanomet-HbGp was made by addition of five-fold excess, relative to haem, of potassium ferrocyanide and potassium cyanide. After incubation for 1 h, a dialysis against the original buffer was performed to eliminate excess oxidation reagents (Agustinho et al., 1996, 1998; Carvalho et al., 2009). Initial crystallization conditions were screened using the sitting-drop vapour-diffusion method employing the Nextal Classic and PEG suites. Crystals appeared in droplets composed of 0.5 µl of an 18 mg ml−1 solution of cyanomet-HbGp together with an equal volume of the reservoir solution (Classic Suit Screen conditions Nos. 44 and 63) equilibrated against 100 µl of the latter. After optimization, hanging drops were mounted under the following two crystallization conditions: (i) 1.2 M sodium citrate, 2.5 mM CaCl2, 50 mM Tris-HCl pH 7.5, and (ii) 10% PEG8000, 2.5 mM CaCl2, 50 mM Tris-HCl pH 7.5, where crystals of HbGp grew in 72 h. Crystals obtained under the condition containing sodium citrate also grew in the presence of 5% ethylene glycol or 5% glycerol. The cryo-protection procedure consisted of the addiction of small volumes (0.5 µl) of cryo-protective solution to the droplet containing the crystals up to a final concentration of 10% ethylene glycol. Crystals were picked from the droplets and flash frozen in liquid nitrogen. X-ray images were collected at the same temperature with an ADSC Quantum 315 detector using synchrotron radiation of wavelength 1.00 Å at beamline X29A of the NSLS (Brookhaven National Laboratory, USA). The crystal-to-detector distance was set to 350 mm and oscillation images were collected at intervals of 0.5° with an exposure time of 1 s. Diffraction data were processed using iMOSFLM (Leslie, 2006) and SCALA from the CCP4 package (Collaborative Computational Project, Number 4, 1994; Potterton et al., 2004). The phase problem was solved by molecular replacement employing the one-twelfth protomer of the erythrocruorin from L. terrestris as the search model (Protein Data Bank code 2gtl ) using the program Phaser (McCoy et al., 2007). The haem groups were omitted during this procedure.
HbGp was successfully purified to homogeneity and crystals of the cyanomet derivative obtained under two different conditions, both at pH 7.5 and in the presence of calcium, as described above. The condition containing PEG8000 was not pursued further as these crystals proved difficult to reproduce, were very fragile during manipulation and usually grew as clusters which degraded rapidly with time, normally disappearing after a few days [Figs. 1(a) and 1(b)]. No significant diffraction pattern could be collected from these crystals. On the other hand, those obtained from the condition containing sodium citrate were easy to reproduce, stable, well formed and resistant to manipulation (Fig. 1c). They also provided significant initial diffraction patterns which extended to approximately 4.8 Å resolution on a rotating-anode generator (data not shown). However, these crystals were highly sensitive during transfer to the cryo-protectant solution. This problem could be significantly diminished by growing the crystals in the presence of 5% ethylene glycol followed by successively adding small volumes of a more concentrated ethylene glycol solution up to a final concentration of 10%. This procedure minimized the osmotic shock, and typically resulted in several unbroken crystals in the droplet suitable for flash cooling in liquid nitrogen and subsequent diffraction. The same procedure was adopted when using glycerol as cryo-protectant but none of the crystals obtained under these conditions provided useful diffraction data. Larger crystals, of approximately 0.5 mm in length, generally did not withstand the cryogenic cooling and the data described here were therefore collected from smaller or medium-sized crystals (Fig. 1d).
Diffraction data were obtained from the cyanomet derivative of HbGp and diffracted to a minimum d-spacing of 3.2 Å. Crystal parameters and diffraction data statistics are summarized in Table 1 and a typical diffraction image is shown in Fig. 2. The space group was initially determined to be I222 or I212121, with unit-cell parameters of a = 272.68, b = 319.90 and c = 333.18 Å. From the total of 1105247 measured reflections, 237062 independent reflections were obtained with an Rmerge of 11.2% (50.0% in the outermost shell). The data set was 99.8% (99.2%) complete at a final resolution limit of 3.2 Å. The phase problem was readily solved and the space group ambiguity broken by molecular replacement using the crystallographic structure of the homologue from L. terrestris as template (2gtl ). The search model corresponded to a one-twelfth protomer from which the haem groups had been excluded in order to be used subsequently as a criterion for evaluating the molecular replacement solution. The localization of the first protomer yielded an R value of 57.2% and a log likelihood gain of 1446. These values improved to 53.5% and 5509, respectively, after localization of the second protomer and then to 49.6% and 12050 after the third. The initial electron density maps clearly showed evidence of haem groups at the expected positions as well as many side chains including disulphide bonds. Furthermore, densities consistent with possible calcium ions are observed close to the linker chains in positions equivalent to those seen for HbLt. The overall electron density is consistent with a similar subunit composition as that seen in HbLt. Fig. 3 shows some examples of electron density maps from selected regions.
‡R is the conventional crystallographic R-factor, Σ||Fobs| − |Fcalc||/Σ|Fobs|, where Fobs and Fcalc are the observed and calculated structure factors, respectively.
Three mushroom-like protomers comprise the asymmetric unit. This corresponds to one-quarter of the full particle, consisting of 36 globin chains and nine linkers. The full particle lies on a special position at the intersection of the three twofold axes with two of the three protomers of the asymmetric unit belonging to one hexagonal disc and the third to the other. The full particle (Fig. 4) is therefore generated by the application of crystallographic symmetry and clearly shows the vertices of the six protomers in one layer to be staggered with respect to those in the other, identifying it to be of the type I architecture, very similar to that observed for HbLt (Royer et al., 2007).
The content of the asymmetric unit of the crystals described here is considerably smaller than that for HbAc and HbLt which have one and two full particles in the asymmetric unit, respectively. This may represent an advantage in the search for higher resolution data and it is already encouraging that those reported here are already the highest described to date. This has the potential to be further extended if crystal optimization together with the use of appropriate synchrotron sources can be successfully employed. This may open up the route towards higher resolution studies of the full particle bound to different ligands and therefore a better understanding of cooperativity and allosterism. Moreover, studies on the binding of small biomolecules to HbGp would potentially contribute to the deeper understanding of some unknown roles of the linker chains besides their assumed requirement to maintain the oligomeric structure of the complex.
Currently the refinement of the structure is hampered by a lack of complete amino acid sequences for six (or seven) of the expected seven (or eight) chains. We are in the process of addressing this using a combination of mass spectrometry and recombinant DNA technologies.
The authors are indebted to Mr Ézer Biazin, Mr Francisco Adriano de Oliveira Carvalho and Mr Jose Wilson Pires Carvalho for help with the purification and preparation of the protein samples used in the crystallization experiments. We thank the people from the RapiData course (NSLS) led by Bob Sweet for the opportunity to use the X25, X26 and X29c beamlines. We gratefully acknowledge the financial support of CNPq and FAPESP.
Agustinho, S., Tinto, M., Imasato, H., Tominaga, T., Perussi, J. & Tabak, M. (1996). Biochim. Biophys. Acta, 1298, 148–158. CrossRef CAS PubMed Web of Science Google Scholar
Agustinho, S., Tinto, M., Perussi, J., Tabak, M. & Imasato, H. (1998). Comparat. Biochem. Physiol. 118A, 171–181. Google Scholar
Bosch Cabral, C., Imasato, H., Rosa, J., Laure, H., da Silva, C., Tabak, M., Garratt, R. & Greene, L. (2002). Biophys. Chem. 97, 139–157. CrossRef PubMed CAS Google Scholar
Carvalho, F., Santiago, P., Borges, J. & Tabak, M. (2009). Anal. Biochem. 385, 257–263. Web of Science CrossRef PubMed CAS Google Scholar
Collaborative Computational Project, Number 4 (1994). Acta Cryst. D50, 760–763. CrossRef IUCr Journals Google Scholar
Costa, M., Bonafé, C., Meirelles, N. & Galembeck, F. (1988). Braz. J. Med. Biol. Res. 21, 115–118. CAS PubMed Web of Science Google Scholar
Gelamo, E. L., Itri, R. & Tabak, M. (2004). J. Biol. Chem. 279, 33298–33305. Web of Science CrossRef PubMed CAS Google Scholar
Haas, F. de, Biosset, N., Taveau, J., Lambert, O., Vinogradov, S. & Lamy, J. (1996). Biophys. J. 70, 1973–1984. PubMed Web of Science Google Scholar
Haas, F. de, Taveau, J., Boisset, N., Lambert, O., Vinogradov, S. & Lamy, J. (1996). J. Mol. Biol. 255, 140–153. CrossRef PubMed Google Scholar
Haas, F. de, Zal, F., Lallier, F., Toulmond, A. & Lamy, J. (1996). Proteins Struct. Funct. Bioinform. 26, 241–256. Google Scholar
Imasato, H., Tinto, M. H., Perussi, J. R. & Tabak, M. (1995). Comparat. Biochem. Physiol. 112, 217–226. CrossRef Web of Science Google Scholar
Jouan, L., Taveau, J., Marco, S., Lallier, F. & Lamy, J. (2001). J. Mol. Biol. 305, 757–771. Web of Science CrossRef PubMed CAS Google Scholar
Kao, W., Qin, J., Fushitani, K., Smith, S. S., Gorr, T. A., Riggs, C. K., Knapp, J. E., Chait, B. T. & Riggs, A. F. (2006). Proteins Struct. Funct. Bioinform. 63, 174–187. Web of Science CrossRef CAS Google Scholar
Krebs, A., Durchschlag, H. & Zipper, P. (2004). Biophys. J. 87, 1173–1185. Web of Science CrossRef PubMed CAS Google Scholar
Leslie, A. G. W. (2006). Acta Cryst. D62, 48–57. Web of Science CrossRef CAS IUCr Journals Google Scholar
McCoy, A. J., Grosse-Kunstleve, R. W., Adams, P. D., Winn, M. D., Storoni, L. C. & Read, R. J. (2007). J. Appl. Cryst. 40, 658–674. Web of Science CrossRef CAS IUCr Journals Google Scholar
Martin, P. D., Kuchumov, A. R., Green, B. N., Oliver, R. W. A., Braswell, E. H., Wall, J. S. & Vinogradov, S. N. (1996). J. Mol. Biol. 255, 154–169. CrossRef CAS PubMed Web of Science Google Scholar
Oliveira, M., Moreira, L. & Tabak, M. (2008). Intl. J. Biol. Macromol. 42, 111–119. Web of Science CrossRef CAS Google Scholar
Oliveira, M. S., Moreira, L. M. & Tabak, M. (2007). Intl. J. Biol. Macromol. 40, 429–436. Web of Science CrossRef CAS Google Scholar
Potterton, L., McNicholas, S., Krissinel, E., Gruber, J., Cowtan, K., Emsley, P., Murshudov, G. N., Cohen, S., Perrakis, A. & Noble, M. (2004). Acta Cryst. D60, 2288–2294. Web of Science CrossRef CAS IUCr Journals Google Scholar
Royer, W. J., Omartian, M. & Knapp, J. (2007). J. Mol. Biol. 365, 226–236. Web of Science CrossRef PubMed CAS Google Scholar
Royer, W. J., Sharma, H., Strand, K., Knapp, J. & Bhyravbhatla, B. (2006). Structure, 14, 1167–1177. Web of Science CrossRef PubMed CAS Google Scholar
Santiago, P., Moreira, L., de Almeida, E. & Tabak, M. (2007). Biochim. Biophys. Acta, 1770, 506–517. Web of Science CrossRef PubMed CAS Google Scholar
Santiago, P., Moura, F., Moreira, L., Domingues, M., Santos, N. & Tabak, M. (2008). Biophys. J. 94, 2228–2240. Web of Science CrossRef PubMed CAS Google Scholar
Strand, K., Knapp, J., Bhyravbhatla, B. & Royer, W. J. (2004). J. Mol. Biol. 344, 119–134. Web of Science CrossRef PubMed CAS Google Scholar
Weber, R. & Vinogradov, S. (2001). Physiol. Rev. 81, 569–628. Web of Science PubMed CAS Google Scholar
This is an open-access article distributed under the terms of the Creative Commons Attribution (CC-BY) Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original authors and source are cited.