The crystal structure of haemoglobin from Atlantic cod

The isolation of the dominant isoform of haemoglobin from Atlantic cod caught in southwest Norway is reported and its X-ray crystal structure is presented.


Introduction
Increasing the knowledge of how Atlantic cod (Gadus morhua) is affected by chemicals that contaminate the aquatic environment has been a longstanding effort Pampanin, 2017;Pampanin et al., 2014Pampanin et al., , 2016Pampanin & Sydnes, 2013;Sundt et al., 2012). Environmental pollutants such as polycyclic aromatic hydrocarbons (PAHs), resulting for example from oil exploration and production operations or accidental oil spills, have received attention from researchers since the 1970s owing to their carcinogenic potential (United States Environmental Protection Agency, 2009). However, only in recent years, with the development of new and more sensitive analytical methods, has it become possible to obtain a deeper understanding of the adverse effects and the mode of action of these contaminants (Beyer et al., 2011;Enerstvedt, Sydnes, Larssen et al., 2018;Enerstvedt et al., 2017;Karlsen et al., 2011).
In fish, PAHs are metabolized in vivo in order to make them more water-soluble, thus facilitating excretion (Pampanin & Sydnes, 2013). However, this process generates products that can potentially be more toxic than the parent compounds. Some of them can react with molecules such as DNA and proteins and form adducts, producing genotoxic effects or impairment of metabolism . In humans, adducts of haemoglobin have been studied substantially as an important method of evaluating individual health conditions (Stedingk et al., 2012). As an example, PAHs derived from tobacco smoking have been found to make adducts with particular amino acids in haemoglobin (Phillips & Venitt, 2012). It is reasonable to believe that this would also be the case for Atlantic cod, and this hypothesis is strengthened by the fact that PAH adducts with other blood proteins have been identified in this fish species (Enerstvedt et al., 2017;. The potential use of haemoglobin adducts as potential biomarkers of PAH exposure in Atlantic cod is under investigation in our research ISSN 2053-230X group. Therefore, in order to study the structural effects that a PAH adduct would have on haemoglobin, the structure of this protein has to be defined in the presence and the absence of the adduct. Firstly, however, pure haemoglobin needs to be obtained. The full genome of Atlantic cod has been sequenced (Star et al., 2011), and about 70 isoforms of haemoglobin have been proposed by the EST translation of the genome. However, sequence similarity within the and forms of cod haemoglobin is high, simplifying the structure-identification research challenge. Here, isolation of the dominant isoform of haemoglobin from Atlantic cod caught in southwest Norway is reported and its X-ray crystal structure is presented.

Materials and methods
2.1. Macromolecule production 2.1.1. Blood sampling. An individual Atlantic cod caught in Idsefjord, Stavanger, Norway using baited traps was kept in a 1000 l glass-fibre tank with a continuous-flow system (water temperature 8 C, salinity 33%, light/darkness 8/16 h, water flow 7-9 l min À1 ). Haemoglobin was obtained from the fish by extracting blood samples over a period of four weeks (total amount 10 ml). Heparin was used as an anticoagulant and the blood samples were centrifuged for 5 min at 2000g and 4 C. Plasma fractions were removed and the remaining red blood cell (RBC) fractions were kept at À80 C until further analysis.
2.1.2. Purification. A blood sample of about 500 ml from the cod was thawed and lysed by incubating the sample overnight at 4 C in 8 ml 4 mM Tris pH 7.5, 10 mM NaCl, 2 ml DNaseI.
2 ml of the lysed sample was diluted to 10 ml using 50 mM Tris pH 9.5 (buffer A). Samples were subsequently applied onto a 5 ml HiTrap QHP column (IEX) for anion-exchange chromatography. A washing step using 60% 50 mM bis-Tris propane pH 5.5 (buffer B) was included before eluting the protein using a gradient from 60% to 100% buffer B over 60 ml. Fractions containing haemoglobin were subjected to sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE) for assessment of purity ( Fig. 1) before being pooled and concentrated to 16 mg ml À1 using 3K Amicon Ultra spin filters.

Crystallization
Initial crystallization conditions were screened with a Phoenix crystallization robot (Art Robbins Instruments) using the sitting-drop vapour-diffusion method. A total of 480 inhouse conditions were screened in MRC plates with 60 ml reservoir solution per well; drop solutions were prepared by mixing 0.25 ml well solution and 0.25 ml protein solution at 16 mg ml À1 . Further optimization in Hampton Research 24-well hanging-drop plates using 500 ml reservoirs, a protein concentration of 8 mg ml À1 and 1 ml + 1 ml drops yielded thin needle-shaped and plate-like crystals from 12% PEG 5K MME, 0.1 M Bicine pH 8.5 (Table 1, Fig. 2).
The template for molecular replacement was generated by homology modelling using the structure of haemoglobin from yellow perch (PDB entry 1xq5; Center for Eukaryotic  Cod haemoglobin crystals. Structural Genomics, unpublished work) and BLAST sequences for the and chains of cod haemoglobin (GenBank codes ABV21458.1 and ACV69847.1, respectively). Homology models of the and chains were generated using the Schrö dinger software suite (https:// www.schrodinger.com/) and were assembled into a tetramer similar to PDB entry 1xq5. Inspection of electron-density maps was carried out in Coot (Emsley et al., 2010), followed by restrained positional refinement in REFMAC . Refinement statistics are listed in Table 3. To confirm the polypeptide sequence, cod haemoglobin was subjected to in-solution digestion and subsequent mass-spectrometric analysis.
The structure of cod haemoglobin has been deposited in the Protein Data Bank as entry 6hit.

Results and discussion
A new purification protocol was established by modifying the method described previously by Delatorre et al. (2000). The sample was loaded onto an anion-exchange chromatography column at high pH and was eluted by a gradual reduction of the pH. Cod haemoglobin at 16 mg ml À1 with a purity sufficient for crystallographic studies was obtained (Fig. 1).
Thin needle-shaped and plate-like crystals grew in bundles from 12% PEG 5K MME, 0.1 M Bicine pH 8.5 (Fig. 2). Optimization of the crystallization conditions, in an attempt to obtain larger single crystals, proved to be difficult. The optimization included a reduction in the protein and/or precipitant concentration, seeding and the use of additives. Single crystals were difficult to isolate and data could often not be processed because of multiple crystals or lattices. The best data were collected to 2.5 Å resolution from a crystal of approximately 0.3 Â 0.05 Â 0.05 mm in size on beamline ID23-1 at the ESRF.
The crystal volume-to-molecular weight ratio suggested the presence of eight haemoglobin molecules in the asymmetric unit. A homology model of tetrameric cod haemoglobin (made for pre-crystallographic studies) was used as a template for structure determination using molecular replacement. Phaser (McCoy et al., 2007) was able to identify two tetramers in the asymmetric unit [ Fig. 3(a)]. The solution was confirmed by running Phaser using PDB entry 1xq5 as the template. Autobuilding was carried out using the 'simulated annealing' option with the tetrameric homology model as the starting model instead of PDB entry 1xq5. The model was chosen to avoid fragmentation of the polypeptide sequences during autobuilding and to maintain the intrinsic order of the and chains, which are structurally very similar. The homologybased sequences were edited to correspond to the sequences obtained by mass spectrometry. Manual refitting of electrondensity maps and subsequent restrained positional refinement, including isotropic B-factor refinement, resulted in a final structure with R work = 23.23% and R free = 30.13%.
The cod haemoglobin shares 64% and 74% sequence identity with the yellow perch haemoglobin (PDB entry 1xq5) and chains, respectively, and the yellow perch tetramer superimposes on that of cod haemoglobin with r.m.s.d.s of 1.02 and 0.96 Å for the ABCD and EFGH tetramers, respectively [ Fig. 3(b)]. The two cod haemoglobin tetramers superimpose on each other with an r.m.s.d. of 0.51 Å . The r.m.s.d.s between individual monomers are of the order of 0.38-0.78 and 0.46-1.12 for the and chains, respectively. Differences in conformation are primarily located in the loop between residues 45-60 and the C-terminal loops. Differences in conformation are observed in the same regions between the cod and perch haemoglobins.   Table 2 Data-collection and processing statistics.
Values in parentheses are for the outer shell. The electron density is generally well defined for internal residues, but can be weaker for surface-exposed side chains. The quality of the electron density differs between polypeptide chains that are identical (i.e. chains A, C, E and G and chains B, D, F and H). In places where the electron density is weak, the conformation of the peptide is based on the conformation of other chains in which the electron density is better defined. This could be the explanation for the relatively large difference between R and R free .
Eight heme groups could be located in the electron density. The residues forming heme-binding sites are reasonably well defined, as are the heme propionate groups. The heme iron appears to be pentacoordinated in all subunits. Weak difference density in the vicinity of the iron in some of the heme groups may suggest a coordinated water or hydroxyl molecule at the sixth coordinating position, as observed in haemoglobin from, for example, perch (PDB entry 3bj3; Aranda et al., 2009). However, additional studies are needed to confirm this. The side chains of the histidines in the and units have conformations similar to that observed for perch haemoglobin, with distances of about 4.0-4.4 and 2.1-2.3 Å between the histidinyl N "2 atom and Fe for -chain residues His60 and His89, respectively, and 4 and 2.1-2.4 Å for -chain residues His64 and His93, respectively. Haemoglobins from cold-adapted species have been found to follow distinct autooxidation patterns (Vitagliano et al., 2004(Vitagliano et al., , 2008Aranda et al., 2009). The small differences in the distances and the amount of difference density observed close to the heme iron may suggest different states of autooxidation (Fig. 4). The electron density suggests that the tetramers may consist of different isoforms of the and subunits. This is not totally unexpected since heterogeneity in fish haemoglobins has been suggested previously (Sick, 1961;Andersen et al., 2009). For example, the electron density of -unit chain A, but not chain C, resembles the sequence (ACJ66341) reported by Andersen et al. (2009). However, refinement using this sequence did not improve the refinement statistics, and the R and R free values were about 5% higher. Similarly, the -unit chain D (and H) appears to have Ala at position 68 instead of Ile or Val which are usually found at this position. The cod haemoglobin structure presented in this work appears to have Ala at position 63 and Leu at position 56 in the chains, instead of Lys or Met, which are more common in cod sequences. Work by Andersen et al. (2009) suggests that smaller residues in these positions increase the oxygen affinity.
Blood samples were collected from a single individual fish with the aim of avoiding polymorphism owing to genetic variation. Although the resolution of the data is sufficient to   trace most of the polypeptide chains, it is not sufficient to specifically identify particular amino-acid residues. The difference electron-density maps suggest polymorphism in both the and the chains. However, the majority of the polypeptide chains could be traced with adequate certainty and can thus serve as a starting point for further analysis of adduct formation in Atlantic cod haemoglobin exposed to PAH contamination.