research communications
of an inferred ancestral bacterial pyruvate decarboxylase
aSchool of Biochemistry, University of Bristol, University Walk, Bristol BS8 1TD, England, bDepartment of Biology and Biochemistry, University of Bath, Claverton Down, Bath BA2 7AY, England, and cSchool of Science, Faculty of Science and Engineering, University of Waikato, Private Bag 3105, Hamilton 3240, New Zealand
*Correspondence e-mail: s.j.crennell@bath.ac.uk
Pyruvate decarboxylase (PDC; EC 4.1.1.1) is a key enzyme in homofermentative metabolism where ethanol is the major product. PDCs are thiamine pyrophosphate- and Mg2+ ion-dependent enzymes that catalyse the non-oxidative decarboxylation of pyruvate to acetaldehyde and carbon dioxide. As this enzyme class is rare in bacteria, current knowledge of bacterial PDCs is extremely limited. One approach to further the understanding of bacterial PDCs is to exploit the diversity provided by evolution. Ancestral sequence reconstruction (ASR) is a method of computational molecular evolution to infer extinct ancestral protein sequences, which can then be synthesized and experimentally characterized. Through ASR a novel PDC was generated, designated ANC27, that shares only 78% amino-acid sequence identity with its closest extant homologue (Komagataeibacter medellinensis PDC, GenBank accession No. WP_014105323.1), yet is fully functional. Crystals of this PDC diffracted to 3.5 Å resolution. The data were merged in P3221, with unit-cell parameters a = b = 108.33, c = 322.65 Å, and contained two dimers (two tetramer halves) in the The structure was solved by using PDB entry 2wvg as a model, and the final R values were Rwork = 0.246 (0.3671 in the highest resolution bin) and Rfree = 0.319 (0.4482 in the highest resolution bin). Comparison with extant bacterial PDCs supports the previously observed correlation between decreased tetramer interface area (and number of interactions) and decreased thermostability.
Keywords: ancestral sequence reconstruction; pyruvate decarboxylase; lyases; crystal structure; TPP-dependent enzymes.
PDB reference: inferred ancestral pyruvate decarboxylase, 5npu
1. Introduction
Pyruvate decarboxylase (PDC; EC 4.1.1.1) is a thiamine pyrophosphate- and Mg2+ ion-dependent enzyme that catalyses the non-oxidative decarboxylation of pyruvate to acetaldehyde and carbon dioxide. Bacterial PDCs are key enzymes in homofermentative metabolism where ethanol is the main product, and as such are of great interest for biotechnological applications, including second-generation bioethanol production for use as a renewable transport fuel. In recent years there has been growing interest in utilizing bacterial PDCs for high-temperature ethanol-production processes in thermophilic bacteria.
However, our current knowledge of bacterial PDCs is extremely limited, as so far only six bacterial PDCs have been described. These are PDCs from Acetobacter pasteurianus (ApPDC; PDB entry 2vbi; D. Gocke, C. L. Berthold, G. Schneider & M. Pohl, unpublished work), Gluconoacetobacter diazotrophicus (GdPDC; PDB entry 4cok; van Zyl, Schubert et al., 2014), Gluconobacter oxydans (GoPDC; van Zyl, Taylor et al., 2014) and Zymobacter palmae (ZpPDC; PDB entry 5euj; Buddrus et al., 2016), the extensively studied Zymomonas mobilis PDC (ZmPDC; PDB entries 1zpd, 2wva, 2wvg, 2wvh, 3oei and 4zp1; Dobritzsch et al., 1998; Meyer et al., 2010; Pei et al., 2010; Wechsler et al., 2015) and a PDC from Sarcina ventriculi (SvPDC; Lowe & Zeikus, 1992), which is the only Gram-positive species known to possess a PDC.
One approach to open up new opportunities for bacterial PDCs adapted to a variety of environments is to exploit the diversity provided by evolution. Ancestral sequence reconstruction (ASR), first envisaged by Pauling & Zuckerkandl (1963), is a method of computational molecular evolution to infer extinct ancestral protein sequences, which can then be synthesized and experimentally characterized (for a comprehensive review of state-of-the-art ASR methods, see Merkl & Sterner, 2016). ASR explores a sequence space that has already been screened over evolutionary time spans, thus reducing the nonfunctional space that would otherwise be included in protein libraries generated by random mutagenesis, for example. Thus, ASR has an advantage over random or purely computational approaches in that it limits the `design space' to proteins that are properly folded and have a demonstrable activity (Cole & Gaucher, 2011; Hobbs et al., 2012).
Many enzymes reconstructed using ASR show higher thermostability and kinetic activity when compared with modern enzymes (Akanuma et al., 2013; Groussin et al., 2015; Hobbs et al., 2012, 2015; Perez-Jimenez et al., 2011; Risso et al., 2013). Thus, ASR is a promising protein tool for the purpose of generating enzymes and proteins with favourable thermal properties.
In this study, we employed ASR to infer an ancestral bacterial PDC designated ANC27. Here, we present the
of the inferred ancestral PDC ANC27 at 3.5 Å resolution, show that the enzyme is fully functional and compare it with known extant bacterial PDCs.2. Materials and methods
2.1. Ancestral sequence reconstruction
2.1.1. PDC sequences
The ASR input was 25 amino-acid sequences of extant bacterial PDCs retrieved from GenBank or identified through a BLAST search based on >50% amino-acid sequence identity over >90% coverage to the reference ZpPDC (PDB entry 5euj). Full details of the strains used and the protein/gene accession numbers can be found in Supplementary Table S1.
2.1.2. Phylogenetic analysis and node-age estimates
Following the approach used by Hobbs et al. (2012), the PDC amino-acid sequences were aligned using Geneious (v.R7 7.1.7; Kearse et al., 2012) with the MUSCLE alignment option (eight iterations) and then refined using Gblocks (Castresana, 2000). ProtTest (Abascal et al., 2005) was used to determine the most appropriate model of amino-acid evolution (WAG + F+). This model and the Gblocks alignment were used in PhyML (v.3.0; Guindon et al., 2010) to build a phylogenetic guide tree based on (ML) phylogenies. This guide tree, together with the MUSCLE alignment, was then implemented in PRANK (Löytynoja & Goldman, 2010) to generate an amino-acid alignment based on phylogenetic information. The resulting alignment was analysed using GARLI 2.0 (Zwickl, 2010) while implementing the WAG model of evolution to find the best ML tree based on log-likelihood scores (see Supplementary Fig. S1). GARLI was also used to bootstrap the tree using 1024 replicates. This tree was rooted to the outgroup (PDC sequences with <60% amino-acid identity to ZpPDC, in this case SvPDC with 31% and Ktedonobacter racemifer PDC with 52%) in Geneious.
The best ML GARLI tree was aged using r8s (v.1.8; Sanderson, 2003; available at https://loco.biosci.arizona.edu/r8s/). The point of divergence of proteobacteria and firmicutes (3.19 billion years ago) was used as the calibration point, based on a robust prokaryotic phylogeny study using 32 protein sequences and molecular-divergence times estimated from geological calibration points (Battistuzzi et al., 2004). The resulting tree was visualized in FigTree (v.1.4.2; available at https://tree.bio.ed.ac.uk/software/figtree/). See Supplementary Fig. S2 for the chronogram.
2.1.3. Ancestral reconstruction
Three different methods of ancestral inference (amino-acid, nucleotide and codon inference) were performed using the PAML software (v.4.3; Yang, 2007) under the ML criterion. Pdc sequences were collated in Geneious and manually curated to include gaps as determined by the PRANK amino-acid sequence alignment. Nucleotide-sequence inference in BASEML used the REV/GTR nucleotide-substitution rate model (with rate variation +I and +G), which was determined to be the most appropriate model of evolution in JModelTest (v.2.1.5; Posada, 2008). GTR was also employed for codon inference in CODEML, while for amino-acid inference the WAG model was used. Ancestral gaps were inferred in PRANK using the PRANK amino-acid alignment and the best ML GARLI tree. Taken together, the inferred sequences for nodes of interest were compiled by generating a consensus sequence from all methods of inference. Any ambiguities in the resulting consensus amino-acid sequence were resolved taking the following into consideration: (i) physiochemical properties, (ii) the structural environment, (iii) the most common residues present in extant sequences and (iv) the residue as predicted by the codon-inference method (considered to be the most robust method; Hobbs et al., 2012).
2.2. Macromolecule production
The amino-acid sequence generated through ASR was backtranslated and synthesized by Eurofins, including XbaI and XhoI restriction sites for cloning into pET-28a(+) (Novagen). The resulting construct added a thrombin cleavage site, a 3× glycine linker and a hexahistidine tag to the C-terminus of ANC27, and was transformed into Escherichia coli BL21 (DE3) cells for expression.
For overexpression, the E. coli cells were grown in autoinducing Overnight Express TB (Novagen) medium supplemented with 100 µg ml−1 kanamycin and 5 mM thiamine chloride at 303 K for 16 h with shaking at 220 rev min−1.
The cells were harvested by centrifugation (4000g, 277 K) and resuspended in His-bind buffer (20 mM Tris pH 8, 300 mM NaCl, 20 mM imidazole). The cells were lysed by sonication on ice and the insoluble debris was removed by centrifugation (17 000g, 277 K, 30 min). The supernatant was loaded onto a 5 ml HisTrap HP (GE Healthcare) column for nickel-affinity The column was washed with ten column volumes of His-bind buffer before eluting the protein with increasing concentrations of His-elute buffer (20 mM Tris pH 8, 300 mM NaCl, 1 M imidazole) on an ÄKTAexplorer FPLC system (GE Healthcare), monitoring the eluted protein at 280 nm.
The eluted protein was buffer-exchanged into 50 mM MES [2-(N-morpholino)ethanesulfonic acid] pH 6.5, 20 mM MgSO4, 3 mM TPP using a Superdex 200 10/300 GL gel-filtration column (GE Healthcare) on an ÄKTAexplorer FPLC system and purified to >95% as determined by 12% SDS–PAGE analysis.
The enzyme-activity assays were performed as described by Raj et al. (2002), with the reaction mixture consisting of 0.15 mM NADH, 20 mM MgSO4, 3 mM thiamine pyrophosphate (TPP), 29 mM pyruvate and 10 U Saccharomyces cerevisiae alcohol dehydrogenase (Sigma–Aldrich) in 50 mM MES pH 6.5 at 303 K. The reduction in NADH concentration was followed spectrophotometrically at 340 nm. Kinetic properties were analysed using the nonlinear-fit model from the enzyme-kinetics module in SigmaPlot (Systat Software). The temperature optimum was determined as described by Gocke et al. (2009) by following the depletion of pyruvate at 320 nm. The temperature was determined using SYPRO Orange dye in thermal shift assays as described in Huynh & Partch (2015).
Information relating to the production of ANC27 PDC is summarized in Table 1.
|
2.3. Crystallization
Crystals of the purified ANC27 were obtained using the hanging-drop vapour-diffusion method at 291 K. The same approach was used for the crystallization of ANC27 as had been used for earlier PDC structures (Buddrus et al., 2016). A screen assessing 4 × 96 conditions was carried out in the presence of ligand, and those conditions that resulted in crystal growth were followed up with multiple fine screens to optimize conditions for crystal growth. These efforts did not result in crystals of better diffraction quality. Multiple crystals were screened for diffraction at the home source, and the crystals with the `best' diffraction were further screened at the Australian Synchrotron (AS) for data collection. Following these assessments, further optimization of conditions was carried out; however, the crystal quality did not improve. The data were collected from crystals grown from purified ANC27 that had been incubated with 2 mM pyruvate for 30 min at room temperature (293 K). 2 µl drops of the enzyme/pyruvate solution mixed in a 1:1 ratio with crystallization solution [0.15 M sodium citrate pH 5.5, 14%(w/v) PEG 3350] were placed onto cover slips and equilibrated against 400 µl reservoir solution. Crystals were looped out and soaked in cryoprotectant [10%(w/v) glycerol added to the crystallization buffer] before flash-cooling and storage in liquid nitrogen. Crystallization information is summarized in Table 2.
|
2.4. Data collection and processing
X-ray diffraction data were collected to 3.5 Å resolution on the MX2 beamline at the Australian Synchrotron, Melbourne, Australia. iMosflm (Battye et al., 2011), AIMLESS (Evans & Murshudov, 2013) and BALBES (Long et al., 2008) were used for data reduction, data scaling and phasing, respectively. The crystal belonged to P3221, with unit-cell parameters a = b = 108.333, c = 322.65 Å, and contained two dimers (tetramer halves) in the Data-collection and processing statistics are summarized in Table 3.
|
2.5. Structure solution and refinement
The structure was solved by BALBES using ZmPDC (PDB entry 2wvg; 71% amino-acid identity) as the starting model. The structure was refined by iterative cycles of manual building and modelling in Coot (Emsley et al., 2010) and in REFMAC5 (CCP4 suite; Winn et al., 2011; Potterton et al., 2018; Murshudov et al., 2011) and PHENIX (Adams et al., 2010). in REFMAC5 was restricted with (NCS) local restraints and TLS (translation, liberation, screw rotation) restraints, and included jelly-body and isotropic temperature factors under automatic weighting. The NCS restraints and TLS parameters were also applied in PHENIX. The quality of the final model was checked using MolProbity (Chen et al., 2010). The structure has been submitted to the Protein Data Bank and assigned PDB entry 5npu. Table 4 summarizes the structure-solution and statistics.
in
|
3. Results and discussion
3.1. ASR produced a fully functional bacterial PDC
ANC27 (with an inferred age of 1248 million years; Supplementary Fig. S2) was reconstructed using ASR. The three methods of inference (amino-acid, nucleotide and codon inference) resulted in <3% ambiguous sites, which were resolved as described in §2.1.3. The average posterior probability at each site was 0.6946 (confidence scores for amino-acid, codon and nucleotide inference were 0.89443, 0.43704 and 0.77432, respectively). The final amino-acid sequence showed an amino-acid identity of <79% compared with all extant sequences used in the inference (Supplementary Table S2).
Investigation of the kinetic properties of ANC27 showed a Vmax of 536 ± 13 U mg−1 and a Km for pyruvate of 3.6 ± 0.3 mM. The Km is comparable to data from previously characterized extant bacterial PDCs (Buddrus et al., 2016). However, the Vmax is higher than for any reported bacterial PDC. The highest previously reported Vmax was 181 U mg−1, determined for Z. mobilis PDC (at pH 6 and 30°C; Bringer-Meyer et al., 1986). Furthermore, the kcat of ANC27 is the highest reported for bacterial PDCs at 580 s−1. kcat/Km is very similar to those for GoPDC and GdPDC (Table 5).
‡van Zyl, Schubert et al. (2014). §Gocke et al. (2009). ¶Buddrus et al. (2016). ††van Zyl, Taylor et al. (2014). ‡‡Lowe & Zeikus (1992). §§Siegert et al. (2005). |
Comparing the thermal properties of ANC27 with those of extant PDCs (summarized in Table 6), ANC27 is one of the least thermostable and thermoactive bacterial PDCs currently known, only surpassing SvPDC, with which it shares only 33% amino-acid sequence identity.
‡van Zyl, Schubert et al. (2014). §van Zyl, Taylor et al. (2014). ¶Gocke et al. (2009). ††Buddrus et al. (2016). ‡‡Unpublished results. |
3.2. Crystal structure
The 2wvg; 71% amino-acid identity) as the starting model, with the correct solution confirmed by the presence of electron density for TPP, which was omitted from the search model. As described for extant bacterial PDCs, the of this inferred ancestral PDC is a homotetramer or dimer of dimers (Fig. 1). The domains can be assigned as follows: amino acids 1–191, PYR (pyrimidine binding, with 177–191 linker); 192–361, R (regulatory, with 341–361 linker); 362–563, PP (pyrophosphate binding) (Fig. 2). A dimer forms when TPP binds across the two subunits, with the pyrophosphate group binding to the PP domain from one subunit and the pyrimidine ring binding to the PYR domain from the second subunit, thus forming two active sites in the dimer. The final model comprises four chains of 563 amino acids with a molecular mass of 60.6 kDa, each with a TPP and an Mg2+ ion, together with one water in the active site of chain A and one PEG molecule between chains A and C. The final model contains four monomers in the each comprising part of a dimer formed with a symmetry-related monomer, with the dimers coming together in a less tight interaction to form dimer-of-dimers tetramers. A full tetramer would have a surface area of 65 130 Å2.
of ANC27 was determined by using ZmPDC (PDB entrySuperposition of the four monomers yielded an average root-mean-square deviation (r.m.s.d.) of 0.213 Å, indicating that the structures are very similar.
Overall, the electron-density map was of good quality, apart from the N-terminal methionine and the loop 340–360, which is the exposed linker between the R and PP domains. These regions seem to be very flexible and have not been included in the final model (more specifically chain A residues 344–358, chain B residues 343–359, chain C residues 343–357, chain D residues 340–358 and the N-terminal methionine in all chains). Compared with other crystal structures of bacterial PDCs ANC27 has much higher average B factors (Table 7). However, in all the PDCs the exposed linker between the R and PP domains is more mobile than the molecule as a whole.
|
The final model was refined to an R factor of 24.6% and an Rfree value of 31.86% using data between 81.1 and 3.5 Å resolution. 92.75% of the residues are located in favoured regions of the Ramachandran plot and a further 6.88% are in allowed regions.
3.3. Structural comparison
It has been noted previously that a trend for increasing tetramer interface areas and increasing numbers of interactions at interfaces (i.e. forming hydrogen bonds or salt bridges) can be observed as the thermoactivity and thermostability of the PDCs increase (Buddrus et al., 2016). The interactions between the monomers within a dimer, however, are constant across the thermostability range (45–65°C). This observation is supported by the data obtained from ANC27. Table 8 summarizes the interface areas and Table 9 summarizes the numbers of interactions made between different interfaces.
|
‡Neighbour, interactions between neighbouring monomers; diagonal, interactions between monomers diagonally across tetrameric centre. |
Using PROMALS3D (Pei et al., 2010) to generate a structure-based alignment, conserved regions were analysed. Domains are well conserved, but linker regions, especially the linker region between the R and PP domains, are less so. The R–PP linker appears to be extended in ANC27 (Supplementary Fig. S3). However, the diversity displayed by the extant PDCs in the length, sequence and structural positioning of this linker region suggests that it may play a subtle role in the thermostability and thermoactivity differences observed between these PDCs, as previously mentioned by van Zyl, Schubert et al. (2014). GdPDC is one of the least thermostable PDCs, but has one of the shortest linker regions. In ZmPDC this linker is four amino acids longer than the linker in the most thermostable PDC, ZpPDC. The ZpPDC linker is five amino acids shorter than the linker in ANC27.
3.4. The active site and TPP binding
Four monomers form a tetramer with pseudo-222 symmetry and create four active sites. There is good-quality electron-density evidence for a TPP molecule and an Mg2+ ion in each of the four chains in the structure. Each TPP binds as indicated by the domain nomenclature, with the pyrophosphate group binding to the PP domain of one monomer and the pyrimidine ring binding to the PYR domain of a second monomer in a dimer, thus creating the active site in a narrow cleft at the interfaces between the two monomers (Fig. 2).
As noted in PDCs from extant bacteria, TPP is held in a V shape by Ile416 from the first monomer (Fig. 3 and Supplementary Fig. S4). The O atoms of the pyrophosphate form hydrogen bonds to the backbone amide groups of Ile473 and Thr472, while the N1 atom on the pyrimidine ring forms a hydrogen bond to Glu50 (from the second monomer). These interactions are essential for the catalytic mechanism (Pei et al., 2010). Tyr471 and Glu474 from one monomer are involved in the binding of the pyrophosphate, while Thr72 from the second monomer is involved in TPP binding around the pyrimidine ring.
Through interactions with Asp27 and Glu474, a water molecule supports the active-site arrangement. This has also been observed in extant PDCs, even in the absence of substrate, and is thought to play a pivotal role in the organization of the substrate complex and of the hydrogen-bond network (Pei et al., 2010).
The Mg2+ ion anchors the pyrophosphate group of the TPP and forms interactions with the two O atoms on the diphosphate group of the TPP, the side-chain O atoms of Asp441 and Asn467 and the main-chain O atom of Gly470. In other PDC structures a water molecule completes the coordination sphere, but in this low-resolution structure this cannot be defined. In all other respects the conformation of TPP and the Mg2+ ion are the same in this and the other PDC structures (Supplementary Fig. S4).
3.5. Ancestral reconstruction
This ASR study used an unconventional approach in only using mesophilic species as the inference input to reconstruct a bacterial enzyme, as bacterial PDCs have so far only been found in mesophilic species. Other ASR studies described in the literature use a broader range of species with a wider growth-temperature range, including those by Hobbs et al. (2012) and Groussin et al. (2015).
It is possible that the mesophilic nature of ANC27 is caused by it simply being too young. On close examination of the study conducted by Gaucher et al. (2008) and their description of the progressive cooling of the environmental conditions correlating with a progressive decrease in the denaturing temperature of the bacterial elongation factor TU between 3.5 and 0.5 billion years ago, the denaturing temperature of 62°C for ANC27 fits well with their data. Their data suggest that in ancestors from 1000 to 2000 million years ago, a denaturing temperature of around 50°C is likely to be observed. However, earlier life (3–4 billion years ago) is likely to have experienced conditions similar to today's hot springs (60–80°C; Gaucher et al., 2008). There are also reports of fluctuating trends in the thermophily of reconstructed enzymes, in which not all ancestors exhibit the exceptional thermal properties observed for other ancestors (Hobbs et al., 2012), and evidence for moderately thermophilic ancestors that show no specific trends with regard to or efficiency (Oulavallickal, 2016). ANC27 maybe more comparable with these enzymes.
In summary, we present the
of an inferred ancestral bacterial PDC, ANC27, and a brief functional and structural comparison with known extant bacterial PDCs. Structural analysis supports the previously observed correlation between decreased oligomeric interface and salt bridges and decreased thermostability. Although the ancestral protein ANC27 did not possess the thermostability that we hoped for, the suggestion that the lack of inter-dimer interactions appears to be the cause of its fragility should guide future ancestral reconstruction protocols to achieve the desired characteristics.Supporting information
PDB reference: inferred ancestral pyruvate decarboxylase, 5npu
DNA sequence and Supplementary Figures and Tables. DOI: https://doi.org/10.1107/S2053230X18002819/or5003sup1.pdf
Acknowledgements
Special thanks to Konny Shim, University of Waikato, New Zealand for computational support.
Funding information
This work is based on a collaboration between the University of Bath, England and the University of Waikato, New Zealand supported by a Microbiology Society Research Visit Grant (RVG14-10). LB thanks the Biotechnology and Biological Sciences Research Council for a CASE studentship with TMO Renewables.
References
Abascal, F., Zardoya, R. & Posada, D. (2005). Bioinformatics, 21, 2104–2105. CrossRef CAS Google Scholar
Adams, P. D. et al. (2010). Acta Cryst. D66, 213–221. Web of Science CrossRef CAS IUCr Journals Google Scholar
Akanuma, S., Nakajima, Y., Yokobori, S.-I., Kimura, M., Nemoto, N., Mase, T., Miyazono, K.-I., Tanokura, M. & Yamagishi, A. (2013). Proc. Natl Acad. Sci. USA, 110, 11067–11072. CrossRef CAS Google Scholar
Battistuzzi, F. U., Feijao, A. & Hedges, S. B. (2004). BMC Evol. Biol. 4, 44. Google Scholar
Battye, T. G. G., Kontogiannis, L., Johnson, O., Powell, H. R. & Leslie, A. G. W. (2011). Acta Cryst. D67, 271–281. Web of Science CrossRef CAS IUCr Journals Google Scholar
Bringer-Meyer, S., Schimz, K.-L. & Sahm, H. (1986). Arch. Microbiol. 146, 105–110. CAS Google Scholar
Buddrus, L., Andrews, E. S. V., Leak, D. J., Danson, M. J., Arcus, V. L. & Crennell, S. J. (2016). Acta Cryst. F72, 700–706. CrossRef IUCr Journals Google Scholar
Castresana, J. (2000). Mol. Biol. Evol. 17, 540–552. CrossRef CAS Google Scholar
Chen, V. B., Arendall, W. B., Headd, J. J., Keedy, D. A., Immormino, R. M., Kapral, G. J., Murray, L. W., Richardson, J. S. & Richardson, D. C. (2010). Acta Cryst. D66, 12–21. Web of Science CrossRef CAS IUCr Journals Google Scholar
Cole, M. F. & Gaucher, E. A. (2011). Curr. Opin. Chem. Biol. 15, 399–406. CrossRef CAS Google Scholar
Dobritzsch, D., König, S., Schneider, G. & Lu, G. (1998). J. Biol. Chem. 273, 20196–20204. Web of Science CrossRef CAS PubMed Google Scholar
Emsley, P., Lohkamp, B., Scott, W. G. & Cowtan, K. (2010). Acta Cryst. D66, 486–501. Web of Science CrossRef CAS IUCr Journals Google Scholar
Evans, P. R. & Murshudov, G. N. (2013). Acta Cryst. D69, 1204–1214. Web of Science CrossRef CAS IUCr Journals Google Scholar
Gaucher, E. A., Govindarajan, S. & Ganesh, O. K. (2008). Nature (London), 451, 704–707. CrossRef CAS Google Scholar
Gocke, D., Graf, T., Brosi, H., Frindi-Wosch, I., Walter, L., Müller, M. & Pohl, M. (2009). J. Mol. Catal. B Enzym. 61, 30–35. Web of Science CrossRef CAS Google Scholar
Groussin, M., Hobbs, J. K., Szöllősi, G. J., Gribaldo, S., Arcus, V. L. & Gouy, M. (2015). Mol. Biol. Evol. 32, 13–22. CrossRef CAS Google Scholar
Guindon, S., Dufayard, J.-F., Lefort, V., Anisimova, M., Hordijk, W. & Gascuel, O. (2010). Syst. Biol. 59, 307–321. Web of Science CrossRef CAS PubMed Google Scholar
Hobbs, J. K., Prentice, E. J., Groussin, M. & Arcus, V. L. (2015). J. Mol. Evol. 81, 110–120. CrossRef CAS Google Scholar
Hobbs, J. K., Shepherd, C., Saul, D. J., Demetras, N. J., Haaning, S., Monk, C. R., Daniel, R. M. & Arcus, V. L. (2012). Mol. Biol. Evol. 29, 825–835. CrossRef CAS Google Scholar
Huynh, K. & Partch, C. L. (2015). Curr. Protoc. Protein Sci. 79, 28.9.1–28.914. CrossRef Google Scholar
Kearse, M., Moir, R., Wilson, A., Stones-Havas, S., Cheung, M., Sturrock, S., Buxton, S., Cooper, A., Markowitz, S., Duran, C., Thierer, T., Ashton, B., Meintjes, P. & Drummond, A. (2012). Bioinformatics, 28, 1647–1649. Web of Science CrossRef PubMed Google Scholar
Krissinel, E. & Henrick, K. (2007). J. Mol. Biol. 372, 774–797. Web of Science CrossRef PubMed CAS Google Scholar
Long, F., Vagin, A. A., Young, P. & Murshudov, G. N. (2008). Acta Cryst. D64, 125–132. Web of Science CrossRef CAS IUCr Journals Google Scholar
Lowe, S. E. & Zeikus, J. G. (1992). J. Gen. Microbiol. 138, 803–807. CrossRef PubMed CAS Web of Science Google Scholar
Löytynoja, A. & Goldman, N. (2010). BMC Bioinformatics, 11, 579. Google Scholar
Merkl, R. & Sterner, R. (2016). Biol. Chem. 397, 1–21. CrossRef CAS Google Scholar
Meyer, D., Neumann, P., Parthier, C., Friedemann, R., Nemeria, N., Jordan, F. & Tittmann, K. (2010). Biochemistry, 49, 8197–8212. Web of Science CrossRef CAS PubMed Google Scholar
Murshudov, G. N., Skubák, P., Lebedev, A. A., Pannu, N. S., Steiner, R. A., Nicholls, R. A., Winn, M. D., Long, F. & Vagin, A. A. (2011). Acta Cryst. D67, 355–367. Web of Science CrossRef CAS IUCr Journals Google Scholar
Oulavallickal, T. (2016). PhD Thesis. University of Waikato, Hamilton, New Zealand. Google Scholar
Pauling, L. & Zuckerkandl, E. (1963). Acta Chem. Scand. 17, S9–S16. CrossRef CAS Google Scholar
Pei, J., Erixon, K. M., Luisi, B. F. & Leeper, F. J. (2010). Biochemistry, 49, 1727–1736. Web of Science CrossRef CAS PubMed Google Scholar
Perez-Jimenez, R., Inglés-Prieto, A., Zhao, Z.-M., Sanchez-Romero, I., Alegre-Cebollada, J., Kosuri, P., Garcia-Manyes, S., Kappock, T. J., Tanokura, M. & Holmgren, A. (2011). Nature Struct. Mol. Biol. 18, 592–596. CAS Google Scholar
Posada, D. (2008). Mol. Biol. Evol. 25, 1253–1256. CrossRef CAS Google Scholar
Potterton, L. et al. (2018). Acta Cryst. D74, 68–84. CrossRef IUCr Journals Google Scholar
Raj, K. C., Talarico, L. A., Ingram, L. O. & Maupin-Furlow, J. A. (2002). Appl. Environ. Microbiol. 68, 2869–2876. Web of Science CrossRef PubMed CAS Google Scholar
Risso, V. A., Gavira, J. A., Mejia-Carmona, D. F., Gaucher, E. A. & Sanchez-Ruiz, J. M. (2013). J. Am. Chem. Soc. 135, 2899–2902. CrossRef CAS Google Scholar
Sanderson, M. J. (2003). Bioinformatics, 19, 301–302. CrossRef CAS Google Scholar
Siegert, P., McLeish, M. J., Baumann, M., Iding, H., Kneen, M. M., Kenyon, G. L. & Pohl, M. (2005). Protein Eng. Des. Sol. 18, 345–357. CrossRef CAS Google Scholar
Wechsler, C., Meyer, D., Loschonsky, S., Funk, L. M., Neumann, P., Ficner, R., Brodhun, F., Muller, M. & Tittman, K. (2015). Chembiochem, 16, 2580–2584. CrossRef CAS Google Scholar
Winn, M. D. et al. (2011). Acta Cryst. D67, 235–242. Web of Science CrossRef CAS IUCr Journals Google Scholar
Yang, Z. (2007). Mol. Biol. Evol. 24, 1586–1591. CrossRef CAS Google Scholar
Zwickl, D. J. (2010). PhD Thesis. The University of Texas, Austin, Texas, USA. Google Scholar
Zyl, L. J. van, Schubert, W. D., Tuffin, M. I. & Cowan, D. A. (2014). BMC Struct. Biol. 14, 21. Google Scholar
Zyl, L. J. van, Taylor, M. P., Eley, K., Tuffin, M. & Cowan, D. A. (2014). Appl. Microbiol. Biotechnol. 98, 1247–1259. Web of Science PubMed Google Scholar
This is an open-access article distributed under the terms of the Creative Commons Attribution (CC-BY) Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original authors and source are cited.