research papers
Structural basis of regioselective tryptophan dibromination by the single-component flavin-dependent halogenase AetF
aDepartment of Chemistry, Bielefeld University, Universitaetsstrasse 25, 33615 Bielefeld, Germany
*Correspondence e-mail: hartmut.niemann@uni-bielefeld.de
The flavin-dependent halogenase (FDH) AetF successively brominates tryptophan at C5 and C7 to generate 5,7-dibromotryptophan. In contrast to the well studied two-component tryptophan halogenases, AetF is a single-component flavoprotein monooxygenase. Here, crystal structures of AetF alone and in complex with various substrates are presented, representing the first experimental structures of a single-component FDH. Rotational pseudosymmetry and XGXXG and GXGXXA. A large domain tightly binds the cofactor flavin adenine dinucleotide (FAD), while the small domain responsible for binding the nicotinamide adenine dinucleotide (NADP) is unoccupied. About half of the protein forms additional structural elements containing the tryptophan binding site. FAD and tryptophan are about 16 Å apart. A tunnel between them presumably allows diffusion of the active halogenating agent hypohalous acid from FAD to the substrate. Tryptophan and 5-bromotryptophan bind to the same site but with a different binding pose. A flip of the indole moiety identically positions C5 of tryptophan and C7 of 5-bromotryptophan next to the tunnel and to catalytic residues, providing a simple explanation for the regioselectivity of the two successive halogenations. AetF can also bind 7-bromotryptophan in the same orientation as tryptophan. This opens the way for the biocatalytic production of differentially dihalogenated tryptophan derivatives. The structural conservation of a catalytic lysine suggests a way to identify novel single-component FDHs.
complicated the phasing of one structure. AetF is structurally related to flavin-dependent monooxygenases. It contains two dinucleotide-binding domains for binding the ADP moiety with unusual sequences that deviate from the consensus sequences GKeywords: aetokthonotoxin; biocatalysis; bromination; enzymes; Rossmann fold; site selectivity; AetF; substrate binding.
1. Introduction
Flavin-dependent halogenases (FDHs) mediate the regioselective chlorination and bromination of many halogenated aromatic moieties in natural products (Neumann et al., 2008; Weichold et al., 2016; Agarwal et al., 2017; Phintha et al., 2020). Sometimes one aromatic moiety carries more than one halogen. Different routes exist for such multiple site-specific halogenations. In the biosynthesis of kutzneride, two FDHs act successively in a defined order, with 7-chlorotryptophan (7-Cl-Trp), the product of the first halogenase KtzQ, serving as the substrate of the second halogenase KtzR (Heemstra & Walsh, 2008). In other cases a single enzyme can install halogens on up to four aromatic carbons, but the structural basis of these multiple halogenations often remains elusive (Dorrestein et al., 2005; Pang et al., 2015; El Gamal et al., 2016). The mechanism of the FDH MalA is exceptionally well understood. During the biosynthesis of malbrancheamides, MalA regioselectively halogenates two positions of the substrate. There is no defined order and the product of the first halogenation does not need to reorient in the enzyme for the second halogenation to occur (Fraley et al., 2017).
Aetokthonotoxin (AETX) is a pentabrominated natural compound with two indole moieties, one of which carries two bromines and the other three. AETX was recently discovered as a cyanobacterial toxin that kills bald eagles and other animals in the USA (Breinlinger et al., 2021). The gene cluster for AETX biosynthesis encodes the FDHs AetA and AetF. AetF converts tryptophan to 5-bromotryptophan (5-Br-Trp) and 5-Br-Trp to 5,7-dibromotryptophan (5,7-Br2-Trp) (Breinlinger et al., 2021). AetA brominates 5-bromoindole (5-Br-indole) to 3,5-dibromoindole (3,5-Br2-indole), which it then converts to the final product 2,3,5-tribromoindole (2,3,5-Br3-indole) (Adak et al., 2022). AetA requires a separate flavin reductase to regenerate the reduced cofactor FADH2. It thus belongs to the two-component FDHs, which constitute the majority of currently known FDHs, including the well characterized tryptophan halogenases (Dong et al., 2005; Dachwitz et al., 2021). AetF is a single-component FDH that uses NADPH to reduce FAD (Adak et al., 2022). AetF is more closely related to single-component flavin-dependent monooxygenases than to any of the other characterized flavin-dependent tryptophan halogenases (Adak et al., 2022; Paul et al., 2021). The phenol brominase Bmp5 has previously been described as a single-component FDH (Agarwal et al., 2014). However, single-component FDHs remain poorly characterized in terms of substrate binding and regioselectivity. Docking of the substrate into an AlphaFold2 model of AetF allowed the identification of a catalytically essential lysine residue but provided no clear explanation for the regioselective dibromination of tryptophan (Jiang et al., 2022).
Here, we set out to unravel the structural basis of regioselective dibromination by AetF. We determined crystal structures of AetF, including structures with the two natural substrates tryptophan and 5-Br-Trp. These structures facilitate a better understanding of single-component FDHs in general.
2. Methods
2.1. Protein expression
The Escherichia coli codon-optimized C-terminally truncated variant aetF1–655 was synthesized and subcloned into a pET-24(+) vector between the BamHI and XhoI restriction sites by Twist Bioscience. The synthesized insert additionally contained an N-terminal His6 tag and Tobacco etch virus (TEV) protease cleavage site. The plasmid was transformed into E. coli BL21(DE3) cells via heat shock. An overnight preculture (37°C, 120 rev min−1) grown in LB medium (30 µg ml−1 kanamycin) was used to inoculate 2 l TB medium (30 µg ml−1 kanamycin) to an at 600 nm (OD600) of 0.1. The culture was incubated at 37°C and 90 rev min−1 until it reached an OD600 of 0.4, cooled to 18°C and expression was induced at an OD600 of 0.6–0.8 by adding 0.1 mM isopropyl β-D-1-thiogalactopyranoside (IPTG). The induced cultures were incubated for 20 h at 18°C and 90 rev min−1 before harvesting the cells via centrifugation (4500g, 30 min, 4°C). The cell pellets were washed with cooled phosphate-buffered saline (PBS) and frozen at −20°C.
2.2. Protein purification
The pellet from 2 l TB culture was resuspended in ∼70 ml cold lysis buffer (50 mM HEPES pH 7, 500 mM NaCl, 20 mM imidazole pH 7, 10 mM β-mercaptoethanol) and lysed via a SPCH ultrahigh-pressure cell homogenizer (Stansted Fluid Power) in three cycles at 1.17 MPa. Soluble protein was separated from insoluble compounds by centrifugation (30 000g, 30 min, 4°C). The supernatant was incubated with Co2+–NTA Agarose (Cube Biotech) for 60 min at 4°C. After washing with lysis buffer, the protein was eluted (50 mM HEPES pH 7, 500 mM NaCl, 250 mM imidazole pH 7, 10 mM β-mercaptoethanol) in 5 ml fractions and 5 mM EDTA was added immediately afterwards to reduce protein aggregation. His6-TEV-AetF1–655-containing fractions were pooled, concentrated (Pierce, 10 000 Da molecular-weight cutoff) and desalted using PD-10 desalting columns (GE Healthcare) equilibrated with (SEC) buffer (50 mM HEPES pH 7, 100 mM NaCl, 1 mM DTT). In order to remove the His6 tag, TEV protease (1 mg TEV protease per 50 mg protein) was added and incubated overnight at 20°C. In a subsequent Co2+–NTA the cleaved AetF1–655 was obtained in the flowthrough and washing fractions and was concentrated (Pierce, 10 000 Da molecular-weight cutoff). SEC was performed at 4°C with SEC buffer using a 16/60 Superdex 200 column (GE Healthcare) connected to an ÄKTA purifier system. The eluted protein was concentrated to 19.19 mg ml−1 and stored at −80°C. The protein had a bright yellow colour in all purification steps, indicating high FAD occupancy. UV–Vis spectroscopy (NanoDrop 1000, PEQLAB) confirmed an occupancy of close to 100% for FAD.
2.3. Crystallization
AetF1–655 was diluted to 6 mg ml−1 with SEC buffer. Initial crystallization trials comprised eight commercial screens at 4 and 20°C in MRC 2-Lens sitting-drop plates (SWISSCI) with two drop ratios (300 nl protein solution + 300 nl reservoir solution and 600 nl protein solution + 300 nl reservoir solution). Yellow spear tip-shaped crystals appeared within 2–4 days in ∼6% of the tested conditions. The crystals were harvested after 4–5 days. The best diffracting crystals grew at 4°C in condition A6 [30 mM CaCl2, 30 mM MgCl2, 0.1 M HEPES/MOPS pH 7.5, 20%(v/v) ethylene glycol, 10%(w/v) PEG 8000] of the Morpheus screen (Molecular Dimensions) and yielded the non-substrate-bound structure (empty AetF). AetF1–655 (6 mg ml−1) was co-crystallized with substrates under the same conditions and temperature. Instead of mixing protein and substrate beforehand, we dried 300 nl of 40 mM L-Trp (AppliChem GmbH), saturated 5-Br-Trp (Chem-Impex) or saturated 7-bromotryptophan (7-Br-Trp; provided by Nicolai Montua, OC III, Bielefeld University) on the plate prior to setting up the crystallization plate as described above. This resulted in 5-Br-Trp-bound AetF1–655 co-crystals with the same colour and morphology, but did not yield co-crystals with L-Trp or 7-Br-Trp. Therefore, L-Trp and 7-Br-Trp-bound AetF1–655 crystals were obtained by soaking non-substrate-bound crystals with L-Trp [250 nl 37.5 mM L-Trp, 25%(v/v) ethylene glycol] and 7-Br-Trp [150 nl sat. 7-Br-Trp, 25%(v/v)], respectively. All crystals were flash-cooled in liquid nitrogen without additional cryoprotectant.
2.4. Data collection and structure determination
X-ray diffraction data for the non-substrate-bound crystals (empty AetF) were collected at a wavelength of 0.97626 Å on beamline P13 operated by EMBL Hamburg at the PETRA III storage ring, DESY, Hamburg, Germany (Cianci et al., 2017). Measurements of 5-Br-Trp co-crystals and crystals soaked with L-Trp were carried out at a wavelength of 0.9184 Å on beamline ID30B (McCarthy et al., 2018) at the European Synchrotron Radiation Facility (ESRF), Grenoble, France. Data for AetF1–655 soaked with 7-Br-Trp (measured at 0.88560 Å) were collected on beamline ID23-1 (Nurizzo et al., 2006) at the ESRF. All data were collected at 100 K and the beamlines were controlled using MxCuBE (Gabadinho et al., 2010; Mueller et al., 2017). Diffraction data were indexed, integrated and scaled using XDS (Kabsch, 2010), treating Friedel pairs as different reflections for the bromine-containing structures. The structure of empty AetF was solved by (MR) via Phaser (McCoy et al., 2007) using a truncated model generated by a local installation of AlphaFold2 (Jumper et al., 2021). The final version of empty AetF was used to solve the isomorphous 5-Br-Trp-bound structure (AetF–5-Br-Trp) by rigid-body against the respective data in phenix.refine (Afonine et al., 2012; Liebschner et al., 2019). The same model was used to solve the L-Trp-bound and 7-Br-Trp-bound structures (AetF–Trp and AetF–7-Br-Trp) via MR by searching for four molecules per in P212121 for AetF–Trp and two molecules per in P43 for AetF–7-Br-Trp. All models were optimized by several rounds of model building in Coot (Emsley et al., 2010) and using phenix.refine (Afonine et al., 2012; Liebschner et al., 2019). Different strategies were tested and the best Rfree values were obtained by refining atomic coordinates, individual B factors and occupancies using TLS parameters (generated by phenix.refine) and (NCS) restraints and by optimizing both X-ray/stereochemistry and X-ray/ADP weights. H atoms were automatically added for the empty AetF structure and the complex with 5-Br-Trp. For the 5-Br-Trp-bound and 7-Br-Trp-bound structures (AetF–5-Br-Trp and AetF–7-Br-Trp) the Br atoms were refined as anomalous groups. The twin laws −k, −h, −l for the twinned AetF–7-Br-Trp structure (solved in P43) and −h, −l, −k for the twinned AetF–Trp structure (solved in P212121) were determined via twin in REFMAC5 (Kovalevskiy et al., 2018) and were used in all refinements. For all four structures, the resolution limit was determined with paired (Karplus & Diederichs, 2012) as implemented in PAIREF (Malý et al., 2020). Data-collection and are summarized in Tables 1 and 2. All structural illustrations were generated in PyMOL. The coordinates and structure factors of the final models were deposited in the Protein Data Bank with accession codes 8cjd (empty AetF), 8cje (AetF–Trp), 8cjf (AetF–5-Br-Trp) and 8cjg (AetF–7-Br-Trp). Diffraction images are available via the SBGrid Data Bank (Meyer et al., 2016) and additionally via the ESRF Data Portal for data collected at the ESRF. DOIs for the data sets are given in Table 1.
|
|
3. Results and discussion
3.1. Construct design
Initially, we worked with full-length AetF expressed as His6-TEV-AetF with a single alanine inserted between the TEV protease site and AetF, as found in pETM11 vectors. TEV protease digestion of this construct turned out to be impossible, although this short linker between the TEV site and the protein of interest had worked for many other proteins. This construct resulted in irreproducible crystals that diffracted to about 3 Å resolution. Attempts to solve the structure by failed. When an AlphaFold2 model became available, we modified the construct in two ways. Firstly, we noticed that the start methionine hardly protrudes from the well folded core, resulting in of TEV protease, preventing it from binding and cutting at its Insertion of an additional (SG)3 linker solved this problem and resulted in complete TEV protease digestion. Secondly, the AlphaFold2 model suggested that the C-terminus was unfolded, which could impair crystallization. Therefore, we deleted the last 13 amino acids, resulting in the His6-TEV-AetF1–655 construct described in this work. TEV protease-digested and purified AetF1–655 (hereafter referred to as AetF) exhibited exceptionally good crystallization properties, yielding crystals in 99 of the 1536 conditions tested. We collected data from crystals grown in ten different conditions. All tested crystals had the same unit-cell dimensions, suggesting that they have the same packing. We settled on condition A6 of the Morpheus screen (Gorrec, 2009) because these crystals showed the best diffraction. This illustrates how AlphaFold2 models can help to streamline construct design for crystallization.
3.2. from untwinned and twinned crystals in space groups P43 and P212121 both exhibiting pseudo-P43212 symmetry
P41 or P43 without signs of twinning or translational (tNCS). Empty AetF was solved in P43 by with two molecules per The structure of AetF co-crystallized with 5-Br-Trp was phased by the difference Fourier method. The data for AetF soaked with 7-Br-Trp scaled reasonably well in 422. These crystals turned out to be isomorphous to the non-substrate-bound and the 5-Br-Trp crystals, but were twinned with twin operator −k, −h, −l and a twin fraction of 20%. All data sets for AetF soaked with Trp scaled well in 422 and showed a larger with a roughly doubled volume. searching for two molecules per in all possible space groups of 422 gave two solutions in P43212 with TFZ values of 19.2 for the first chain in both solutions and 17.2 and 15.3 for the second chain of the respective solutions. However, there were severe clashes and the electron density was poor. A closer inspection of the data suggested a combination of and tNCS. The length of the tetragonal unit-cell c axis of Trp-bound AetF is roughly the same compared with the other AetF structures, while the a and b axes are roughly 1.4 (the square root of 2) times longer. phenix.xtriage revealed a small tNCS peak at 0.484, 0.489, 0.0 with a height of 12.3% of the origin peak. The cumulative intensity distribution from AIMLESS and the L-test in phenix.xtriage both indicated a high degree of Therefore, we assumed a pseudo-C-centred cell, with the real being P43, and a twofold twin operator around a. searching for four molecules in P43 produced four solutions, which only differed in the placement of the fourth chain. The TFZ values were 44.1, 32.3 and 52.6 for first three chains and 7.4, 7.1, 5.5 and 5.1 for the four different solutions of the fourth chain. The packing produced severe clashes and the electron density was very poor for two of the chains. Subsequent inspection of individual symmetry operators showed that a fourfold axis along c had slightly worse Rmeas and I/σ(I) values than each of the three twofold axes along a, b and c, suggesting that the true could be 222 with a twin operator emulating a fourfold axis, which is visible in the self-rotation functions (Supplementary Fig. S1). searching for four molecules per in all possible space groups of 222 resulted in a solution in P21212 (TFZ = 38.9), but again the electron density was poor. Explicitly searching in P212121 gave a unique solution for four molecules per with TFZ values of 19.5, 39.3, 41.1 and 101.6. without any rebuilding resulted in Rwork = 29.40%, Rfree = 34.64% without twin and Rwork = 19.71%, Rfree = 22.07% with twin suggesting P212121 to be the correct To scrutinize our choice of we solved the structure in P1. Phaser managed to place 15 of the 16 molecules and we manually placed the 16th molecule. Based on the P1 structure, Zanuda (Lebedev & Isupov, 2014) confirmed P212121 to be the most likely The refined twin fraction for twin operator −h, −l, −k is 45%. The packing of AetF is very similar in the small P43 and the larger P212121 unit cells. In P43 there are noncrystallographic twofold and 21 axes, resulting in a pseudo-P43212 packing (Fig. 1a). P212121 is a translationsgleiche of P43212. In the larger P212121 there are noncrystallographic twofold, 21 and 43 axes, again resulting in a pseudo-P43212 packing (Fig. 1b).
was straightforward for non-substrate-bound (empty AetF) and 5-Br-Trp-bound AetF (AetF–5-Br-Trp). These crystals were isomorphous to each other and scaled in space groups3.3. Only about half of AetF forms the two conserved nucleotide-binding domains, while the additional structural elements contain a catalytic lysine
AetF eluted as monomer from gel filtration and it is monomeric in the crystalline state. In all solved structures the complete sequence could be built except for the N-terminal (SG)3 linker and a disordered loop (residues 608 ± 2 to 630 ± 2). AetF retained a tightly bound FAD during purification and FAD is clearly visible in all chains. yielded occupancies of ∼1 for all FADs.
The AlphaFold2 model of AetF employed as a search model for MR represents a good structural model with an overall r.m.s.d. of 1 Å for 632 aligned Cα atoms. A DALI search (Holm, 2022) with our structure (Supplementary Table S1) returned many good hits with Z-scores above 20, including several ancestral flavin-containing monoxygenases (AncFMO2, AncFMO5, AncFMO3 and AncFMO1; Nicoll et al., 2020; Bailleul et al., 2021) and many other flavoprotein monooxygenases (FPMOs) such as pyrrolizidine alkaloid N-oxygenase from Zonocerus variegatus (ZvPNO; Kubitza et al., 2018), phenylacetone monooxygenase (Malito et al., 2004) and cyclohexanone monooxygenase (Mirza et al., 2009). While the latter two are Baeyer–Villiger monooxygenases and belong to FMPO B1, the structurally most similar enzymes and Bmp5, the only other known single-component FDH, belong to FMPO B2 (Paul et al., 2021), suggesting that AetF can also be assigned to B2 and therefore is a flavin-dependent monooxygenase (FMO). Because there is no experimental structure of Bmp5, we overlaid our AetF structure with the AlphaFold2 model of Bmp5 (Varadi et al., 2022). A pairwise alignment with the DALI server gave a Z-score of 27.6, an r.m.s.d. of 3.5 Å for 438 aligned residues and a sequence identity of 16% (Supplementary Table S1).
Group B FPMOs contain two domains with dinucleotide-binding Rossmann folds: a larger FAD-binding domain and a smaller NADP-binding domain (Paul et al., 2021). Both domains are present in AetF. Although we use the term domains here, the overall structure and the distribution of hydrophobic residues suggest that it might be more fitting to describe AetF as single-domain protein with several subdomains. A structural comparison with other FMOs such as Bmp5 and the ancestral AncFMO2 (DALI Z-score 26.5; PDB entry 6sf0) and with ZvPNO (Z-score 18.3; PDB entry 5nmx) is complicated by the fact that the dinucleotide-binding domains are not contiguous in sequence except for the small domain of ZvPNO. The small NADPH-binding domain is inserted into the large FAD-binding domain. In AetF, Bmp5 and the ancestral FMOs, the small domain is also split, containing an insertion that forms additional structural elements together with the C-terminus. In AetF, these extra structures comprise about half of the sequence (Fig. 2). Some structural elements that we assign to the insertion in the small domain [a β-strand (residues 177–184) and a β-strand followed by a short helix (residues 257–293)] are conserved between AetF and Bmp5 and may eventually be assigned to the small domain instead. In the C-terminal extension, the sequence 469-YRLXG-473 is conserved between AetF, Bmp5 and AncFMO2, with the tyrosine and arginine forming extensive conserved interactions with the large domain via hydrogen bonds and salt bridges, supporting our assumption that the whole protein may fold as a single unit.
AetF almost completely envelops the FAD, which explains the tight binding and high occupancy of FAD without any addition of cofactor (Fig. 3). The consensus sequences for dinucleotide binding in Rossmann folds are GXGXXG for FAD and pyridine dinucleotides (as found, for example, in ZvPNO) and sometimes GXGXXA for NADP (as found, for example, in AncFMO2) (Hanukoglu, 2015). In AetF these sequences deviate from the consensus, with 8-GFGFSA-13 for FAD and 154-TMGDSA-159 for NADP (Supplementary Fig. S2). Our AetF structure shows a large empty space where NADP is bound in ZvPNO and AncFMO2. Without the addition of NADP, significant difference density at the expected NADP binding site is visible, especially in the structure of AetF–5-Br-Trp, which could indicate NADP bound with low occupancy (Supplementary Fig. S3). However, this electron density is not sufficient to reliably model an NADP or even only a nicotinamide ribose monophosphate. Co-crystallization with NADP under the same conditions did not result in any crystals and soaking our crystals with NADP did not improve the electron density for the presumed NADP. This could be due to crystal packing, as the adenine of NADP from a structurally aligned AncFMO2 would clash with a symmetry mate of AetF in our AetF crystals. Hence, we assume that the reduction of FAD proceeds analogously to that in other FMOs by binding NADPH in this region. Thereby, the tightly bound FAD can be reduced without having to leave the protein.
All known two-component FDHs have a strictly conserved catalytic lysine in the active site that is most to likely to serve to polarize and activate hypohalous acid (Dong et al., 2005; Flecks et al., 2008; Andorfer et al., 2022). Based on docking tryptophan into the active site next to FAD in an AlphaFold2 model of AetF, Lewis and coworkers found the group of Lys258 to be located next to the substrate. Mutational analysis showed Lys258 to be essential for catalysis (Jiang et al., 2022). The catalytic lysine of AetF is located in the insertion within the NADP-binding domain in the region that is structurally similar in AetF and Bmp5. Lysine at this position is not generally conserved among FMOs. It is not present in the ancestral FMOs and ZvPNO (Nicoll et al., 2020; Bailleul et al., 2021; Kubitza et al., 2018). Interestingly, the phenol halogenase Bmp5 has an equivalent lysine, suggesting that Lys258 of AetF and Lys277 of Bmp5 may fulfil a similar function to that of the catalytic lysine in two-component FDHs.
In an attempt to locate the catalytic halide binding site, we co-crystallized AetF with Br− up to a concentration of 200 mM. Data sets collected at the bromine did not show strong anomalous signal and there was no anomalous difference density for bound bromide. It is conceivable that either the binding of NADP(H) or the reduction of FAD to FADH2 is required to form a functional bromide binding site in AetF. Further experiments will be necessary to settle this issue.
3.4. Tryptophan and 5-Br-Trp bind to the same site with a flipped indole moiety
Soaking AetF crystals with Trp and co-crystallization with 5-Br-Trp provided highly interpretable difference densities by the difference Fourier method or by MR, showing unique binding positions. After incorporation of the respective ligands, the occupancies refined to greater than 90%, except for one Trp with an occupancy of 65% (Fig. 4). For each ligand, there is no evidence of alternative binding poses in addition to the pose we modelled and the binding poses are conserved in all chains of the same structure. In the case of 5-Br-Trp, the position of the Br atom could be unambiguously determined based on the anomalous difference density.
As in two-component tryptophan halogenases (Dong et al., 2005; Moritzer et al., 2019), the indole moiety of Trp is mostly surrounded by hydrophobic and aromatic residues (for example Leu183, Leu196, Met216, Pro219, Phe372 and Phe524; Figs. 5a and 5b). The indole binding site is completed by the side chains of Glu200 and Thr373. The NH group of indole forms a hydrogen bond to the carbonyl of Met216. Overall, the indole binding site appears to be rather spacious. The backbone atoms of Trp are positioned by direct and water-mediated polar contacts with surrounding amino acids. The amino group of Trp forms hydrogen bonds to the side-chain O atom of Gln500 and the carbonyl groups of both Leu199 and Leu215. The Trp carboxylate forms a salt bridge with Lys587 and hydrogen bonds to the side chains of both Gln500 and Ser523. Additionally, a water molecule links the Trp carboxylate to the side chains of Asp516 and Lys587.
5-Br-Trp binds to the same site but with a different binding pose (Figs. 5c and 5d). The indole moiety of 5-Br-Trp lies in the same plane as that of Trp but is flipped around an axis formed by C3 and C6, the positions of which are almost unchanged, while N1, C2, C5 and C7 are rotated by ∼180° around this axis. The indole moiety of 5-Br-Trp is surrounded by the same hydrophobic (Leu183, Leu196, Met216, Pro219, Phe373 and Phe524) and polar (Glu200 and Thr373) residues as that of Trp, but its NH group now forms a hydrogen bond to the carbonyl of Leu196 instead of Met216. The backbone and Cβ atoms of 5-Br-Trp also move so that the polar interactions with the protein have to change. The side chain of Gln500, which binds both the amino and the carboxy group of Trp, rearranges and its amide group flips so that it can again form hydrogen bonds to both the amino and carboxy groups of 5-Br-Trp. The amino group of 5-Br-Trp now forms a hydrogen bond to the side chain of Asp516 instead of the carbonyl of Leu199. In addition, the group of 5-Br-Trp forms a salt bridge with Asp516. The contacts of the carboxylate of the substrate are more conserved, including the hydrogen bond to the side chain of Ser523, the salt bridge to Lys587 and the hydrogen bond to Lys587 via a bridging water previously described for Trp.
A comparison of the substrate-binding sites in the structures without substrate, with Trp and with 5-Br-Trp reveals no major changes except for the movement of Gln500 and Asp516 (Fig. 6). The main factor determining the flipped binding pose of 5-Br-Trp appears to be the snug fit of bromine into a preformed hydrophobic pocket surrounded by Leu183, Met217, Val220, Pro250, Leu254 and Phe372 (Fig. 7). This pocket is empty in structures without substrate and in the complex with Trp.
3.5. The positioning of the halogenated C atom relative to Glu200 and Lys258 might explain the regioselectivity of AetF
The flipped binding pose of 5-Br-Trp leads to an almost congruent position of the C atoms that will be halogenated in the two substrates. C5 of Trp and C7 of 5-Br-Trp are less than 0.5 Å apart when overlaying the two structures This suggests that C5 of Trp and C7 of 5-Br-Trp will be positioned productively relative to the catalytic residues (Fig. 8a). The well studied two-component tryptophan halogenases presumably achieve regioselectivity by positioning the C atom to be halogenated with the shortest distance to the catalytic lysine (Dong et al., 2005; Zhu et al., 2009; Moritzer et al., 2019). This is not the case in AetF. Averaged over all chains of the respective structures, the mean distance between the halogenated C atom and Nɛ of Lys258 is 9.0 Å for C5 of Trp and 8.9 Å for C7 of 5-Br-Trp. In both structures the mean distance between C6 and Nɛ of Lys258 is shorter, at 8.8 and 8.6 Å for Trp and 5-Br-Trp, respectively. However, Phe372 sterically blocks the C6 position, which might prevent halogenation of C6. Attempts to co-crystallize AetF with 6-Br-Trp yielded no crystals, while soaking AetF crystals with 6-Br-Trp led to empty active sites. This supports the assumption that a steric conflict with Phe372 might preclude the formation of 6-Br-Trp. The distance between the halogenated C atom and Nɛ of the catalytic lysine is much longer in AetF (∼9 Å) than in the two-component tryptophan halogenases PrnA, RebH, PyrH and Thal (∼4 Å). However, distances similar to those in AetF are not unprecedented in two-component FDHs, with distances of 6.8 Å in the structure of a D-Trp–Gly dipeptide bound to Thal and of ∼7.5 Å in MalA′ (Schnepel et al., 2023; Fraley et al., 2017).
All two-component tryptophan halogenases contain a catalytic glutamate that is not conserved in FDHs that halogenate phenol or pyrrole (Andorfer & Lewis, 2018). The glutamate may function as a general base to abstract the proton from the by stabilizing the positive charge of the or by polarizing the halogen of hypohalous acid (Dong et al., 2005; Andorfer & Lewis, 2018; Flecks et al., 2008). In AetF, Glu200, which is part of the indole-binding region, might perform an equivalent function. Thus, not only the distance to Nɛ of Lys258 but also that to the carboxylate of Glu200 may determine the site selectivity. It is worth mentioning that for Trp, C5 has the shortest distance of all C atoms to the carboxylate of Glu200 (Fig. 8b). For 5-Br-Trp, this distance is shortest for C7. At ∼3.5 Å, the distance between the halogenated C atom and the carboxylate of the possibly catalytic glutamate is almost identical in AetF and in the two-component FDHs PrnA, RebH, PyrH and Thal, hinting at a similar catalytic mechanism.
Our structures provide a clear-cut and simple explanation for the two successive regioselective brominations of Trp by AetF. The first step in the formation of 5,7-Br2-Trp by AetF is the conversion of Trp to 5-Br-Trp (Breinlinger et al., 2021). We found that Trp and 5-Br-Trp bind to the substrate-binding site in different orientations. The flip of the indole moiety results in either C5 or C7 being positioned most closely to Glu200 and Lys258. Trp binds such that C5 is positioned correctly for halogenation. Addition of bromine to C5 increases the affinity of the flipped binding pose. We do not know whether 5-Br-Trp rearranges while bound to AetF or whether it dissociates from and rebinds to the enzyme. However, it is clear that at least a fraction of 5-Br-Trp has to emerge from AetF for further processing by the tryptophanase AetE to 5-Br-indole, which is an intermediate in AETX biosynthesis (Adak et al., 2022).
The flip of the product upon the bromination of Trp on C5 by AetF differs from, for example, PrnA, which binds its product 7-Cl-Trp in the same orientation as the substrate Trp (Dong et al., 2005). In PrnA, the chlorine of 7-Cl-Trp is thus very close to the Nɛ atom of the catalytic lysine. Moreover, the strategy adopted by AetF for regioselective double halogenation clearly differs from those previously described for other enzymes. During the biosynthesis of kutzneride, the tryptophan 7-halogenase KtzQ and the tryptophan 6-halogenase KtzR act successively to produce 6,7-Cl2-Trp, with 7-Cl-Trp serving as a substrate for KtzR (Heemstra & Walsh, 2008). MalA, in contrast, regioselectively halogenates two positions of the substrate in the biosynthesis of malbrancheamides. There is no defined order as MalA produces roughly equal amounts of both singly halogenated intermediates during the reaction (Fraley et al., 2017). Structures of the closely related FDH MalA′ showed the nonhalogenated educt, both singly halogenated intermediates and the doubly halogenated product to bind almost identically (Fraley et al., 2017), suggesting that the product of the first halogenation does not need to reorient for the second halogenation to occur.
3.6. A tunnel connects the FAD and the distant tryptophan binding site
FMOs generally bind their substrates in the cleft between the large and the small dinucleotide-binding domains and in the direct vicinity of FAD (Romero et al., 2018). While at least some Baeyer–Villiger monooxygenases from B1 can bind substrate and NADP at the same time (Yachnin et al., 2012), in many FMOs of B2 the binding of substrate and NADP appear to be mutually exclusive, presumably because both occupy the same space (Eswaramoorthy et al., 2006; Alfieri et al., 2008; Cho et al., 2011; Wang et al., 2021). Accordingly, a previous attempt to dock the substrate Trp into an AlphaFold2 model of AetF used AetF in complex with FAD but lacking NADP because there was not sufficient space for both NADP and tryptophan to occupy the active site simultaneously (Jiang et al., 2022). The docking resulted in several models with Trp placed next to the isoalloxazine ring of FAD, but even the top ten poses differed substantially from each other.
Our structures now reveal that the actual substrate-binding site is distant from the FAD-binding site and does not overlap with the presumed NADP-binding site. The C atom of the substrate that is to be halogenated and the reactive C4a of flavin are ∼16 Å apart, excluding direct reaction of flavin with the substrate. This is unusual for FMOs but analogous to the situation in two-component FDHs. In two-component tryptophan halogenases the substrate and flavin are connected by a tunnel of ∼10 Å in length (Dong et al., 2005; Prakinee et al., 2022). The separation of the redox-active flavin and the substrate is possible because the flavin C4a hydroperoxide intermediate reacts with a halide ion to generate hypohalous acid (HOX), which subsequently diffuses through the tunnel to the substrate. Analysis with CAVER 3.0 (Chovancova et al., 2012; Pavelka et al., 2016) shows that in AetF a tunnel of approximately 16 Å in length also connects the FAD and the substrate (Fig. 9). We propose that AetF uses a similar mechanism and the tunnel allows the directed diffusion of HOX to the substrate. The Nɛ atom of the catalytic lysine of AetF (Lys258) points towards the tunnel and is located about halfway between the flavin C4a and the C atom of the substrate to be halogenated. Mutation of this lysine leads to a complete loss of halogenating activity, suggesting that it serves a role analogous to that of the catalytic lysine in two-component FDHs (Jiang et al., 2022). We note, as a caveat, that the tunnel shown here has a radius of only 0.90 Å at the narrowest point. Thus, it would be too narrow to allow diffusion of HOX. The tunnel connecting FAD and tryptophan in the tryptophan 6-halogenase Thal also has several bottlenecks, with the narrowest having a radius of only 0.98 Å (Prakinee et al., 2022). We assume that the side chains of Leu183 and Lys258, which have relatively large spatial freedom, can rearrange to widen the tunnel and thus allow the diffusion of HOX.
The tunnel in AetF further extends from the substrate to the surface, suggesting a route through which the substrate can access its binding site. Similar tunnels are present in ancestral FMO2, the protein in the PDB that has the highest structural similarity to AetF. The mammalian FMOs, on which the ancestral FMOs generated by ancestral sequence reconstruction are based, are membrane-binding enzymes and possess a tunnel system that presumably serves to guide multiple hydrophobic and hydrophilic substrates from the membrane or the cytosol towards the active site, where they can bind next to the flavin (Nicoll et al., 2020). In an overlay of AetF and ancestral FMO2 (PDB entry 6sf0), Trp from AetF is located in close proximity to the bound detergent CYMAL6 that delineates the membrane–enzyme interface and sits at the start of a tunnel through which hydrophobic substrates are likely to diffuse to the active site (Supplementary Fig. S4; Nicoll et al., 2020). The tunnel that provides access to the active site for various substrates in FMOs with broad substrate specificity may have evolved into a specific substrate-binding site in AetF.
3.7. Binding of 7-bromotryptophan to AetF opens a route to the biocatalytic production of various differentially dihalogenated tryptophans
The different binding poses of Trp and 5-Br-Trp suggested to us that AetF should be able to bind 7-Br-Trp in the same orientation as Trp. Despite 7-Br-Trp not being a natural substrate of AetF, soaking AetF crystals with 7-Br-Trp provided highly interpretable difference densities after MR and anomalous difference density unambiguously showed the position of the Br atom (Figs. 10a and 10b). As predicted, 7-Br-Trp binds in the same binding pose as Trp (Fig. 10c). The Br atom of 7-Br-Trp is located near the Br atom of 5-Br-Trp and occupies the previously described hydrophobic pocket (Fig. 10d). C5 of 7-Br-Trp points towards the tunnel and the catalytic amino acids Glu200 and Lys258. Based on this structure, we predict that AetF might be able to halogenate 7-Br-Trp regioselectively at C5. This would open the way for site-specific differential double halogenation of Trp. Our structures suggest that AetF can accept both 5-Br-Trp and 7-Br-Trp as substrates for a second halogenation at positions 7 or 5, respectively. Enzymatic generation of 5-Br-Trp and 7-Br-Trp is possible with two-component FDHs such as PyrH (tryptophan 5-halogenase) and PrnA or RebH (tryptophan 7-halogenase). While PyrH, PrnA and RebH catalyze chlorination and bromination (Dong et al., 2005; Yeh et al., 2005; Zhu et al., 2009), AetF can efficiently brominate and iodinate Trp (Jiang et al., 2022). The successive use of two FDHs should therefore yield six different dihalogenated tryptophan derivatives with two different halogens, namely 5-X,7-Y-Trp, with X and Y being Cl, Br or I (Supplementary Fig. S5). Halogenated tryptophan can be further derivatized using chemical cross-coupling (Büchler et al., 2019; Crowe et al., 2021). The envisaged site-specific differential double halogenation of tryptophan may open new routes for subsequent chemical synthesis.
3.8. Homologous sequences containing the catalytic lysine could be single-component FDHs with different substrate specificities
FDHs have potential as biocatalysts for environmentally friendly halogenation reactions (Büchler et al., 2019; Crowe et al., 2021) and AetF has already been shown to be a versatile catalyst (Jiang et al., 2022). Therefore, identifying new FDHs with altered substrate scope or halide specificity is of interest. Genome mining has already resulted in the identification of promising enzymes with new properties, but it has so far focused on two-component FDHs (Bayer et al., 2013; Neubauer et al., 2018; Gkotsi et al., 2019). Moore and coworkers performed extensive sequence-similarity analysis for AetF and found that it is not related to two-component tryptophan halogenases, to the only other known single-component FDH Bmp5 or to any other characterized flavin-dependent monooxygenases, suggesting AetF to be unique among known halogenases (Adak et al., 2022).
The presence of the catalytic lysine (Lys258 in AetF) might be a way to identify new halogenases among single-component FPMOs. While lysine at the position equivalent to Lys258 of AetF is not generally conserved among FMOs, it is conserved in the phenol halogenase Bmp5 (see above). In our hands, a pure sequence alignment did not result in correct alignment of the structurally equivalent catalytic lysines of AetF and Bmp5. Instead, the conservation becomes apparent in a structure comparison, for example an alignment of the AlphaFold2 model of Bmp5 with our structure of AetF. Other FMOs with a lysine in the same structural position as Lys258 of AetF might also be halogenases. To identify potential halogenases, we searched the AlphaFold/UniProt50 v.4 database (Bateman et al., 2023) for structural homologs of AetF using Foldseek (van Kempen et al., 2023). Among the top 50 hits (score ≥ 249; E-value ≤ 4.11 × 10−8), ten carry a lysine at the position equivalent to Lys258 of AetF (Supplementary Fig. S6). Thus, these ten hits might be novel single-component FDHs with an as yet unknown substrate specificity. If this prediction turns out to be true and substrates can be identified, these potential enzymes might become useful additions to the toolbox for biocatalytic halogenation.
4. Conclusion
The substrate-bound AetF structures nicely demonstrate that despite the high accuracy of AlphaFold2 protein models, experimental structures are still required. The actual substrate-binding site of AetF is distant from that anticipated based on analogy to known substrate-binding sites in other FMOs (Jiang et al., 2022). Our structures facilitate a better mechanistic understanding of single-component FDHs in general and they may allow the identification of novel single-component FDHs based on structural conservation of the catalytic lysine. The Trp-bound and 5-Br-Trp-bound structures provide a clear explanation of the regioselective double bromination by AetF. Finally, our structures allow predictions of how to apply wild-type AetF for the generation of differentially double-halogenated tryptophan and they provide the necessary basis for structure-based modification of AetF to modify its regioselectivity and alter its substrate scope.
5. Related literature
The following references are cited in the supporting information for this article: Franceschini et al. (2012) and Valentino et al. (2020).
Supporting information
Link https://doi.org/10.15785/SBGRID/1002
Raw data for PDB entry 8cjd.
Link https://doi.org/10.15785/SBGRID/1003
Raw data for PDB entry 8cje.
Link https://doi.org/10.15785/SBGRID/1004
Raw data for PDB entry 8cjf.
Link https://doi.org/10.15785/SBGRID/1005
Raw data for PDB entry 8cjg.
Link https://doi.org/10.15151/ESRF-DC-1089735712
Raw data for PDB entry 8cje.
Link https://doi.org/10.15151/ESRF-DC-1089719643
Raw data for PDB entry 8cjf.
Link https://doi.org/10.15151/ESRF-DC-1089739837
Raw data for PDB entry 8cjg.
Supplementary Tables and Figures. DOI: https://doi.org/10.1107/S2059798323004254/ag5042sup1.pdf
Acknowledgements
The synchrotron data were collected on beamline P13 operated by EMBL Hamburg at the PETRA III storage ring, DESY, Hamburg, Germany. We would like to thank Michael Agthe for assistance in using the beamline. We acknowledge the European Synchrotron Radiation Facility (ESRF) for the provision of synchrotron-radiation facilities and we would like to thank Andrew McCarthy for assistance and support in using beamline ID30B and David Flot for assistance and support in using beamline ID23-1. We thank Nicolai Montua (OC III, Bielefeld University) for providing 7-bromotryptophan. We thank Willy Keller and Marie Ruppel for expert technical assistance. Open access funding enabled and organized by Projekt DEAL.
References
Adak, S., Lukowski, A. L., Schäfer, R. J. B. & Moore, B. S. (2022). J. Am. Chem. Soc. 144, 2861–2866. CrossRef CAS PubMed Google Scholar
Afonine, P. V., Grosse-Kunstleve, R. W., Echols, N., Headd, J. J., Moriarty, N. W., Mustyakimov, M., Terwilliger, T. C., Urzhumtsev, A., Zwart, P. H. & Adams, P. D. (2012). Acta Cryst. D68, 352–367. Web of Science CrossRef CAS IUCr Journals Google Scholar
Agarwal, V., El Gamal, A. A., Yamanaka, K., Poth, D., Kersten, R. D., Schorn, M., Allen, E. E. & Moore, B. S. (2014). Nat. Chem. Biol. 10, 640–647. Web of Science CrossRef CAS PubMed Google Scholar
Agarwal, V., Miles, Z. D., Winter, J. M., Eustáquio, A. S., El Gamal, A. A. & Moore, B. S. (2017). Chem. Rev. 117, 5619–5674. CrossRef CAS PubMed Google Scholar
Alfieri, A., Malito, E., Orru, R., Fraaije, M. W. & Mattevi, A. (2008). Proc. Natl Acad. Sci. USA, 105, 6572–6577. CrossRef PubMed CAS Google Scholar
Andorfer, M. C., Evans, D., Yang, S., He, C. Q., Girlich, A. M., Vergara-Coll, J., Sukumar, N., Houk, K. N. & Lewis, J. C. (2022). Chem. Catal. 2, 2658–2674. CrossRef CAS PubMed Google Scholar
Andorfer, M. C. & Lewis, J. C. (2018). Annu. Rev. Biochem. 87, 159–185. CrossRef CAS PubMed Google Scholar
Bailleul, G., Nicoll, C. R., Mascotti, M. L., Mattevi, A. & Fraaije, M. W. (2021). J. Biol. Chem. 296, 100221. CrossRef PubMed Google Scholar
Bateman, A., Martin, M., Orchard, S., Magrane, M., Ahmad, S., Alpi, E., Bowler-Barnett, E. H., Britto, R., Bye-A-Jee, H., Cukura, A., Denny, P., Dogan, T., Ebenezer, T., Fan, J., Garmiri, P., da Costa Gonzales, L. J., Hatton-Ellis, E., Hussein, A., Ignatchenko, A., Insana, G., Ishtiaq, R., Joshi, V., Jyothi, D., Kandasaamy, S., Lock, A., Luciani, A., Lugaric, M., Luo, J., Lussi, Y., MacDougall, A., Madeira, F., Mahmoudy, M., Mishra, A., Moulang, K., Nightingale, A., Pundir, S., Qi, G., Raj, S., Raposo, P., Rice, D. L., Saidi, R., Santos, R., Speretta, E., Stephenson, J., Totoo, P., Turner, E., Tyagi, N., Vasudev, P., Warner, K., Watkins, X., Zaru, R., Zellner, H., Bridge, A. J., Aimo, L., Argoud-Puy, G., Auchincloss, A. H., Axelsen, K. B., Bansal, P., Baratin, D., Batista Neto, T. M., Blatter, M., Bolleman, J. T., Boutet, E., Breuza, L., Gil, B. C., Casals-Casas, C., Echioukh, K. C., Coudert, E., Cuche, B., de Castro, E., Estreicher, A., Famiglietti, M. L., Feuermann, M., Gasteiger, E., Gaudet, P., Gehant, S., Gerritsen, V., Gos, A., Gruaz, N., Hulo, C., Hyka-Nouspikel, N., Jungo, F., Kerhornou, A., Le Mercier, P., Lieberherr, D., Masson, P., Morgat, A., Muthukrishnan, V., Paesano, S., Pedruzzi, I., Pilbout, S., Pourcel, L., Poux, S., Pozzato, M., Pruess, M., Redaschi, N., Rivoire, C., Sigrist, C. J. A., Sonesson, K., Sundaram, S., Wu, C. H., Arighi, C. N., Arminski, L., Chen, C., Chen, Y., Huang, H., Laiho, K., McGarvey, P., Natale, D. A., Ross, K., Vinayaka, C. R., Wang, Q., Wang, Y. & Zhang, J. (2023). Nucleic Acids Res. 51, D523–D531. PubMed Google Scholar
Bayer, K., Scheuermayer, M., Fieseler, L. & Hentschel, U. (2013). Mar. Biotechnol. 15, 63–72. CrossRef CAS Google Scholar
Breinlinger, S., Phillips, T. J., Haram, B. N., Mareš, J., Martínez Yerena, J. A., Hrouzek, P., Sobotka, R., Henderson, W. M., Schmieder, P., Williams, S. M., Lauderdale, J. D., Wilde, H. D., Gerrin, W., Kust, A., Washington, J. W., Wagner, C., Geier, B., Liebeke, M., Enke, H., Niedermeyer, T. H. J. & Wilde, S. B. (2021). Science, 371, eaax9050. CSD CrossRef PubMed Google Scholar
Büchler, J., Papadopoulou, A. & Buller, R. (2019). Catalysts, 9, 1030. Google Scholar
Cho, H. J., Cho, H. Y., Kim, K. J., Kim, M. H., Kim, S. W. & Kang, B. S. (2011). J. Struct. Biol. 175, 39–48. Web of Science CrossRef CAS PubMed Google Scholar
Chovancova, E., Pavelka, A., Benes, P., Strnad, O., Brezovsky, J., Kozlikova, B., Gora, A., Sustr, V., Klvana, M., Medek, P., Biedermannova, L., Sochor, J. & Damborsky, J. (2012). PLoS Comput. Biol. 8, e1002708. Web of Science CrossRef PubMed Google Scholar
Cianci, M., Bourenkov, G., Pompidor, G., Karpics, I., Kallio, J., Bento, I., Roessle, M., Cipriani, F., Fiedler, S. & Schneider, T. R. (2017). J. Synchrotron Rad. 24, 323–332. Web of Science CrossRef CAS IUCr Journals Google Scholar
Crowe, C., Molyneux, S., Sharma, S. V., Zhang, Y., Gkotsi, D. S., Connaris, H. & Goss, R. J. M. (2021). Chem. Soc. Rev. 50, 9443–9481. CrossRef CAS PubMed Google Scholar
Dachwitz, S., Widmann, C., Frese, M., Niemann, H. H. & Sewald, N. (2021). Amino Acids, Peptides and Proteins, Vol. 44, edited by M. Ryadnov & F. Hudecz, pp. 1–43. London: Royal Society of Chemistry. Google Scholar
Dong, C., Flecks, S., Unversucht, S., Haupt, C., van Pée, K.-H. & Naismith, J. H. (2005). Science, 309, 2216–2219. Web of Science CrossRef PubMed CAS Google Scholar
Dorrestein, P. C., Yeh, E., Garneau-Tsodikova, S., Kelleher, N. L. & Walsh, C. T. (2005). Proc. Natl Acad. Sci. USA, 102, 13843–13848. Web of Science CrossRef PubMed CAS Google Scholar
El Gamal, A., Agarwal, V., Diethelm, S., Rahman, I., Schorn, M. A., Sneed, J. M., Louie, G. V., Whalen, K. E., Mincer, T. J., Noel, J. P., Paul, V. J. & Moore, B. S. (2016). Proc. Natl Acad. Sci. USA, 113, 3797–3802. Web of Science CrossRef CAS PubMed Google Scholar
Emsley, P., Lohkamp, B., Scott, W. G. & Cowtan, K. (2010). Acta Cryst. D66, 486–501. Web of Science CrossRef CAS IUCr Journals Google Scholar
Eswaramoorthy, S., Bonanno, J. B., Burley, S. K. & Swaminathan, S. (2006). Proc. Natl Acad. Sci. USA, 103, 9832–9837. Web of Science CrossRef PubMed CAS Google Scholar
Flecks, S., Patallo, E. P., Zhu, X., Ernyei, A. J., Seifert, G., Schneider, A., Dong, C., Naismith, J. H. & van Pée, K.-H. (2008). Angew. Chem. Int. Ed. 47, 9533–9536. Web of Science CrossRef CAS Google Scholar
Fraley, A. E., Garcia-Borràs, M., Tripathi, A., Khare, D., Mercado-Marin, E. V., Tran, H., Dan, Q., Webb, G. P., Watts, K. R., Crews, P., Sarpong, R., Williams, R. M., Smith, J. L., Houk, K. N. & Sherman, D. H. (2017). J. Am. Chem. Soc. 139, 12060–12068. Web of Science CrossRef CAS PubMed Google Scholar
Franceschini, S., van Beek, H. L., Pennetta, A., Martinoli, C., Fraaije, M. W. & Mattevi, A. (2012). J. Biol. Chem. 287, 22626–22634. Web of Science CrossRef CAS PubMed Google Scholar
Gabadinho, J., Beteva, A., Guijarro, M., Rey-Bakaikoa, V., Spruce, D., Bowler, M. W., Brockhauser, S., Flot, D., Gordon, E. J., Hall, D. R., Lavault, B., McCarthy, A. A., McCarthy, J., Mitchell, E., Monaco, S., Mueller-Dieckmann, C., Nurizzo, D., Ravelli, R. B. G., Thibault, X., Walsh, M. A., Leonard, G. A. & McSweeney, S. M. (2010). J. Synchrotron Rad. 17, 700–707. Web of Science CrossRef CAS IUCr Journals Google Scholar
Gkotsi, D. S., Ludewig, H., Sharma, S. V., Connolly, J. A., Dhaliwal, J., Wang, Y., Unsworth, W. P., Taylor, R. J. K., McLachlan, M. M. W., Shanahan, S., Naismith, J. H. & Goss, R. J. M. (2019). Nat. Chem. 11, 1091–1097. Web of Science CrossRef CAS PubMed Google Scholar
Gorrec, F. (2009). J. Appl. Cryst. 42, 1035–1042. Web of Science CrossRef CAS IUCr Journals Google Scholar
Hanukoglu, I. (2015). Biochem. Mol. Biol. Educ. 43, 206–209. Web of Science CrossRef CAS PubMed Google Scholar
Heemstra, J. R. & Walsh, C. T. (2008). J. Am. Chem. Soc. 130, 14024–14025. Web of Science CrossRef PubMed CAS Google Scholar
Hekkelman, M. L., de Vries, I., Joosten, R. P. & Perrakis, A. (2023). Nat. Methods, 20, 205–213. CrossRef CAS PubMed Google Scholar
Holm, L. (2022). Nucleic Acids Res. 50, W210–W215. Web of Science CrossRef CAS PubMed Google Scholar
Jiang, Y., Snodgrass, H. M., Zubi, Y. S., Roof, C. V., Guan, Y., Mondal, D., Moneycutt, N. H., Lee, J. W., Lewis, R. D., Martinez, C. A. & Lewis, J. C. (2022). Angew. Chem. Int. Ed. 61, e202214610. Google Scholar
Jumper, J., Evans, R., Pritzel, A., Green, T., Figurnov, M., Ronneberger, O., Tunyasuvunakool, K., Bates, R., Žídek, A., Potapenko, A., Bridgland, A., Meyer, C., Kohl, S. A. A., Ballard, A. J., Cowie, A., Romera-Paredes, B., Nikolov, S., Jain, R., Adler, J., Back, T., Petersen, S., Reiman, D., Clancy, E., Zielinski, M., Steinegger, M., Pacholska, M., Berghammer, T., Bodenstein, S., Silver, D., Vinyals, O., Senior, A. W., Kavukcuoglu, K., Kohli, P. & Hassabis, D. (2021). Nature, 596, 583–589. Web of Science CrossRef CAS PubMed Google Scholar
Kabsch, W. (2010). Acta Cryst. D66, 125–132. Web of Science CrossRef CAS IUCr Journals Google Scholar
Karplus, P. A. & Diederichs, K. (2012). Science, 336, 1030–1033. Web of Science CrossRef CAS PubMed Google Scholar
van Kempen, M., Kim, S. S., Tumescheit, C., Mirdita, M., Lee, J., Gilchrist, C. L. M., Söding, J. & Steinegger, M. (2023). Nat. Biotechnol. https://doi.org/10.1038/s41587-023-01773-0. Google Scholar
Kovalevskiy, O., Nicholls, R. A., Long, F., Carlon, A. & Murshudov, G. N. (2018). Acta Cryst. D74, 215–227. Web of Science CrossRef IUCr Journals Google Scholar
Kubitza, C., Faust, A., Gutt, M., Gäth, L., Ober, D. & Scheidig, A. J. (2018). Acta Cryst. D74, 422–432. CrossRef IUCr Journals Google Scholar
Lebedev, A. A. & Isupov, M. N. (2014). Acta Cryst. D70, 2430–2443. Web of Science CrossRef IUCr Journals Google Scholar
Liebschner, D., Afonine, P. V., Baker, M. L., Bunkóczi, G., Chen, V. B., Croll, T. I., Hintze, B., Hung, L.-W., Jain, S., McCoy, A. J., Moriarty, N. W., Oeffner, R. D., Poon, B. K., Prisant, M. G., Read, R. J., Richardson, J. S., Richardson, D. C., Sammito, M. D., Sobolev, O. V., Stockwell, D. H., Terwilliger, T. C., Urzhumtsev, A. G., Videau, L. L., Williams, C. J. & Adams, P. D. (2019). Acta Cryst. D75, 861–877. Web of Science CrossRef IUCr Journals Google Scholar
Malito, E., Alfieri, A., Fraaije, M. W. & Mattevi, A. (2004). Proc. Natl Acad. Sci. USA, 101, 13157–13162. Web of Science CrossRef PubMed CAS Google Scholar
Malý, M., Diederichs, K., Dohnálek, J. & Kolenko, P. (2020). IUCrJ, 7, 681–692. Web of Science CrossRef PubMed IUCr Journals Google Scholar
McCarthy, A. A., Barrett, R., Beteva, A., Caserotto, H., Dobias, F., Felisaz, F., Giraud, T., Guijarro, M., Janocha, R., Khadrouche, A., Lentini, M., Leonard, G. A., Lopez Marrero, M., Malbet-Monaco, S., McSweeney, S., Nurizzo, D., Papp, G., Rossi, C., Sinoir, J., Sorez, C., Surr, J., Svensson, O., Zander, U., Cipriani, F., Theveneau, P. & Mueller-Dieckmann, C. (2018). J. Synchrotron Rad. 25, 1249–1260. Web of Science CrossRef CAS IUCr Journals Google Scholar
McCoy, A. J., Grosse-Kunstleve, R. W., Adams, P. D., Winn, M. D., Storoni, L. C. & Read, R. J. (2007). J. Appl. Cryst. 40, 658–674. Web of Science CrossRef CAS IUCr Journals Google Scholar
Meyer, P. A., Socias, S., Key, J., Ransey, E., Tjon, E. C., Buschiazzo, A., Lei, M., Botka, C., Withrow, J., Neau, D., Rajashankar, K., Anderson, K. S., Baxter, R. H., Blacklow, S. C., Boggon, T. J., Bonvin, A. M. J. J., Borek, D., Brett, T. J., Caflisch, A., Chang, C., Chazin, W. J., Corbett, K. D., Cosgrove, M. S., Crosson, S., Dhe-Paganon, S., Di Cera, E., Drennan, C. L., Eck, M. J., Eichman, B. F., Fan, Q. R., Ferré-D'Amaré, A. R., Christopher Fromme, J., Garcia, K. C., Gaudet, R., Gong, P., Harrison, S. C., Heldwein, E. E., Jia, Z., Keenan, R. J., Kruse, A. C., Kvansakul, M., McLellan, J. S., Modis, Y., Nam, Y., Otwinowski, Z., Pai, E. F., Pereira, P. J. B., Petosa, C., Raman, C. S., Rapoport, T. A., Roll-Mecak, A., Rosen, M. K., Rudenko, G., Schlessinger, J., Schwartz, T. U., Shamoo, Y., Sondermann, H., Tao, Y. J., Tolia, N. H., Tsodikov, O. V., Westover, K. D., Wu, H., Foster, I., Fraser, J. S., Maia, F. R. N. C., Gonen, T., Kirchhausen, T., Diederichs, K., Crosas, M. & Sliz, P. (2016). Nat. Commun. 7, 10882. Web of Science CrossRef PubMed Google Scholar
Mirza, I. A., Yachnin, B. J., Wang, S., Grosse, S., Bergeron, H., Imura, A., Iwaki, H., Hasegawa, Y., Lau, P. C. K. & Berghuis, A. M. (2009). J. Am. Chem. Soc. 131, 8848–8854. Web of Science CrossRef PubMed CAS Google Scholar
Moritzer, A. C., Minges, H., Prior, T., Frese, M., Sewald, N. & Niemann, H. H. (2019). J. Biol. Chem. 294, 2529–2542. CrossRef CAS PubMed Google Scholar
Mueller, U., Thunnissen, M., Nan, J., Eguiraun, M., Bolmsten, F., Milàn-Otero, A., Guijarro, M., Oscarsson, M., de Sanctis, D. & Leonard, G. (2017). Synchrotron Radiat. News, 30(1), 22–27. CrossRef Google Scholar
Neubauer, P. R., Widmann, C., Wibberg, D., Schröder, L., Frese, M., Kottke, T., Kalinowski, J., Niemann, H. H. & Sewald, N. (2018). PLoS One, 13, 1–21. Google Scholar
Neumann, C. S., Fujimori, D. G. & Walsh, C. T. (2008). Chem. Biol. 15, 99–109. Web of Science CrossRef PubMed CAS Google Scholar
Nicoll, C. R., Bailleul, G., Fiorentini, F., Mascotti, M. L., Fraaije, M. W. & Mattevi, A. (2020). Nat. Struct. Mol. Biol. 27, 14–24. CrossRef CAS PubMed Google Scholar
Nurizzo, D., Mairs, T., Guijarro, M., Rey, V., Meyer, J., Fajardo, P., Chavanne, J., Biasci, J.-C., McSweeney, S. & Mitchell, E. (2006). J. Synchrotron Rad. 13, 227–238. Web of Science CrossRef CAS IUCr Journals Google Scholar
Pang, A. H., Garneau-Tsodikova, S. & Tsodikov, O. V. (2015). J. Struct. Biol. 192, 349–357. Web of Science CrossRef CAS PubMed Google Scholar
Paul, C. E., Eggerichs, D., Westphal, A. H., Tischler, D. & van Berkel, W. J. H. (2021). Biotechnol. Adv. 51, 107712. Web of Science CrossRef PubMed Google Scholar
Pavelka, A., Sebestova, E., Kozlikova, B., Brezovsky, J., Sochor, J. & Damborsky, J. (2016). IEEE/ACM Trans. Comput. Biol. Bioinform. 13, 505–517. Web of Science CrossRef Google Scholar
Phintha, A., Prakinee, K. & Chaiyen, P. (2020). Enzymes, 47, 327–364, CrossRef CAS PubMed Google Scholar
Prakinee, K., Phintha, A., Visitsatthawong, S., Lawan, N., Sucharitakul, J., Kantiwiriyawanitch, C., Damborsky, J., Chitnumsub, P., van Pée, K.-H. & Chaiyen, P. (2022). Nat. Catal. 5, 534–544. CrossRef CAS Google Scholar
Romero, E., Gómez Castellanos, J. R., Gadda, G., Fraaije, M. W. & Mattevi, A. (2018). Chem. Rev. 118, 1742–1769. Web of Science CrossRef CAS PubMed Google Scholar
Schnepel, C., Moritzer, A. C., Gäfe, S., Montua, N., Minges, H., Niess, A., Niemann, H. H. & Sewald, N. (2023). ChemBioChem, 24, e202200569. CrossRef PubMed Google Scholar
Valentino, H., Campbell, A. C., Schuermann, J. P., Sultana, N., Nam, H. G., LeBlanc, S., Tanner, J. J. & Sobrado, P. (2020). J. Biol. Chem. 295, 11042–11055. CrossRef CAS PubMed Google Scholar
Varadi, M., Anyango, S., Deshpande, M., Nair, S., Natassia, C., Yordanova, G., Yuan, D., Stroe, O., Wood, G., Laydon, A., Žídek, A., Green, T., Tunyasuvunakool, K., Petersen, S., Jumper, J., Clancy, E., Green, R., Vora, A., Lutfi, M., Figurnov, M., Cowie, A., Hobbs, N., Kohli, P., Kleywegt, G., Birney, E., Hassabis, D. & Velankar, S. (2022). Nucleic Acids Res. 50, D439–D444. Web of Science CrossRef CAS PubMed Google Scholar
Wang, X.-J., Zhang, N., Teng, Z.-J., Wang, P., Zhang, W.-P., Chen, X.-L., Zhang, Y.-Z., Chen, Y., Fu, H.-H. & Li, C.-Y. (2021). Front. Microbiol. 12, 735793. CrossRef PubMed Google Scholar
Weichold, V., Milbredt, D. & van Pée, K.-H. (2016). Angew. Chem. Int. Ed. 55, 6374–6389. Web of Science CrossRef CAS Google Scholar
Yachnin, B. J., Sprules, T., McEvoy, M. B., Lau, P. C. K. & Berghuis, A. M. (2012). J. Am. Chem. Soc. 134, 7788–7795. Web of Science CrossRef CAS PubMed Google Scholar
Yeh, E., Garneau, S. & Walsh, C. T. (2005). Proc. Natl Acad. Sci. USA, 102, 3960–3965. Web of Science CrossRef PubMed CAS Google Scholar
Zhu, X., De Laurentis, W., Leang, K., Herrmann, J., Ihlefeld, K., van Pée, K.-H. & Naismith, J. H. (2009). J. Mol. Biol. 391, 74–85. Web of Science CrossRef PubMed CAS Google Scholar
This is an open-access article distributed under the terms of the Creative Commons Attribution (CC-BY) Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original authors and source are cited.