research communications
Crystal structures of Val58Ile tryptophan repressor in a domain-swapped array in the presence and absence of L-tryptophan
aDepartment of Chemistry, University of Copenhagen, DK-2100 Copenhagen, Denmark, bCenter for Molecular Protein Science, Lund University, SE-221 00 Lund, Sweden, cInstitute for Quantitative Biomedicine, Rutgers University, Piscataway, NJ 08854, USA, dDepartment of Biology, Lund University, SE-221 00 Lund, Sweden, and eChemistry Department, Princeton University, Princeton, NJ 08544, USA
*Correspondence e-mail: janina.sprenger@cfel.de
The crystal structures of domain-swapped tryptophan repressor (TrpR) variant Val58Ile before and after soaking with the physiological ligand L-tryptophan (L-Trp) indicate that L-Trp occupies the same location in the domain-swapped form as in native dimeric TrpR and makes equivalent residue contacts. This result is unexpected because the ligand binding-site residues arise from three separate polypeptide chains in the domain-swapped form. This work represents the first published structure of a domain-swapped form of TrpR with L-Trp bound. The presented structures also show that the protein amino-terminus, whether or not it bears a disordered extension of about 20 residues, is accessible in the large solvent channels of the domain-swapped crystal form, as in the structures reported previously in this form for TrpR without N-terminal extensions. These findings inspire the exploration of L-Trp analogs and N-terminal modifications as labels to orient guest proteins that cannot otherwise be crystallized in the solvent channels of crystalline domain-swapped TrpR hosts for potential diffraction analysis.
Keywords: crystalline protein gel; hostal system; fragment-based screening; ligand binding; molecular baits; domain swapping; Val58Ile tryptophan repressor.
1. Introduction
Tryptophan repressor (TrpR) was one of the earliest gene-regulatory proteins for which a high-resolution et al., 1988). The TrpR dimer (Fig. 1a) is formed from deeply intertwined monomers (Schevitz et al., 1985) and is related to domain-swapped proteins (Carey et al., 2007), even though it does not meet a proposed definition predicated on the existence of a stably folded monomeric structure (Liu & Eisenberg, 2002), which TrpR apparently does not have (Shao et al., 1997). Under certain conditions of crystal growth, TrpR dimers partially unfold and self-associate in a process akin to runaway domain swapping (Liu & Eisenberg, 2002), forming extended, higher-order domain-swapped structures connected by dimer-like nodes, each made up of four distinct polypeptide chains (Lawson et al., 2004; Fig. 1b). Formation of the extended domain-swapped structure depends on the conversion of two interhelical turns in each subunit of native dimeric TrpR into helical segments, resulting in the formation of a continuous, nearly straight α-helix of 47 residues. The resulting crystals of domain-swapped TrpR (ds-TrpR) have very large solvent channels and can be described as crystallinely ordered gels based on the chemical definition of gels, namely chain entanglement and high solvent content; IUPAC defines a gel as a nonfluid expanded throughout its whole volume by a fluid (Jones et al., 2008). The highly unusual feature of ds-TrpR relevant to considering it as a crystalline gel is the extraordinary degree of chain entanglement that extends throughout the entire crystal, promoting a robust as shown here, despite a very high solvent content that would ordinarily be associated with notably fragile protein crystals. To the best of the authors' knowledge no other protein crystal qualifies as a crystallinely ordered gel, not even other domain-swapped examples, which have less extensive swapping of their parts and lower solvent contents. The ability of crystalline ds-TrpR to incorporate 5-bromotryptophan, a derivative of the physiological TrpR ligand L-tryptophan (L-Trp), has been reported previously (Lawson et al., 2004), although the corresponding was not deposited, nor has any structure been reported for ds-TrpR with L-Trp.
was determined in complex with its DNA regulatory site (OtwinowskiTrpR crystals in the typical native dimeric form have previously been reported for a variant in which Val58 is replaced by isoleucine (Val58Ile), one of 19 TrpR mutants created by Arvidson et al. (1991) to explore the role of the identity of the residue at position 58, which forms part of the L-Trp-binding site. Crystals of dimeric Val58Ile TrpR were used for high-resolution crystallographic and neutron diffraction analyses of L-tryptophan binding (Lawson, 1996a; Daniels et al., 2003) due to their facile growth to a very large size. The Val58Ile TrpR dimer structure (Lawson, 1996a) is indistinguishable from that of the wild-type TrpR dimer (Lawson et al., 1988), and the Val58Ile variant has wild-type TrpR activity in vivo, except for being slightly less sensitive to the L-Trp analog 5-methyltryptophan (Arvidson et al., 1991, 1994). Structural analysis of Val58Ile TrpR suggested the basis for this alteration (Lawson, 1996b), indicating that the L-Trp-binding site can accommodate an extra methyl group on the L-Trp ligand or an extra methylene group on the Val58 side chain, but that when these extra groups are present on both the protein and the ligand, the structural adjustments that would be required to accommodate them may account for the reduced affinity of 5-methyltryptophan for Val58Ile TrpR.
During the course of studies to pursue further analysis of Val58Ile TrpR by neutron diffraction based on promising early results (Lawson, 1996a; Daniels et al., 2003), it was discovered that this mutant could be crystallized in the extended domain-swapped form under the conditions reported previously for wild-type TrpR. The very large solvent channels typical of ds-TrpRs are being analyzed for possible use as a host system to facilitate of guest proteins that cannot be crystallized (a so-called hostal system; Sprenger et al., 2021). This system potentially offers a novel approach to the incorporation of biomolecular guests into porous systems, an area of general interest in applied biotechnology today (Yan et al., 2015; Abe & Ueno, 2015; Hartje & Snow, 2018). Successful of guest proteins in any host system requires uniform guest orientation to achieve crystallographic occupancy. In this regard, the ds-TrpR system offers a possible advantage if the guest can be targeted to the L-Trp-binding site using a label based on L-Trp. The present work was thus carried out using the abundantly available crystals of Val58Ile ds-TrpR soaked with L-Trp to evaluate the ability of ds-TrpR to bind L-Trp and to define the ligand orientation, which are here documented for the first time for any ds-TrpR. The results show that L-Trp binds in the same location and orientation as in typical dimeric TrpRs and makes equivalent residue contacts, affirming its potential as a label to potentially orient guests.
In the course of this work, crystals were also obtained and a structure was solved for Val58Ile ds-TrpR with a 23-residue N-terminal extension including a His6 purification segment, a spacer segment and a protease-cleavage sequence. Electron density for the N-terminal extension is visible in the solvent channel, but only two of its residues could be modeled due to disorder. The crystallizability of ds-TrpR with a flexible extension accessible in the solvent channels suggests the N-terminus as another locus for the attachment of groups that might orient guest proteins within the channels. All Val58Ile ds-TrpR structures solved here, with and without the N-terminal extension and with and without L-Trp, share essentially identical overall structures with wild-type ds-TrpR.
2. Materials and methods
2.1. Protein expression and purification
A DNA fragment encoding the Val58Ile TrpR variant (Fig. 2) with an N-terminal extension for subsequent purification containing a His6 segment (MHHHHHH) followed by a spacer (SSGVDLGT) and a rhinovirus 3C protease-cleavage sequence (LEVLFQ↓GP, with the arrow indicating the cleavage position; Cordingley et al., 1989) was purchased from GenScript. The fragment was subcloned into plasmid pNIC28-Bsa4 (Savitsky et al., 2010) and the resulting plasmid was transformed into E. coli T7 Express (New England Biolabs). The bacteria were grown in 1 l lysogeny broth (Difco) containing kanamycin at 50 µg ml−1 in 5 l baffled Erlenmeyer flasks at 37°C with agitation (200 rev min−1) until the cultures reached an of 0.8 at 600 nm. Expression of Val58Ile TrpR was induced by the addition of isopropyl β-D-1-thiogalactopyranoside to 0.25 mM and growth of the culture was continued at 37°C for 4 h with shaking at 200 rev min−1. The cells were harvested by centrifugation at 8000g for 10 min at 4°C and the pellet was resuspended in ∼35 ml buffer A (50 mM sodium phosphate buffer with 300 mM NaCl, 20 mM imidazole pH 8.0) with the addition of one tablet of cOmplete EDTA-free protease inhibitor (Roche). The cells were broken by passage through a French press twice at 124 MPa; after centrifugation for 1 h at 100 000g at 4°C, the supernatant was filtered (Acrodisc syringe filter, 0.45 µm Supor membrane) and applied onto a HisTrap HP column (1 ml, GE Healthcare) pre-equilibrated with buffer A. Bound proteins were eluted with a linear gradient of 20–500 mM imidazole in buffer A over 20 column volumes. Fractions containing TrpR with the N-terminal extension were pooled and the concentration was determined using the absorbance at 280 nm with an extinction coefficient of 13 980 M−1 cm−1 for TrpR with the extension. To remove the extension, the pooled fractions were mixed with His-tagged 3C protease at an enzyme:TrpR mass ratio of approximately 1:100, and the digestion mixture was transferred to a dialysis tube (32 mm, mass cutoff 3500 Da; Spectrapor) and dialyzed into buffer A containing 1 mM dithiothreitol (DTT) and 0.5 mM EDTA (as disodium salt) at 6°C for at least 12 h. After an additional dialysis step for 3 h at 6°C against 1 l buffer A without DTT and EDTA, the TrpR–protease mixture was applied as previously onto an equilibrated HisTrap HP column. The flowthrough containing cleaved TrpR was collected and dialyzed overnight against 1 l crystallization buffer (100 mM sodium phosphate buffer, 200 mM NaCl pH 7.5) containing 0.5 mM EDTA at 6°C and additionally against 1 l crystallization buffer for 12 h. confirmed that the N-terminus of the cleaved protein includes the expected N-terminal Gly-Pro residues (data not shown). The sequences of all proteins are shown in Fig. 2.
2.2. Crystallization
2.2.1. Val58Ile ds-TrpR with L-Trp
Following the removal of the N-terminal extension by protease cleavage and before crystallization, purified Val58Ile TrpR was subjected to a final gel-filtration step using a Superdex 75 300/10 column with crystallization buffer. Pure fractions were pooled and concentrated to 3–8.5 mg ml−1 using 3000 Da Amicon Ultra-15 centrifugal filter units. Crystallization was performed using the hanging-drop vapor-diffusion technique according to previously reported conditions with a reservoir consisting of 27.5–35%(v/v) 2-propanol (iPrOH), 100 mM NaCl, 100 mM 2-[4-(2-hydroxyethyl)piperazin-1-yl]ethanesulfonic acid (HEPES) pH 7.5 (Lawson et al., 2004). Crystals of Val58Ile TrpR grew within 1–2 days in drops consisting of 2–4 µl of a 1:1 mixture of protein and reservoir solutions. For X-ray diffraction, the crystals were soaked for 10 s in their reservoir solution to which ethylene glycol was added to 25%(v/v) as a cryoprotectant before cryocooling in liquid N2. To obtain the L-Trp-bound Val58Ile ds-TrpR structure, 1–2 day-old crystals were soaked for 30 min in cryosolution containing 10 mM L-Trp.
2.2.2. Val58Ile ds-TrpR with the N-terminal extension
Val58Ile ds-TrpR with the N-terminal extension (Fig. 2) was crystallized as described for the cleaved protein and purified identically except that proteolytic cleavage was not performed. No difference in protein purification nor in the growth or the behavior of the crystals was noted compared with the cleaved protein. Crystals of the uncleaved protein were harvested, cryocooled and measured as described for crystals of cleaved ds-TrpR.
2.3. X-ray data collection, processing and molecular replacement
2.3.1. Structure of Val58Ile ds-TrpR with and without L-Trp
X-ray diffraction data from crystals of the cleaved TrpR Val58Ile variant (50–100 µm in the longest dimension) were collected on beamline ID30A-3 at ESRF and data from 10 mM L-Trp-soaked crystals were collected on beamline P11 at DESY, Hamburg. After cryoprotectant soaking and cryocooling as described above, 4000 frames were collected with 0.1° rotation, with a detector distance corresponding to about 0.5 Å below the diffraction limit of the crystals suggested by the initial processing. Processing was performed using XDS (Kabsch, 2010). The data sets of the best-diffracting crystals extended to 2.03 and 2.45 Å for Val58Ile ds-TrpR and L-Trp-soaked Val58Ile ds-TrpR crystals, respectively. Structures for these two single-crystal data sets were determined by using MOLREP (Vagin & Teplyakov, 2010) with the domain-swapped wild-type TrpR structure as a search model (PDB entry 1mi7; Lawson et al., 2004). Alternating cycles of manual fitting and automated including translation–libration–screw (TLS) (for the L-Trp-bound structure) were carried out with Coot (Emsley et al., 2010) and phenix.refine (Afonine et al., 2012), respectively.
2.3.2. Structure of Val58Ile ds-TrpR with the N-terminal extension
Data were collected on beamline P13 at DESY, Hamburg. The crystals typically diffracted to 2.4–2.7 Å resolution, similar to the diffraction observed from crystals of cleaved ds-TrpR. The data sets after data reduction and processing with autoPROC (Vonrhein et al., 2011) showed identical unit-cell parameters and to cleaved ds-TrpR. One of the best-quality data sets from the uncleaved ds-TrpR Val58Ile crystals was used for with the (cleaved) ds-TrpR Val58Ile structure (PDB entry 6st6) as a search model in MOLREP (Vagin & Teplyakov, 2010) and subsequent with REFMAC5 (Murshudov et al., 2011). Further manual and automated was carried out as described for the cleaved structures above.
3. Results
3.1. determination of Val58Ile ds-TrpR
After protease cleavage, the Val58Ile TrpR protein contains an N-terminal Gly-Pro dipeptide preceding the original methionine residue, which in native wild-type TrpR is present as N-formylmethionine (fMet) and is largely processed off (Gunsalus & Yanofsky, 1980). Hereafter, the Val58Ile TrpR protein after protease cleavage is referred to as cleaved. Although the initiating fMet residue is mostly absent from native TrpR, by convention the numbering of the TrpR sequence begins with methionine as residue 1, and this convention is followed here for both the cleaved and uncleaved proteins for consistency with prior work. Additionally in this work, non-native residues prior to the original initiating methionine are given negative numbers beginning with -1 for the residue adjacent to the original methionine, as indicated in Fig. 2. The TrpR Val58Ile variant protein, including the N-terminal His6, spacer and protease cleavage sequences, contains 22 or 23 additional N-terminal residues depending on whether the initial fMet residue is processed off, which is unknown. The uncleaved protein is expected to include the initiating fMet residue, as inferred from of similar N-terminally extended proteins (unpublished observations). This Val58Ile TrpR protein with intact N-terminal extension is hereafter referred to as uncleaved.
Val58Ile TrpR both with and without the N-terminal extension, i.e., uncleaved and cleaved Val58Ile TrpRs, produced hexagonal bipyramidal crystals in 1–2 days under conditions identical to those that produced hexagonal bipyramidal crystals of wild-type TrpR in the domain-swapped form (Lawson et al., 2004), and with no detected differences in the crystallization behavior between cleaved and uncleaved proteins or between Val58Ile and wild-type TrpR. Crystals of Val58Ile ds-TrpR varied in size from ∼50 to ∼300 µm in the largest dimension depending on the protein concentration used in crystallization (3–8 mg ml−1), with smaller crystals growing at higher protein concentrations. L-Trp at 10 mM final concentration in cryoprotectant solution was used to soak 1–2 day-old crystals for 30 min.
The structures of free and L-Trp-soaked cleaved Val58Ile ds-TrpR were determined to 2.05 and 2.45 Å resolution, respectively, and refined to an Rwork/Rfree of 26.8/29.9% and 27.8/29.1%, respectively. The structures of cleaved Val58Ile ds-TrpR with and without L-Trp soaking have been deposited in the Protein Data Bank (PDB) as entries 6st6 and 6st7, respectively. Processing and are shown in Table 1, and Fig. 3(a) gives an overview of the structures and their prominent features. The structure of uncleaved TrpR is considered in a separate section below.
|
The final Rfree values of cleaved Val58Ile ds-TrpR models both with and without L-Trp soaking are slightly above the median of 24–26% reported by Shao et al. (2017) in an analysis of recently determined crystal structures at similar resolution. These Rfree values indicate that the structure models are basically correct; the slightly high R-factor values are likely to reflect difficulties in accurately modeling the extensive solvent regions (comprising 75% of the total crystal volume; Matthews coefficient VM = 5.11 Å3 Da−1), as well as the complex solvent composition, which includes water [∼50%(v/v)], iPrOH [∼25%(v/v)] and ethylene glycol [∼25%(v/v)], any of which may be partially ordered at the protein–solvent interface but could not be modeled. As suggested in the original study of wild-type ds-TrpR (Lawson et al., 2004), deviations from an ideal may reflect minor displacements along the highly extended polypeptide network. TLS to account for positional displacements of atoms in the crystal was applied, and resulted in improved agreement with the diffraction data for the L-Trp-soaked crystals. Diffraction data from crystals with and without L-Trp had high Wilson B factors after processing (>50 Å2), and the average atomic B factors (>70 Å2) for the refined structures are higher than expected for their nominal resolutions.
The structures of cleaved Val58Ile ds-TrpR with and without soaked L-Trp have only minor differences in overall structure (r.m.s.d. of 0.56 Å for all non-H protein atoms). The most prominent deviations (0.7–1.2 Å) are at Cα positions close to the L-Trp site and in the C-terminal region (residues 75–102). The Val58Ile substitution has no effect on the overall domain-swapped structure relative to the wild-type ds-TrpR structure (Lawson et al., 2004; PDB entry 1mi7), with an r.m.s.d. over all protein atoms of 0.36 Å. The structures reported here (Table 1) have improved geometry compared with the previously reported ds-TrpR structure but have slightly higher average B factors, whereas the R factors are in a similar range.
3.2. Interactions of the L-Trp ligand
At the expected position of the L-Trp-binding site, the structure of cleaved Val58Ile ds-TrpR without L-Trp soaking shows electron density that could be modeled as one iPrOH and two water molecules positioned to interact with Arg84 and Ser88 (Fig. 3b), similar to the previously reported ligand-free wild-type ds-TrpR (Lawson et al., 2004). Although the density somewhat resembles L-Trp, comparison with the unambiguous electron density for L-Trp obtained after soaking (Fig. 3c), and the orientations of the side chains implicated in L-Trp interactions, indicate that L-Trp is not present in the L-Trp-free form. In the L-Trp-soaked form distinct changes are observed in the side-chain conformations of Arg84, which comes within hydrogen-bonding distance of the L-Trp carboxylate, and of Thr81, which rotates to within hydrogen-bondimg distance of the indole ring N atom. The L-Trp ligand has an average B factor of ∼75 Å2, which is lower than the overall B factor of the structure (96 Å2; Table 1) and is in the range of the surrounding residues (∼70–90 Å2). The modeled iPrOH and two water molecules in the L-Trp site of the L-Trp-free structure have an average B factor of ∼65 Å2, which is slightly lower than the overall B factor of this structure (74 Å2; Table 1) and is in the range of the surrounding residues (∼40–70 Å2). The conclusion that L-Trp is absent from the ligand-binding site of the unsoaked ds-TrpR crystals is consistent with all prior evidence from both X-ray and NMR results on apo TrpR in both native dimeric and ds-TrpR forms (see, for example, Zhang et al., 1987; Hyde et al., 1991; Zhao et al., 1993; Lawson et al., 2004; Carey et al., 2012; Harish et al., 2017). L-Trp has never been observed in crystals or NMR structures of TrpR purified as the apoprotein, and assays of purified apo TrpR always confirm its characteristic DNA-binding affinity (Carey, 1988), affirming the absence of L-Trp.
Each L-Trp ligand in L-Trp-soaked cleaved Val58Ile ds-TrpR crystals has a position and surrounding environment equivalent to each of the two L-Trp ligands in symmetric, dimeric Val58Ile TrpR (PDB entry 1jhg; Lawson, 1996a; Fig. 3d), which in turn are identical to the position and environment of the L-Trp ligands in wild-type dimeric TrpR structures (PDB entries 1wrp and 2oz9; Lawson et al., 1988). In the dimeric structures each of the two equivalent L-Trp-binding sites is located at the interface between the two symmetrically intertwined chains, both of which contribute interactions to each ligand. From one chain the Arg84 guanidino group is positioned to interact with the L-Trp carboxylate, and the Thr81 hydroxyl group is positioned to interact with the L-Trp indole ring N atom; Ser88 has been reported to adopt different conformations and can be a direct or water-mediated hydrogen-bonding partner for the L-Trp α-amino group, and the Arg54 methylene groups pack against one `face' of the L-Trp indole ring. In the second chain of dimeric TrpR the backbone carbonyl O atoms of Leu41 and Leu43 are positioned to interact with the L-Trp α-amino group, with Asn40 as an additional potential hydrogen-bonding partner. All of these potential ligand interactions are maintained in Val58Ile ds-TrpR, except that the origin of Arg54 is a third protein chain rather than the first chain as in dimeric TrpR (Fig. 3e). In the domain-swapped form the long extended helix positions Arg54 away from the ligand-binding site formed by residues of its own chain, bringing it instead into an equivalent position at the ligand-binding site of the next dimer-like node of the extended array. Thus, in Val58Ile ds-TrpR the physiological ligand L-Trp occupies the equivalent binding site and engages in all of the same functional-group interactions as in native dimeric TrpR. This first reported structure of L-Trp bound to any domain-swapped TrpR reveals the remarkable finding that L-Trp binds identically in dimeric and ds-TrpRs despite the differences in the chain origins of the residues in their respective ligand-binding sites.
Given the long-term goal of using ds-TrpRs as crystalline hosts for proteins that may be modified with L-Trp-based labels to orient them in the ligand-binding site, it was of interest to evaluate how the binding site compares in its tolerance towards modified ligands. To this end, a comparison of binding sites was made between Val58Ile ds-TrpR with bound L-Trp and dimeric TrpR with bound 5-methyl L-Trp (5-MT; PDB entry 6f9k). This choice was influenced by the previous proposal, based on the dimeric structure of Val58Ile TrpR (PDB entry 1jhg; Lawson, 1996a), that dimeric TrpR can accommodate an extra methyl group on either the protein or the ligand but not both. The two structures were aligned by overlaying their local helical segments around residue 58, i.e., Arg54–Leu61. Fig. 4 shows that the local alignment is excellent, including the side chains of residues Arg54 and Leu61. The superposition shows that as in dimeric TrpR, Val58Ile ds-TrpR presents a steric clash with bound 5-MT, with a predicted closest approach distance of 2.5 Å.
3.3. Structure of uncleaved Val58Ile ds-TrpR
The structure of uncleaved Val58Ile ds-TrpR solved at 2.45 Å resolution (Table 1) shows electron density equivalent to residues -2 through 106 (22–131 in the numbering of the deposited structure) that could be assigned as in the cleaved protein. Additional electron density observed in the channels could be modeled by only two residues of the N-terminal extension (Gly -2 and Pro -1; Fig. 2). Both the L-Trp-soaked and unsoaked cleaved Val58Ile TrpR proteins contain these two residues after cleavage, but they are not visible in the electron density, with Met1 as the first residue that can be modeled in both cases. In the wild-type ds-TrpR structure previously deposited by Lawson et al. (2004) the first modeled residue is Ala2, although the protein contains fMet as the first due to its incomplete removal in vivo from the overproduced protein (Gunsalus & Yanofsky, 1980). Connected but weak electron density that is unique to the uncleaved protein is detected in the solvent channels originating around Gly -2 (Fig. 5), consistent with the presence of the extension in the channel; this density is absent from the corresponding structure of cleaved ds-TrpR. Thus, whether or not the N-terminal extension is present, the N-terminus points into the channel (Fig. 5), as was observed previously for wild-type ds-TrpR crystals (Lawson et al., 2004; Sprenger et al., 2021). The remainder of the uncleaved Val58Ile ds-TrpR protein structure is essentially identical to that of cleaved Val58Ile ds-TrpR, except for the additional two N-terminal residues. The structure of uncleaved Val58Ile ds-TrpR is not further considered here except to note the relevance of the N-terminus as a modification site for potentially orienting guest proteins, as discussed below.
4. Discussion
Substitution of isoleucine for Val58 does not alter the DNA- or L-Trp-binding activities of dimeric TrpR in vivo or in vitro (Arvidson et al., 1991, 1994), and its overall dimeric structure is the same as that of wild-type TrpR (Lawson, 1996a). The results presented here demonstrate that the structure of domain-swapped Val58Ile TrpR is also essentially identical to that of wild-type ds-TrpR. The results further indicate that the physiological ligand of dimeric TrpR, L-Trp, can occupy the expected ligand-binding site of Val58Ile ds-TrpR even though domain swapping replaces the native, intra-dimeric binding interface of dimeric TrpR with a binding interface created by three protein chains. Because the manner of L-Trp binding is otherwise indistinguishable from that of wild-type dimeric TrpR, and because Val58Ile ds-TrpR crystals are also indistinguishable from those of wild-type ds-TrpR, it is expected that L-Trp also binds in this manner to wild-type ds-TrpR. This conclusion is consistent with the previous report of 5-bromotryptophan incorporation into wild-type ds-TrpR crystals by soaking (Lawson et al., 2004), with the Br atom localized in the L-Trp-binding site.
Like wild-type ds-TrpR crystals, Val58Ile ds-TrpR crystals present solvent channels that form continuous, largely straight, pores throughout the crystal of ∼5 nm in diameter (Fig. 6a). These wide channels presumably facilitate diffusion of the ligands L-Trp or 5-bromotryptophan into the crystals to reach the binding site. The channels are also large enough to admit small proteins (horse heart cytochrome c and human calmodulin; Sprenger et al., 2021). The latter property has inspired the investigation of ds-TrpR as a crystalline host to enable the determination of structures of guest proteins that cannot themselves be crystallized, although to date the trial guest proteins cytochrome c and calmodulin have not been sufficiently ordered within the channels for structure solution. The availability of an apparently functional binding site for the native L-Trp ligand in Val58Ile ds-TrpR crystals, as shown in this work (Figs. 6a–6c) and inferred here for wild-type ds-TrpR, may offer the possibility of designing L-Trp-based or indole-based labels on guests that could lead to improved guest ordering in the channels of ds-TrpRs.
Many L-Trp analogs have been reported to bind to wild-type dimeric TrpR with affinities within approximately tenfold of that of L-Trp (Marmorstein et al., 1987). Many of those analogs present modifications of the α-substituents of L-Trp (3-β-indoleacrylic acid, indole-3-butyric acid, indole-3-acetic acid, indole-3-propionic acid, tryptaphol, tryptamine, indole, N-formyl-L-tryptophan, indoline, L-abrine, 5-methyltryptamine, L-tryptophan hydroxamate, L-indole-3-lactic acid, L-tryptophan methyl ester and L-tryptophanamide), suggesting that this location is suitable for derivatization by a linker moiety for connection to a guest protein. The solvent-accessibility of L-Trp in Val58Ile ds-TrpR is essentially equivalent to that in dimeric wild-type TrpR (Figs. 6c and 6d), suggesting that chemically modified functional groups of L-Trp are likely to be tolerated similarly in dimeric TrpR and ds-TrpR. In addition, the N-terminus of wild-type (Lawson et al., 2004) and Val58Ile ds-TrpR (Fig. 6a) points towards the solvent channels and thus presents an alternative site that may be more tolerant of and/or solvent-accessible for modifications to anchor guest proteins. Indeed, the presence of the apparently disordered N-terminal extension does not reduce the resolution of the overall structure compared with the extension-free structure. The fact that the crystallization of TrpR in a domain-swapped form tolerates the presence of a disordered extension of over 20 residues encourages further work to explore a range of N-terminal extensions to use as bait for guest proteins or as a site for fusion of or proteins that cannot otherwise be crystallized. These and other possibilities for using ds-TrpR as a crystalline host system have been discussed in more detail by Sprenger et al. (2021) and are being explored in ongoing work.
The evident similarity of L-Trp binding to dimeric TrpR and ds-TrpR suggests the possibility of employing dimeric TrpR in large-scale screening to identify L-Trp or indole-based analogs or even unrelated compounds that may be developed into potential labels that can be covalently attached to a guest protein and serve to orient guest proteins in ds-TrpR host crystals by binding at the L-Trp site. Screening of potential binding candidates in solution is not applicable to ds-TrpR because this form of the protein is presumably only highly populated in crystals. Fragment-based screening approaches such as those used in the identification of candidate drug leads (Shuker et al., 1996) typically employ high-throughput analysis of low-molecular-weight compounds (the fragments) in large libraries to identify those with weak binding activity. The fragment method is predicated on the expectation that the chemical coupling of two or more weakly binding fragments improves affinity through non-additive entropic effects (Green, 1966; Tsallis, 2009; Bronowska, 2011). Success in a broad fragment-screening approach using dimeric TrpR appears to have the potential to greatly expand the range of chemical functionalities suitable as labels for guest proteins entrained in ds-TrpR host crystals.
In recent years, high-throughput strategies have also been adapted for fragment screening with crystals, primarily in the context of drug discovery or inhibitor design (Chilingaryan et al., 2012; Patel et al., 2014), enabling direct application to ds-TrpR. Individual compounds from fragment libraries are soaked into crystals before diffraction, and data analysis is performed with efficient pipelines including XchemExplorer (Krojer et al., 2017), with identification of hits using PanDDA (Pearce et al., 2017). These methods can allow screening on the scale of a few hundred candidates. One advantage of the crystal-screening method is that the pose of the soaked compound is resolved, which in the case of ds-TrpR will be important for predicting the disposition of a prospective ligand-labeled guest protein in the channels. This advantage will be critical for the TrpR system in particular because of the surprising finding (Lawson & Sigler, 1988) that indolepropionic acid (IPA) co-crystallizes with dimeric TrpR in the same binding site as L-Trp but in a ∼180° flipped orientation, with the indole ring N atom taking the place of the missing α-amino group. The affinity of dimeric TrpR for IPA determined by equilibrium dialysis in solution is reported to be Kd = 9.74 µM, compared with 14.6 µM for L-Trp (Marmorstein et al., 1987). Given the high resolution of the IPA cocrystals (1.65 Å), this unexpected pose for a ligand whose structure and affinity are highly similar to those of L-Trp raises fundamental questions about the relationship between crystallographic occupancy and ligand binding, with the following practical implications that must be confronted for success in a hostal-type approach, and which are of general relevance for crystallography as well.
Although ligand-binding affinity confirmed in solution implies that crystallographic occupancy may be achievable, the reverse is not necessarily true: the observation of a localized ligand in the phase-separated L-Trp site of the L-Trp-free ds-TrpR structure here, which has a B factor similar to those of its surrounding residues, as does L-Trp in the L-Trp-soaked structure. Binding affinity is a continuous quantity that has no theoretical lower limit among potential ligands of a given target, only the practical limit of detectability. Solvents and solutes can credibly be considered to be ligands whose affinities under equilibrium conditions are immeasurably small, even if their pose and interactions with the target are resolved crystallographically. The overall conclusion is that measurable affinity in solution cannot necessarily be inferred from crystallographic observation of a well ordered ligand. Thus, the ability to use the TrpR dimer for screening of ligand affinity in solution, and ds-TrpR to screen for ligand pose in crystals, will be important complementary approaches to identify candidate labels for a hostal-based structure determination.
does not necessarily predict ligand affinity in solution at equilibrium. In the a ligand (from cocrystallization or soaking) will presumably orient itself to optimize any noncovalent bonding interactions that are possible in its local environment. Thus, localization to a crystallographic `binding' site can occur even if that site is only marginally favored over other sites and binding is immeasurably weak in solution. A clear example of this effect is illustrated by the common observation of components of crystallization solvents in seemingly well localized sites on protein surfaces, as for iPrOH in theFinally, the finding that the Val58Ile variant supports the formation of crystals in the domain-swapped array and has an apparently functional binding site for L-Trp suggests that other TrpR sequence variants may also be useful as crystalline hosts. Domain-swapped TrpR was first identified in a temperature-sensitive mutant, Leu75Phe (Lawson et al., 2004), that also alters the L-Trp-binding site, although its and its ability to bind L-Trp were not examined in the domain-swapped form. Altering the TrpR binding site thus offers another degree of freedom for designing potential labels to accommodate non-L-Trp-based ligands.
Footnotes
‡Current address: Center for Free-Electron Laser Science (CFEL), Deutsches Elektronen Synchrotron (DESY), Notkestrasse 85, 22607, Hamburg, Germany.
Acknowledgements
The authors declare no conflicts of interests. We thank Jens-Christian Navarro Poulsen for support with protein crystallization. The crystallography (MX) experiments to obtain the ligand-free ds-TrpR Val58Ile structure were performed on beamline ID30A-3 at the European Synchrotron Radiation Facility (ESRF), Grenoble, France. We are grateful to Dr Antoine Royant at ESRF for providing assistance in using beamline ID30A-3. We furthermore acknowledge DESY, Hamburg, Germany, a member of the Helmholtz Association HGF, for the provision of experimental facilities. Parts of this research were carried out at PETRA III, and we would like to thank Dr Eva Crosas for assistance in using beamline P11. Additional synchrotron MX data were collected on beamline P13 operated by EMBL Hamburg at the PETRA III storage ring (DESY, Hamburg, Germany). We would like to thank Dr David von Stetten for assistance in using the beamline.
Funding information
This work was supported by funding from the Villum Foundation (Villum Experiment Grant 17535 to LLL and JS), EU ÖKS–Interreg project `ESS & MAX IV: Cross Border Science and Society' to JS, MAX4ESSFUN grant LU001 to JS and Hanseatic League of Science grant UCPH-002 to LLL and JS. Travel to synchrotrons was supported by the Danish Ministry of Higher Education and Science through the Instrument Center DANSCATT (to LLL). JC gratefully acknowledges support from US National Science Foundation awards DBI13-58737 and DBI16-59726.
References
Abe, S. & Ueno, T. (2015). RSC Adv. 5, 21366–21375. Web of Science CrossRef CAS Google Scholar
Afonine, P. V., Grosse-Kunstleve, R. W., Echols, N., Headd, J. J., Moriarty, N. W., Mustyakimov, M., Terwilliger, T. C., Urzhumtsev, A., Zwart, P. H. & Adams, P. D. (2012). Acta Cryst. D68, 352–367. Web of Science CrossRef CAS IUCr Journals Google Scholar
Arvidson, D. N., Grove Arvidson, C., Lawson, C. L., Miner, J., Adams, C. & Youderian, P. (1994). Nucleic Acids Res. 22, 1821–1829. CrossRef CAS PubMed Google Scholar
Arvidson, D. N., Shapiro, M. & Youderian, P. (1991). Genetics, 128, 29–35. CrossRef CAS PubMed Google Scholar
Bronowska, A. K. (2011). Thermodynamics: Interaction Studies – Solids, Liquids and Gases, edited by J. C. Moreno-Piraján, pp. 1–48. Rijeka: InTechOpen. Google Scholar
Carey, J. (1988). Proc. Natl Acad. Sci. USA, 85, 975–979. CrossRef CAS PubMed Google Scholar
Carey, J., Benoff, B., Harish, B., Yuan, L. & Lawson, C. L. (2012). Protein Sci. 21, 63–74. CrossRef CAS PubMed Google Scholar
Carey, J., Lindman, S., Bauer, M. & Linse, S. (2007). Protein Sci. 16, 2317–2333. Web of Science CrossRef PubMed CAS Google Scholar
Chen, V. B., Arendall, W. B., Headd, J. J., Keedy, D. A., Immormino, R. M., Kapral, G. J., Murray, L. W., Richardson, J. S. & Richardson, D. C. (2010). Acta Cryst. D66, 12–21. Web of Science CrossRef CAS IUCr Journals Google Scholar
Chilingaryan, Z., Yin, Z. & Oakley, A. J. (2012). Int. J. Mol. Sci. 13, 12857–12879. Web of Science CrossRef CAS PubMed Google Scholar
Cordingley, M. G., Register, R. B., Callahan, P. L., Garsky, V. M. & Colonno, R. J. (1989). J. Virol. 63, 5037–5045. CrossRef CAS PubMed Web of Science Google Scholar
Daniels, B. V., Myles, D. A. A., Forsyth, V. T. & Lawson, C. L. (2003). Acta Cryst. D59, 136–138. Web of Science CrossRef CAS IUCr Journals Google Scholar
Emsley, P., Lohkamp, B., Scott, W. G. & Cowtan, K. (2010). Acta Cryst. D66, 486–501. Web of Science CrossRef CAS IUCr Journals Google Scholar
Green, N. M. (1966). Biochem. J. 101, 774–780. CrossRef PubMed CAS Google Scholar
Gunsalus, R. P. & Yanofsky, C. (1980). Proc. Natl Acad. Sci. USA, 77, 7117–7121. CrossRef CAS PubMed Google Scholar
Harish, B., Swapna, G. V., Kornhaber, G. J., Montelione, G. T. & Carey, J. (2017). Proteins, 85, 731–740. CrossRef CAS PubMed Google Scholar
Hartje, L. F. & Snow, C. D. (2018). Wiley Interdiscip. Rev. Nanomed. Nanobiotechnol. 11, e1547. Web of Science PubMed Google Scholar
Hyde, E. I., Ramesh, V., Frederick, R. & Roberts, G. C. (1991). Eur. J. Biochem. 201, 569–579. CrossRef PubMed CAS Google Scholar
Jones, R. G., Kahovec, J., Stepto, R., Wilks, E. S., Hess, M., Kitayama, T. & Metanomski, W. V. (2008). Compendium of Polymer Terminology and Nomenclature: IUPAC Recommendations 2008. Cambridge: Royal Society of Chemistry. Google Scholar
Kabsch, W. (2010). Acta Cryst. D66, 125–132. Web of Science CrossRef CAS IUCr Journals Google Scholar
Krojer, T., Talon, R., Pearce, N., Collins, P., Douangamath, A., Brandao-Neto, J., Dias, A., Marsden, B. & von Delft, F. (2017). Acta Cryst. D73, 267–278. Web of Science CrossRef IUCr Journals Google Scholar
Lawson, C. L. (1996a). Nat. Struct. Mol. Biol. 3, 986–987. CrossRef CAS Google Scholar
Lawson, C. L. (1996b). Biological Structure and Dynamics: Proceedings of the Ninth Conversation in the Discipline Biomolecular Stereodynamics, edited by M. H. Sarma & R. H. Sarma, pp. 83–90. New York: Adenine Press. Google Scholar
Lawson, C. L., Benoff, B., Berger, T., Berman, H. M. & Carey, J. (2004). Structure, 12, 1099–1108. Web of Science CrossRef PubMed CAS Google Scholar
Lawson, C. L. & Sigler, P. B. (1988). Nature, 333, 869–871. CrossRef CAS PubMed Google Scholar
Lawson, C. L., Zhang, R. G., Schevitz, R. W., Otwinowski, Z., Joachimiak, A. & Sigler, P. B. (1988). Proteins, 3, 18–31. CrossRef CAS PubMed Google Scholar
Liu, Y. & Eisenberg, D. (2002). Protein Sci. 11, 1285–1299. Web of Science CrossRef PubMed CAS Google Scholar
Marmorstein, R. Q., Joachimiak, A., Sprinzl, M. & Sigler, P. B. (1987). J. Biol. Chem. 262, 4922–4927. CrossRef CAS PubMed Google Scholar
Murshudov, G. N., Skubák, P., Lebedev, A. A., Pannu, N. S., Steiner, R. A., Nicholls, R. A., Winn, M. D., Long, F. & Vagin, A. A. (2011). Acta Cryst. D67, 355–367. Web of Science CrossRef CAS IUCr Journals Google Scholar
Otwinowski, Z., Schevitz, R. W., Zhang, R. G., Lawson, C. L., Joachimiak, A., Marmorstein, R. Q., Luisi, B. F. & Sigler, P. B. (1988). Nature, 335, 321–329. CrossRef CAS PubMed Google Scholar
Patel, D., Bauman, J. D. & Arnold, E. (2014). Prog. Biophys. Mol. Biol. 116, 92–100. Web of Science CrossRef CAS PubMed Google Scholar
Pearce, N. M., Krojer, T., Bradley, A. R., Collins, P., Nowak, R. P., Talon, R., Marsden, B. D., Kelm, S., Shi, J., Deane, C. M. & von Delft, F. (2017). Nat. Commun. 8, 15123. Web of Science CrossRef PubMed Google Scholar
Savitsky, P., Bray, J., Cooper, C. D. O., Marsden, B. D., Mahajan, P., Burgess-Brown, N. A. & Gileadi, O. (2010). J. Struct. Biol. 172, 3–13. Web of Science CrossRef CAS PubMed Google Scholar
Schevitz, R. W., Otwinowski, Z., Joachimiak, A., Lawson, C. L. & Sigler, P. B. (1985). Nature, 317, 782–786. CrossRef CAS PubMed Web of Science Google Scholar
Shao, C., Yang, H., Westbrook, J. D., Young, J. Y., Zardecki, C. & Burley, S. (2017). Structure, 25, 458–468. CrossRef CAS PubMed Google Scholar
Shao, X., Hensley, P. & Matthews, C. R. (1997). Biochemistry, 36, 9941–9949. CrossRef CAS PubMed Google Scholar
Shuker, S. B., Hajduk, P. J., Meadows, R. P. & Fesik, S. W. (1996). Science, 274, 1531–1534. CrossRef CAS PubMed Web of Science Google Scholar
Sprenger, J., Carey, J., Schulz, A., Drouard, F., Lawson, C. L., von Wachenfeldt, C., Linse, S. & Lo Leggio, L. (2021). Acta Cryst. D77, 471–485. CrossRef IUCr Journals Google Scholar
Tsallis, C. (2009). Eur. Phys. J. A, 40, 257–266. CrossRef CAS Google Scholar
Vagin, A. & Teplyakov, A. (2010). Acta Cryst. D66, 22–25. Web of Science CrossRef CAS IUCr Journals Google Scholar
Vonrhein, C., Flensburg, C., Keller, P., Sharff, A., Smart, O., Paciorek, W., Womack, T. & Bricogne, G. (2011). Acta Cryst. D67, 293–302. Web of Science CrossRef CAS IUCr Journals Google Scholar
Yan, E.-K., Cao, H.-L., Zhang, C.-Y., Lu, Q.-Q., Ye, Y.-J., He, J., Huang, L.-J. & Yin, D.-C. (2015). RSC Adv. 5, 26163–26174. Web of Science CrossRef CAS Google Scholar
Zhang, R. G., Joachimiak, A., Lawson, C. L., Schevitz, R. W., Otwinowski, Z. & Sigler, P. B. (1987). Nature, 327, 591–597. CrossRef CAS PubMed Google Scholar
Zhao, D., Arrowsmith, C. H., Jia, X. & Jardetzky, O. (1993). J. Mol. Biol. 229, 735–746. CrossRef CAS PubMed Google Scholar
This is an open-access article distributed under the terms of the Creative Commons Attribution (CC-BY) Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original authors and source are cited.