research papers
Structures of permuted halves of a modern ribose-binding protein
aDepartment of Biochemistry, University of Bayreuth, 95447 Bayreuth, Germany
*Correspondence e-mail: birte.hoecker@uni-bayreuth.de
Periplasmic binding proteins (PBPs) are a class of proteins that participate in the cellular transport of various ligands. They have been used as model systems to study mechanisms in protein evolution, such as duplication, recombination and domain swapping. It has been suggested that PBPs evolved from precursors half their size. Here, the crystal structures of two permuted halves of a modern ribose-binding protein (RBP) from Thermotoga maritima are reported. The overexpressed proteins are well folded and show a monomer–dimer equilibrium in solution. Their crystal structures show partially noncanonical PBP-like fold type I conformations with structural deviations from modern RBPs. One of the half variants forms a dimer via segment swapping, suggesting a high degree of malleability. The structural findings on these permuted halves support the evolutionary hypothesis that PBPs arose via a duplication event of a flavodoxin-like protein and further support a domain-swapping step that might have occurred during the evolution of the PBP-like fold, a process that is necessary to generate the characteristic motion of PBPs essential to perform their functions.
Keywords: periplasmic binding proteins; ribose binding protein; Thermotoga maritima; flavodoxin-like fold; circular permutation; domain swapping; protein evolution.
PDB references: RBP-CPC, 7qsp; RBP-CPN, 7qsq
1. Introduction
Understanding the emergence of modern protein structures can be addressed by investigating the mechanisms that evolution might have employed. Some of the drivers for structural diversification are genetic mechanisms, such as mutation, duplication and recombination of domain-sized or even subdomain-sized protein fragments, offering the structural complexity needed for functions to evolve (Romero-Romero et al., 2021; Sikosek & Chan, 2014; Höcker, 2014; Ohta, 2000). Another mechanism expanding this repertoire is domain swapping. While domain swapping does not lead to a change in protein sequence, its influence on the structure by forming oligomers via exchange of structural elements within the topology of a protein also contributes to the emergence of functions (Bennett et al., 1995). Insights into these characteristics can shed light not only on the evolutionary history of proteins but also on our understanding of the determinants of protein folding in general.
One group of proteins that have been used for this purpose are periplasmic binding proteins (PBPs). They are involved in the cellular transport of a wide variety of small molecules such as et al., 2021; Felder et al., 1999). The structurally symmetric bilobal architecture of their fold has long been thought to originate from a duplication and fusion event of an individual lobe (Fukami-Kobayashi et al., 1999; Louie, 1993). While more detailed classifications of their fold exist (Scheepers et al., 2016), they can be structurally separated into PBP-like fold types I and II, with somewhat different arrangements of secondary-structure elements. It has been proposed that type II PBPs derive from a tandem domain swap of type I PBPs, leading to exchange of the (βα)5 elements between the lobes (Fukami-Kobayashi et al., 1999). Similar domain dislocation has previously been described in related protein folds such as the chemotaxis response regulator CheY (Paithankar et al., 2019), the receiver domain of cytokinin receptor CRE1 (Tran et al., 2021), the tryptophan synthase subunit TrpA (Michalska et al., 2020) and the uroporphyrinogen III synthase (Toledo-Patiño et al., 2019; Szilágyi et al., 2017).
amino acids, vitamins and ions (ChandravanshiTo investigate the structural flexibility of the α/β architecture found in type I PBPs, we separated and investigated the individual lobes of the ribose-binding protein from Thermotoga maritima (RBP; Cuneo et al., 2008). An established way to stabilize and isolate structural units within a given protein fold is the use of circular permutations (Huang, Nayak et al., 2011; Iwakura et al., 2000; Hennecke et al., 1999). Following this approach, two protein variants that structurally represent each lobe of RBP were created and characterized (Fig. 1). We successfully obtained crystal structures of both the N-terminal lobe (RBP-CPN) and the C-terminal lobe (RBP-CPC), observing a non-native swapping of elements in RBP-CPN. Our experiments also indicate dimerization of this lobe in solution, with the showing a rearrangement reminiscent of the antiparallel β-sheet observed in type II PBPs. The observed structural malleability and the propensity to rearrange secondary-structural elements furthermore suggest a possible mechanism for transition from the type I PBP-like fold to type II via domain dislocation.
2. Materials and methods
2.1. Construct designs with Rosetta
The RosettaRemodel protocol included in the Rosetta suite (release 2018.19; Huang, Ban et al., 2011) was used to sample possible loop conformations to connect the secondary-structure elements of the RBP lobes, leading to both the RBP-CPN and RBP-CPC sequences. The unliganded structure of T. maritima RBP (PDB entry 2fn9; Cuneo et al., 2008), trimmed to include only the residues of the respective lobe, was used as a template. The new termini for the permuted constructs were introduced at positions 1 and 263 for RBP-CPN, with a loop inserted between positions 105 and 244 (strand E and helix 9; Fig. 1a). For RBP-CPC the N-terminus was shifted to residue 128, and a loop was inserted to connect residue 243 to the new C-terminal stretch from 106 to 127 (strand D and helix 4; Fig. 1a). Flexibility of the input model was allowed for one additional residue on each side of the gap during loop closure. 1000 models of three- and four-residue loops were generated using parallelized processing with Open MPI and procedural seed generation. The top ten scoring models were relaxed using the relax algorithm provided in this version of Rosetta, and the total and per-residue scoring functions were used. The sequences of the best scoring models for both RBP-CPN and RBP-CPC were used as final constructs (Table 1). The per-residue energies of the relaxed models were compared with the unrelaxed of RBP and the obtained crystal structures of RBP-CPN and RBP-CPC using the score_jd2 application in the same version of Rosetta.
|
2.2. Cloning and protein purification
The gene fragments for full-length RBP as well as RBP-CPC were subcloned into empty linearized pET-21b(+) using NdeI/XhoI restriction sites. To prevent translation of the truncated sequence in wild-type RBP, an M142A mutation (Cuneo et al., 2008) was introduced via QuikChange site-directed mutagenesis. The resulting plasmids were verified by sequencing. Gene synthesis and cloning into pET-21b(+) for RBP-CPN were provided by Biocat. Transformant Escherichia coli BL21 (DE3) cells were grown in Terrific broth medium (TB) at 37°C to an OD600 of 1.2 in the presence of 100 µg ml−1 ampicillin. Protein expression was induced by the addition of 1 mM isopropyl β-D-1-thiogalactopyranoside and continued for 18 h at 20°C. The cells were harvested by centrifugation (5000g, 15 min), resuspended and lysed by sonication. To remove cell debris, the suspension was centrifuged again (40 000g, 1 h) and the supernatant was filtered through a 0.22 µm filter prior to immobilized metal ion (IMAC).
IMAC was performed on a Cytiva HisTrap 5 ml column previously equilibrated with buffer (20 mM MOPS, 500 mM NaCl, 10 mM imidazole pH 7.8). Elution was performed with a 40% step of elution buffer (20 mM MOPS, 500 mM NaCl, 600 mM imidazole pH 7.8). Fractions containing the protein of interest were pooled and concentrated for the step. was performed on a Cytiva Superdex 26/600 75 pg column with isocratic elution of buffer (20 mM Tris–HCl, 300 mM NaCl pH 7.8). Fractions containing protein were analyzed by SDS–PAGE and those containing the proteins of interest were pooled, flash-frozen in liquid nitrogen and stored at −20°C until further analysis.
2.3. Crystallization
Initial crystallization screens were set up using a Phoenix pipetting robot (Art Robbins Instruments) with commercially available sparse-matrix screens (Qiagen; JCSG Core I–IV Suites and The PEGs Suite and PEGs Suite II) in 96-well sitting-drop plates (3-drop Intelli-Plates, Art Robbins Instruments). Droplets were pipetted in 1:1, 1:2 and 2:1 ratios of protein:reservoir solution with a protein concentration of 30 mg ml−1 and were incubated at 293 K. Initial crystals of RBP-CPN appeared after 35 days in the following condition: 30% PEG 4000, 0.2 M lithium sulfate, 0.1 M Tris–HCl pH 8.5 (JCSG Core IV Suite) in the 1:1 ratio droplet. Subsequent optimization with Additive Screen (Hampton Research) yielded well diffracting cuboid-shaped crystals in the presence of the abovementioned initial hit solution supplemented with 4% 2,2,2-trifluoroethanol. Further cryoprotection was not needed.
RBP-CPC was crystallized in the same fashion with a protein concentration of 15 mg ml−1. Diffracting cuboid-shaped crystals were found after one month in 0.2 M magnesium acetate, 20% PEG 3350 (The PEGs Suite) in the 1:2 ratio droplet. Cryoprotection was ensured by transferring the crystal to 20% PEG 3000, 20% ethylene glycol, 0.2 M KNO3.
2.4. X-ray data collection, and model building
Crystals were manually mounted using cryo-loops on SPINE standard bases and were flash-cooled after cryoprotection if needed. Diffraction data were collected on BL14.1 at the BESSY II electron-storage ring operated by the Helmholtz-Zentrum Berlin (Mueller et al., 2015). Measurements were performed at 100 K in single-wavelength mode at 0.9184 Å with a Dectris PILATUS 6M detector in fine-slicing mode (0.1° wedges) using the MXCuBE beamline-control software (Gabadinho et al., 2010). Data were processed with XDSAPP2 (Sparta et al., 2016) employing XDS (Kabsch, 2010). Data quality was assessed by applying phenix.xtriage (Zwart et al., 2005). Resolution cutoffs were determined by applying the automated paired protocol PAIREF (Malý et al., 2020).
In both cases, phases were solved by 2fn9) as a search model with Phaser (McCoy et al., 2007). The resulting models were manually rebuilt with Coot (Emsley et al., 2010) and refined with phenix.refine (Afonine et al., 2018) in an iterative manner. Coordinates and structure factors were validated and deposited in the PDB (Berman et al., 2002) with accession codes 7qsq (RBP-CPN) and 7qsp (RBP-CPC).
using the respective lobe of RBP (PDB entry2.5. Far-UV circular dichroism
Far-UV M sodium phosphate pH 7.8, 50 mM sodium chloride. Samples were measured at a protein concentration of 10 µM in a 2 mm cuvette in a wavelength range from 195 to 260 nm with a bandwidth of 1 nm. After subtraction of the buffer signal, the measured ellipticity signal was converted to mean residue molar ellipticity ([Θ]) using [Θ] = Θ/(lCNr), where Θ is the ellipticity signal in millidegrees, l is the cell path in millimetres, C is the molar protein concentration and Nr is the number of amino acids per protein (Greenfield, 2006).
(CD) was measured on a Jasco J-710 spectropolarimeter equipped with a Peltier device (PTC-348 WI) to control the temperature at 20°C. Before the measurements, the protein samples were dialyzed overnight into 10 m2.6. Intrinsic fluorescence
Intrinsic fluorescence (IF) spectra were collected on a Jasco FP-6500 spectrofluorometer. Measurements were performed at 20°C controlled with a water bath (Julabo MB). Samples were dialyzed and the concentration was set as described previously for CD measurements. The excitation wavelength was set to 280 nm and emission was measured in the range 300–500 nm with a bandwidth of 1 nm. The raw signal was corrected for protein concentration and further normalized to relative fluorescence.
2.7. Size-exclusion chromatography–multi-angle light scattering
Size-exclusion chromatography–multi-angle M sodium phosphate pH 7.8, 50 mM sodium chloride, 0.02% sodium azide at room temperature. Measurements were run at a constant flow rate of 0.8 ml min−1 at protein concentrations of 0.5, 1.0 and 5 mg ml−1. The system setup was normalized and checked by measurement of a commercially available standardized BSA sample (2 mg ml−1; Pierce, catalogue No. 23209) before and after each series of measurements. Weight-averaged molar-mass determination was performed using the Zimm equation with the differential refractive-index signal as a source for the concentration calculations (the refractive-index increment dn/dc was set to 0.185). Analysis of the experiments was performed using the ASTRA version 7.3.2 software suite (Wyatt Technology).
(SEC-MALS) measurements were performed with a miniDAWN detector and an Optilab refractometer (Wyatt Technology) coupled to an analytical column (Superdex 75 Increase 10/300 GL). Centrifuged samples were run on the column connected to an ÄKTApure FPLC system (GE Healthcare Life Sciences) and equilibrated with 10 m2.8. Differential scanning calorimetry
−1, a temperature range of 10–130°C and a scan rate of 1.5°C min−1. All samples were prepared after exhaustive dialysis in 10 mM sodium phosphate pH 7.8, 50 mM sodium chloride. After proper instrument equilibration with at least two buffer–buffer scans, physical and chemical baselines were subtracted from protein–buffer scans and the data were normalized by protein concentration. Origin version 9.0 (OriginLab Corporation) was used for data analysis.
(DSC) endotherms were collected using a MicroCal PEAQ-DSC instrument (Malvern Panalytical) with protein concentrations of 0.5, 1.0 and 5 mg ml3. Results and discussion
3.1. Design of RBP-CPN and RBP-CPC
To assess how the individual lobes of a PBP-like fold behave, we chose the ribose-binding protein from T. maritima (RBP). Due to its thermophilic nature, it was considered to be a robust model system that could more readily tolerate this manipulation. In addition, it has previously been reported that this protein is expressed as a 21 kDa truncation (Cuneo et al., 2008), suggesting that at least some elements of this protein may exist in isolation. To isolate the two lobes of RBP, the elements that make up the individual two halves were linked together via an artificial loop (Table 1). The resulting constructs RBP-CPN (N-terminal lobe) and RBP-CPC (C-terminal lobe) represent the two symmetric lobes of the PBP-like fold (Figs. 1a and 1b). The specific intersections were determined by structural alignment of the of RBP from T. maritima in the absence of its ligand ribose (PDB entry 2fn9). RBP-CPN was designed to consist only of the βA–Eα1–4 elements, which are directly linked to α9. Similarly, RBP-CPC consists of the elements βF–Jα5–8 connected to α4 of RBP by permutation (Fig. 1a). To be consistent with the structure of the theoretical evolutionary precursor before duplication, the additional secondary-structural elements at the C-terminus of RBP (βK–L) responsible for the second crossover between the two lobes were removed.
We obtained computational models of each lobe with comparable total and per-residue energies to the trimmed input structures of full-length RBP. Comparison of the scores obtained from the Rosetta energy function of native RBP and the models show similar energies for all structures (Figs. 2a and 2b). The similarity of the per-residue energy of RBP to the corresponding values for the models indicates that at least energetically, the added loop residues are suitable. The per-residue energies further show a similar distribution. For most of the sequence of RBP-CPN, the residue energies of the are comparable to those of the model. Only the residues of the inserted loop (blue bracket in Fig. 2a) score lower in the compared to the computational model. However, the entire stretch after the inserted residues displays a higher energy (in Rosetta energy units; REU) than in the model. This is similarly reflected in both the structural rearrangement of the secondary-structure elements (Figs. 1d and 2c) and the per-residue r.m.s.d. in RBP-CPN (Fig. 2e). The observation is consistent with the dimerization interface being facilitated via swapping of the α4 element and disruption of the expected conformation at the C-terminus. While the deviation in r.m.s.d. for RBP-CPN would imply a disturbance of per-residue energies in the C-terminal stretch (Fig. 2e), the segment swap seems to compensate for it in canonical topology.
In contrast, a comparison of the scores of the RBP-CPC model and its resulting shows similar energies for all resolved residues (Fig. 2b). The per-residue energies of the designed loop are also comparable, even though their conformation in the crystal differs significantly from the model (yellow bracket in Fig. 2b). Apart from the residues around the stretch of missing density (Asp96–Met116), the predicted structure corresponds well to the obtained (Fig. 2d) and the per-residue r.m.s.d. values also indicate good agreement (Fig. 2f).
3.2. Both lobes are stable proteins with a tendency to form dimers
RBP-CPN and RBP-CPC could be expressed recombinantly in high yields in E. coli and purified to Far-UV CD spectra of both RBP-CPN and RBP-CPC show typical characteristics of a protein with an α/β-like structure and are comparable to that of full-length RBP (Fig. 3a). In addition, an initial hint about the correct formation of the in solution was obtained from the intrinsic fluorescence spectra. The emission maximum at 335 nm for both proteins as well as for RBP indicates that the aromatic residues are in a hydrophobic core and are buried from solvent, confirming that all proteins adopt a comparable compact structure (Fig. 3b). Another indication that the constructs appear to fold stably is the determination of thermal stability by (DSC). The DSC endotherms obtained for both RBP-CPN and RBP-CPC show a single and highly (Fig. 3c). The thermal unfolding appears to be irreversible, as no transition is observed upon cooling and the measurement of a second heating cycle. The permuted constructs show a lower thermostability than full-length RBP, with Tm values of 76.1 ± 0.4°C for RBP-CPC and 97.9 ± 0.9°C for RBP-CPN, in contrast to 108°C for RBP (Cuneo et al., 2008). There also appears to be a small dependence on protein concentration, with a shift to higher transition temperatures at higher protein concentrations (Fig. 3c).
Since the architecture of PBPs is likely to have originated from an ancestral dimer with the canonical binding site between the lobes, the question arises whether both variants can adopt a similar conformation. To investigate this, the oligomeric state of the proteins was determined in solution using SEC-MALS measurements (Fig. 3d). In the concentration range 0.5–5 mg ml−1, the determined molecular weight (MW) of RBP-CPN is approximately 27.5 kDa. This corresponds to a dimeric conformation, as it is about double the expected monomeric MW of 14.9 kDa. The shift from lower molecular weight at lower concentrations to higher molecular weight at higher concentrations indicates that the monomer–dimer equilibrium is dynamic and concentration-dependent. A similar pattern is observed for RBP-CPC. While the protein appears to be monomeric at low concentrations (0.5 mg ml−1), the MW shifts to 18.7 kDa at 1 mg ml−1 and to 22.4 kDa at 5 mg ml−1. This would correspond to a dynamic shift from a monomer (theoretical MW of 16.7 kDa) to a dimer (Table 2). These results are in agreement with the concentration-dependent thermostability observed in DSC measurements. Together, they explain the shift to higher temperatures during thermal unfolding, with possible stabilization of the overall fold by forming a defined dimer interface.
|
3.3. The structures of both RBP-CPN and RBP-CPC differ from their native counterparts
The PBP-like type I canonical fold consists of two lobes with a continuous, parallel β-sheet with five strands in the order 21345 plus an additional, noncontinuous β6 strand flanked by alternating α-helices on each side and one crossover between each lobe (Figs. 1a and 1b). In contrast to the expected single-lobed architecture, the crystal structures obtained for RBP-CPN and RBP-CPC deviate from the structure of full-length RBP.
RBP-CPC crystallized in the orthorhombic P212121, with two chains of the protein in the and was refined to a resolution of 1.36 Å (Table 3). While the N-terminal (αβ)4 elements in both chains are nearly identical to the core of the corresponding part in full-length RBP, the remaining elements differ from the canonical topology (Figs. 1b and 1c). While the core structure of α5–7 and βF–I in RBP-CPC is comparable to that of RBP, the following βJ strand and the synthetic loop are not resolved in the (Fig. 4a). However, the connecting α8 helix on the other side of this gap in the structure can unambiguously be seen (Fig. 1c). It remains unclear whether the inserted loop or the energetical frustration of missing elements on this terminal side of the protein interferes with the proper formation of βJ, or whether a preferential but unobserved swap of elements with an adjacent protein molecule results in the lack of density in this protein region (Fig. 4e). An alternative explanation could be the formation of an interface between two crystallographic dimers, as indicated by an analysis with the PISA server (Krissinel & Henrick, 2007). In this case, the C-terminal α8 would not originate from the same chain of the but from its corresponding symmetry mate. The resulting extended arrangement is facilitated by an interaction of the βI strand and the residue stretch 116′–120′ (Fig. 5a). This extension is similar to a continuation of the sheet via the antiparallel addition of a short, single stretch resembling a strand, with the residues of the designed loop (Val117–His121) participating in the interaction (Fig. 1c). With the α4 helix originating from the adjacent symmetry mate, it is also possible that there is a mixed population of both conformations, with the helix serving as a common structural anchor point. This could also explain the lack of density in the connecting area. A similar shuffling of elements can be observed with less ambiguity in the of RBP-CPN (Fig. 5b). This possible interaction could also explain the concentration-dependent observed in the SEC-MALS measurements (Fig. 3d). The central β-sheet as well as all α-helices appear to be well ordered, except for the loops close to the unresolved region and the termini. The r.m.s.d. of 0.5 Å over 135 Cα atoms of the resolved residues, however, indicates a high similarity between RBP-CPC and the corresponding elements of full-length RBP (Fig. 4c).
|
The case is different when looking at the N-terminal lobe. The N was solved in the monoclinic P21 at 1.79 Å resolution. The is composed of four chains, of which two pairs form a dimer via a segment swap. Unlike the interface of the two lobes in native PBPs, the dimer is located on the edge of the two central β-sheets (Fig. 4b). This extension of the sheet is mediated via each of the respective βE strands. In contrast to the rest of the central β-sheet, the two βE strands form an antiparallel stretch of the extended β-sheet. This change in direction of the C-terminal β-strand is not known to occur in PBP-like fold type I proteins, in which the central β-sheet always adopts a parallel conformation. In addition, this swap of the βDβE elements in their parallel–antiparallel arrangement forms the interface of the dimer (Fig. 1d). These structural rearrangements are also reflected by the significant difference in r.m.s.d. of 5.9 Å when comparing the structure of RBP-CPN with the equivalent half of the full-length RBP (Fig. 4d). This unusual rearrangement of elements indicates a high tolerance of this structural motif to variations in its topology. In agreement with other structures, such as the CheY-like fold (Paithankar et al., 2019), the TIM-barrel fold (Michalska et al., 2020) and other related folds (Lewis et al., 2000; Tran et al., 2021; Szilágyi et al., 2017), the isolated domains of a PBP-like type I protein show a high degree of malleability.
of RBP-CP4. Conclusions
The obtained crystal structures of the permuted constructs of both the N- and C-terminal lobes of RBP from T. maritima suggest the possibility that they could have existed in isolation of the full structural context. This corresponds to the idea that modern PBPs arose from a duplication event. Based on structural and sequence similarities, it has been proposed that this progenitor was an ancestral protein of the flavodoxin-like fold. The existence of the stable permuted halves clearly shows that the single lobe can exist on its own and can help inform on this evolutionary process.
However, the observed swapping of elements in RBP-CPN could also correspond to another event in the evolution of PBPs. It has previously been concluded that the evolution of the PBP-like fold involved domain swapping of the C-terminal helices, a step that was necessary to generate the characteristic hinge-bending motion of PBPs, with subsequent fusion of this proposed ancestral dimer (Fukami-Kobayashi et al., 1999). In addition, it has been proposed that the absence of the helix between β-strands D and E and helix 8 (Fig. 1b) may have been a necessary step for the swapping event that led to PBPs with the type II fold. This partially explains why we observe a dimer with an unusual segment swap in RBP-CPN, which lacks this helix. However, it appears that RBP-CPC, which still contains this corresponding helix 8, does not reliably form a dimer. However, the alternative interface involving the chain from a symmetry mate could partially explain the behavior observed in SEC-MALS measurements. The dynamic shift to higher molecular weight species can only be observed at high protein concentrations. Interestingly, however, the antiparallel stretch of residues 117′–119′ in RBP-CPC bears a resemblance to the continuation of the central β-sheet in RBP-CPN. The residues participating in the interaction with β4 are the additional residues introduced via the design. A reason for this could be the energetically frustrated surface of β4, which now lacks the corresponding β5 from RBP, that induces the switch of the designed loop into a more strand-like conformation to satisfy this hydrophobic surface.
Alternatively, a possible explanation may lie in the folding pathway of proteins with a flavodoxin-like fold. The folding mechanism of CheY, a well studied protein with a flavodoxin-like fold, suggests that there may be a universal subdomain intermediate in the folding pathway (Hills & Brooks, 2008). The N-terminal β1–3α1–2 elements appear to initially form a central triad followed by folding of the remaining elements. The permuted RBP lobes could follow a similar path. The corresponding elements could form a folded scaffold onto which the rest of the protein folds. This potentially stabilizes the protein to a point where the C-terminal elements can still adapt a structured conformation but provide sufficient flexibility for the unusual rearrangement that we have found.
The novel antiparallel stretch of the dimer-swapped β-sheets has not been observed before in proteins with the type I PBP-like fold, and the existence of this swap highlights the flexibility of this structural element. Additionally, the alleviation of the energetically frustrated hydrophobic surface achieved via the alternative interface in the structure of RBP-CPC could offer valuable insights into the mechanisms behind domain swapping in PBPs in general. More detailed sequence analysis and experiments would be required to obtain a clear picture of the transition from type I to type II PBPs. The malleability of this α/β architecture, which is also apparent in other folds (for example the Rossmann, flavodoxin and TIM-barrel-like folds), may be a reason for its frequent occurrence in modern proteins (Ferruz et al., 2021).
Acknowledgements
We acknowledge the allocation of synchrotron beamtime and financial support by HZB and thank the beamline staff at BESSY for support. The authors declare that they have no known competing financial interests or personal relationships that could have influenced the work reported in this paper. We thank all members of the Höcker Laboratory for their constructive suggestions to improve the research. Open access funding enabled and organized by Projekt DEAL.
Funding information
This work was supported by the European Research Council (ERC Consolidator Grant 647548 `Protein Lego' to BH), the VolkswagenStiftung (grant 94747 to BH) and by a fellowship from the Alexander von Humboldt and Bayer Science and Education Foundations (Humboldt–Bayer Research Fellowship for Postdoctoral Researchers to SRR).
References
Afonine, P. V., Poon, B. K., Read, R. J., Sobolev, O. V., Terwilliger, T. C., Urzhumtsev, A. & Adams, P. D. (2018). Acta Cryst. D74, 531–544. Web of Science CrossRef IUCr Journals Google Scholar
Bennett, M. J., Schlunegger, M. P. & Eisenberg, D. (1995). Protein Sci. 4, 2455–2468. CrossRef CAS PubMed Web of Science Google Scholar
Berman, H. M., Battistuz, T., Bhat, T. N., Bluhm, W. F., Bourne, P. E., Burkhardt, K., Feng, Z., Gilliland, G. L., Iype, L., Jain, S., Fagan, P., Marvin, J., Padilla, D., Ravichandran, V., Schneider, B., Thanki, N., Weissig, H., Westbrook, J. D. & Zardecki, C. (2002). Acta Cryst. D58, 899–907. Web of Science CrossRef CAS IUCr Journals Google Scholar
Chandravanshi, M., Tripathi, S. K. & Kanaujia, S. P. (2021). FEBS Lett. 595, 2395–2409. CrossRef PubMed Google Scholar
Cuneo, M. J., Beese, L. S. & Hellinga, H. W. (2008). BMC Struct. Biol. 8, 50. Google Scholar
Emsley, P., Lohkamp, B., Scott, W. G. & Cowtan, K. (2010). Acta Cryst. D66, 486–501. Web of Science CrossRef CAS IUCr Journals Google Scholar
Felder, C. B., Graul, R. C., Lee, A. Y., Merkle, H. P. & Sadee, W. (1999). AAPS PharmSci, 1, E2. Google Scholar
Ferruz, N., Michel, F., Lobos, F., Schmidt, S. & Höcker, B. (2021). Front. Mol. Biosci. 8, 715972. CrossRef PubMed Google Scholar
Fukami-Kobayashi, K., Tateno, Y. & Nishikawa, K. (1999). J. Mol. Biol. 286, 279–290. Web of Science CAS PubMed Google Scholar
Gabadinho, J., Beteva, A., Guijarro, M., Rey-Bakaikoa, V., Spruce, D., Bowler, M. W., Brockhauser, S., Flot, D., Gordon, E. J., Hall, D. R., Lavault, B., McCarthy, A. A., McCarthy, J., Mitchell, E., Monaco, S., Mueller-Dieckmann, C., Nurizzo, D., Ravelli, R. B. G., Thibault, X., Walsh, M. A., Leonard, G. A. & McSweeney, S. M. (2010). J. Synchrotron Rad. 17, 700–707. Web of Science CrossRef CAS IUCr Journals Google Scholar
Greenfield, N. J. (2006). Nat. Protoc. 1, 2876–2890. CrossRef PubMed Google Scholar
Hennecke, J., Sebbel, P. & Glockshuber, R. (1999). J. Mol. Biol. 286, 1197–1215. Web of Science CrossRef PubMed CAS Google Scholar
Hills, R. D. Jr & Brooks, C. L. III (2008). J. Mol. Biol. 382, 485–495. CrossRef PubMed Google Scholar
Höcker, B. (2014). Curr. Opin. Struct. Biol. 27, 56–62. Web of Science PubMed Google Scholar
Huang, P.-S., Ban, Y.-E. A., Richter, F., Andre, I., Vernon, R., Schief, W. R. & Baker, D. (2011). PLoS One, 6, e24109. CrossRef PubMed Google Scholar
Huang, Y.-M., Nayak, S. & Bystroff, C. (2011). Protein Sci. 20, 1775–1780. CrossRef PubMed Google Scholar
Iwakura, M., Nakamura, T., Yamane, C. & Maki, K. (2000). Nat. Struct. Biol. 7, 580–585. CrossRef PubMed Google Scholar
Kabsch, W. (2010). Acta Cryst. D66, 125–132. Web of Science CrossRef CAS IUCr Journals Google Scholar
Krissinel, E. & Henrick, K. (2007). J. Mol. Biol. 372, 774–797. Web of Science CrossRef PubMed CAS Google Scholar
Laskowski, R. A., Jabłońska, J., Pravda, L., Vařeková, R. S. & Thornton, J. M. (2018). Protein Sci. 27, 129–134. Web of Science CrossRef CAS PubMed Google Scholar
Lewis, R. J., Muchová, K., Brannigan, J. A., Barák, I., Leonard, G. & Wilkinson, A. J. (2000). J. Mol. Biol. 297, 757–770. Web of Science CrossRef PubMed CAS Google Scholar
Louie, G. V. (1993). Curr. Opin. Struct. Biol. 3, 401–408. CrossRef CAS Web of Science Google Scholar
Malý, M., Diederichs, K., Dohnálek, J. & Kolenko, P. (2020). IUCrJ, 7, 681–692. Web of Science CrossRef PubMed IUCr Journals Google Scholar
McCoy, A. J., Grosse-Kunstleve, R. W., Adams, P. D., Winn, M. D., Storoni, L. C. & Read, R. J. (2007). J. Appl. Cryst. 40, 658–674. Web of Science CrossRef CAS IUCr Journals Google Scholar
Michalska, K., Kowiel, M., Bigelow, L., Endres, M., Gilski, M., Jaskolski, M. & Joachimiak, A. (2020). Acta Cryst. D76, 166–175. Web of Science CrossRef IUCr Journals Google Scholar
Mueller, U., Förster, R., Hellmig, M., Huschmann, F. U., Kastner, A., Malecki, P., Pühringer, S., Röwer, M., Sparta, K., Steffien, M., Ühlein, M., Wilk, P. & Weiss, M. S. (2015). Eur. Phys. J. Plus, 130, 141–151. CrossRef Google Scholar
Ohta, T. (2000). Philos. Trans. R. Soc. London B, 355, 1623–1626. CrossRef Google Scholar
Paithankar, K. S., Enderle, M., Wirthensohn, D. C., Miller, A., Schlesner, M., Pfeiffer, F., Rittner, A., Grininger, M. & Oesterhelt, D. (2019). Acta Cryst. F75, 576–585. Web of Science CrossRef IUCr Journals Google Scholar
Romero-Romero, S., Kordes, S., Michel, F. & Höcker, B. (2021). Curr. Opin. Struct. Biol. 68, 94–104. PubMed Google Scholar
Scheepers, G. H., Lycklama a Nijeholt, J. A. & Poolman, B. (2016). FEBS Lett. 590, 4393–4401. Google Scholar
Sikosek, T. & Chan, H. S. (2014). J. R. Soc. Interface, 11, 20140419. CrossRef PubMed Google Scholar
Sparta, K. M., Krug, M., Heinemann, U., Mueller, U. & Weiss, M. S. (2016). J. Appl. Cryst. 49, 1085–1092. Web of Science CrossRef CAS IUCr Journals Google Scholar
Szilágyi, A., Györffy, D. & Závodszky, P. (2017). Proteins, 85, 46–53. PubMed Google Scholar
Toledo-Patiño, S., Chaubey, M., Coles, M. & Höcker, B. (2019). Biochemistry, 58, 4790–4793. PubMed Google Scholar
Tran, L. H., Urbanowicz, A., Jasiński, M., Jaskolski, M. & Ruszkowski, M. (2021). Front. Plant Sci. 12, 756341. CrossRef PubMed Google Scholar
Zwart, P. H., Grosse-Kunstleve, R. W. & Adams, P. D. (2005). CCP4 Newsl. Protein Crystallogr. 43, contribution 7. Google Scholar
This is an open-access article distributed under the terms of the Creative Commons Attribution (CC-BY) Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original authors and source are cited.