Engineering glycoside hydrolase stability by the introduction of zinc binding

The engineering of metal binding into a cellulase increases its temperature stability while maintaining its other catalytic properties.


Introduction
Concerns about climate, increasing energy demands and the limited long-term supply of fossil fuels, as well as ethical considerations, have promoted research on more sustainable lignocellulosic 'second-generation' biofuels. They rely on the deconstruction of cellulose and hemicellulose from feedstocks such as switchgrass into monosaccharides. One step in this process, saccharification, employs mixtures of thermally and chemically robust glycoside hydrolases (GHs). These mixtures typically make use of the combined activities of endoglucanases (EC 3.2.1.4), -glucosidases (EC 3.2.1.21) and cellobiohydrolases (EC 3.2.1.91 and EC 3.2.1.176; CBHs). The Carbohydrate-Active enZymes database (CAZy; http:// www.cazy.org) currently lists 153 different GH families based on amino-acid sequence similarities. This classification facilitates the grouping of GHs according to structural, mechanistic and evolutionary aspects independent of substrate specificities.
J30, an endoglucanase with activity against p-nitrophenyl cellobiose (pNPC), which produces cellobiose as its primary product, was identified and analyzed from metagenomic data originating from a switchgrass-adapted microbial community (Allgaier et al., 2010;D'haeseleer et al., 2013;Gladden et al., 2014). Its likely source is the moderately thermophilic bacterium Thermobacillus composti KWC4, and its catalytic optimum is 50 C. It is annotated as a GH family 9 member consisting of an N-terminal Ig-like domain and a C-terminal catalytic domain. Initial annotations, based on dbCAN, suggested the presence of a carbohydrate-binding module (CBM; for example, a CBM family 30 motif) within the Ig-like domain. However, the more recent dbCAN2 meta server (http://cys.bios.niu.edu/dbCAN2) does not identify a CBM. With 54% sequence identity, an uncharacterized family 9 GH from T. composti is the closest homolog of J30. Of those with solved structures, the leaf-branch compost CelG (LC-CelG; PDB entry 3x17; Okano et al., 2015) is most similar with 41% identity, followed by Cel9A from Alicyclobacillus acidocaldarius (AaCel9A; PDB entry 3ez8; Pereira et al., 2009) and Cel9A from Clostridium thermocellum (CtCel9A; PDB entry 1clc; M. B. Lascombe, H. Souchon, M. Juy & P. M. Alzari, unpublished work) with 32 and 31% identity, respectively. Among the less similar family members found in the Protein Data Bank (Berman et al., 2000) are crystal structures without an Ig-like domain and those carrying a C-terminally adjacent family 3 CBM, as summarized in greater detail elsewhere (Okano et al., 2015).
We have solved the structure of J30 to 1.7 Å resolution. The enzyme represents one of only three structures of a GH family 9 member free of structural Ca 2+ and Zn 2+ ions, the others being a chitobiase from Vibrio parahaemolyticus (PDB entry 3h7l; New York SGX Research Center for Structural Genomics, unpublished work) and a glucosaminidase from Photobacterium profundum (PpGlcNase; PDB entry 5dgq; Honda et al., 2016). Among the other members with solved structures, those exhibiting a considerably higher thermal tolerance than J30 contain one to four structural metal ions (Schubot et al., 2004;Kesavulu et al., 2012;Okano et al., 2015;Eckert et al., 2002;Brunecky et al., 2013;Chauvaux et al., 1995). In light of the stabilization of CtCel9A through metal ion binding (Chauvaux et al., 1995(Chauvaux et al., , 1990, we hypothesized that the introduction of a Zn 2+ ion could improve the thermal stability of J30. Using the CtCel9A crystal structure as a reference model and comparing the geometry of its Zn 2+ -binding site with the corresponding segments within J30, we created a mutant called J30 CCH (denoting the residue identities of the mutations: A98C, G114C and Y143H). The crystal structure of the triple mutant showed the successful introduction of a Zn 2+ ion at the intended binding site. Our circular-dichroism (CD) and differential scanning fluorimetry (DSF) data demonstrate a successful increase in the melting temperature of J30. An enzymatic analysis of the enzyme variants showed an increase in the catalytic optimum temperature by 5-10 C. Our data support the proposed role of structural metal ions in promoting thermal stability (Juy et al., 1992) and underline the suitability of metal introduction as a tool to engineer robust proteins in the context of biomass saccharification and beyond (Browner et al., 1994;Plegaria et al., 2015).

Molecular biology, protein expression and purification
The A98C, G114C and Y143H mutations were introduced at the DNA level into plasmid pBIL-J30 using the QuikChange Lightning Multi Site-Directed Mutagenesis Kit (Agilent Technologies, catalog No. 210515).
Wild-type J30 (J30 wt) and J30 CCH, each carrying a C-terminal polyhistidine tag, were produced in Escherichia coli NEB Express cells grown in TB medium with 50 mg ml À1 kanamycin and 2 mM MgSO 4 at 200 rev min À1 . The bacterial suspensions were incubated at 37 C before induction with 0.5 mM IPTG at an OD 600 nm of 4 (J30 wt) or 1 (J30 CCH) and at 20 C thereafter. The protein expressions lasted 17 h (J30 wt) or 19 h (J30 CCH) until an OD 600 nm of 20 (J30 wt) or 20.5 (J30 CCH) was attained.
The cells were lysed in a buffer consisting of 0.15 M NaCl, 25 mM HEPES pH 7.4 with 1 mM PMSF using an Avestin EmulsiFlex-C3 homogenizer. The proteins were purified using an Ä KTAexplorer 100 Air by nickel-affinity (5 ml HisTrap HP or FF) and size-exclusion chromatography (Superdex 200 10/300 GL, all from GE Healthcare Life Sciences). All of the media and buffers used to produce and isolate J30 CCH also contained 10 mM ZnCl 2 .

Crystallization
Initial sparse-matrix crystallization screening of wild-type J30 was conducted using the following screens: Berkeley (from Lawrence Berkeley National Laboratory; Pereira et al., 2017), Crystal Screen, Index, Natrix, PEG/Ion, PEGRx, SaltRx (all from Hampton Research) and MCSG-1 (from Microlytic) (Jancarik & Kim, 1991). They were set up using a Phoenix Robot (Art Robbins Instruments). At a concentration of 12 mg ml À1 purified enzyme, the protein was crystallized in 0.1 M trisodium citrate dihydrate pH 5, 15%(w/v) 2-propanol, 10%(w/v) PEG 10 000. Suitably sized crystals were obtained after 3 d of growth at room temperature using the sitting-drop vapor-diffusion method. J30 CCH crystals were prepared in the same manner. All drops consisted of 1 ml protein solution and 0.5 ml reservoir solution.

Data collection and processing
The crystals were transferred through 20%(v/v) glycerol and flash-cooled in liquid nitrogen. Native J30 data sets were collected on beamline 7-1 at the Stanford Synchrotron Radiation Lightsource (SSRL), SLAC National Accelerator Laboratory, and the J30 CCH diffraction data were recorded on the Berkeley Center for Structural Biology beamline 5.0.1 of the Advanced Light Source (ALS) at Lawrence Berkeley National Laboratory. All data were measured using an oscillation angle Á' of 1 and ADSC Quantum 315r detectors.
All data sets were processed using the -3dii option in xia2 (Winter, 2010;Kabsch, 2010;Evans, 2006). Phasing was carried out by molecular replacement using Phaser (McCoy et al., 2007) from the PHENIX suite of programs (Adams et al., 2010). CtCel9A (PDB entry 1clc; 31% sequence identity) served as a search model for J30 wt (top TFZ score of 19.7), which in turn was used to generate the J30 CCH maps. Model improvement employed alternating rounds of manual modelling using Coot (Emsley et al., 2010) and automated structure refinement by phenix.refine (Afonine et al., 2012). 2000 reflections in each data set were randomly selected for cross-validation. Later iterations included TLS refinement using TLS groups obtained from the TLSMD web server (Painter & Merritt, 2006a,b). All data-collection, phasing and refinement statistics are summarized in Table 1. The figures showing structural models were created using PyMOL (v.1.7.2.1; Schrö dinger).

Differential scanning fluorimetry
DSF assays comparing the thermal stability of J30 wt and J30 CCH were performed on an Applied Biosciences StepOnePlus Real-Time PCR System in triplicate on a 30 ml scale containing 0.5 mg ml À1 enzyme, 150 mM NaCl, 25 mM HEPES pH 7.4, 10 mM ZnCl 2 and SYPRO Orange Protein Gel Stain (Life Technologies, catalog No. S-6650) at 5Â. Fluorescence signals were recorded from 25 to 99 C.
The Zn 2+ -titration samples were prepared by a 1 h incubation of J30 CCH with a 150-fold stoichiometric excess of EDTA pH 7.4 and dialysis for 18 h against 0.15 M NaCl, 25 mM HEPES buffer pH 7.4. A 1 mg ml À1 enzyme solution was incubated with ZnCl 2 at various concentrations for 30 min before the addition of the dye. The DSF assays were carried out as above and at final concentrations of 0.5 mg ml À1 J30 CCH, 0-10 mM ZnCl 2 , 0.15 M NaCl, 25 mM HEPES pH 7.4 with SYPRO Orange Protein Gel Stain at 5Â.

Circular dichroism
Melting curves of J30 wt and J30 CCH from E. coli were recorded from 20 to 85 C with a Jasco J-815 CD spectrometer using 1 mm cuvettes. The assay concentrations were 0.1 mg ml À1 enzyme, 150 mM NaCl, 25 mM HEPES pH 7.4, 10 mM ZnCl 2 . Three accumulations were recorded in the wavelength range 200-250 nm applying 0.2 nm data pitches.

Enzyme-specificity assays
Reactions were set up on a 50 ml scale using 15 mg ml À1 enzyme, 0.1 M MES pH 6, 10 mM ZnCl 2 , and 0.25 mM 4-nitrophenyl -d-glucopyranoside (pNPG; Sigma, catalog No. N7006), 4-nitrophenyl -d-cellobioside (pNPC; TCI America, catalog No. N0867) or 4-nitrophenyl -d-cellotrioside (pNPG3; Carbosynth, catalog No. EN04796). Positive controls contained a final concentration of 3.6 M NaOH instead of enzyme. Negative controls contained buffer instead of enzyme, and were subtracted before internal standardization against the alkaline hydrolysis. Triplicate samples were incubated at 50 C in Applied Biosciences Veriti 96-Well Thermal Cyclers, cooled to 4 C, transferred to 96-well clear flat-bottom plates and mixed with 2%(w/v) sodium carbonate in a 1:1 volume ratio. The absorbance at an incident wavelength of 405 nm was measured on a Molecular Devices SpectraMax M2.
is the intensity of an individual measurement of the reflection and hI(hkl)i is the mean intensity of the reflection. ‡ R cryst = P hkl jF obs j À jF calc j = P hkl jF obs j, where F obs and F calc are the observed and calculated structure-factor amplitudes, respectively. § R free was calculated as R cryst using 2000 randomly selected reflections that were omitted from the structure refinement. } The root-mean-square deviations of bond angles and lengths were calculated based on the conformation-dependent library (Moriarty et al., 2014(Moriarty et al., , 2016). were carried out as described above, except that Beckman Coulter Biomek FX and NX P robots were used for all of the liquid handling in the 96-well format plates. Negative controls contained buffer instead of enzyme and were subtracted.

Overall fold of J30
The crystal structure of J30 was solved to a resolution of 1.7 Å by molecular replacement (Table 1). The 64 kDa enzyme contains an N-terminal Ig-like domain of approximately 80 amino-acid residues and a C-terminal catalytic domain of approximately 465 amino-acid residues ( Fig. 1 and Supplementary Fig. S1). The substrate-binding cleft assumes the shape of an open channel. While all of the secondarystructure elements within the N-terminal portion are -strands, the enzyme as a whole contains approximately 40% each of -helices and random coil.
The overall fold of the Ig-like domain largely resembles those of related structures, with some differences found in loop regions. It contains seven -strands grouped into sets of three (strands 1, 4 and 5) and four (strands 2, 3, 6 and 7) to form two strictly antiparallel -sheets. The latter sheet is bifurcated owing to Pro74 kinking -strand 7 into two halves.
The N-terminus is located at the end of a cavity near the catalytic domain. In both crystal structures presented here a citrate ion occupies the space where the closest homologs of J30 within GH family 9, including CbhA from C. thermocellum (CtCbhA; PDB entry 1ut9), have additional upstream amino acids that contribute to an eighth -strand at the interface to the catalytic domain ( Supplementary Fig. S1; Pereira et al., 2009;Schubot et al., 2004;Okano et al., 2015). The absence of this strand, and the associated hydrogen bonds that would connect it to -strands 1 and 7 (J30 numbering) as well as to the catalytic domain, may contribute to the lower thermal tolerance of J30. The interface between the Ig-like domain and the catalytic domain is stabilized by hydrophobic interactions, a salt bridge between Asp47 and Lys510, and seven hydrogen bonds between Glu4 and Arg431, Ile8 and Arg447, Asp42 and Asn395, Ala44 and Ala509, Asp47 and Tyr397, and His51 and both Gly396 and Arg398. Their relative arrangement is similar to other GH family 9 members with an Ig-like domain, such as LC-CelG, AaCel9A and CtCel9A (Okano et al., 2015;Pereira et al., 2009).
The predominant architectural element of the catalytic domain is an (/) 6 -barrel structure composed of 12 -helices arranged in an alternating pattern as is typical within GH family 9 (Juy et al., 1992) as well as other families (Parsiegla et al., 1998) Crystal structure of J30 as a cartoon representation with (a) and without (b) the protein surface. J30 wt contains an N-terminal Ig-like domain and a C-terminal catalytic domain, as is common among GH family 9 members. The side chains of the catalytic amino-acid residues Asp147 and Glu523 are depicted in teal. Also highlighted are the side chains corresponding to the residues mediating Zn 2+ coordination in the triple mutant J30 CCH. (a) Residues Asp147 and Glu523 are positioned at the active site and in the center of the substrate-binding channel. (b) The active site (red asterisk) and Zn 2+ -introduction site (purple asterisk) are separate sites in proximity to one another. Within the catalytic domain, the (/) 6 -barrel structure (bottom left) and the four-stranded -sheet (bottom right) are turned into perspective.
-helices (2, 6, 8, 11, 13 and 18; arranged clockwise in Fig. 1) framed by an outer ring of six -helices of opposite direction (5, 7, 9, 12, 14 and 1). The outer helices are tilted at an angle relative to the inner helices. While some loops between the barrel helices contain extended sections of random coil as well as additional secondary-structure features, the loops between any inner helix and its adjacent outer helix are mostly short (between the pairs 6/7, 8/9, 11/12 and 13/14; Supplementary  Fig. S1). As is typical for these (/) 6 -barrels, the circle closes with the most C-terminal -helix 18 neighboring -helix 1.
-Helix 10 can be considered a continuation of -helix 9, with the helices being separated only by a kink caused by Phe339. In the catalytic domain, the more extended segments between barrel helices contain five additional -helices, one 3 10 -helix and six short -strands forming two separate sheets. Apart from the helices of the (/) 6 -barrel, -helix 17 is the only conserved helix.
The substrate-binding channel extends from a region near the N-terminal ends of the inner barrel helices (corresponding to where the nonreducing end of the bound substrate would be) towards a more unstructured portion of J30 flanked mostly by random coil (Fig. 1). It runs at an angle relative to the barrel structure, coming closest to secondary-structure elements at the barrel helices 13 and 18 as well as the conserved -helix 17. Most of the residues expected to directly bind substrate or catalyze the reaction are positioned in random-coil sections of the enzyme, with only Tyr151 (-helix 2), Arg469 (15), Tyr519 (17) and Tyr528 (18) found on secondary-structure elements ( Supplementary Fig. S1). Many of the secondary-structure elements listed above appear to provide a scaffold behind the residues that directly interact with substrates.

The structure of the triple mutant J30 CCH
The crystal structure of J30 CCH was solved to a resolution of 1.46 Å . The electron density is consistent with the presence of a Zn 2+ ion in the predicted site (Fig. 2b). Metal coordination is mediated by the side chains of Cys98, Cys114, His115 and His143, and effectively replaces one hydrogen bond formed by His115 and Tyr143 in the wild-type enzyme (Fig. 2a). Among the GH family 9 structures with a homologous Zn 2+ -binding site, CtCel9A, AaCel9A (Pereira et al., 2009), Cel9M from Clostridium cellulolyticum (PDB entry 1ia7; Parsiegla et al., 2002) and CelT from C. thermocellum (PDB entry 2yik; Kesavulu et al., 2012) share the same mode of chelation, whereas LC-CelG features an aspartate residue at the position equivalent to Cys114 of J30 CCH (Supplementary Fig. S1; Okano et al., 2015). In general, histidine, acidic and particularly cysteine side chains are the common ligands of structural Zn 2+ ions (Pace & Weerapana, 2014). While CtCbhA contains a hydrogen bond between the side chains of a tyrosine and a histidine residue, similar to J30 (Schubot et al., 2004), some GH family 9 members, including an endoglucanase from Perinereis brevicirris (PDB entry 4zg8), have a serine taking the role of the histidine (Arimori et al., 2013;Sakon et al., 1997;Mandelman et al., 2003;Petkun et al., 2015). Other structures lack the hydrogen bond entirely because a phenylalanine replaces the tyrosine (Khademi et al., 2002;Brunecky et al., 2013), or there is a different backbone fold, as is the case in PpGlcNase (Honda et al., 2016) and the chitobiase from V. parahaemolyticus. The Zn 2+ ion is located at a distance of 14 Å from the center of the active site towards what would correspond to the reducing end of the bound substrate. The root-mean-square deviation between the backbone C, N and C atoms of the crystal structures of the wild-type and triplemutant enzymes is 0.094 Å , i.e. they are virtually identical. In the crystal structure of J30 CCH we observe a lower average

Figure 3
A cellotetraose molecule from the CtCbhA mutant E795Q (PDB entry 1rq5) fitted into the active site of the crystal structure of J30 wt. The individual rings are marked +2 to À2 according to their relative position with respect to the glycosidic bond that would be cleaved by the action of the side chains of Asp147 and Glu523 (labeled in red). Dashes denote hydrogen bonds immediately involved in the nucleophilic attack of the hydrolyzing water molecule (red sphere). The engineered Zn 2+ site is at the bottom right and is shown with gray labels. The residue-label font sizes reflect the proximity to the viewer. isotropic B factor than in the crystal structure of J30 wt (Table 1). The local atomic displacement parameters of the residues neighboring residues 98, 114 and 143 show a similar trend, suggesting stabilization of the mutated site through the introduction of the Zn 2+ ion.

The modes of substrate binding and catalysis
With the most prominent secondary-structure elements and key residues conserved, the question of the underlying details of substrate binding by J30 is raised. We have superposed the crystal structures of J30 wt and the inactive CtCbhA mutant E795Q, which was co-crystallized with cellotetraose (PDB entry 1rq5; Schubot et al., 2004), in an effort to model the ligand into the active site of J30 (Fig. 3). Glu523 of J30 corresponds to Glu795 of CtCbhA ( Supplementary Fig. S1).
The highly conserved acidic side chains of Asp147 and Glu523 are directly involved in the enzymatic catalysis. Glu523 acts as a proton donor, while Asp147 deprotonates the depicted water molecule for its nucleophilic attack (Davies & Henrissat, 1995). This model is in agreement with GH family 9 members following the inverting reaction mechanism with regard to the anomeric C atom of the ring in the À1 position, as Asp147 and Glu523 interact with the substrate from opposite sides. The nucleophilic attack is aided by a network of hydrogen bonds involving the highly conserved Asp144 and the moderately conserved Tyr151. The arrangement is nearly identical to that found in AaCel9A (Pereira et al., 2009) and is similar to the well described geometry within cellulase E4 from Thermomonospora fusca (Sakon et al., 1997; Fig. 3).
The carbohydrate ring in the +1 position is stacked between the side chains of Trp226 and Tyr519. All GH9 family members with solved structures contain aromatic residues in these positions. This ring is further held in position through hydrogen bonds involving His467 and Arg469, both of which are highly conserved among the same structures except for the less closely related PpGlcNase and the chitobiase from V. parahaemolyticus. The same conservation pattern is found for Tyr306, Trp352 and Trp408 that interact with the rings in the À1 and À2 positions through hydrogen bonds (Tyr306 and Trp352) or ring stacking (Trp408). The side chain of the latter residue is stabilized through a moderately conserved hydrogen bond to Tyr528. The hydrogen bond between the main-chain N atom of Gly304 and the carbohydrate moiety in the À1 position is only shared between J30 and its closest homologs LC-CelG, CtCel9A, AaCel9A and CtCbhA.
Figs. 2 and 3 together illustrate that the separation of the Zn 2+ -introduction site from the active site is a result of the side chain of His143 pointing in the opposite direction to Asp144 and Asp147. As this orientation of the His143 side chain is similar to that of Tyr143 in J30 wt, a negative impact of metal ion binding on the catalytic activity of the enzyme would not be expected. In contrast, stabilization of the stretch of random coil containing His143, Asp144 and Asp147 could be explained by the triple mutation and introduction of Zn 2+ through connection of the three loops containing Ala98, Gly114 and His115, and Tyr143 through the metal ion.

The influence of metal introduction on the thermal tolerance of J30
In differential scanning fluorimetry assays, the enzyme variants are seen to unfold differently (Fig. 4a). Based on the inflection points of the curves, the T m of J30 CCH (70.1 C) is 11.2 C higher than that of the wild-type enzyme (58.9 C). That this increase is owing to the presence of the Zn 2+ ion was confirmed by its removal using EDTA, which reduced the T m of the triple mutant to the same level as that of J30 wt (Fig. 4b). Reintroduction of the ion into J30 CCH fully restores the thermal tolerance of the enzyme. CD melting curves support these observations (Fig. 4c). The local minima at 208 and 222 nm typical of -helical content are consistent with the structure of the catalytic domain. Their decreases in amplitude with increasing temperatures probably reflect a loss of structural integrity of the catalytic domain. Based on the DSF data, the J30 CCH mutant has a modest 5 mM affinity for Zn 2+ (Fig. 4b;Kochań czyk et al., 2015). However, incorporation of the Zn 2+ ion still stabilizes the protein and contributes to the difference in thermal tolerance between the wild-type and mutant enzymes.

Enzymatic specificity of J30 wt and J30 CCH
J30 has been reported to hydrolyze the cellotriose analog pNPC but not the cellobiose analog pNPG (Gladden et al., 2014), and previous enzymatic studies have revealed the substrate specificity of AaCel9A (Eckert et al., 2002(Eckert et al., , 2009. We examined the substrate specificities of J30 wt and J30 CCH towards 4-nitrophenyl glucosides containing one, two or three sugar moieties, and compared the release of free p-nitrophenolate (pNP) with substrate hydrolysis by 3.6 M NaOH (representing 100%; Fig. 5). Both variants had the same activity profiles across the three tested substrates, confirming the previous results, and are similar to AaCel9A: they are unable to hydrolyze pNPG and produce more detectable pNP from pNPC than from the cellotetraose analog pNPG3. While the triple mutant produced slightly more pNP from pNPG3 than the wild-type enzyme at 50 C, both ultimately appear to favor the hydrolysis of pNPG3 to cellobiose and noncleavable pNPG over the production of cellotriose and pNP, thereby paralleling the experimental data for AaCel9A (Eckert et al., 2002).

Catalytic profiling of J30 wt and J30 CCH
The rate of product formation in enzymatic catalysis, and in a hydrolysis reaction in particular, is inherently dependent on the temperature and pH. The nature of these relationships in the case of J30 wt and J30 CCH was studied using pNPC as a substrate (Fig. 6). Both enzyme variants show a catalytic optimum around pH 6, which is consistent with the reaction mechanism. The incorporation of the Zn 2+ ion increases the temperature optimum of J30 for pNPC turnover by 5-10 C, and its influence on catalysis is merely physical and not chemical. These findings are confirmed by the DSF and CD data, and are consistent with prior characterization of the wild-type J30 enzyme (Gladden et al., 2014).

Conclusions
The positive effect of Ca 2+ binding on the thermal tolerance of the cellulase CtCel9A has been shown previously, as has the correlation of a Zn 2+ -binding site with enzyme activity and structural integrity (Chauvaux et al., 1990(Chauvaux et al., , 1995  Enzyme substrate comparison. J30 wt and J30 CCH follow a distinct activity pattern on selected glucoside derivatives. The absorbance of pNP reflects the hydrolysis of its -glucosides with one (pNPG), two (pNPC) or three (pNPG3) glucose moieties. 100% refers to positive controls using 3.6 M NaOH.
in stability is likely to be a result of the net exchange of a hydrogen bond between the side chains of His115 and Tyr143 (J30 wt) for four coordinating bonds between the Zn 2+ ion and the side chains of Cys98, Cys114, His115 and His143 (J30 CCH; Fig. 2). This stabilization may in part result from three separate sections of the polypeptide chain interacting via the Zn 2+ ion. The structural and biochemical data strongly suggest that the metal-binding site is purely structural, yet it is also proximal to the active-site residues Asp144 and Asp147 and the region of the substrate cleft where the reducing end binds.
In contrast to its closest homologs, J30 does not contain bound Ca 2+ ions. Three Ca 2+ -binding sites are found in CtCel9A, two of which, in varying combinations, are found in AaCel9A, CtCbhA and LC-CelG. Two of the Ca 2+ sites are found in loop regions with backbone geometries different to those in J30, corresponding to J30 residues 175-198 and 471-475 (Juy et al., 1992). The third site is located in a region of conserved backbone fold, and is found in the crystal structures of all four homologs; it is referred to as 'B' in Chauvaux et al. (1995). In J30, the side-chain amino group of Lys308 forms hydrogen bonds to the Gly353 O atom and the Glu313 and Glu314 carboxyl groups at precisely the same site. Hence, the introduction of a Ca 2+ ion into J30 by point mutations in this region may work in a similar manner to the Zn 2+ example, but would be likely to require loop remodeling in the case of the other two sites.
Unlike the incorporation of disulfide bonds to cross-link otherwise more loosely connected sections of a polypeptide chain, our approach is not limited to secreted proteins. We have taken advantage of a Zn 2+ -binding site seen in homologs of the target enzyme, and as a result have improved the thermal stability and the temperature optimum of the enzyme. This strategy is likely to be proven to be promising in the future in a more general context: the de novo introduction of metal ion binding into proteins. Such an approach is undoubtedly more challenging when there are no metal ionbinding homologs that can be used as a reference. In particular, recent efforts show how interaction networks beyond the geometry of the immediate site to be designed need to be considered (Guffy et al., 2016). However, computational approaches for protein redesign have improved considerably in the last decade (Simons et al., 1999;Huang et al., 2016), and could enable the creation of thermally and chemically more robust proteins that are useful for medical and biotechnological applications. The increases in both the thermal tolerance and the optimal catalytic temperature of J30 address bottlenecks in biomass deconstruction. In this step of biofuel production, the robustness of the enzymes translates directly into the cost of the final product (Blanch et al., 2011).