research papers
Homochiral and racemic MicroED structures of a peptide repeat from the ice-nucleation protein InaZ
aDepartment of Chemistry and Biochemistry, UCLA-DOE Institute for Genomics and Proteomics, University of California Los Angeles, Los Angeles, CA 90095, USA, bDepartment of Biological Chemistry, UCLA-DOE Institute for Genomics and Proteomics, University of California Los Angeles, Los Angeles, CA 90095, USA, and cHoward Hughes Medical Institute, Departments of Physiology and Biological Chemistry, University of California Los Angeles, Los Angeles, CA 90095, USA
*Correspondence e-mail: jrodriguez@mbi.ucla.edu
The ice-nucleation protein InaZ from Pseudomonas syringae contains a large number of degenerate repeats that span more than a quarter of its sequence and include the segment GSTSTA. Ab initio structures of this repeat segment, resolved to 1.1 Å by microfocus X-ray crystallography and to 0.9 Å by the cryo-EM method MicroED, were determined from both racemic and homochiral crystals. The benefits of racemic protein crystals for by MicroED were evaluated and it was confirmed that the phase restriction introduced by crystal centrosymmetry increases the number of successful trials during the ab initio phasing of the electron diffraction data. Both homochiral and racemic GSTSTA form amyloid-like protofibrils with labile, corrugated antiparallel β-sheets that mate face to back. The racemic GSTSTA protofibril represents a new class of amyloid assembly in which all-left-handed sheets mate with their all-right-handed counterparts. This determination of racemic amyloid assemblies by MicroED reveals complex amyloid architectures and illustrates the racemic advantage in macromolecular crystallography, now with submicrometre-sized crystals.
Keywords: amyloid; racemic; electron diffraction; ice nucleation; intermolecular interactions; co-crystals; electron crystallography; structural biology.
1. Introduction
Expressed by a subset of microorganisms, ice-nucleation proteins are capable of stimulating ice formation in supercooled water (Green & Warren, 1985). The Gram-negative microbe Pseudomonas syringae is sold commercially as Snomax® for its ice-nucleating activity (Green & Warren, 1985; Cochet & Widehem, 2000). The ice-nucleation protein InaZ is produced by P. syringae and localized to its outer membrane (Green & Warren, 1985; Wolber et al., 1986). The sequence of InaZ is approximately 1200 residues in length, over half of which includes degenerate octapeptide repeats. A subpopulation of degenerate repeats share the consensus motif GSTXT(A/S), where X represents an unconserved amino acid (Supplementary Fig. S1; Green & Warren, 1985; Warren et al., 1986). These repeats are shared by other Ina proteins and may collectively contribute to ice nucleation (Green & Warren, 1985; Kobashigawa et al., 2005; Han et al., 2017).
Despite the crystallographic determination of structures of other ice-binding proteins (Davies, 2014; Garnham, Campbell & Davies, 2011), InaZ remains recalcitrant to crystallization. Models of full-length InaZ have proposed it to have a β-helical (Garnham, Campbell, Walker et al., 2011; Graether & Jia, 2001) or solenoid-like fold rich in stacked β-strands (Cochet & Widehem, 2000; Pandey et al., 2016). These features are shared by amyloid filaments: their tightly mated β-sheets form fibrils that can cross-link, cluster and be functional (Nelson et al., 2005; Sawaya et al., 2007; Fitzpatrick et al., 2017; Eisenberg & Jucker, 2012; Maury, 2009). Functional amyloid assemblies appear across the tree of life (Wasmer et al., 2008; Hughes et al., 2018; Maury, 2009; Tayeb-Fligelman et al., 2017) and can contain low-complexity regions with degenerate repeats (Hughes et al., 2018).
Success in determining amyloid structures was first achieved by crystallizing short segments that stabilize the cores of fibrils through a motif known as the steric zipper (Nelson et al., 2005; Sawaya et al., 2007). However, the propensity of elongated β-strands to twist or kink can limit crystal growth, sometimes yielding nanocrystals that pose a challenge for (Rodriguez et al., 2015). These limits have recently been overcome in part by the development of the cryo-electron microscopy (cryo-EM) method, electron microdiffraction (MicroED; Shi et al., 2013). MicroED yields high-resolution structures from protein crystals no thicker than a few hundred nanometres (Shi et al., 2016; Rodriguez et al., 2017). Because of this, MicroED has helped in determining the structures of a number of amyloid protofibrils (Rodriguez et al., 2015; Krotee et al., 2017) at atomic resolution, some ab initio (Sawaya et al., 2016; Gallagher-Jones et al., 2018).
Racemic crystallography further facilitates the crystallization of proteins and ; Yeates & Kent, 2012; Patterson et al., 1999), including ice-binding proteins (Pentelute et al., 2008). Mixing left-handed (L) and right-handed (D) enantiomers of a macromolecule improves its likelihood of crystallization and facilitates structural analysis (Yeates & Kent, 2012; Wukovitz & Yeates, 1995). Crystallographic phases are restricted for data from centrosymmetric crystals, making the associated with the determination of their structure more tractable (Yeates & Kent, 2012). This is advantageous for by (Hauptman, 1986), where phases must be computed from measured intensities alone (Hauptman, 1986, 2001; Sheldrick, 2008). Accordingly, various polypeptide structures have been determined by racemic X-ray crystallography, including those of ester insulin, plectasin and an antifreeze protein (Pentelute et al., 2008; Avital-Shmilovici et al., 2013; Mandal et al., 2009, 2012). While the benefits of racemic crystallography are evident for X-ray diffraction (Matthews, 2009), questions remain about the potential for exploiting these benefits in MicroED.
(Matthews, 2009Hypothesizing that the repeat segments of the ice-nucleation protein InaZ may form amyloid-like assemblies, we set out to interrogate the structure of GSTSTA from both homochiral and racemic crystals by MicroED. In doing so, we also assessed the fidelity of MicroED data in racemic et al., 2016).
By comparing the structures of homochiral and racemic GSTSTA, we gauge the effect of racemic self-assembly on protofibril architecture. With these structures of a core repeat in the InaZ protein, we begin an atomic-level investigation of peptide segments derived from ice-nucleation proteins (Pandey2. Methods
2.1. Sequence analysis of ice-nucleation proteins
The sequence of the ice-nucleation protein InaZ from P. syringae was screened for the existence of hexameric degenerate repeat motifs that contained one or more threonine residues (Supplementary Fig. S1). The repeats were then evaluated for their propensity for amyloid fibril formation by ZipperDB (Supplementary Fig. S1). For each, a Rosetta energy score was calculated. A single repeat, GSTSTA, was chosen from this list of hexameric sequences. This segment appears five times identically in the sequence of InaZ, first at residue 707, and is part of a group with the consensus motif GSTXT(A/S) that appears 59 times in the InaZ sequence.
2.2. Synthesis, purification, characterization and crystallization of L- and D-enantiomers of the InaZ-derived peptide GSTSTA
The L-enantiomer of GSTSTA was purchased from GenScript with 98% purity. The D-enantiomer of GSTSTA was synthesized by solid-phase peptide synthesis and was purified using a Waters Breeze 2 HPLC System in reversed phase buffered with trifluoroacetic acid (Supplementary Fig. S2). The two enantiomers were qualified by ESI-MS on a Waters LCT Premier. The spectrum of the L-enantiomer showed an [M+H]+ peak of 523.30 g mol−1 (expected 523.22 g mol−1) and a dimer [M+M+H]+ peak of 1045.6 g mol−1 (expected 1045.44 g mol−1). The spectrum of the D-enantiomer showed an [M+H]+ peak of 523.24 g mol−1 (expected 523.22 g mol−1) and a dimer [M+M+H]+ peak of 1045.49 g mol−1 (expected 1045.44 g mol−1) (Supplementary Fig. S2).
Crystals of L-GSTSTA were grown as follows. Lyophilized peptide was weighed and dissolved in ultrapure water at concentrations of between 82 and 287 mM, assuming a 1:1 ratio of peptide to trifluoroacetic acid (TFA) in the lyophilized powder. Crystals were grown at room temperature by the hanging-drop method in a high-content 96-well Wizard screen. Of the many crystallization trials that yielded crystals, those obtained from a condition consisting of 0.1 M CHES buffer pH 9.5, 10%(w/v) PEG 3000 were used for microfocus X-ray data collection. Another promising condition was optimized by the hanging-drop method in 24-well plates. This condition consisted of 0.1 M McIlvaine (citrate–phosphate) buffer pH 4.2, 12.5%(w/v) PEG 8000, 0.1 M sodium chloride and was used to grow crystals of L-GSTSTA in batch.
Crystals of racemic GSTSTA were grown as follows. Lyophilized powders of L-GSTSTA and D-GSTSTA were separately weighed and dissolved in ultrapure water so that the concentrations of the two enantiomers matched. Crystal formation was screened at concentrations ranging from 82 to 123 mM after accounting for TFA. Control trays containing only L- or D-GSTSTA were prepared simultaneously alongside racemic screens. All three trays were stored and monitored at room temperature, with crystal formation observed in various conditions. Images of every well were collected after 3 h, one day, three days, five days and seven days, and crystal formation was monitored over time. A condition consisting of 0.1 M imidazole pH 8.0, 10%(w/v) PEG 8000 produced the best crystals.
Crystals were batch grown for data collection by MicroED. Lyophilized L-GSTSTA peptide was weighed and dissolved in 0.1 M McIlvaine (citrate–phosphate) buffer pH 4.2, 12.5%(w/v) PEG 8000, 0.1 M sodium chloride to an effective final concentration of 123 mM, mimicking the final concentration of a hanging drop in the 24-well optimization. Lastly, the solution was seeded with crystal needles extracted from crystals grown in the 24-well optimization described above. Batch crystals of racemic GSTSTA were grown from lyophilized L-GSTSTA and D-GSTSTA that had been separately weighed and dissolved in 0.1 M imidazole buffer pH 8.0 containing 10%(w/v) PEG 8000 to a final concentration of 50 mM for each after accounting for the mass contributed by TFA.
2.3. Microfocus X-ray data collection and structure determination
Crystals of L-GSTSTA were harvested from a 96-well hanging drop using MiTeGen loops and flash-cooled in liquid nitrogen. No additional cryoprotectant was used other than the PEG 3000 that was already present in the mother liquor. 72 diffraction images were collected with an oscillation range of 5° from a single crystal; 40 of these were indexed and integrated. Crystals of racemic GSTSTA were harvested from a 96-well hanging drop using MiTeGen loops and flash-cooled in liquid nitrogen. No additional cryoprotectant was used other than the PEG 8000 that was already present in the buffer. 144 diffraction images were collected with an oscillation range of 2.5° from a single crystal; 64 of these were indexed and integrated.
Diffraction data were collected from both homochiral and racemic GSTSTA crystals under cryogenic conditions (100 K) on beamline 24-ID-E at the Advanced Photon Source (APS) equipped with an ADSC Q315 CCD detector using a 5 µm beam with wavelength 0.979 Å. Signal was only limited by the edge of our detector at approximately 1.1 Å; as such, higher resolution data could perhaps be achieved by modifying the experimental geometry and/or adjusting the energy of the X-ray beam in the experiment. Data from both homochiral and racemic crystals were reduced in XDS (Kabsch, 2010) and yielded ab initio solutions using SHELXT and SHELXD (Sheldrick, 2015). The phases obtained from these coordinates produced maps of sufficient quality for subsequent model building in Coot (Emsley et al., 2010). The resulting models were refined against the measured data using PHENIX (Adams et al., 2010).
2.4. MicroED data collection and structure determination
Crystals were prepared for MicroED data collection following a variation of the procedures detailed in Rodriguez et al. (2015) as follows. Following a 1:2 dilution in ultrapure water, crystals were applied onto glow-discharged grids of type (PELCO easiGlow) 300 mesh Cu 1/4. Excess liquid was blotted off onto filter paper wetted with 4 µl ultrapure water to avoid salt-crystal formation. Grids were plunge-frozen into liquid ethane using a Vitrobot (FEI). Grids were then initially stored in liquid nitrogen before being transferred to a liquid-nitrogen-cooled Gatan 626 cryo-holder for insertion and manipulation within the electron microscope.
MicroED data were collected from three submicrometre-thick needle crystals of L-GSTSTA and two submicrometre-thick needle crystals of racemic GSTSTA. Briefly, frozen hydrated crystals of either L-GSTSTA or racemic GSTSTA were visually inspected in overfocused diffraction mode on a cryocooled FEI Tecnai F20 microscope operated at 200 kV (Janelia Research Campus). The diffraction patterns used for were collected on a TVIPS TemCam-F416 CMOS detector in rolling-shutter mode. For L-GSTSTA, diffraction patterns were collected during unidirectional rotation with 2 s exposures. For racemic GSTSTA, diffraction patterns were collected during unidirectional rotation with 3 s exposures. A rotation rate of 0.30° s−1 and rotation angles ranging from −63° to 72° were used for both. Beam intensity was held constant, with an average dose rate of 0.003–0.005 e− Å−1 s−1 or ∼0.01 e− Å−2 per image, corresponding to a total dose of ∼1–3 e− Å−2 per data set. Data were recorded at an effective camera length of 730 mm, which is the equivalent of a sample-to-detector distance of 1156 mm in a corresponding lenseless system. All diffraction was performed using a circular selected area aperture of ∼1 µm2 in projection.
Diffraction movies were converted to the SMV file format using TVIPS tools as described previously (Hattne et al., 2015). Indexing and integration were performed in XDS. Partial data sets from three L-GSTSTA crystals were sorted and merged in XSCALE. Intensities from a total of 196 diffraction images were merged. An ab initio solution was achieved using SHELXD (Sheldrick, 2015). To achieve a complete data set from racemic GSTSTA crystals, the integrated diffraction intensities from partial data sets of two different crystals were sorted and merged in XSCALE. Intensities from a total of 145 diffraction images were merged. An ab initio solution was achieved using SHELXD and SHELXT (Sheldrick, 2015). Although XDS accurately differentiated the Laue classification for the racemic GSTSTA data, SHELXT, which does not rely on user input for space-group selection, ensured a correct solution for the racemic data. SHELXT selected P21/c as the racemic a choice corroborated by the that were present in the data. The phases obtained from the L-GSTSTA and racemic GSTSTA coordinates produced by SHELX were used to generate maps of sufficient quality for subsequent model building in Coot (Emsley et al., 2010). The resulting models were refined with PHENIX (Adams et al., 2010), using electron scattering form factors, against the measured data.
2.5. Analysis of homochiral and racemic GSTSTA structures
In the analysis of the hydrogen-bonding and assembly interactions of each L-GSTSTA structure, an assembly of four strands, composed of two pairs in mating sheets, was used to find all unique hydrogen bonds, while racemic GSTSTA required an assembly of 12 strands composed of three strands from a pair of mating sheets and six more strands related by translation along the protofibril axis to achieve a unique set of hydrogen bonds. Hydrogen bonds were tabulated from this structure using HBPLUS (McDonald & Thornton, 1994).
Distances between strands along a sheet were calculated as differences between α carbons of one strand and its neighbor along the same sheet. These distances were calculated for both GSTSTA and GNNQQNY using PDB entry 1yjp (Sawaya et al., 2016). The angle between a strand and its corresponding sheet was calculated against the plane formed by α carbons along that sheet.
2.6. Analysis of phases in structures determined by MicroED and X-ray crystallography
To analyze the distribution of phases associated with reflections measured from racemic crystals by both X-ray and electron diffraction, data reduction was performed in P1) and refined in PHENIX against a model encompassing the entire of four strands. This model was obtained by applying all symmetry operations on the of the P21/c structure. in P1 allowed symmetry to be broken, no longer restricting phases to 0 or 180°, as the phases changed in the case where coordinates deviated from their symmetry-related positions. The resulting set of reflections and phases were analyzed in MATLAB. We plotted the observed and calculated magnitudes of each reflection against each other and the set fitted by linear regression. For each measured magnitude, the associated phases were plotted and showed a Histograms were drawn using these data to evaluate phase distributions; the standard deviation of these was computed by merging the distributions around 0 and 180° using a modulo operation.
1 (2.7. Analysis of paired reflections in MicroED and X-ray crystallographic data
Merged data sets collected by either MicroED or microfocal X-ray crystallography were paired for homochiral and racemic crystals of GSTSTA. MicroED data .mtz files were scaled against their corresponding X-ray counterparts, where corresponding reflections were paired and missing reflections were ignored within a single .mtz file. This was achieved using custom scripts and the RSTATS program, which scaled and compared common reflections between corresponding data sets. The corresponding distributions of Fourier magnitudes were then analyzed using MATLAB, in which a best-fit line was determined for each of the paired data sets. Zones were visualized using the HKLVIEW program, in which either h, k or l were selectively set to zero.
3. Results
3.1. Identification, synthesis and crystallization of amyloid-forming ice-nucleation protein (INP) segments
With the goal of characterizing the structural properties of degenerate repeats in INPs, we identified a group of hexapeptides within the set of InaZ repeats and evaluated their amyloid-forming propensities (Goldschmidt et al., 2010; Supplementary Fig. S1). We ranked the hexapeptides based on their predicted propensity for amyloid zipper formation, their repeated appearance in INP sequences and whether they contained polar residues, including threonine (Supplementary Fig. S1). We chose to further investigate a segment whose sequence, GSTSTA, appears identically five times within InaZ at residues 707–712, 755–760, 803–808, 851–856 and 899–904. For simplicity, we numbered the segment 707–712.
We evaluated the crystallization potentials of synthesized L- and D-enantiomers of GSTSTA (Supplementary Fig. S2) compared with that of their by performing high-throughput crystallization trials and monitoring crystal growth. Most crystals appeared within two weeks of the start of each trial. In some conditions crystallization was observed as early as 3 h after the start of the trial. Racemic mixtures produced a greater number of successful crystallization conditions across a broad variety of trials (Supplementary Fig. S3). The number of successful conditions that were identified for racemic mixtures outpaced those identified for each alone (Supplementary Fig. S3), which is consistent with previous predictions (Yeates & Kent, 2012). In conditions in which both racemic and single-enantiomer crystals grew, racemic crystals appeared sooner (Supplementary Fig. S3). Minor differences in the speed of crystal appearance and the total number of conditions with identifiable crystals were also seen between L- and D-enantiomers. Fewer conditions were found to produce D-enantiomer crystals across all trials (Supplementary Fig. S3). These differences may have been a consequence of subtle inequities in the amount of residual trifluoracetic acid (TFA) associated with each in lyophilized powders. These effects may have been magnified by the relatively high concentrations of peptide required for crystallization of these segments (∼100–150 mM).
The crystallization conditions chosen for L-GSTSTA) and racemic GSTSTA (DL-GSTSTA) yielded a high density of well ordered microcrystals, each with a unique powder diffraction pattern, indicating they had formed distinct structures (Supplementary Fig. S4). Microcrystals were optimized from these conditions for microfocal X-ray diffraction; unoptimized batch conditions yielded nanocrystal slurries that were directly suitable for MicroED. Since the powder diffraction patterns of homochiral GSTSTA crystals were identical for both enantiomers (Supplementary Fig. S4), we focused our investigation on the L-enantiomer.
of homochiral GSTSTA (3.2. Ab initio of L-GSTSTA
We optimized crystals of L-GSTSTA for microfocal X-ray diffraction, starting from dense needle clusters and ending with single needles (Supplementary Fig. S5). Crystals grown in batch were monodisperse rods of 1–10 µm in length and 100–500 µm in width; these diffracted to approximately 0.9 Å resolution by MicroED (Fig. 1). X-ray diffraction from a single crystal of L-GSTSTA yielded a 91.7% complete data set to approximately 1.1 Å resolution (Supplementary Table S1), while data sets from three crystals of L-GSTSTA obtained by MicroED were merged to achieve a data set with an overall completeness of 86.4% at 0.9 Å resolution. It is important to note that the X-ray data in this case were limited by the detector geometry, which could be adjusted to facilitate slightly higher resolution. Atomic structure solutions were determined for L-GSTSTA from both microfocal X-ray and MicroED data by (Sheldrick, 2008; Supplementary Fig. S6).
After 50 000 trials, SHELXD yielded correlation figures of merit (CFOMs) of greater than 80 for both X-ray diffraction and MicroED data (Supplementary Fig. S6; Sawaya et al., 2016). The initial L-GSTSTA solution with the highest CFOM shows 33 atoms for the X-ray data set and 36 atoms for the MicroED data set (Fig. 2a and Supplementary Fig. S7). During the number of atoms in the X-ray structure increased to 36 peptide atoms and one bound water (Supplementary Fig. S7). The final solution achieved from the 0.9 Å resolution MicroED data also contained 36 atoms in the peptide chain and one water molecule (Fig. 2a).
3.3. Ab initio of racemic GSTSTA from centrosymmetric crystals
Like the ). These were immediately suitable for MicroED and diffracted to approximately 0.9 Å resolution (Fig. 1). Data from a single crystal obtained by X-ray diffraction produced a 93.7% complete data set at 1.1 Å resolution, while MicroED data from two nanocrystals of racemic GSTSTA were merged to reach an overall completeness of 77.4% at 0.9 Å resolution (Supplementary Table S1). Initial atomic structure solutions for racemic GSTSTA were obtained by (Sheldrick, 2008; Fig. 2b and Supplementary Fig. S7).
crystals of GSTSTA, crystals of racemic GSTSTA started as dense needle clusters and were optimized to single needles, and diffracted as single crystals on a microfocal X-ray source (Supplementary Fig. S5). Batch crystals of racemic GSTSTA were also rod-shaped and were several micrometres in length and a few hundred nanometres in thickness (Fig. 1As with L-GSTSTA, solutions for the racemic crystals yielded correlation figures of merit (CFOMs) of greater than 80 after 50 000 trials (Supplementary Fig. S6). A comparison of the racemic GSTSTA and L-GSTSTA data sets indicated that a higher number of potentially correct solutions were found for the racemic GSTSTA data (Supplementary Fig. S6). The MicroED data show a distribution of CFOM values that is shifted towards higher values, even when truncated to 1.1 Å resolution to match the resolution of the X-ray data sets. However, the most dramatic shift in this distribution is evident at 0.9 Å resolution (Supplementary Fig. S6).
Initial solutions with the highest CFOM show a total of 35 peptide atoms and four waters for the structure determined from X-ray data, and a total of 36 peptide atoms and one water for that determined by MicroED (Fig. 2b). During the number of peptide atoms in the X-ray structure increased to 36 (Supplementary Fig. S7), while the MicroED structure gained two waters (Fig. 2b). Linear regression of observed to calculated structure factors for the MicroED data shows an R value of 0.94 and a slope of 0.97 for data reduced in P1 (Fig. 3c). These values are in good agreement with those obtained by microfocal X-ray diffraction (Supplementary Fig. S11c) and indicate a good fit between model and measurement for the racemic GSTSTA structure.
3.4. Paired comparison of Fourier magnitudes measured by X-ray crystallography or MicroED
A comparison between the X-ray and MicroED data sets for homochiral crystals of GSTSTA shows that these two types of measurement are in close agreement (Supplementary Figs. S8 and S9), although slightly higher merge errors are observed in the MicroED data across resolution bins (Supplementary Table S2). A direct comparison of Fourier magnitudes for paired reflections between these data sets is fitted by a line with a slope of 0.921 and an R value of 0.826 (Supplementary Fig. S8). In contrast, the comparison between X-ray and MicroED data for racemic GSTSTA shows a greater difference between the two sets and a lower R value for the best-fit line comparing the Fourier magnitudes of paired X-ray and MicroED reflections (Supplementary Figs. S8 and S10). This difference is likely to be owing to a lack of isomorphism between the unit cells of the racemic GSTSTA crystals used for MicroED data collection versus X-ray data collection. The unit-cell parameters for racemic GSTSTA crystals obtained by MicroED and microfocal X-ray crystallography were a = 15.23, b = 9.29, c = 21.06 Å, α = 90.0, β = 108.2, γ = 90.0° and a = 14.03, b = 9.22, c = 20.77 Å, α = 90.0, β = 104.5, γ = 90.0°, respectively (Supplementary Table S1).
3.5. Phase restriction in centrosymmetric crystals evaluated by MicroED
Data from racemic GSTSTA crystals obtained by MicroED and reduced in the centrosymmetric P21/c satisfy with imposed phases of 0 or 180°. The of data from the same crystals reduced in P1 results in similar residuals to those obtained for P21/c (Supplementary Table S1). The phases that result from of this structure against data reduced in P1 appear to be bilaterally distributed around 0 and 180° (Fig. 3a). Collapse of this bimodal phase distribution around nπ yields a standard deviation of 34.3° (Fig. 3). When the same procedure is applied to data collected from racemic GSTSTA crystals by X-ray diffraction, a similar trend appears: a normal distribution around nπ with a standard deviation of 34.4° (Supplementary Fig. S11). Bragg reflections that appear in disallowed regions of phase space (90 and 270°) for both MicroED and X-ray diffraction data are generally weakest (Fig. 3 and Supplementary Fig. S11). This suggests that the primary source of phase error in MicroED data, as with X-ray diffraction, may come from noisy or weak reflections.
3.6. Structure of L-GSTSTA
L-GSTSTA assembles into antiparallel in-register β-sheets that mate to form a protofibril (Fig. 4a and Supplementary Figs. S12 and S13). The sheets are buckled, compressing the fibril along its length with strands spaced approximately 4.6 Å apart (Fig. 4a and Supplementary Fig. S14), closer than the typical 4.7–4.8 Å spacings seen in amyloid protofibrils (Sawaya et al., 2007). This spacing equates to half of the L-GSTSTA cell edge along the a axis: approximately 9.2 Å (Supplementary Table S1). To accommodate this compression, the strands tilt approximately 17° with respect to the fibril axis in alternating directions along a sheet, allowing the to lie askew from the fibril axis (Supplementary Fig. S14) while maintaining hydrogen bonding along the protofibril axis (Supplementary Table S3). Side chains between neighboring sheets tightly interdigitate to create a close packing within the fibril (Supplementary Fig. S12); the inter-sheet distances range from 5 to 7 Å. The interface created at the fibril core is small, with 229 Å2 of buried surface area, but shows a relatively high degree of shape complementarity (Sc = 0.75; Lawrence & Colman, 1993). The L-GSTSTA protofibril appears tightly restrained within the as shown by a mean B factor of 0.92 Å2. The modeled water molecule also appears to be well ordered, particularly in the structure of L-GSTSTA determined by MicroED, where it has a B factor of 3.28 Å2. The single coordinated water is hydrogen-bonded to Ser708, the C-terminus of a symmetry-related strand and the backbone of Thr709 in the mating sheet (Fig. 4, Supplementary Figs. S12 and S13, and Supplementary Table S4).
3.7. Structure of racemic GSTSTA
In crystals of racemic GSTSTA, homochiral strands stack to form single-enantiomer antiparallel β-sheets (Fig. 4b and Supplementary Fig. S13). Like the homochiral L-GSTSTA sheets, the racemic GSTSTA sheets are buckled, with adjacent strands spaced 4.6 Å apart along each sheet (Fig. 4b). In the structure of racemic GSTSTA these sheets pack with alternating whereby each racemic GSTSTA protofibril is composed of one L-GSTSTA sheet and one D-GSTSTA sheet (Fig. 4b). The packing of D-GSTSTA sheets against their L-GSTSTA mates in the racemic fibril differs from that seen in the homochiral fibrils of L-GSTSTA. An alignment of the two protofibrils shows D-GSTSTA sheets displaced by approximately 5.3 Å compared with their corresponding L counterparts in the homochiral fibril (Fig. 4 and Supplementary Fig. S15). As a result of this displacement, the sheets are spaced farther apart (7–8 Å) in the racemic GSTSTA protofibril (Fig. 4 and Supplementary Fig. S12).
The longer spacing between sheets in the racemic GSTSTA protofibril is associated with bridging waters at its core (Supplementary Fig. S12). These waters make extensive contacts along the protofibril, with each hydrogen-bonding to at least one residue (Fig. 4, Supplementary Fig. S13 and Supplementary Table S4). Notably, the racemic GSTSTA structure shows a distinct rotamer for Ser710, which appears bound to an ordered water, unlike its equivalent residue in the homochiral structure (Fig. 4, Supplementary S15 and Supplementary Table S4). One water (water 1; Supplementary Table S4) links Ser708 and Thr711 on the same D sheet while also coordinating Ser708 of the adjacent L sheet. This water is isolated from the other waters found within the structure. A small network of waters near the protofibril core links the carboxylate of one strand to Thr711 of a symmetry-related strand (Fig. 4, Supplementary Table S4). As in the structure of homochiral L-GSTSTA, the peptide atoms and bound waters in racemic GSTSTA show low B factors.
4. Discussion
Ice nucleation by P. syringae is linked to the expression of surface proteins, including InaZ (Wolber et al., 1986). While full-length InaZ and InaZ fragments help to nucleate ice (Green & Warren, 1985; Kobashigawa et al., 2005), individual InaZ repeats do not (Han et al., 2017). However, at the high concentrations required for crystallization, GSTSTA repeats self-assemble into a protofibrillar structure of corrugated β-sheets (Supplementary Fig. S14). Similar structures are formed by both racemic GSTSTA and L-GSTSTA, and both contain ordered waters bridging tightly packed antiparallel β-sheets (Fig. 4, Supplementary Figs. S12 and S13). These waters may play a role in helping to stabilize the GSTSTA protofibril or could act as bridges or templates for solvent ordering at low temperatures. While we have no evidence to suggest that GSTXT(A/S) repeats facilitate the formation of amyloid-like InaZ protofibrils, our structures of GSTSTA present an opportunity to analyze the interactions between polar residues in InaZ repeats and ordered solvent molecules at atomic resolution.
The structures of entantiomerically pure and racemic GSTSTA present a platform for the comparison of homochiral and racemic amyloid protofibrils (Supplementary Fig. S16). To evaluate the packing of each GSTSTA protofibril, we look at the categorization of strand packing in amyloid fibrils through homosteric zipper classes, which were first proposed by Sawaya et al. (2007) and later by Stroud (2013). Many of these classes have been experimentally observed in amyloid crystals (Nelson et al., 2005; Sawaya et al., 2007). Homochiral GSTSTA forms a class 8 zipper in which two in-register, antiparallel β-sheets meet, related by a 180° rotation normal to the protofibril growth axis (Sawaya et al., 2007; Stroud, 2013). The racemic GSTSTA structure resembles a class 8 zipper but is distinct in that two sheets of opposite handedness come together to form the protofibril (Supplementary Fig. S16). Because of this similarity to a class 8 zipper, we label this arrangement class 8 bar (Supplementary Fig. S16).
The increased propensity for crystallization by racemic mixtures could be exploited to facilitate the growth of amyloid crystals. The symmetry present in racemic amyloid crystals would have to accommodate the packing of homochiral protofibrils into the racemic structure or allow the formation of racemic protofibrils (Yeates & Kent, 2012), as is the case with GSTSTA. Our experiments in high-throughput crystallographic trials of GSTSTA confirm the expected higher propensity for crystallization of racemic mixtures (Yeates & Kent, 2012; Supplementary Fig. S3), yielding a high number of conditions that contain submicrometre-sized crystals suitable for MicroED. The facile determination of ab initio structures from these crystals demonstrates how MicroED combined with solid-phase polypeptide synthesis (Dawson et al., 1994; Merrifield, 1986) can expand the reach of racemic crystallography to submicrometre-sized crystals.
Supporting information
Supplementary Tables and Figures. DOI: https://doi.org/10.1107/S2052252518017621/fq5004sup1.pdf
Acknowledgements
This project was inspired by conversations on racemic crystallography with Todd Yeates (UCLA). We thank Dan Shi (HHMI, Janelia Research Campus), David Boyer and Daniel Anderson (UCLA) for their assistance in data collection, Johan Hattne (UCLA) for helpful discussions and Janak Dadhaniya for assistance with figures.
Funding information
The following funding is acknowledged: US Department of Energy (grant No. DE-FC02-02ER63421); Howard Hughes Medical Institute – Janelia Research Visitor Program; National Institutes of Health, National Institute of General Medical Sciences (grant No. P41 GM103403); National Science Foundation, Office of Integrative Activities (grant No. DMR 1548924); Arnold and Mabel Beckman Foundation (award to Jose A. Rodriguez); Searle Scholars Program (award to Jose A. Rodriguez); Pew Charitable Trusts (award to Jose A. Rodriguez); QCB Collaboratory (award to Marcus Gallagher-Jones); National Institutes of Health, Division of Biomedical Research Workforce (award No. GM007185 to Calina Glynn); Howard Hughes Medical Institute (award to Tamir Gonen).
References
Adams, P. D., Afonine, P. V., Bunkóczi, G., Chen, V. B., Davis, I. W., Echols, N., Headd, J. J., Hung, L.-W., Kapral, G. J., Grosse-Kunstleve, R. W., McCoy, A. J., Moriarty, N. W., Oeffner, R., Read, R. J., Richardson, D. C., Richardson, J. S., Terwilliger, T. C. & Zwart, P. H. (2010). Acta Cryst. D66, 213–221. Web of Science CrossRef CAS IUCr Journals Google Scholar
Avital-Shmilovici, M., Mandal, K., Gates, Z. P., Phillips, N. B., Weiss, M. A. & Kent, S. B. H. (2013). J. Am. Chem. Soc. 135, 3173–3185. CAS Google Scholar
Cochet, N. & Widehem, P. (2000). Appl. Microbiol. Biotechnol. 54, 153–161. CrossRef CAS Google Scholar
Davies, P. L. (2014). Trends Biochem. Sci. 39, 548–555. CrossRef CAS Google Scholar
Dawson, P. E., Muir, T. W., Clark-Lewis, I. & Kent, S. B. (1994). Science, 266, 776–779. CrossRef CAS Google Scholar
Eisenberg, D. & Jucker, M. (2012). Cell, 148, 1188–1203. Web of Science CrossRef CAS PubMed Google Scholar
Emsley, P., Lohkamp, B., Scott, W. G. & Cowtan, K. (2010). Acta Cryst. D66, 486–501. Web of Science CrossRef CAS IUCr Journals Google Scholar
Fitzpatrick, A. W. P., Falcon, B., He, S., Murzin, A. G., Murshudov, G., Garringer, H. J., Crowther, R. A., Ghetti, B., Goedert, M. & Scheres, S. H. W. (2017). Nature (London), 547, 185–190. Web of Science CrossRef CAS PubMed Google Scholar
Gallagher-Jones, M., Glynn, C., Boyer, D. R., Martynowycz, M. W., Hernandez, E., Miao, J., Zee, C.-T., Novikova, I. V., Goldschmidt, L., McFarlane, H. T., Helguera, G. F., Evans, J. E., Sawaya, M. R., Cascio, D., Eisenberg, D. S., Gonen, T. & Rodriguez, J. A. (2018). Nat. Struct. Mol. Biol. 25, 131–134. CAS Google Scholar
Garnham, C. P., Campbell, R. L. & Davies, P. L. (2011). Proc. Natl Acad. Sci. USA, 108, 7363–7367. Web of Science CrossRef CAS PubMed Google Scholar
Garnham, C. P., Campbell, R. L., Walker, V. K. & Davies, P. L. (2011). BMC Struct. Biol. 11, 36. Google Scholar
Goldschmidt, L., Teng, P. K., Riek, R. & Eisenberg, D. (2010). Proc. Natl Acad. Sci. USA, 107, 3487–3492. CrossRef CAS Google Scholar
Graether, S. P. & Jia, Z. (2001). Biophys. J. 80, 1169–1173. CrossRef CAS Google Scholar
Green, R. L. & Warren, G. J. (1985). Nature (London), 317, 645–648. CrossRef CAS Google Scholar
Han, Y. J., Song, H., Lee, C. W., Ly, N. H., Joo, S.-W., Lee, J. H., Kim, S.-J. & Park, S. (2017). Int. J. Biol. Macromol. 94, 634–641. CrossRef CAS Google Scholar
Hattne, J., Reyes, F. E., Nannenga, B. L., Shi, D., de la Cruz, M. J., Leslie, A. G. W. & Gonen, T. (2015). Acta Cryst. A71, 353–360. Web of Science CrossRef IUCr Journals Google Scholar
Hauptman, H. (1986). Science, 233, 178–183. CrossRef PubMed CAS Web of Science Google Scholar
Hauptman, H. A. (2001). Twentieth Century Harmonic Analysis – A Celebration, edited by J. S. Byrnes, pp. 163–171. Dordrecht: Springer. Google Scholar
Hughes, M. P., Sawaya, M. R., Boyer, D. R., Goldschmidt, L., Rodriguez, J. A., Cascio, D., Chong, L., Gonen, T. & Eisenberg, D. S. (2018). Science, 359, 698–701. CrossRef CAS Google Scholar
Kabsch, W. (2010). Acta Cryst. D66, 125–132. Web of Science CrossRef CAS IUCr Journals Google Scholar
Kobashigawa, Y., Nishimiya, Y., Miura, K., Ohgiya, S., Miura, A. & Tsuda, S. (2005). FEBS Lett. 579, 1493–1497. CrossRef CAS Google Scholar
Krotee, P., Rodriguez, J. A., Sawaya, M. R., Cascio, D., Reyes, F. E., Shi, D., Hattne, J., Nannenga, B. L., Oskarsson, M. E., Philipp, S., Griner, S., Jiang, L., Glabe, C. G., Westermark, G. T., Gonen, T. & Eisenberg, D. S. (2017). Elife, 6, e19273. CrossRef Google Scholar
Lawrence, M. C. & Colman, P. M. (1993). J. Mol. Biol. 234, 946–950. CrossRef CAS PubMed Web of Science Google Scholar
Mandal, K., Pentelute, B. L., Tereshko, V., Thammavongsa, V., Schneewind, O., Kossiakoff, A. A. & Kent, S. B. H. (2009). Protein Sci. 18, 1146–1154. Web of Science CrossRef PubMed CAS Google Scholar
Mandal, K., Uppalapati, M., Ault-Riché, D., Kenney, J., Lowitz, J., Sidhu, S. S. & Kent, S. B. H. (2012). Proc. Natl Acad. Sci. USA, 109, 14779–14784. Web of Science CrossRef CAS PubMed Google Scholar
Matthews, B. W. (2009). Protein Sci. 18, 1135–1138. Web of Science CrossRef PubMed CAS Google Scholar
Maury, C. P. J. (2009). J. Intern. Med. 265, 329–334. CrossRef CAS Google Scholar
McDonald, I. K. & Thornton, J. M. (1994). J. Mol. Biol. 238, 777–793. CrossRef CAS PubMed Web of Science Google Scholar
Merrifield, B. (1986). Science, 232, 341–347. CrossRef CAS Google Scholar
Nelson, R., Sawaya, M. R., Balbirnie, M., Madsen, A. Ø., Riekel, C., Grothe, R. & Eisenberg, D. (2005). Nature (London), 435, 773–778. Web of Science CrossRef PubMed CAS Google Scholar
Pandey, R., Usui, K., Livingstone, R. A., Fischer, S. A., Pfaendtner, J., Backus, E. H. G., Nagata, Y., Fröhlich-Nowoisky, J., Schmüser, L., Mauri, S., Scheel, J. F., Knopf, D. A., Pöschl, U., Bonn, M. & Weidner, T. (2016). Sci. Adv. 2, e1501630. CrossRef Google Scholar
Patterson, W. R., Anderson, D. H., DeGrado, W. F., Cascio, D. & Eisenberg, D. (1999). Protein Sci. 8, 1410–1422. Web of Science CrossRef PubMed CAS Google Scholar
Pentelute, B. L., Gates, Z. P., Tereshko, V., Dashnau, J. L., Vanderkooi, J. M., Kossiakoff, A. A. & Kent, S. B. H. (2008). J. Am. Chem. Soc. 130, 9695–9701. Web of Science CrossRef PubMed CAS Google Scholar
Rodriguez, J. A., Eisenberg, D. S. & Gonen, T. (2017). Curr. Opin. Struct. Biol. 46, 79–86. Web of Science CrossRef CAS PubMed Google Scholar
Rodriguez, J. A., Ivanova, M. I., Sawaya, M. R., Cascio, D., Reyes, F. E., Shi, D., Sangwan, S., Guenther, E. L., Johnson, L. M., Zhang, M., Jiang, L., Arbing, M. A., Nannenga, B. L., Hattne, J., Whitelegge, J., Brewster, A. S., Messerschmidt, M., Boutet, S., Sauter, N. K., Gonen, T. & Eisenberg, D. S. (2015). Nature (London), 525, 486–490. CrossRef CAS Google Scholar
Sawaya, M. R., Rodriguez, J., Cascio, D., Collazo, M. J., Shi, D., Reyes, F. E., Hattne, J., Gonen, T. & Eisenberg, D. S. (2016). Proc. Natl Acad. Sci. USA, 113, 11232–11236. Web of Science CrossRef CAS PubMed Google Scholar
Sawaya, M. R., Sambashivan, S., Nelson, R., Ivanova, M. I., Sievers, S. A., Apostol, M. I., Thompson, M. J., Balbirnie, M., Wiltzius, J. J. W., McFarlane, H. T., Madsen, A. Ø., Riekel, C. & Eisenberg, D. (2007). Nature (London), 447, 453–457. CrossRef CAS Google Scholar
Sheldrick, G. M. (2008). Acta Cryst. A64, 112–122. Web of Science CrossRef CAS IUCr Journals Google Scholar
Sheldrick, G. M. (2015). Acta Cryst. C71, 3–8. Web of Science CrossRef IUCr Journals Google Scholar
Shi, D., Nannenga, B. L., de la Cruz, M. J., Liu, J., Sawtelle, S., Calero, G., Reyes, F. E., Hattne, J. & Gonen, T. (2016). Nat. Protoc. 11, 895–904. Web of Science CrossRef CAS PubMed Google Scholar
Shi, D., Nannenga, B. L., Iadanza, M. G. & Gonen, T. (2013). Elife, 2, e01345. Web of Science CrossRef PubMed Google Scholar
Stroud, J. C. (2013). Acta Cryst. D69, 540–545. CrossRef IUCr Journals Google Scholar
Tayeb-Fligelman, E., Tabachnikov, O., Moshe, A., Goldshmidt-Tran, O., Sawaya, M. R., Coquelle, N., Colletier, J.-P. & Landau, M. (2017). Science, 355, 831–833. Google Scholar
Warren, G., Corotto, L. & Wolber, P. (1986). Nucleic Acids Res. 14, 8047–8060. CrossRef CAS Google Scholar
Wasmer, C., Lange, A., Van Melckebeke, H., Siemer, A. B., Riek, R. & Meier, B. H. (2008). Science, 319, 1523–1526. CrossRef PubMed CAS Google Scholar
Wolber, P. K., Deininger, C. A., Southworth, M. W., Vandekerckhove, J., van Montagu, M. & Warren, G. J. (1986). Proc. Natl Acad. Sci. USA, 83, 7256–7260. CrossRef CAS Google Scholar
Wukovitz, S. W. & Yeates, T. O. (1995). Nat. Struct. Mol. Biol. 2, 1062–1067. CrossRef CAS Google Scholar
Yeates, T. O. & Kent, S. B. H. (2012). Annu. Rev. Biophys. 41, 41–61. Web of Science CrossRef CAS PubMed Google Scholar
This is an open-access article distributed under the terms of the Creative Commons Attribution (CC-BY) Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original authors and source are cited.