research communications
Crystallization and 1.6 Å resolution of an acylated GLP-1/GIP analogue peptide
aDepartment of Chemical Engineering, Imperial College London, London SW7 2AZ, United Kingdom, bDiscovery Chemistry Research and Technology, Lilly Corporate Center, Eli Lilly and Company, Indianapolis, IN 46285, USA, cSynthetic Molecule Design and Development, Lilly Research Laboratories, Eli Lilly and Company, Indianapolis, IN 46221, USA, and dInstitute for Molecular Science and Engineering, Imperial College London, London SW7 2AZ, United Kingdom
*Correspondence e-mail: [email protected], [email protected]
With the meteoric rise in interest in GLP-1 and GIP analogue in recent years, there is a drive for the use of alternative purification techniques to alleviate processing bottlenecks and reduce the cost of peptide manufacturing. However, a lack of reported crystal structures for this class of has hindered molecular-scale understanding of GLP-1/GIP analogue peptide crystallization, particularly related to acylated This paper therefore reports what is believed to be the first of a GLP-1 and GIP analogue lipopeptide. Crystals obtained using a microseed matrix-screening protocol diffracted to ≤1.6 Å resolution in P43, with unit-cell parameters a = b = 64.66, c = 11.42 Å. Model building and the resultant structural analysis reveals that the predominantly helical peptide forms a uniquely porous spiral composed of clockwise-ascending monomers in a square pattern, with aromatic C⋯H—π interactions around Phe22 forming the primary crystal contact between neighbouring square motifs.
Keywords: GLP-1; peptides; crystallization; diabetes; obesity.
PDB reference: acylated GLP-1/GIP analogue peptide, 9tb1
1. Introduction
Therapeutic represent a unique and fast-growing sector of the global pharmaceutical market (Muttenthaler et al., 2021
) owing to their advantages over both small-molecule drugs (improved specificity and toxicity profiles; Rossino et al., 2023
) and biologics [simpler synthesis and development, lower immunogenicity, improved stability and storage (Wang et al., 2022
), as well as potential for oral delivery, as shown by therapeutics such as Rybelsus (Drucker, 2020
)]. Currently, peptides are used to treat a variety of chronic diseases, such as diabetes, cancer and various autoimmune disorders (Muttenthaler et al., 2021
). Of these, the class of GLP-1 and GIP analogue peptides, used for the treatment of type 2 diabetes and obesity, have seen explosive growth in the past five years. Multiple blockbuster peptides such as semaglutide (Knudsen & Lau, 2019
; sold as Ozempic, Wegovy and Rybelsus) have emerged in this class, and advancements in upstream manufacturing have enabled efficient synthesis techniques on industrially relevant scales (Frederick et al., 2021
). However, the separation and purification of these peptides are almost entirely achieved via chromatographic methods (Al Musaimi & Jaradat, 2024
), which contribute significantly to the high costs of production and therefore drive up costs to patients. As a result, purification is often the bottleneck to peptide manufacturing (Isidro-Llobet et al., 2019
), and there is therefore a drive to explore alternative purification methodologies beyond chromatography.
Crystallization is one such alternative to chromatographic separations, and is already widely employed for the purification of small-molecule pharmaceuticals, offering high degrees of purification, lower operating costs and improved stability of the crystalline product (Roque et al., 2020
). In this context, crystallization is used as a preparative technique, where the isolation of a crystalline phase is sufficient and diffraction-quality crystals are not required. However, outside of insulin (dos Santos et al., 2017
), peptides are not commonly purified by crystallization. This is due in part to the prevalence of flexible sections of the peptide, coupled with the comparatively smaller size of peptides compared with macromolecules such as proteins. As a result, highly flexible portions of the peptide are inherently present on the surface of the peptide. The presence of highly entropic chains on the surface of the peptide has been identified as the most deleterious feature to crystallization (Price et al., 2009
), making peptides notoriously difficult to crystallize. This issue is exacerbated by the common addition of long, flexible conjugate groups (commonly fatty acids or polyethylene glycols) to the peptides in order to enhance their bioavailability (Wijesinghe et al., 2022
), which acts as a further detriment to crystallization. In the case of fatty acid-conjugated peptides, while limited exceptions do exist (Ho et al., 2008
), the resultant class of `lipopeptides' is broadly perceived as a noncrystallizing class of molecule (Castelletto & Hamley, 2018
; Pilz et al., 2023
).
Substantial research efforts have been dedicated to the crystallization of short peptides (Verma et al., 2022
; Verma, Bade et al., 2023
; Verma, Mitchell et al., 2023
; Guo et al., 2023
) as well as some model larger peptides such as insulin (Schlichtkrull et al., 1957
; Link & Heng, 2021
; Link & Heng, 2022
; Ferreira et al., 2022
) and vancomycin (Kim et al., 2011
; Li et al., 2023
). Additionally, crystallographic analysis has been performed on a variety of peptides for the exploration of subjects such as amyloid fibrils (Yoo et al., 2016
; Samdin et al., 2023
) and peptide antibiotics (Pfeffer et al., 1991
). For the family of GLP-1 and/or GIP analogue peptides, receptor-complexed crystal structures have been elucidated for GLP-1, GIP and exendin-4 (Underwood et al., 2010
; Parthier et al., 2007
; Runge et al., 2008
), as well as some analogues (including notable therapeutic peptides such as semaglutide; Lau et al., 2015
; Oddo et al., 2018
; Evers et al., 2018
). However, crystal structures for non-receptor-complexed peptides in this family are significantly more sparse. Crystallization has been demonstrated for GLP-1 (Kim & Haren, 1995
) and the analogue glucagon-cex (Li et al., 2007
), and a crystal structure has been reported for a separate glucagon analogue (Sturm et al., 1998
). However, for conjugated GLP-1 peptides such as semaglutide and tirzepatide, which fall under the category of lipopeptides owing to the fatty-acid conjugation, there is a distinct lack of reported crystallization or crystal structures for the unbound (not complexed with the GLP-1 or GIP receptor) molecule (Knudsen & Lau, 2019
).
In the present study, crystallization was initially explored in the context of assessing the feasibility of crystallization as a preparative strategy for a GLP-1/GIP lipopeptide, with single-crystal X-ray diffraction employed primarily to verify the nature of the crystalline phase and to gain insight into crystal packing and stability. However, the structure obtained from the diffraction experiments is believed to be the first reported crystal structure for an unbound (non-receptor-complexed) GLP-1/GIP analogue, motivating this work as a standalone crystallographic study.
2. Materials and methods
2.1. GG-353
GG-353 is a 42-amino-acid lipopeptide consisting of a 39-amino-acid backbone which incorporates sequence components based on the GLP-1 and GIP hormones. GG-353 is synthesized by solid-phase or solution-phase techniques; its sequence is presented in Fig. 1
. The backbone contains multiple non-natural amino-acid residues, which are listed below including their corresponding three-letter codes.
|
| | Figure 1 Molecular sequence of GG-353. Natural amino acids are represented by their single-letter codes; non-natural amino acids are shown in detail and labelled with their three-letter code and residue number. |
| | Figure 2 Aligned sequences for GG-353 compared with exendin-4, GLP-1 (7–37) and glucagon-cex. Non-natural amino acids have been replaced with their closest proteinogenic analogue (Ala2, Phe6, Leu13, Lys16, Lys17, Ala20 and Glu24) and are denoted in lower case. Matched residues to GG-353 are represented by a period, fully conserved residues are highlighted in blue and absent residues are represented by a dash. |
GG-353 was provided as a sodium salt by an industrial collaborator at a purity of ≥95% (with the identity and purity confirmed by the supplier via HPLC), and used as received without further purification. It was delivered as a white lyophilized powder and stored at −20°C prior to use. The purity and identity of the raw material were further verified in-house via LC/MS with a Shimadzu Nexera LCMS-2050, details of which can be found in the supporting information.
2.2. Crystallization
To make stock solutions, GG-353 powder was removed from the freezer and allowed to equilibrate at room temperature for at least 30 min before use, before being solubilized at concentrations of 70 and 100 mg ml−1 in purified deionized water provided by a PURELAB Chorus 1 (ELGA LabWater, High Wycombe, UK), followed by filtration through a 0.2 µm pore-size regenerated cellulose membrane filter (Sartorius Minisart RC4) to remove any solid contaminants. GG-353 was found to be readily soluble in water (≥200 mg GG-353 per millitre of water), so no pH adjustment or buffer preparation was required at this stage. The pH as prepared was determined using a Mettler Toledo LE420 electrode to be 7.4 at room temperature. The solutions were stored at 4°C when not in use, and allowed to warm to room temperature for ≥30 min before use in screens.
2.2.1. Initial screening
Initial crystallization attempts were performed using the commercially available IndexHT (referred to here as `Index') and PEG/IonHT (`PEGIon') kits from Hampton Research, as well as the MemGold2 HT-96 (`MemGold2') kit from Molecular Dimensions. Crystallization experiments were performed in a 96-well sitting-drop configuration using SWISSCI MRC 2 Lens crystallization plates. Each reservoir was initially filled with 100 µl reservoir solution from the kits. Sitting drops were dispensed using an SPT Labtech Mosquito LCP (Melbourn, UK), with each drop containing 100 nl GG-353 stock solution and 100 nl reservoir (precipitant) solution. The plates were sealed and incubated at 298 K.
2.2.2. Microseed matrix screening
Crystals generated from subsequent unseeded sitting-drop vapour-diffusion optimization experiments were used to create a seed stock of GG-353 crystals. Optimization of the initial hit condition from screening proved to be a difficult task owing to issues with poor reproducibility. A detailed description of attempts to optimize these conditions is given elsewhere (Mitchell et al., 2026
). Briefly, the crystals used for the seed stock were grown via sitting-drop vapour diffusion at the 4 µl scale, which comprised 2 µl 70 mg ml−1 GG-353 stock solution in water and 2 µl precipitant/reservoir solution containing 0.1 M bis-Tris pH 5.3, 0.2 M ammonium acetate, 25%(w/v) PEG 3350. The methodology for creating the seed stock was similar to that outlined elsewhere (D'Arcy et al., 2014
) and is given in detail in the supporting information.
To incorporate the seed stock into crystallization screens, an identical procedure was followed to that of the initial screening, but with the drops consisting of 100 nl 70 mg ml−1 GG-353 stock solution, 80 nl reservoir solution and 20 nl seed stock. Unseeded experiments (100 + 100 nl) were also carried out in parallel to be able to compare seeded and unseeded experiments directly. The plates were again sealed and incubated at 298 K; this temperature was chosen as optimization experiments of the unseeded crystallization conditions (which are covered in detail elsewhere; Mitchell et al., 2026
) showed a preference for this temperature over the more conventionally used 293 K.
2.3. Data collection and analysis
Prior to crystallographic data collection, single crystals were isolated from the drop and flash-cooled in liquid nitrogen. For crystals obtained in precipitant solutions which did not contain a suitable cryoprotectant, the crystals were first soaked in a 70:30 mixture of precipitant solution:ethylene glycol before being cooled.
Single-crystal X-ray diffraction studies were performed on the I24 tuneable microfocus beamline at Diamond Light Source (Oxford, UK) using a CdTe EIGER2 9M detector set to a wavelength of 0.62 Å. 360° of data were collected for each crystal, with an oscillation of 0.1° and exposure of 0.05 s per diffraction image. The transmission was fixed at 20%, with a beam size of 7 × 7 µm and a crystal-to-detector distance of 193.9 mm. Data were collected at 100 K.
Diffraction data were processed using the built-in data-processing pipeline at Diamond Light Source. The data were indexed and reduced using fast_dp (Winter & McAuley, 2011
), xia2 (Winter, 2010
) and autoPROC (Vonrhein et al., 2011
). If the indexed, reduced data were of sufficient quality, they were automatically sent for model building and refinement via molecular replacement using the MrBUMP pipeline (Keegan & Winn, 2008
). Diffraction data and the resulting density maps and structures were also processed manually using the CCP4i2 (Potterton et al., 2018
) and CCP4 Cloud (Krissinel et al., 2022
) software; data reduction was performed using DIALS (Winter et al., 2018
) with merging and analysis from AIMLESS (Evans & Murshudov, 2013
), and molecular replacement was carried out with Phaser (McCoy et al., 2007
) with the initial model generated by MrBUMP. Model building was performed using Buccaneer (Cowtan, 2006
) and ModelCraft (Bond & Cowtan, 2022
), model refinement was performed using REFMAC (Murshudov et al., 1997
) and manual model building, particularly, the incorporation of non-natural amino acids into the peptide sequence, was performed using Coot (Emsley & Cowtan, 2004
). Model visualization and validation were carried out using CCP4MG (McNicholas et al., 2011
) and Iris (Rochira & Agirre, 2021
), respectively.
The incorporation of non-natural amino acids was for the most part a simple process, as α-methylalanine, ornithine, α-methylleucine, (2-fluoro)-α-methylphenylalanine and D-glutamic acid were included in the monomer libraries available in Coot and thus could be added via mutation of the naturalized sequence. Addition of the fatty acid conjugated to the side-chain amine on Lys17 was performed by defining the entire conjugate molecule (AEEA-AEEA-γGlu-C18 diacid) as a ligand using AceDRG and connecting the ligand to Lys17 with a defined amide bond via AceDRG.
3. Results and discussion
3.1. Initial screening
After 17 days, crystals were observed (see Fig. 3
a) in condition G6 of the Index screen [0.1 M bis-Tris pH 5.5, 0.2 M ammonium acetate, 25%(w/v) PEG 3350] at GG-353 stock concentrations of both 70 and 100 mg ml−1. Crystals grew as hexagonal and rod-shaped plates inside a pre-existing coacervate phase, which was determined via UV microscopy to be comparatively richer in GG-353 than the bulk phase.
| | Figure 3 (a) Hexagonal and rod-like crystals of GG-353 observed during initial crystallization screening. (b) Hexagonal crystals of GG-353 grown in unseeded optimization experiments. The green line (short diagonal) represents approximately 80 µm. |
The conditions discovered from initial screening were subsequently optimized (details of this optimization can be found elsewhere; Mitchell et al., 2026
), with regular hexagonal plate crystals observed to grow up to 80 µm along their short diagonal (Fig. 3
b). However, little-to-no diffraction was observed during single-crystal X-ray studies. It was also observed that the crystals were exceptionally fragile owing to their habit, and behaved in a gel-like manner, indicating a high solvent content, which may be the reason for the lack of diffraction (McPherson & Cudney, 2014
).
3.2. Microseed matrix screening
Microseed matrix screening increased the number of hits observed during sparse-matrix screening significantly over the duration of the seeded experiments. As can be seen in Fig. 4
, while a further two hits were discovered in the unseeded screens (for a total of three over the 288 conditions studied), seeding resulted in ten unique conditions which were conducive to crystal growth. Images of all crystals observed during microseed matrix-screening experiments, including those produced without seeding, are provided in the supporting information. The reservoir/precipitant conditions for each hit are given in Table 1
.
| ||||||||||||||||||||||||||||||||||||
| | Figure 4 Comparison of unseeded and seeded screening results using Index, PEGIon and MemGold2 sparse-matrix screens. Crystalline hits are highlighted in green, and the time taken to observe crystals (in days) is given within the highlighted cell. The reservoir/precipitant conditions for each hit are given in Table 1 |
These conditions can be broadly divided into two categories: (i) conditions which resulted in the growth of hexagonal plate-like crystals as previously observed and (ii) conditions which resulted in the observation of new crystal habits. In conjunction with seeding, crystals were obtained in a variety of buffers and buffer pHs (4.5–7), salts and PEG molecular weights (400–6000); a vast increase when compared with the hits obtained from unseeded experiments, which were limited to a single buffer (bis-Tris pH 5.5) and molecular-weight PEG (3350). Similarly to protein crystallization (D'Arcy et al., 2014
), the use of microseed matrix screening is a material-sparse and efficient method for uncovering and optimizing crystallization conditions for peptides.
Of particular interest were the new crystal habits uncovered during MMS. While hexagonal crystals of a suitable size for single-crystal analysis were obtained from seeded conditions (as well as two new unseeded conditions), subsequent diffraction experiments resulted in a lack of improvement, with little-to-no diffraction observed. However, the crystals obtained in condition A11 of MemGold2, while initially appearing as a poor candidate for optimization, owing to the dense clusters of acicular crystals (Fig. 5
), exhibited a dramatic improvement in diffraction quality. It was also noted while harvesting these crystals for analysis that they exhibited improved mechanical integrity, signifying that the solvent content was significantly reduced and evidencing a change in polymorph via MMS. The improved mechanical integrity also suggested that these crystals produced by seeding may be a more viable candidate for further optimization, especially with the view of attempting to scale towards batch crystallization for purification purposes. It is unknown whether the seed material acts as a `classical' seed which grows upon addition to a supersaturated solution, or via epitaxial jumps (Stura et al., 1999
) whereby a new polymorph nucleates and grows on the surface of the seed crystal due to favourable epitaxy between the two polymorphs. Unlike the crystals from which the seed stock was produced, these acicular crystals also exhibited strong birefringence under cross-polarized light (see Fig. 5
). Unfortunately, the lack of diffraction of the initial crystals prevented a true understanding of potential polymorphism.
| | Figure 5 Clusters of fine acicular crystals exhibiting strong birefringence under polarized light [reservoir condition: 0.1 M sodium citrate pH 4.5, 0.2 M ammonium phosphate monobasic, 0.1 M ammonium sulfate, 32%(v/v) PEG 400]. Postulated positions of seed material are indicated with arrows. |
3.3. Single-crystal X-ray data collection
Where of suitable size, crystals were harvested from both unseeded and seeded conditions for single-crystal diffraction experiments. For the plate-like crystals, the degree of diffraction was either very poor or absent, indicating that the diffraction quality of the plate-like crystals had not notably improved through MMS. The crystals from PEGIon condition B3 showed the best improvement, with diffraction to ∼10 Å resolution but low overall completeness (∼30%). Initial indexing of the diffraction data gave a = b = 168.21, c = 180.33 Å, α = β = 90, γ = 120°. The volume of the was excessively large compared with the molecular weight of GG-353; Matthews coefficient analysis suggested that there were an enormous number (∼250–500) of monomers in the Ultimately, the poor quality of the diffraction data obtained prevented any meaningful interpretation.
However, a crystal from MemGold2 condition A11 diffracted to a relatively high resolution (∼1.6 Å as determined by a fitted CC1/2 > 0.3 cutoff in DIALS). A single 360° sweep was found to provide a full (≥99.8% complete) dataset, with the automatic data-reduction protocols suggesting space group P41, resulting in ambiguity between the P41/P43 enantiomorphic pair. Subsequent initial map inspection revealed that the true was P43, which was then fixed for subsequent manual reprocessing of the diffraction data. Calculation of the Matthews coefficient resulted in an estimated solvent content of ∼49%, corresponding to one monomer in the asymmetric unit. The data-reduction statistics for the dataset collected are given in Table 2
; per-shell statistics can be found in the supporting information.
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
3.4. Structure solution
Automated model building initially gave a poor solution to the experimentally determined electron-density map, due to the aforementioned enantiomorphic ambiguity of the With the data re-indexed in the true (P43), it was then possible to obtain a much better model fit. Among the tested search models in MrBUMP, the template derived from PDB entry 2qkh (60% sequence identity) gave the best solution, with a TFZ score of 4.5 and a LLG of 28.0, and was subsequently refined. The maps were of sufficient quality to manually build all non-natural amino acids (positions 2, 6, 13, 16, 20 and 24); many of these modifications were visible directly from the difference density maps. However, attempts to build the model out past Ser32 were largely unsuccessful owing to a lack of continuous electron density (see Fig. 6
). Similarly, attempts to model the conjugate molecule on Lys17 were also not possible. This was reasoned to be due to high conformational flexibility for both the amidated C-terminus (Ser33–Ser39) and the conjugate molecule. This result is in line with those obtained from cryo-EM studies for tirzepatide (which contains an identical amidated C-terminal composition and is similarly acylated) in complex with both the GLP-1 and GIP receptors (Zhao et al., 2022
), which similarly were not able to resolve the lipid chain, as well as solution NMR studies of exendin-4 and GLP-1 analogues (Evers et al., 2017
, 2018
), which both show significant flexibility as per the ensemble. For the amidated C-terminal residues (Ser33–Ser39), the theory of high conformational flexibility corroborates with a notable increase in the isotropic B factor for Pro31 (39.2 Å2) and Ser32 (71.2 Å2) compared with the average B factor for the rest of the chain (21.8 Å2). For the conjugate molecule, patches of difference density were observed in the vicinity of Lys17, but a lack of clear connection and the observed conformational flexibility of the side chain on Lys17 (as shown by partial occupancy of the side chain and the two resultant alternate conformations (see Fig. 6
) prevented meaningful interpretation of these difference maps. Additionally, the components present in the crystallization cocktail (namely PEG 400 and PEG 3350) would present similar conformational flexibility and could also be part of the crystal structure, so the resultant ambiguity further prevented building of the conjugate molecule. Ultimately, it was not possible to prove the existence of the lipid chain or the amidated C-terminal residues Ser33–Ser39, and therefore the intactness of the peptide, solely via crystallographic data.
| | Figure 6 (a) Final molecular-replacement model fit to the experimentally obtained electron-density map via diffraction data. (b) Resolution of additional atoms in the nonstandard amino acids Aib2 and 9DT6. |
As a result, the conjugate molecule was removed from model building, and where sufficiently long continuous sections of density were observed, triethylene glycol molecules were added as ligands to represent either the conjugate molecule or any PEG molecules which may be part of the A total of two triethylene glycol molecules were added as ligands to represent density in the `corners' of the (see Fig. 7
): one in the vicinity of Lys17 and another on the opposite side of the peptide chain. Due to the geometry of the crystal structure, the second ligand is unlikely to be part of the conjugate molecule, and is postulated to instead be a fragment of a PEG molecule present in the crystallization liquor (either PEG 400 or PEG 3350). Further refinement led to final R and Rfree values of 0.208 and 0.245, respectively. The full list of are provided in Table 3
.
| ||||||||||||||||||||||||||||||||||||||
| Figure 7 (a) Positions of triethylene glycol `ligands' fitted to the density map and their corresponding positions in the crystal lattice. (b) Position of triethylene glycol 1 in the density map. (c) Position of triethylene glycol 2 in the density map. |
It is worth noting that the for the GG-353 structure appear to be slightly anomalous at first glance, notably the higher-than-expected Rfree value given the resolution of the data, and the relatively high percentage of Ramachandran outliers. The heightened Rfree value can be attributed to the aforementioned positive density, especially in the vicinity of Lys17, which could not be explained by the model. As a result, it was deemed more appropriate to leave these patches of density unmodelled rather than attempting to spuriously fit more ligands to `improve' the model. Additionally, inspection of the Ramachandran outliers using Iris showed that the only Ramachandran outlier was Gly30. Manual inspection of the model and map revealed that the density was clear in the vicinity of Gly30 and no other conformation was possible. To attempt to improve the model statistics, paired via PAIREF (Malý et al., 2020
) was attempted at resolutions of 2.5–1.5 Å, as AIMLESS had estimated resolution limits of 2.00 Å [based on overall 〈I/σ(I)〉 > 2] to 1.59 Å [based on 〈I/σ(I)〉 > 1 along the l axis]. The resultant statistics (namely R and Rfree) appeared `better' at the suggested resolution cutoff of 2.5 Å, owing to the reduced noise in the maps and the higher values of R and Rfree associated with lower resolution data, but this came at the penalty of a less detailed map; for example, the alternative conformations of Ile27 were no longer visible. As such, paired was not pursued further and the resolution of the data was kept at 1.59 Å. It can be argued that the value of CC1/2 in the highest resolution shell at this resolution (see Table 2
) is low, making the resolution choice difficult to justify. However, literature dedicated to resolution cutoffs in crystallographic analysis indicate that a statistic which can be reliably used to indicate resolution cutoff does not exist (Karplus & Diederichs, 2012
; Dubach & Guskov, 2020
) and that `optimal' values for CC1/2 may lie between 0.1 and 0.5 (Dubach & Guskov, 2020
).
As expected from analysis of other GLP-1 analogue peptides, GG-353 contains an α-helical motif running from Gly4 to Glu28, with the remaining sections being randomly coiled. However, the crystal structure of GG-353 is quite interesting; monomers of GG-353 form square motifs within the crystal lattice, with the N-terminus of one monomer lying next to the amidated C-terminus of the next (see Fig. 8
). These square motifs are helical in and of themselves, forming a clockwise-ascending spiral structure, with the helical core of each monomer stacking to form square channels running through the crystal lattice. As a result, the crystal lattice is inherently porous, with the square pores being approximately 30 × 30 Å in area and extending infinitely down the c axis of the crystal. This type of crystal packing is reminiscent of that of a macrocyclic β-sheet peptide (Yoo et al., 2016
). It is worth noting, however, that these porous regions would also contain the disordered sections of the peptide, namely the amidated C-terminal residues and the conjugate molecule. As such, the accessible volume of the pore would be significantly lower than the total volume available, and any species which diffuse into the pores in the crystal lattice (e.g. water, salts) would also interact with the disordered, flexible regions of the peptide within the lattice. Due to the the pores are not identical; each pore either contains conjugate molecules or amidated C-terminal residues (Ser33–Ser39), but not both (see Fig. 8
). As the conjugate molecule is primarily composed of the aliphatic chain of the fatty acid, it is expected that the hydrophobicity of the pores containing the amidated C-terminal residues would be lower than that of the pores containing the fatty-acid conjugate molecule. It is also worth noting that while the conjugate molecule and C-terminal residues were omitted from the final model, it was possible to fit them into the crystal structure without atomic clashes, reinforcing the idea that flexible sections of the peptide are still present in the crystal structure.
| Figure 8 Experimentally determined crystal structure of GG-353. (a) Square channels formed down the c axis of the with channel formation facilitated by hydrogen-bonding interactions between terminal residues. Positions of attachment of the flexible fatty-acid conjugate molecule onto Lys17 are indicated with red arrows; positions of attachment of flexible amidated C-terminal residues are indicated with black arrows. (b) Pore formation is facilitated by stacking of the helical peptide portions to form clockwise-ascending spirals. GG-353 monomers are coloured by crystal symmetry. (c) Hydrophobicity of a GG-353 molecule superimposed into the crystal packing, with hydrophobic regions (beige) buried within the crystal structure and hydrophilic regions (blue) exposed to solvent channels. (d) C⋯H—π interactions around Phe22 (with Ile23, Leu26 and Ile27) forming the primary crystal contact between square motifs in the lattice. |
As shown in Fig. 8
(d), the primary crystal contact between neighbouring square motifs exists in the form of C⋯H—π interactions around Phe22 between neighbouring chains, which simultaneously act to bury the hydrophobic section of GG-353 within the and reduce the exposure of hydrophobic groups to solvent. In this region, Phe22 interacts with other hydrophobic residues, namely Ile23, Leu26 and Ile27, both within and between chains. The occurrence of this crystal contact is in line with results postulated by Kim and Haren, who observed that the addition of aromatic compounds (e.g. phenol) to crystals of insulinotropin [GLP-1(7–37)] resulted in the transition to an amorphous state, and postulated that this was due to the disturbance of lattice interactions in this hydrophobic region around Phe22 (Kim & Haren, 1995), which this supports. These hydrophobic interactions about Phe residues have also been observed in the 3 Å resolution of another glucagon analogue (Sturm et al., 1998
). In contrast, the stacking of helices along the c axis is facilitated by salt bridges, such as those between the side-chain amine of Orn16 with the side-chain carboxyl group of Asp15, as well as the N-terminus with the side-chain carboxyl group of Glu21. This is bolstered by hydrogen-bonding interactions along the chain, such as those between the main-chain amide NH of Phe22 with the side-chain carboxyl group of D-Glu24, the side-chain amine of Orn16 with the side-chain hydroxyl group of Ser11, and the main-chain carboxyl of Gly30 with the main-chain amide NHs of Glu3 and Gly4.
This crystal structure is therefore unique; unlike proteins, which are large enough to be able to sequester flexible or hydrophobic regions within their tertiary structure, smaller peptides such as GG-353 are inherently solvent-exposed throughout the chain. As such, the crystal lattice itself accounts for these otherwise deleterious features by burying them within the crystal contacts (as shown by Phe22) or by allowing them to occupy void spaces within the lattice (as in the case of the amidated C-terminal Ser33–Ser39 residues and fatty-acid conjugate molecule on Lys17). The difficulty in obtaining a crystal structure for GG-353 may therefore lie in its low crystallization propensity, as a result of its inherent disorder in solution. This theory corroborates with previous experimental crystallization screening of glucagon-cex, which does not contain a fatty acid and was found to crystallize readily from solution (Li et al., 2007
), as well as a similar glucagon analogue which was similarly not acylated (Sturm et al., 1998
).
4. Conclusions
Herein, the of a noncomplexed GLP-1/GIP analogue lipopeptide is reported for the first time. The use of a single round of microseed matrix screening was found to be a powerful tool to overcome diffraction issues, which conventional unseeded optimization approaches could not. It is recommended that similarly to proteins, microseed matrix screening should be attempted as soon as feasibly possible for as it is a simple but powerful optimization method. The use of microseed matrix screening was also found to improve important crystal-quality attributes for the goal of purification, signifying its potential utility both for crystallographic and bulk-scale crystallization purposes.
In terms of the peptide forms a unique composed of square pores which facilitate flexible sections of the peptide to remain disordered. Analysis of the crystal-packing interactions reveals that while the stacking of helical sections of neighbouring is mediated by hydrogen bonding, the primary crystal contact between the square pores is aromatic C⋯H—π interactions of Phe22. This serves as a proof-of-concept for the feasibility of crystallization of acylated or otherwise conjugated GLP-1 analogue as well as providing insight into critical residues for the formation of important crystal contacts. Therefore, the results reported herein warrant further investigation into developing crystallization as a purification process for GLP-1 analogue (lipo)peptides, particularly related to transitioning from the nanolitre-scale vapour-diffusion experiments performed to batch or continuous crystallization setups on millilitre or litre scales. The dramatic improvement in diffraction quality via microseed matrix screening also suggests that further optimization of the new seeded condition would be fruitful in further improving crystal quality.
Supporting information
PDB reference: acylated GLP-1/GIP analogue peptide, 9tb1
LS/MS analysis, seed preparation protocol, crystal images, per-shell data reduction statistics and Ramachandran analysis, including Supplementary Tables and Figures. DOI: https://doi.org/10.1107/S2053230X26001937/va5069sup1.pdf
Acknowledgements
The authors gratefully acknowledge Dr Marc Morgan and Dr Sam Horrell of the Centre for Structural Biology at Imperial College London, and Sherry Guo of Eli Lilly and Company, for their assistance with sample preparation for single-crystal X-ray diffraction experiments, as well as Dr Maxim Bird of Imperial College London for performing LC/MS and pH analysis. The authors would like to thank Diamond Light Source for beamtime (proposal mx31800) and the staff of beamlines I04 and I24 for assistance with crystal testing and data collection. Finally, the authors thank Dr Jeremy Merritt, Dr Jon Selbo, Dr Jing Teng and Dr Sofiane Saouane of Eli Lilly and Company for their useful insight into crystallization techniques.
Conflict of interest
The authors declare no conflicts of interest.
Data availability
The associated structure for GG-353 has been deposited within the PDB (PDB entry 9tb1).
Funding information
HMM gratefully acknowledges financial support from Eli Lilly and Company and the Engineering and Physical Sciences Research Council (EPSRC) of the UK via Prosperity Partnership (grant Nos. EP/T005556/1 and EP/T518207/1).
References
Al Musaimi, O. & Jaradat, D. M. M. (2024). Separations, 11, 233. CrossRef Google Scholar
Bond, P. S. & Cowtan, K. D. (2022). Acta Cryst. D78, 1090–1098. Web of Science CrossRef IUCr Journals Google Scholar
Castelletto, V. & Hamley, I. W. (2018). Methods Mol. Biol. 1777, 3–21. CrossRef PubMed Google Scholar
Cowtan, K. (2006). Acta Cryst. D62, 1002–1011. Web of Science CrossRef CAS IUCr Journals Google Scholar
D'Arcy, A., Bergfors, T., Cowan-Jacob, S. W. & Marsh, M. (2014). Acta Cryst. F70, 1117–1126. Web of Science CrossRef IUCr Journals Google Scholar
dos Santos, R., Carvalho, A. L. & Roque, A. C. A. (2017). Biotechnol. Adv. 35, 41–50. CrossRef PubMed Google Scholar
Drucker, D. J. (2020). Nat. Rev. Drug Discov. 19, 277–289. CrossRef PubMed Google Scholar
Dubach, V. R. & Guskov, A. (2020). Crystals, 10, 580. CrossRef Google Scholar
Emsley, P. & Cowtan, K. (2004). Acta Cryst. D60, 2126–2132. Web of Science CrossRef CAS IUCr Journals Google Scholar
Evans, P. R. & Murshudov, G. N. (2013). Acta Cryst. D69, 1204–1214. Web of Science CrossRef CAS IUCr Journals Google Scholar
Evers, A., Bossart, M., Pfeiffer-Marek, S., Elvert, R., Schreuder, H., Kurz, M., Stengelin, S., Lorenz, M., Herling, A., Konkar, A., Lukasczyk, U., Pfenninger, A., Lorenz, K., Haack, T., Kadereit, D. & Wagner, M. (2018). J. Med. Chem. 61, 5580–5593. CrossRef PubMed Google Scholar
Evers, A., Haack, T., Lorenz, M., Bossart, M., Elvert, R., Henkel, B., Stengelin, S., Kurz, M., Glien, M., Dudda, A., Lorenz, K., Kadereit, D. & Wagner, M. (2017). J. Med. Chem. 60, 4293–4303. CrossRef PubMed Google Scholar
Ferreira, J., Sárkány, Z., Castro, F., Rocha, F. & Kuhn, S. (2022). J. Cryst. Growth, 582, 126516. CrossRef Google Scholar
Frederick, M. O., Boyse, R. A., Braden, T. M., Calvin, J. R., Campbell, B. M., Changi, S. M., Coffin, S. R., Condon, C., Gowran, O., McClary Groh, J., Groskreutz, S. R., Harms, Z. D., Humenik, A. A., Kallman, N. J., Klitzing, N. D., Kopach, M. E., Kretsinger, J. K., Lambertus, G. R., Lampert, J. T., Maguire, L. M., Moynihan, H. A., Mullane, N. S., Murphy, J. D., O'Mahony, M. E., Richey, R. N., Seibert, K. D., Spencer, R. D., Strege, M. A., Tandogan, N., Torres Torres, F. L., Tsukanov, S. V. & Xia, H. (2021). Org. Process Res. Dev. 25, 1628–1636. CrossRef Google Scholar
Gallwitz, B. (2022). Front. Endocrinol. 13, 1004044. CrossRef Google Scholar
Guo, M., Jones, M. J., Goh, R., Verma, V., Guinn, E. & Heng, J. Y. Y. (2023). Cryst. Growth Des. 23, 1668–1675. Web of Science CrossRef CAS PubMed Google Scholar
Ho, D. N., Pomroy, N. C., Cuesta-Seijo, J. A. & Privé, G. G. (2008). Proc. Natl Acad. Sci. USA, 105, 12861–12866. CrossRef PubMed Google Scholar
Isidro-Llobet, A., Kenworthy, M. N., Mukherjee, S., Kopach, M. E., Wegner, K., Gallou, F., Smith, A. G. & Roschangar, F. (2019). J. Org. Chem. 84, 4615–4628. PubMed Google Scholar
Karplus, P. A. & Diederichs, K. (2012). Science, 336, 1030–1033. Web of Science CrossRef CAS PubMed Google Scholar
Keegan, R. M. & Winn, M. D. (2008). Acta Cryst. D64, 119–124. Web of Science CrossRef CAS IUCr Journals Google Scholar
Kim, Y. & Haren, A. M. (1995). Pharm. Res. 12, 1664–1670. CrossRef PubMed Google Scholar
Kim, Y.-N., Lee, J.-Y. & Kim, J.-H. (2011). Process Biochem. 46, 2068–2073. CrossRef Google Scholar
Knudsen, L. B. & Lau, J. (2019). Front. Endocrinol. 10, 155. CrossRef Google Scholar
Krissinel, E., Lebedev, A. A., Uski, V., Ballard, C. B., Keegan, R. M., Kovalevskiy, O., Nicholls, R. A., Pannu, N. S., Skubák, P., Berrisford, J., Fando, M., Lohkamp, B., Wojdyr, M., Simpkin, A. J., Thomas, J. M. H., Oliver, C., Vonrhein, C., Chojnowski, G., Basle, A., Purkiss, A., Isupov, M. N., McNicholas, S., Lowe, E., Triviño, J., Cowtan, K., Agirre, J., Rigden, D. J., Uson, I., Lamzin, V., Tews, I., Bricogne, G., Leslie, A. G. W. & Brown, D. G. (2022). Acta Cryst. D78, 1079–1089. Web of Science CrossRef IUCr Journals Google Scholar
Lau, J., Bloch, P., Schäffer, L., Pettersson, I., Spetzler, J., Kofoed, J., Madsen, K., Knudsen, L. B., McGuire, J., Steensgaard, D. B., Strauss, H. M., Gram, D. X., Knudsen, S. M., Nielsen, F. S., Thygesen, P., Reedtz-Runge, S. & Kruse, T. (2015). J. Med. Chem. 58, 7370–7380. Web of Science CrossRef CAS PubMed Google Scholar
Li, B., Yao, T., Tao, T., Han, D., Yang, H., Gao, Z. & Gong, J. (2023). Ind. Eng. Chem. Res. 62, 21760–21769. CrossRef Google Scholar
Li, P., Rogers, T., Smiley, D., DiMarchi, R. D. & Zhang, F. (2007). Acta Cryst. F63, 599–601. CrossRef IUCr Journals Google Scholar
Link, F. J. & Heng, J. Y. Y. (2021). CrystEngComm, 23, 3951–3960. CrossRef Google Scholar
Link, F. J. & Heng, J. Y. Y. (2022). Cryst. Growth Des. 22, 3024–3033. CrossRef PubMed Google Scholar
Malý, M., Diederichs, K., Dohnálek, J. & Kolenko, P. (2020). IUCrJ, 7, 681–692. Web of Science CrossRef PubMed IUCr Journals Google Scholar
McCoy, A. J., Grosse-Kunstleve, R. W., Adams, P. D., Winn, M. D., Storoni, L. C. & Read, R. J. (2007). J. Appl. Cryst. 40, 658–674. Web of Science CrossRef CAS IUCr Journals Google Scholar
McNicholas, S., Potterton, E., Wilson, K. S. & Noble, M. E. M. (2011). Acta Cryst. D67, 386–394. Web of Science CrossRef CAS IUCr Journals Google Scholar
McPherson, A. & Cudney, B. (2014). Acta Cryst. F70, 1445–1467. Web of Science CrossRef IUCr Journals Google Scholar
Mitchell, H. M., Gardner, S., Guinn, E. J., Teng, J. & Heng, J. Y. Y. (2026). In preparation. Google Scholar
Murshudov, G. N., Vagin, A. A. & Dodson, E. J. (1997). Acta Cryst. D53, 240–255. CrossRef CAS Web of Science IUCr Journals Google Scholar
Muttenthaler, M., King, G. F., Adams, D. J. & Alewood, P. F. (2021). Nat. Rev. Drug Discov. 20, 309–325. CrossRef PubMed Google Scholar
Neidigh, J. W., Fesinmeyer, R. M., Prickett, K. S. & Andersen, N. H. (2001). Biochemistry, 40, 13188–13200. Web of Science CrossRef PubMed CAS Google Scholar
Oddo, A., Mortensen, S., Thøgersen, H., De Maria, L., Hennen, S., McGuire, J. N., Kofoed, J., Linderoth, L. & Reedtz-Runge, S. (2018). Biochemistry, 57, 4148–4154. CrossRef PubMed Google Scholar
Parthier, C., Kleinschmidt, M., Neumann, P., Rudolph, R., Manhart, S., Schlenzig, D., Fanghänel, J., Rahfeld, J.-U., Demuth, H.-U. & Stubbs, M. T. (2007). Proc. Natl Acad. Sci. USA, 104, 13942–13947. CrossRef PubMed Google Scholar
Pfeffer, S., Höhne, W., Branner, S., Wilson, K. & Betzel, C. (1991). FEBS Lett. 285, 115–119. CrossRef PubMed Google Scholar
Pilz, M., Cavelius, P., Qoura, F., Awad, D. & Brück, T. (2023). Biotechnol. Adv. 67, 108210. CrossRef PubMed Google Scholar
Potterton, L., Agirre, J., Ballard, C., Cowtan, K., Dodson, E., Evans, P. R., Jenkins, H. T., Keegan, R., Krissinel, E., Stevenson, K., Lebedev, A., McNicholas, S. J., Nicholls, R. A., Noble, M., Pannu, N. S., Roth, C., Sheldrick, G., Skubak, P., Turkenburg, J., Uski, V., von Delft, F., Waterman, D., Wilson, K., Winn, M. & Wojdyr, M. (2018). Acta Cryst. D74, 68–84. Web of Science CrossRef IUCr Journals Google Scholar
Price, W. N. II, Chen, Y., Handelman, S. K., Neely, H., Manor, P., Karlin, R., Nair, R., Liu, J., Baran, M., Everett, J., Tong, S. N., Forouhar, F., Swaminathan, S. S., Acton, T., Xiao, R., Luft, J. R., Lauricella, A., DeTitta, G. T., Rost, B., Montelione, G. T. & Hunt, J. F. (2009). Nat. Biotechnol. 27, 51–57. PubMed Google Scholar
Rochira, W. & Agirre, J. (2021). Protein Sci. 30, 93–107. Web of Science CrossRef CAS PubMed Google Scholar
Roque, A. C. A., Pina, A. S., Azevedo, A. M., Aires–Barros, R., Jungbauer, A., Di Profio, G., Heng, J. Y. Y., Haigh, J. & Ottens, M. (2020). Biotechnol. J. 15, 1900274. CrossRef Google Scholar
Rossino, G., Marchese, E., Galli, G., Verde, F., Finizio, M., Serra, M., Linciano, P. & Collina, S. (2023). Molecules, 28, 7165. CrossRef PubMed Google Scholar
Runge, S., Thøgersen, H., Madsen, K., Lau, J. & Rudolph, R. (2008). J. Biol. Chem. 283, 11340–11347. CrossRef PubMed Google Scholar
Samdin, T. D., Jones, C. R., Guaglianone, G., Kreutzer, A. G., Freites, J. A., Wierzbicki, M. & Nowick, J. S. (2023). Chem. Sci. 15, 285–297. CrossRef PubMed Google Scholar
Schlichtkrull, J., Ekholm, R., Norman, A., Noer, B. & Reio, L. (1957). Acta Chem. Scand. 11, 439–460. CrossRef Google Scholar
Stura, E. A., Charbonnier, J.-B. & Taussig, M. J. (1999). J. Cryst. Growth, 196, 250–260. Web of Science CrossRef CAS Google Scholar
Sturm, N. S., Lin, Y., Burley, S. K., Krstenansky, J. L., Ahn, J.-M., Azizeh, B. Y., Trivedi, D. & Hruby, V. J. (1998). J. Med. Chem. 41, 2693–2700. CrossRef PubMed Google Scholar
Underwood, C. R., Garibay, P., Knudsen, L. B., Hastrup, S., Peters, G. H., Rudolph, R. & Reedtz-Runge, S. (2010). J. Biol. Chem. 285, 723–730. CrossRef PubMed Google Scholar
Verma, V., Bade, I., Karde, V. & Heng, J. Y. (2023). Pharmaceutics, 15, 1288. CrossRef PubMed Google Scholar
Verma, V., Mitchell, H., Errington, E., Guo, M. & Heng, J. Y. Y. (2023). Chem. Eng. Technol. 46, 1271–1278. CrossRef Google Scholar
Verma, V., Mitchell, H., Guo, M., Hodnett, B. K. & Heng, J. Y. Y. (2022). Faraday Discuss. 235, 199–218. CrossRef PubMed Google Scholar
Vonrhein, C., Flensburg, C., Keller, P., Sharff, A., Smart, O., Paciorek, W., Womack, T. & Bricogne, G. (2011). Acta Cryst. D67, 293–302. Web of Science CrossRef CAS IUCr Journals Google Scholar
Wang, L., Wang, N., Zhang, W., Cheng, X., Yan, Z., Shao, G., Wang, X., Wang, R. & Fu, C. (2022). Sig. Transduct. Target. Ther. 7, 48. CrossRef Google Scholar
Wijesinghe, A., Kumari, S. & Booth, V. (2022). PLoS One, 17, e0255753. CrossRef PubMed Google Scholar
Winter, G. (2010). J. Appl. Cryst. 43, 186–190. Web of Science CrossRef CAS IUCr Journals Google Scholar
Winter, G. & McAuley, K. E. (2011). Methods, 55, 81–93. Web of Science CrossRef CAS PubMed Google Scholar
Winter, G., Waterman, D. G., Parkhurst, J. M., Brewster, A. S., Gildea, R. J., Gerstel, M., Fuentes-Montero, L., Vollmar, M., Michels-Clark, T., Young, I. D., Sauter, N. K. & Evans, G. (2018). Acta Cryst. D74, 85–97. Web of Science CrossRef IUCr Journals Google Scholar
Yoo, S., Kreutzer, A. G., Truex, N. L. & Nowick, J. S. (2016). Chem. Sci. 7, 6946–6951. CrossRef PubMed Google Scholar
Zhao, F., Zhou, Q., Cong, Z., Hang, K., Zou, X., Zhang, C., Chen, Y., Dai, A., Liang, A., Ming, Q., Wang, M., Chen, L. N., Xu, P., Chang, R., Feng, W., Xia, T., Zhang, Y., Wu, B., Yang, D., Zhao, L., Xu, H. E. & Wang, M. W. (2022). Nat. Commun. 13, 1057. CrossRef PubMed Google Scholar
This is an open-access article distributed under the terms of the Creative Commons Attribution (CC-BY) Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original authors and source are cited.

journal menu
access



