research papers
Analysis and validation of overall N-glycan conformation in Privateer
aYork Structural Biology Laboratory, Department of Chemistry, University of York, York YO10 5DD, United Kingdom, and bOncode Institute and Division of Biochemistry, Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands
*Correspondence e-mail: r.joosten@nki.nl, jon.agirre@york.ac.uk
The N-glycosylation provide key structural and functional contributions to a glycoprotein. These contributions are dependent on the composition and overall conformation of the The Privateer software allows structural biologists to evaluate and improve the atomic structures of including N-glycans; this software has recently been extended to check glycan composition through the use of glycomics data. Here, a broadening of the scope of the software to analyse and validate the overall conformation of N-glycans is presented, focusing on a newly compiled set of glycosidic linkage torsional preferences harvested from a curated set of glycoprotein models.
inKeywords: glycobiology; validation; Privateer; N-glycans; torsion angles.
1. Introduction
Post-translational modifications (PTMs) are covalent modifications of proteins that occur after the nascent polypeptide has left the ribosome. PTMs may induce significant changes in the structure and function of the protein (Xin & Radivojac, 2012). A fundamental and abundant PTM is N-glycosylation, in which an oligosaccharide moiety is attached to the N atom of an asparagine side chain in the target protein. The oligosaccharide is subsequently trimmed and modified according to the available cellular enzymes: glycoside glycosyl and oligosaccharyl The resulting oligosaccharide, or N-glycan, may end up having anything from a complex to a minimal composition, leading to a specific 3D conformation of the mature glycoprotein (Shental-Bechor & Levy, 2009). N-Glycosylation is key to all sorts of interactions, including those with cell-surface receptors (Petrescu et al., 2006; Rudd et al., 2004) or even other parts of the same glycoprotein, as shown in studies of the dynamics of SARS-CoV-2 spike, where conformational changes in the Asn165 glycan push up the receptor-binding domain of the spike (Casalino et al., 2020).
Understanding the complex structure of N-glycans. Producing a correct 3D structure of a glycoprotein at a good enough resolution can be vital in understanding how some biological processes unfold. Alas, working with in software for X-ray crystallography and electron cryo-microscopy has historically been all but straightforward: many carbohydrate modelling, and validation processes relied on software written primarily for proteins and (Atanasova et al., 2020), and libraries of restraints had become outdated or were incorrect (Agirre, 2017). While recent efforts have aimed to address this situation (Atanasova et al., 2022; Joosten et al., 2022), carbohydrate methodology still trails that designed for proteins.
is challenging due to the various stereochemical and regiochemical possibilities exhibited byObtaining a glycoprotein structure at a high enough resolution can generally be considered to be more difficult than with a glycan-free protein. Two main issues are routinely identified as problematic when it comes to obtaining higher resolutions: heterogeneity and mobility, both of which translate into poorer experimental data. Owing to these complications, the Protein Data Bank (PDB; Berman et al., 2000) contains models that include incorrect nomenclature (Lütteke et al., 2005), impossible linkages (Crispin et al., 2007) and improbably high-energy conformations of that deviate from the low-energy chair conformation of six-membered rings (Agirre, Davies et al., 2015): in general, a 4C1 chair for D-pyranosides and a 1C4 chair for L-pyranosides. Ring conformations (Cremer & Pople, 1975) and their energetics (Davies et al., 2012) are discussed in detail elsewhere. Using models with incorrect glycochemistry in downstream analyses or molecular simulations will cause misrepresentation and misinterpretation, while also perpetuating these errors. Software packages such as pdb-care and CARP (Lütteke et al., 2005), and more recently Privateer (Agirre, Iglesias-Fernández et al., 2015; Bagdonas et al., 2020), can be utilized for the identification and rectification of these model errors, therefore allowing future data libraries to be as accurate and representative as possible.
In this study, torsion angles (dihedral angles) in curated structures of N-glycan-forming pyranosides were collected in order to create accurate torsional libraries for use in the Privateer validation software. Previous torsional databases such as GlyTorsionDB (Lütteke et al., 2005) and its associated link-checking tool (CARP) incorporate potentially flawed models from the PDB, as they pre-dated the introduction of ring conformation into the routine validation of glycan structures (Agirre, Iglesias-Fernández et al., 2015); therefore, a survey of the PDB was completed with each PDB entry being analysed and validated using Privateer to ensure that the N-glycans were well fitted to the electron density without any conformational errors. Also, in order to avoid the presentation of data on multiple torsional plots and to allow the easy identification of standout (outlier) linkage conformations, a Z-score is calculated for each linkage, with standout linkages being highlighted in orange on glycan diagrams that follow the third edition of the Standard Symbol Nomenclature for (SNFG; Varki et al., 2015). Furthermore, in recognition that not every standout linkage conformation will be the consequence of a modelling mistake, a collection of verified cases where the interaction between glycan and protein residues has caused an unusual conformation is presented. Finally, a similar study was completed using PDB-REDO (van Beusekom, Touw et al., 2018) to analyse whether modern techniques can lead to less frequent errors in the N-glycan models.
2. Materials and methods
2.1. Data-set collection and validation
A local PDB mirror (August 2021) was created for this study. The PDB mirror was then scanned for proteins containing glycosylated amino-acid residues. Of the Privateer: the software calculates ring conformation using the Cremer–Pople algorithm (Cremer & Pople, 1975) and then compares the detected ring conformation with the minimal energy conformation stored in an internal database. The data set was filtered to include only with a (RSCC) higher than 0.80 [RSCC (equation 1) is a measure of the local agreement between a portion of an atomic model and the observed electron-density map that surrounds it] and which had been deemed diagnostically correct by Privateer, i.e. no nomenclature errors, no unphysical puckering amplitude and all pyranosides in their minimal energy conformations (a chair in all analysed cases). Privateer checks that the anomeric and absolute stereochemistry in the structure matches that encoded in the three-letter code (for example that a monosaccharide modelled as MAN is perceived to be α-D-mannose), that the ring conformation matches the lowest energy pucker, which is a 4C1 chair for most D-pyranosides, with special cases such as 1C4 for the mannose moiety in tryptophan mannosylation (Akkermans et al., 2022; Frank et al., 2020), including puckering amplitude (Cremer & Pople, 1975).
contained within these chains, the conformations of the six-membered rings (pyranosides) were validated usingNo resolution cutoffs were explicitly applied, although some filtering is implicit in requiring a minimum Privateer deemed correct; only these were used in the study. A further 8511 showed a high-energy ring conformation, which normally requires manual assessment. A total of 2421 showed geometry and/or nomenclature errors.
as the accumulation of model-error components at low resolutions makes it harder to obtain high values. A total of 68 541 were analysed, 57 569 of whichFor the PDB-REDO comparison, the equivalent were taken from the so-called `conservatively optimized' models in the PDB-REDO databank (van Beusekom, Touw et al., 2018), i.e. models that were re-refined without any torsional restraints for but were not subjected to N-glycan rebuilding procedures (van Beusekom et al., 2019).
Example linkages present in diverse using the third edition of the SNFG (Varki et al., 2015), which Privateer implements. The definition of φ and ψ for N-acetyl-β-D-glucosamine (GlcNAc, or NAG in the PDB Chemical Component Dictionary) linked to asparagine, plus all 1–2, 1–3 and 1–4 glycosidic bonds, and additionally ω, which covers 1–6 bonds such as in α-D-mannose–1,6–α-D-mannose and α-L-fucose–1,6–N-acetyl β-D-glucosamine, is shown in Fig. 2. While completing this study, a large array of different linkages were identified; however, only a small number had enough independent observations to enable meaningful data extraction. Indeed, only approximately 10% of protein models deposited in the PDB contain one or more carbohydrate groups, while around 6% are N-glycosylated (Agirre, 2017). We set the minimum number of required observations to 50 and introduced a mechanism for Privateer to report which linkages could not be validated due to insufficient data (see below). A table of the linkages investigated in this study is given as Table 1, as well as the commonly used abbreviations associated with them.
are shown in Fig. 1
|
2.2. Implementation in Privateer
To assess the normality of the torsion angles between N-glycans, a Z-score system was implemented using similar methods to the Tortoize (van Beusekom, Joosten et al., 2018) and WHAT_CHECK (Hooft et al., 1997) software. The Z-score is based on how common a certain (φ, ψ) combination is compared with a reference set of the same glycosidic linkages calculated from high-quality structure models. To calculate the Z-scores, torsional data from each linkage were split into two-dimensional bins with a 2° bin spacing and formed into a database. The Z-score is calculated as described by Hooft et al. (1997) and shown in equation (2).
inLet k be a particular glycosidic linkage, for example BMA402–NAG401 in a PDB file, under scrutiny and zk be its Z-score for the φ/ψ torsion pair measured on the structure; l is the linkage type (Man–β1,4–GlcNAc in this case), ckl is the number of data points of that linkage (where c is a count) in the 2° × 2° bin corresponding to the φ/ψ torsion pair in the database, 〈cl〉 is the average number of data points for that linkage across all bins and σ〈cl〉 is the corresponding standard deviation of the number of data points for linkage l in the database, again across all bins. As derived from the formula, positive Z-scores indicate that the φ/ψ torsion pair is well represented in the database and thus normal, whereas negative Z-scores indicate the opposite. Also, the scores are normalized to make the results comparable between different linkages. Detailed results and their interpretation are discussed in the next section.
After scoring every glycosidic linkage, a global Z-score may be calculated by simply averaging the Z-scores of all N-glycan linkages. In addition to this, comparison to a reference set of PDB entries with N-glycans allowed the calculation of a relative `quality Z-score', which is an additional parameter that can be used as a measure of glycan normality. The reference set was chosen following a set of criteria: crystallographic structures and reflections from the wwPDB with Rfree < 0.25 and reported resolution ≤ 2.50 Å, with longer than four pyranosides and with a composition backed up by a GlyConnect ID (Alocci et al., 2019). As a result, 510 structures were chosen containing 59 unique glycan structures. The resolution range covered by the data set was 1.12–2.50 Å, and the Rwork and Rfree values were in the ranges 0.10–0.23 and 0.12–0.25, respectively.
To provide a visual means of highlighting those linkages with an unusual Z-score, the SNFG (Varki et al., 2015) vector engine within Privateer (McNicholas & Agirre, 2017) was modified to create an orange background behind the linkages. Linkages for which insufficient data could be collected for validation are marked with a grey background. This representation was used in the figures presented in this study. The representation was also extended to cover the in so that interesting or problematic models can quickly be identified. We note that an orange background does not automatically mean that there is a modelling mistake, but rather that the linkage is worth inspecting.
3. Results and discussion
The number of N-glycosylated structures in the PDB is growing steadily (Scherbinina & Toukach, 2020; Agirre, 2017), supported by the introduction of carbohydrate structure modelling and validation tools such as pdb-care (Lütteke & von der Lieth, 2004), the N-glycan building module in Coot (Emsley & Crispin, 2018) and Privateer (Agirre, Iglesias-Fernández et al., 2015). However, as the resolvability of pyranosides in N-glycans decreases the further the are from the asparagine residue (Atanasova et al., 2020), the abundance of the data collected here dwindles for linkages that form the antennae of the As stated previously, we set a cutoff of 50 data points in order to guarantee the reliability of the Z-score calculation, and this necessarily means that some glycosidic linkages are not yet included in the analysis performed by the Privateer software. Scripts for reproducing and extending this work are included in the relevant section here, meaning that the torsion library can be regenerated in future when more data are available.
The torsional data that we harvested are plotted in Fig. 3. A first close inspection of the graphs reveals a straightforward correspondence between the most frequent linkage conformations for every link type and their calculated energy minimum or minima in the Disac3-DB section of the Glyco3D 2.0 database (Pérez et al., 2015) and GlycoMapsDB (Frank et al., 2007). The mean linkage torsion angles and respective standard deviations of this PDB survey are shown in Supplementary Tables S1 and S2. Supplementary Table S1 shows the values implemented into Privateer. A comparative plot of quality Z-scores for the curated data set versus the rest of the PDB is available in Supplementary Fig. S1. Low-quality Z-scores (Z < −2) may indicate serious problems with the overall quality of in the structure model. High-quality Z-scores (Z > 2), particularly in low-resolution structure models, may indicate over-restraining of torsions in model and may warrant further inspection, as previously shown for proteins (Sobolev et al., 2020).
3.1. GlcNAc–asparagine bond
Investigations of the torsion-angle data set between the asparagine (ASN) amino-acid side chain and GlcNAc (NAG) highlight a perhaps unsurprising trend. The φ torsion-angle data set has a greater standard deviation (σ = 25.3°) when compared with the ψ torsion angle (σ = 22.1°). This is most likely due to the ψ torsion angle referring to a C—N bond which has a bond order of greater than one, analogous to a peptide bond. Indeed, the mean value of ψ is 178.5°, which is very similar to the 180° torsion angle expected for a peptide bond. Such a bond has limited torsional freedom. The φ torsion angle refers to a single bond which has more rotational freedom, leading to the increased spread of torsional data for φ.
Correct modelling of the protein–sugar linkage torsion angle is particularly important to establish a good basis for other N-glycan tree. Two main conformations for NAG-ASN exist (Fig. 4), in which the conformation with a negative φ angle (Fig. 4a) is the most abundant and the other conformation (Fig. 4b), which is much more infrequent due to the additional CH–π interaction (Trp431) that is required to stabilize it, is flagged up as an outlier by Privateer. The arrangement shown in Fig. 4(b), found in a fungal GH3 β-glucosidase, is conserved across homologous structures.
to be modelled further down the3.2. Glycosidic linkages between pyranosides
N-Glycans exhibit common structures, as shown in Fig. 1. The similarity of these conformations explains the consistency in the types of linkages seen in various and allows this quantitative study. N-Glycosylated chains attach to the residue with a NAG sugar through a β-linkage. Attached to this initial NAG sugar through a β-1,4 linkage is an additional NAG sugar. This initial NAG-1,4-NAG linkage is abundant in the PDB and hence contains a large number (n = 3800) of validated data points. As evident by the two-dimensional histogram (Fig. 3), most NAG-1,4-NAG linkages contain torsion angles around φ ≃ −80° and ψ ≃ −130°.
Often, a BMA sugar is attached to the second NAG sugar through a β-1,4 linkage. This BMA-1,4-NAG linkage may theoretically have slightly more conformational variability than NAG-1,4-NAG due to its position further down the glycan tree; however, the spread of data (standard deviation) is similar for both NAG-1,4-NAG and BMA-1,4-NAG. In addition to this, in the complex tree a FUC sugar can be attached to the initial NAG through an α-1,6 linkage. The FUC-1,6-NAG linkage exhibits a large standard deviation around both torsion angles, particularly around the ψ angle. This could partially be the result of FUC being a terminal residue at this position in the glycan, but the FUC-1,3-NAG linkage, in which the FUC is also a terminal residue connected to the same NAG, has less spread in the observed torsion angles. A key difference, however, is the presence of a third torsion angle, ω, that gives more flexibility to the FUC-1,6-NAG linkage. This additional flexibility also leads to less well defined experimental data and thus more room for modelling errors.
Attachment of additional mannose sugars onto the N-glycan chain can often increase the amount of branching and the size of the chain (see Fig. 1a). The most common attachment onto the currently terminal BMA sugar is MAN-1,3-BMA; indeed, this is shown in our data set of validated (n = 781), with the positional isomer MAN-1,6-BMA being almost as frequent (n = 702). Interestingly, the MAN-1,3-BMA linkage exhibits standard deviations (ψ: σ = 22.6°) which are similar to those of NAG-1,4-NAG (ψ: σ = 22.8°). However, the MAN-1,6-BMA linkage torsion angles do not exist in a singular cluster and hence exhibit a larger standard deviation (ψ: σ = 33.3°). Again, this additional spread may be caused by the presence of a third torsion angle in the linkage.
Certain N-acetyl group in NAG, which makes the placement of the monomer in relatively poor density less error-prone. The large standard deviation of MAN-MAN linkages causes similar challenges to MAN-BMA linkages in torsional restraint application. As well as this, no apparent cluster was observed for the SIA-GAL linkage, most likely due to the very low number of deposited and curated linkages available in the data set. The values that φ can adopt appear to be determined by the anomeric form involved in the glycosidic linkage: for D-pyranosides this means −180° < φ < 0° for β-anomers and 0° < φ < 180° for α-anomers. The inverse is true for L-pyranosides.
have further monosaccharide attachments such as a variety of MAN-MAN, NAG-MAN and SIA-GAL linkages. Interestingly, the torsion-angle spread for all MAN-MAN linkages (1,2, 1,3 and 1,6) is far greater than the torsion-angle spread for NAG-MAN torsion-angle data, despite having a similar data-set size and existing in a similar area of the protein. A reason for this may be theUsing this large torsion-angle data set, an investigation of torsion-angle spread with glycan chain length and branching was conducted, although no meaningful trend was identified between glycan chain length and torsion-angle standard deviation. Despite this, this large data set can be incorporated into software packages such as Privateer to improve the accuracy of glycoprotein models.
3.3. PDB-REDO analysis
With the increasingly commonplace solution of protein complexes with high-resolution data, it is imperative that model-building software can depict the conformation and position of N-glycans accurately. Through the comparison of N-glycan torsion angles of proteins deposited in the PDB and the PDB-REDO databank, the applicability and necessity of modern techniques can be assessed. Comparisons between torsion angles in N-glycans deposited in the PDB and the PDB-REDO databank highlight an interesting relationship between structure resolution and torsion-angle accuracy, as shown in Table 2.
|
The PDB-REDO models used in this study had no torsional restraints applied during PDB-REDO are not influenced by the potentially flawed torsional restraints applied before the model was initially deposited in the PDB. This application of consistent techniques without torsional restraints leads to a data set which naturally has a larger spread than the PDB. To assess whether the PDB and PDB-REDO data sets are significantly different, a series of t-tests were performed and are summarized in Table 3.
Therefore, the torsion angles calculated by
|
For the NAG-1,4-NAG and BMA-1,4-NAG linkages, both mean torsion angles were deemed to be significantly different (p < 0.05) in the PDB and PDB-REDO data sets by the t-test. For the MAN-1,6-BMA linkage, while the φ angle was deemed to be significantly different, the ψ angle was not significantly different. Interestingly, both data sets showed no significant difference between both torsion angles for MAN-1,6-MAN and MAN-1,3-MAN linkages. While the PDB-REDO models had many occurrences in which the torsion angles were not statistically similar to those in the PDB data set, the torsion angles in both data sets are within one standard deviation of each other for every linkage. While it is impossible to automatically determine whether the glycosidic linkages in a deposited structure were restrained to certain values, we know that PDB-REDO does not apply torsional restraints. Hence, in the absence of potential bias towards torsion restraint targets, it is likely that the PDB-REDO databank represents a more realistic distribution of N-glycan glycosidic torsion angles and could be used as an alternative data source for validation in Privateer. A future update of Privateer will allow users to analyse their structures against either the PDB or PDB-REDO torsional sets.
The application of consistent highlights the correction that PDB-REDO applies to the initially skewed MAN-1,6-BMA linkage. The data set of linkages originating from the PDB has numerous instances like this in which PDB-REDO corrects the torsion angles to more reasonable values. This powerful correction is another interesting and useful feature that PDB-REDO facilitates.
techniques was also shown to improve outliers which had no physical basis for occurring (little clear interaction with residues or other ligands). Fig. 53.4. Outlier analysis
This analysis of N-glycan torsion angles deposited in the PDB reveals clusters of abundant torsion angles, as shown in Fig. 3. Perhaps due to the inherent variability in the environment surrounding in N-glycans, these torsion-angle clusters are spread over a large range in most cases. Outliers were quantified as any linkage which had a Z-score which was lower than −1. The Z-score reported here depends on the number of φ/ψ pairs relative to the database (Fig. 3) and not the deviation from the mean. The limit of −1 was chosen to highlight linkages that are uncommon in the database. Examining these linkages in further detail may highlight the cause of this. As always, surprising cases may either be chemically interesting to look at or be wrong. Here, we present one example of each.
3.4.1. Electrostatic interactions
Repulsive and attractive electrostatic interactions are crucial for the functionality and stability of proteins (Law et al., 2006). These interactions are facilitated by both positively charged (lysine and arginine) and negatively charged (glutamic acid and aspartic acid) amino-acid side chains. Similarly, these amino acids can affect the positions of contained within N-glycans via varying degrees of electrostatic interactions.
Fig. 6 depicts an N-glycan (PDB entry 4j0m; She et al., 2013) with MAN-1,2-MAN torsion angles that are highly deviated from the mean. Since this glycan has been validated using Privateer (all including those involved in the linkage, were in low-energy chair conformations) and has an of greater than 0.80, indicating a good fit to electron density, it can be assumed that these torsion angles are a direct result of external factors. Upon examination of the area surrounding the glycan, it becomes evident that a network of electrostatic interactions could be affecting the conformation of the N-glycan chain. The proximity of the linkage to arginine, histidine and asparagine side chains may cause the observed deviation. Furthermore, this highlights how linkages further down a glycan tree can also be subject to interactions with protein residues. These interactions may also explain why MAN-MAN linkage torsion angles are less concentrated on one pair of values than the more constrained NAG-NAG linkage.
3.4.2. High-energy ring-conformation anomalies may distort a linkage
Fig. 7 shows a glycan stabilized by CH–π interactions with phenylalanine side chains (PDB entry 5gsq; Chen et al., 2017). While the fit to electron density is reasonable for the first few pyranosides (which show no issues in the validation report), the MAN-1,3-BMA and the terminal MAN residue are highlighted in orange in the Privateer SNFG representation: the link has a Z-score of −1.32, indicating a large deviation, and the ring of the terminal mannose is in a 1S3 conformation, which is wholly unexpected for a pyranoside that is part of an N-glycan and therefore is marked as worthy of inspection (orange). Examination of the electron-density map around the MAN-1,3-BMA pair reveals that the fit to the observed data is poor for the MAN residue; against incomplete density usually results in high-energy ring conformations without the inclusion of torsion restraints (Agirre, 2017). The distortion of the ring conformation in pyranosides has been reported to have a knock-on effect on linkages (Agirre et al., 2017); hence, we believe this is the most probable explanation for this outlier.
4. Conclusions
In this study, a large number and range of N-glycan linkage torsion angles were collected from both the PDB and the PDB-REDO databank after being curated using Privateer. The collected data, released and articulated through the Privateer software, will provide a strong foundation for future model building, and validation software. The comparisons between the PDB and PDB-REDO models presented here assessed the importance of modern techniques. The differences in the torsion angles between the validated PDB and PDB-REDO data sets are minimal. However, in certain cases the application of a consistent technique can alleviate errors in the model-building process. Furthermore, the absence of torsional restraints in PDB-REDO perhaps allows a more realistic spread of torsional values to be observed. It is also important to note valid rationalizations for linkage torsion angles deviating from the calculated mean. Electrostatic and steric interactions play a large role in protein folding in general and can cause or stabilize the skewed N-glycan linkage torsions exhibited in certain Therefore, it is highly likely that these electrostatically charged or sterically bulky amino acids play a role in overall N-glycan conformation.
5. Availability and open research data
All scripts, data and graphics associated with this work have been uploaded to Zenodo (https://doi.org/10.5281/zenodo.7356467). The Privateer source code is available from GitHub (https://github.com/glycojones/privateer). Binaries will be released as an update to CCP4 8.0.
Supporting information
Link https://doi.org/10.5281/zenodo.7356467
Associated scripts, data and graphics
Link https://github.com/glycojones/privateer
Privateer source code
Supplementary Tables and Figure. DOI: https://doi.org/10.1107/S2059798323003510/qe5003sup1.pdf
Acknowledgements
This work is based on a proof of concept as part of the MChem course of the Department of Chemistry at the University of York. The authors would like to thank Alex Ascham, Sarah Hargan, Eleanor Charnley, Alex Muriel, Charles Kingston, Conor MacDonald, Eve Tipple, Andrew Harvey, Thomas Hartshorn, Alex Bentley, Jordan Wilson, Rachel Napier, Luke Julyan, Catrin Ellis, Will Ashwood and James Jones for their work in the previous study.
Funding information
Jordan Dialpuri is funded by the Biotechnology and Biological Sciences Research Council (BBSRC; grant No. BB/T0072221). Haroldas Bagdonas is funded by The Royal Society (grant No. RGF/R1/181006). Mihaela Atanasova is funded by the UK Engineering and Physical Sciences Research Council (EPSRC; grant No. EP/R513386/1). Lucy Schofield's work was funded by The Royal Society through a summer studentship. Robbie Joosten is supported by the European Union's Horizon 2020 research and innovation programme under grant agreement No. 871037 (iNEXT-Discovery) and by CCP4. Jon Agirre is a Royal Society University Research Fellow (award No. UF160039).
References
Agirre, J. (2017). Acta Cryst. D73, 171–186. Web of Science CrossRef IUCr Journals Google Scholar
Agirre, J., Ariza, A., Offen, W. A., Turkenburg, J. P., Roberts, S. M., McNicholas, S., Harris, P. V., McBrayer, B., Dohnalek, J., Cowtan, K. D., Davies, G. J. & Wilson, K. S. (2016). Acta Cryst. D72, 254–265. Web of Science CrossRef IUCr Journals Google Scholar
Agirre, J., Davies, G., Wilson, K. & Cowtan, K. (2015). Nat. Chem. Biol. 11, 303. Web of Science CrossRef PubMed Google Scholar
Agirre, J., Davies, G. J., Wilson, K. S. & Cowtan, K. D. (2017). Curr. Opin. Struct. Biol. 44, 39–47. Web of Science CrossRef CAS PubMed Google Scholar
Agirre, J., Iglesias-Fernández, J., Rovira, C., Davies, G. J., Wilson, K. S. & Cowtan, K. D. (2015). Nat. Struct. Mol. Biol. 22, 833–834. Web of Science CrossRef CAS PubMed Google Scholar
Akkermans, O., Delloye-Bourgeois, C., Peregrina, C., Carrasquero-Ordaz, M., Kokolaki, M., Berbeira-Santana, M., Chavent, M., Reynaud, F., Raj, R., Agirre, J., Aksu, M., White, E. S., Lowe, E., Ben Amar, D., Zaballa, S., Huo, J., Pakos, I., McCubbin, P. T. N., Comoletti, D., Owens, R. J., Robinson, C. V., Castellani, V., del Toro, D. & Seiradake, E. (2022). Cell, 185, 3931–3949. CrossRef CAS PubMed Google Scholar
Alocci, D., Mariethoz, J., Gastaldello, A., Gasteiger, E., Karlsson, N. G., Kolarich, D., Packer, N. H. & Lisacek, F. (2019). J. Proteome Res. 18, 664–677. Web of Science CrossRef CAS PubMed Google Scholar
Atanasova, M., Bagdonas, H. & Agirre, J. (2020). Curr. Opin. Struct. Biol. 62, 70–78. Web of Science CrossRef CAS PubMed Google Scholar
Atanasova, M., Nicholls, R. A., Joosten, R. P. & Agirre, J. (2022). Acta Cryst. D78, 455–465. Web of Science CrossRef IUCr Journals Google Scholar
Bagdonas, H., Ungar, D. & Agirre, J. (2020). Beilstein J. Org. Chem. 16, 2523–2533. Web of Science CrossRef CAS PubMed Google Scholar
Berman, H. M., Westbrook, J., Feng, Z., Gilliland, G., Bhat, T. N., Weissig, H., Shindyalov, I. N. & Bourne, P. E. (2000). Nucleic Acids Res. 28, 235–242. Web of Science CrossRef PubMed CAS Google Scholar
Beusekom, B. van, Joosten, K., Hekkelman, M. L., Joosten, R. P. & Perrakis, A. (2018). IUCrJ, 5, 585–594. Web of Science CrossRef PubMed IUCr Journals Google Scholar
Beusekom, B. van, Touw, W. G., Tatineni, M., Somani, S., Rajagopal, G., Luo, J., Gilliland, G. L., Perrakis, A. & Joosten, R. P. (2018). Protein Sci. 27, 798–808. Web of Science PubMed Google Scholar
Beusekom, B. van, Wezel, N., Hekkelman, M. L., Perrakis, A., Emsley, P. & Joosten, R. P. (2019). Acta Cryst. D75, 416–425. Web of Science CrossRef IUCr Journals Google Scholar
Casalino, L., Gaieb, Z., Goldsmith, J. A., Hjorth, C. K., Dommer, A. C., Harbison, A. M., Fogarty, C. A., Barros, E. P., Taylor, B. C., McLellan, J. S., Fadda, E. & Amaro, R. E. (2020). ACS Cent. Sci. 6, 1722–1734. CrossRef CAS PubMed Google Scholar
Chen, C.-L., Hsu, J.-C., Lin, C.-W., Wang, C.-H., Tsai, M.-H., Wu, C.-Y., Wong, C.-H. & Ma, C. (2017). ACS Chem. Biol. 12, 1335–1345. CrossRef PubMed Google Scholar
Clauset, A., Shalizi, C. R. & Newman, M. E. J. (2009). SIAM Rev. 51, 661–703. Web of Science CrossRef Google Scholar
Cremer, D. & Pople, J. A. (1975). J. Am. Chem. Soc. 97, 1354–1358. CrossRef CAS Web of Science Google Scholar
Crispin, M., Stuart, D. I. & Jones, E. Y. (2007). Nat. Struct. Mol. Biol. 14, 354. CrossRef PubMed Google Scholar
Crispin, M., Yu, X. & Bowden, T. A. (2013). Proc. Natl Acad. Sci. USA, 110, E3544–E3546. Web of Science CrossRef CAS PubMed Google Scholar
Davies, G. J., Planas, A. & Rovira, C. (2012). Acc. Chem. Res. 45, 308–316. Web of Science CrossRef CAS PubMed Google Scholar
Emsley, P. & Crispin, M. (2018). Acta Cryst. D74, 256–263. Web of Science CrossRef IUCr Journals Google Scholar
Ferrara, C., Grau, S., Jäger, C., Sondermann, P., Brünker, P., Waldhauer, I., Hennig, M., Ruf, A., Rufer, A. C., Stihle, M., Umaña, P. & Benz, J. (2011). Proc. Natl Acad. Sci. USA, 108, 12669–12674. Web of Science CrossRef CAS PubMed Google Scholar
Frank, M., Beccati, D., Leeflang, B. R. & Vliegenthart, J. F. G. (2020). iScience, 23, 101371. CrossRef PubMed Google Scholar
Frank, M., Lütteke, T. & von der Lieth, C.-W. (2007). Nucleic Acids Res. 35, 287–290. Web of Science CrossRef PubMed CAS Google Scholar
Hooft, R. W., Sander, C. & Vriend, G. (1997). Comput. Appl. Biosci. 13, 425–430. CAS PubMed Web of Science Google Scholar
Imberty, A. & Perez, S. (1995). Protein Eng. Des. Sel. 8, 699–709. CrossRef CAS Web of Science Google Scholar
Joosten, R. P., Nicholls, R. A. & Agirre, J. (2022). Curr. Med. Chem. 29, 1193–1207. Web of Science CrossRef CAS PubMed Google Scholar
Law, M. J., Linde, M. E., Chambers, E. J., Oubridge, C., Katsamba, P. S., Nilsson, L., Haworth, I. S. & Laird-Offringa, I. A. (2006). Nucleic Acids Res. 34, 275–285. CrossRef PubMed CAS Google Scholar
Lütteke, T., Frank, M. & von der Lieth, C.-W. (2005). Nucleic Acids Res. 33, D242–D246. PubMed Google Scholar
Lütteke, T. & von der Lieth, C.-W. (2004). BMC Bioinformatics, 5, 69. Google Scholar
McNicholas, S. & Agirre, J. (2017). Acta Cryst. D73, 187–194. Web of Science CrossRef IUCr Journals Google Scholar
Nnamchi, C. I., Parkin, G., Efimov, I., Basran, J., Kwon, H., Svistunenko, D. A., Agirre, J., Okolo, B. N., Moneke, A., Nwanguma, B. C., Moody, P. C. E. & Raven, E. L. (2016). J. Biol. Inorg. Chem. 21, 63–70. Web of Science CrossRef CAS PubMed Google Scholar
Pérez, S., Sarkar, A., Rivet, A., Breton, C. & Imberty, A. (2015). Methods Mol. Biol. 1273, 241–258. PubMed Google Scholar
Petrescu, A.-J., Wormald, M. R. & Dwek, R. A. (2006). Curr. Opin. Struct. Biol. 16, 600–607. Web of Science CrossRef PubMed CAS Google Scholar
Ramirez-Escudero, M., Miguez, N., Gimeno-Perez, M., Ballesteros, A. O., Fernandez-Lobato, M., Plou, F. J. & Sanz-Aparicio, J. (2019). Sci. Rep. 9, 17441. PubMed Google Scholar
Rudd, P. M., Wormald, M. R. & Dwek, R. A. (2004). Trends Biotechnol. 22, 524–530. Web of Science CrossRef PubMed CAS Google Scholar
Scherbinina, S. I. & Toukach, P. V. (2020). Int. J. Mol. Sci. 21, 7702. Web of Science CrossRef PubMed Google Scholar
She, J., Han, Z., Zhou, B. & Chai, J. (2013). Protein Cell, 4, 475–482. CrossRef CAS PubMed Google Scholar
Shental-Bechor, D. & Levy, Y. (2009). Curr. Opin. Struct. Biol. 19, 524–533. PubMed CAS Google Scholar
Sobolev, O. V., Afonine, P. V., Moriarty, N. W., Hekkelman, M. L., Joosten, R. P., Perrakis, A. & Adams, P. D. (2020). Structure, 28, 1249–1258.e2. Web of Science CrossRef CAS PubMed Google Scholar
Varki, A., Cummings, R. D., Aebi, M., Packer, N. H., Seeberger, P. H., Esko, J. D., Stanley, P., Hart, G., Darvill, A., Kinoshita, T., Prestegard, J. J., Schnaar, R. L., Freeze, H. H., Marth, J. D., Bertozzi, C. R., Etzler, M. E., Frank, M., Vliegenthart, J. F., Lütteke, T., Perez, S., Bolton, E., Rudd, P., Paulson, J., Kanehisa, M., Toukach, P., Aoki-Kinoshita, K. F., Dell, A., Narimatsu, H., York, W., Taniguchi, N. & Kornfeld, S. (2015). Glycobiology, 25, 1323–1324. CrossRef CAS PubMed Google Scholar
Xin, F. & Radivojac, P. (2012). Bioinformatics, 28, 2905–2913. Web of Science CrossRef CAS PubMed Google Scholar
This is an open-access article distributed under the terms of the Creative Commons Attribution (CC-BY) Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original authors and source are cited.