lead articles
Combining X-rays, neutrons and electrons, and NMR, for precision and accuracy in structure–function studies
aDepartment of Chemistry, University of Manchester, Manchester, M13 9PL, United Kingdom
*Correspondence e-mail: john.helliwell@manchester.ac.uk
The distinctive features of the physics-based probes used in understanding the structure of matter focusing on biological sciences, but not exclusively, are described in the modern context. This is set in a wider scope of holistic biology and the scepticism about `reductionism', what is called the `molecular level', and how to respond constructively. These topics will be set alongside the principles of accuracy and precision, and their boundaries. The combination of probes and their application together is the usual way of realizing accuracy. The distinction between precision and accuracy can be blurred by the predictive force of a precise structure, thereby lending confidence in its potential accuracy. These descriptions will be applied to the comparison of cryo and room-temperature protein crystal structures as well as the solid state of a crystal and the same molecules studied by small-angle X-ray scattering in solution and by in vivo; a new iodoplatin for radiation therapy predicted from studies of platin crystal structures; and the field of colouration of as an effective assay of function, i.e. their colouration, when unbound and bound to a protein. The complementarity of probes, as well as their combinatory use, is then at the foundation of real (biologically relevant), probe-artefacts-free, structure–function studies. The foundations of our methodologies are being transformed by colossal improvements in technologies of X-ray and neutron sources and their beamline instruments, as well as improved electron microscopes and NMR spectrometers. The success of protein structure prediction from gene sequence recently reported by CASP14 also opens new doors to change and extend the foundations of the structural sciences.
on a sample grid. Examples will include: time-resolved X-ray Laue crystallography of an enzyme Michaelis complex formed directly in a crystal equivalent toKeywords: X-rays; neutrons; electrons; NMR; structure and function.
1. Introduction
W. Lawrence Bragg (1968) wrote on X-ray crystallography that: `I have often been asked `Why are you always showing and talking about models? Other kinds of scientists do not do this'.' Bragg answered that `The investigator seeks a structural plan, a map that shows all the atoms in their relative positions in space. No other branch of science is so completely geographical; a list of spatial coordinates is all that is needed to tell the world what has been discovered'.
So, we can see atoms with crystallography. We now must also ask: when are the atoms that we see in our electron density, nuclear density or electrostatic potential maps in the correct positions to explain a function? We need either an assay of function or a complementary technique providing a consensus or a predictive force from our e.g. site-directed mutagenesis for a protein where we can change the function such as the rate of an enzyme reaction based on its structure). These considerations drive to the core words in science of precision and accuracy (Fig. 1). When there is a correct match between one method's measurements and a model interpretation, this equals precision. When a second method is used its measurements lead to a refined model, which can also be precise in the same way. But when both methods agree then we reach accuracy. There are other common words in measurement science to be mentioned. These are systematic error and random error. With any one method both types of error occur. Systematic errors one strives to remove completely if possible. So, in the use for example of an electronic area detector, it must be calibrated for dealing with its systematic spatial distortions or systematic non-linearity of response. These calibrations may also change with time, such as weeks to months, and recalibration to avoid systematic errors is needed. Another core point was made by the IUCr Working Party on `Expression of Uncertainty in Measurement', and on types of error (Schwarzenbach et al., 1995): `a measurement result is considered complete only when accompanied by a quantitative statement of its uncertainty'. I discuss this more in Section 4.2.
(These terms have been considered from the charge density point of view by Sanjuan-Szklarz et al. (2020), the central issue being the use, in general, of spherical electron densities rather than the physically more reasonable aspherical electron densities. As their Fig. 6 nicely shows, the truth is not known and so accuracy could, even should, be replaced by a reference structure, and the neutron is their reference. There is a commonality of that approach with the one I describe here for protein crystallography, but since it is at a lower, i.e. less good, diffraction resolution the reasons to make the neutron protein the reference instead are: the structural completeness with experimentally determined hydrogens for the ionizable amino acids, the use of physiological relevant temperature and the absence of X-ray or electron radiation damage. That said, as Joachim Frank emphasized (see below), the crystal itself is not the `native state'.
It is important, indeed vital, to retain a big picture. So, in biology we need the hierarchies of organization and thereby of length scales of molecules, macromolecules, viruses and e.g. an animal, and even further, into whole populations and ecosystems. In similar vein we seek to understand the structural dynamics from femtosecond to second. Structural dynamics can include conformational change, atomic vibration or bond making and breaking. To this end we isolate molecules or complexes which in size reach a limit at the several hundred Å scale with e.g. a virus particle. The method of electron tomography can be used to view molecules in situ, such as the pioneering study of coronavirus particles (Almeida & Tyrrell, 1967).
into the complete organism,Crystallography's landscape of determining structures now coexists with DeepMind's prediction of a protein 3D structure from a gene sequence as seen in CASP14, Critical Assessment of Structure Prediction of Proteins [for a short resume see Helliwell (2020)]. At CCP4 2021 McCoy reported that all models were good for (McCoy, 2021). A key question for the field of protein crystallography is: will DeepMind share its protein fold prediction software or provide a webserver for uploading gene sequences?
Flexible proteins and complexes are resistant to crystallization and electron cryo-microscopy (cryoEM) has undergone great strides in resolution capability in recent years [see Kühlbrandt (2014) for an overview]. For cryoEM, freezing a single molecular complex is quite possibly more favourable than freezing a >1 µm protein crystal given the possibility of cryo-artefacts. In its results cryoEM shows quite a range of resolutions across a structure, but so does crystallography with B factors, some very large to the point that such a part of a structure shows no significant density.
These considerations bring me to a more detailed scrutiny of our probes of the structure of matter. Vainshtein's (1964) book provides an excellent survey of our diffraction probes: X-rays, electrons and neutrons. In particular, neutrons give the nuclear positions, X-rays the atomic electron density and electrons the atomic electrostatic potential. Such an overview has to my knowledge not been attempted since, which led me to consider writing this article.
2. Pairwise comparisons of probes
2.1. Element identification
The K or L give X-ray diffraction an exquisite sensitivity to identifying a metal atom, especially bimetallic cases of neighbouring The example with the most X-ray wavelengths, 11, to investigate a zinc-substituted gallium phosphate involving partial occupancy is that of Helliwell et al. (2010). By tuning to each of the zinc and gallium K absorption edges a very precise was determined.
of atoms for X-rays steadily increases for increasing The resonance effects (`anomalous dispersion') provided via the X-ray wavelength tuning to and around an individual elementThe nuclear scattering for different isotopes using neutrons also provides an exquisite sensitivity with contrast variation involving for example hydrogen and deuterium. Likewise, there are element-to-element variations but these are much less marked than when using X-rays. A particularly attractive feature of the neutron scattering factors is that deuterium is as good a scatterer for neutrons as carbon. This is exploited in neutron macromolecular crystallography to determine the protonation states of ionizable amino-acid residues (aspartic acid, glutamic acid, histidine and lysine). Fig. 2 shows the nuclear density map for a single and doubly protonated histidine (Ahmed et al., 2007).
Electrons also show a steadily increasing variation of the electrostatic potential with
They are much more sensitive to hydrogens than X-rays. For a biological molecule known from chemical analysis to contain an electron-dense metal centre, the location of those metal atoms can be found. However, to my knowledge, it is untested whether electrons can be used to determine bimetallic cases of closely similar atomic number.Both electrons and X-rays will very likely change the et al., 2000) is applicable.
of a metal. With X-rays there is the important exception however of a femtosecond-time-range pulsed X-ray laser where the `diffract before destroy' approach (Neutze2.2. Sample scattering power
Perhaps surprisingly this was not introduced as an equation until the work of Andrews et al. (1988) – see their Table 3 comparing microcrystal diffraction of both chemical and protein microcrystallography. A different approach was adopted by Henderson (1995), namely that for naturally occurring biological material, electrons at present provide the most information for a given amount of radiation damage. The study involved comparing X-rays, neutrons and electrons. This result concurs with the Vainshtein (1964) approach of comparing the three probes and where electron scattering is the strongest, so the very smallest crystals can be used. The obvious utility of the Henderson (1995) analysis encouraged the development of cryoEM for single particles of biological macromolecules. Once a crystal is grown, or should I say growable to above a size of ∼0.1–1 µm, then the issue changes. Namely, that electrons are overtaken by X-rays in their utility for structure analysis. Furthermore, if the crystal grows to ∼1 mm3 or more then neutrons become viable and are by far the best because they yield a complete structure with hydrogens, at physiological temperature and free of radiation damage. Table 1 compares the fundamentals of X-rays and electrons, listing their pros and cons. Since neutrons are the ideal probe for crystallography, their use is purely one of practical challenges and these are compared with X-rays and electrons in Table 2. There is one fundamental aspect of neutron crystallography which is that analysis of a neutron protein requires an X-ray crystal structure.
‡In protein electron crystallography the is discussed by Gemmi et al. (2019) and, as they summarize, is so far the method applied. |
|
2.3. The important role of NMR
NMR can determine protein structures in solution, without crystallization, and so is the only way forward for e.g. an intrinsically disordered protein. But the role of NMR has three additional advantages when combined with a Firstly, the NMR structure is actually an ensemble-of-structures fit to the measurements. This means that where the protein polypeptide chain is particularly flexible, and the X-ray electron density has disappeared, the NMR ensemble fit still shows the range of positions for that portion. Secondly, the relatively static core of a protein as seen by the X-ray crystallography in the NMR data still shows dynamics. This was highlighted by Wüthrich & Wagner (1975) with respect to aromatic side chains such as phenylalanine where distinctly different NMR spectra could be seen showing three possibilities: spinning, 180° flip or static. Thirdly, the application of NMR to a crystal allows for both the ordered and disordered structures in the sample to be studied simultaneously. The book NMR Crystallography (Harris et al., 2009) surveys a large number of examples of this; ch. 27 by Middleton (2009) is devoted to structural biology applications. Fig. 27.2 of Middleton shows the effects of sample preparation in solid-state NMR, comparing polycrystalline and nanocrystalline, which are `practically identical', and thirdly, lyophilized ubiquitin protein which shows poor resolution indicating structural heterogeneity. Middleton (2009) also points out that where a crystal is available but does not diffract, or shows limited diffraction, then NMR crystallography could be applied to study protein–ligand binding on soaking with ligand. Lewandowski et al. (2015) used solid-state NMR to investigate protein and solvent motions of nano- and microcrystalline, fully hydrated, protein GB1 at temperatures from 105 to 280 K. GB1 is `a small globular protein specifically binding to antibodies'. Their study describes a hierarchical change in dynamic behaviour (see especially their Fig. 4) showing `a unified description of the essential conformational energy surface, relating the amplitude and activation of solvent, sidechain, and backbone motions in a hierarchical distribution, as well as unambiguous identification of NMR line broadening at cryogenic temperatures'. Stepping outside protein examples, Bryce (2017) provides a topical review of the use of NMR crystallography across a wide range of materials spanning inorganics and organics as well as a protein. The IUCr established a Commission on NMR Crystallography and Related Methods in 2014 in recognition of its importance (https://www.iucr.org/iucr/commissions/nmr-crystallography).
2.4. Diverse further frontiers in pursuit of structural accuracy
There are diverse further important frontiers in the analytical characterization of 3D structures. These include
X-ray absorption spectroscopy and infrared spectroscopy.It is important to recall that, such as during my doctorate more than 40 years ago, the amino-acid sequence of one's protein was not necessarily known and special collaborations were made to determine the sequence. This is generally now all done via gene sequencing not B factors (atomic displacement parameters) inflate accordingly or truncating the side chain at that place where density is no longer visible. A complementary approach would be to measure the mass of the pure, final protein before crystallization. Indeed, has grown impressively in its power and scope these last decades (see e.g. Liko et al., 2016).
Post-translational modifications can occur and in one's electron density maps the missing electron density of amino-acid side chains can cause anxiety with respect to interpretation; debates such as on CCP4bb show the community view split between letting theA milestone example of finding out the damage caused by excessive X-ray dose to acquire crystallography diffraction data was the work of Yano et al. (2005): X-ray absorption spectroscopy was used with a much lower dose to study reduction by X-ray-generated electrons of the manganese ions in the oxygen evolving complex of photosystem II (PSII) (Yano et al., 2005). Specific X-ray damage to protein disulfides was shown much earlier, following a suggestion by Greg Petsko, by Helliwell (1988) and coworkers. This is now an extensive field of enquiry as measured for instance by the succession of radiation damage conferences, published in issues of the Journal of Synchrotron Radiation.
Infrared spectroscopy offers a variety of possible measurements to provide accuracy based on a precise e.g. Barth, 2007). It can be applied to both static and time-resolved studies. These latter include taking the difference spectra between two structural states. As with colour changes, which feature pivotally in two of my case studies below, such difference spectra can be very informative and can confirm and complement what is seen with 3D structural probes.
(seeIn Section 5.6 I describe the use of solution small-angle X-ray scattering (SAXS) as a complement to cryocrystallography with a double aspect, confirming that the solution and solid-state crystal structures of a colouration protein are the same and, as well, that the cryo and room-temperature structures are the same. SAXS, and its neutron equivalent SANS, have very wide-ranging applications and are extensively described in recent books (Svergun et al., 2013; Lattman et al., 2018).
The connection between the molecular level of detail and the cellular level is bridged by the microscopies. The thickness of individual cells is problematic for electrons due to their being absorbed but not for X-rays. This yields a spatial resolution beyond the capability of light microscopy. A comparison of the capabilities of electron and X-ray microscopies is described by Du & Jacobsen (2018).
To end this section I choose an example from before protein crystallography had been brought to fruition by Max Perutz and John Kendrew. This is the case of sickle cell anaemia and haemoglobin (Pauling et al., 1949). In the opening of their discussion Pauling et al. (1949) stated `On the nature of the difference between sickle cell anaemia hemoglobin and normal hemoglobin: Having found that the electrophoretic mobilities of sickle cell anaemia hemoglobin and normal hemoglobin differ, we are left with the considerable problem of locating the cause of the difference. It is impossible to ascribe the difference to dissimilarities in the particle weights or shapes of the two in solution: a purely frictional effect would cause one species to move more slowly than the other throughout the entire pH range and would not produce a shift in the Moreover, preliminary velocity ultracentrifuge and free diffusion measurements indicate that the two have the same sedimentation and diffusion constants. The most plausible hypothesis is that there is a difference in the number or kind of ionizable groups in the two Such changes were later shown by Max Perutz to have occurred and created sticky patches causing two haemoglobin molecules to hold together, distorting an erythrocytic cell and impeding blood flow. This is explained as a PDB-101 (Protein Data Bank, https://pdb101.rcsb.org/motm/41; see the section entitled Troubled Haemoglobins). This example shows the importance of 3D structure as well as the measurements made by Pauling et al. (1949).
3. Criticisms of the reductionist approach
Overall there is scepticism of what we `atomic level structuralists' do for biology. Dame Ottoline Leyser, Professor of Plant Development at the University of Cambridge, was quoted recently (Turney, 2019): `The defining feature of biology during the past few decades has been figuring out details of the parts. But biological systems don't think they have parts'. Scepticism of the role of reductionism in understanding biology is a theme in Ernst Mayr's book What Makes Biology Unique? (Mayr, 2007). Mayr even argues against the relevance of the discovery of the DNA double helix to understanding biology. As a counterpoint, a founding father of quantum mechanics, the physicist Erwin Schrödinger, posed the question `What is life?' in his influential book What is Life?: the Physical Aspect of the Living Cell, in effect applying the physical sciences to this central question of biology (Schrödinger, 1943).
Then we have Sir Paul Nurse, awarded the Nobel Prize in Physiology or Medicine in 2001 for his work on the cell cycle of fission yeast [interview by Ireland (2014)]: `I work on the model organism fission yeast as I have done for 40 years. I'm taking synthetic and systems biology approaches to global cellular controls – that is, looking at the whole cell, particularly the cell cycle and cell shape, which are my areas of interest. Rather than get bogged down in detailed molecular descriptions of everything, I'm asking bigger questions like `how does a cell know how big it is?', which has always fascinated me.'
So, where do I think these strong views stand today against our reductionist research in the molecular sciences within which crystallography is a key player, as are the microscopies and spectroscopies? More to the point, how can the crystallographer respond constructively, maybe only partly, to the concerns of the holistic biologists?
The issue of relevance starts when considering the protein crystalline state or the solution state of a protein but imagined placed inside the biological cell. NMR provides atomically detailed results in solution and of course protein crystallography provides atomic details for a protein in the solid state. The protein crystal is a curious hybrid of solid state, being an ordered lattice, with a large percentage of the crystal volume in solution, namely the solvent channels that run through the crystal.
Studies of the structure and function of an enzyme in the crystalline state were to my mind greatly facilitated by the invention of the flow cell (Wyckoff et al., 1967). I describe a case study of the application of the flow cell in Section 5.2. This is a powerful approach to the issue of the relevance of solid-state results. It answers with a resounding yes the question of whether these results are relevant to enzyme function. Flow cell results overcome then the objections of the NMR solution-state spectroscopists to the crystallographer's results in the protein crystal.
Weaknesses in the armoury of crystallography remain such as crystallization conditions, to a greater or lesser degree, taking one's results away from biological functioning conditions. As Lin (2018) recently stated: `Structure based drug design requires accurate structural information with the native conformation of a protein. However, scientists frequently select the protein that is suitable for crystallization but far from the physiological condition. It has been well known that protein conformation may be sensitive to the crystallization solution, and the different crystallization conditions or buffer selection can be a source of irreproducibility. It is a big challenge to find crystallization conditions that are close to the physiological conditions of a protein. To guide drug design with the native conformation of a protein, the developments of the proprietary crystallization screen kits with soft [`soft' in this context means to have a relatively non-perturbing impact on the protein structure] features for the protein to maintain its native conformation for drug target protein crystallization and new methods that can help select the best protein sample for crystallization trials are required.'
This caveat of Lin (2018) is perhaps more relevant than the one about having a lattice for a contacts in macromolecular crystals are rather few, whereas other possible effects of non-physiological crystallization conditions cannot be similarly dismissed as they affect all of the atoms in the crystallized protein.
A similar concern is the question about the strict relevance to biology of crystallography results now predominantly based on X-ray diffraction data measured at cryo temperature (Halle, 2004); see Section 4.1 below. This has been compounded by observations of specific X-ray damage to the crystallized protein. Conducting crystallography at physiologically relevant temperatures has then become an objective.
Neutron macromolecular crystallography (nMX), whilst pursuing protein structures with protonation states experimentally determinable, has also automatically yielded room-temperature structures. Given also that projects succeed in getting beamtime only where all other methods have failed, X-ray-, electron- or NMR-based, it is clear that in structural biology there is a strategic importance of this method. nMX has seen a sustained growth of the instruments, the software and methods at leading neutron sources (e.g. see Niimura & Podjarny, 2011; Cuypers et al., 2013; Blakeley & Podjarny, 2018; Hoogerheide et al., 2020), maximization of the potential of perdeuteration (Haertlein et al., 2016), and application to resolving problematic protein structure–function studies (Helliwell, 2021).
The new femtosecond-range X-ray lasers yield X-ray diffraction data at room temperature and before the onset of radiation damage, the `diffract before the sample is destroyed' approach (Neutze et al., 2000). Synchrotron facilities are now also adopting the X-ray laser methods for delivery of streams of micron-sized samples and thereby also yielding results at physiological temperatures, albeit not free of radiation damage like the X-ray lasers. The use of streams of micron-sized crystals may be susceptible to variability within samples of the biological molecules being studied.
So, the physical methods of crystallography, microscopy and spectroscopy continue to strive for, and do clearly deliver, functionally relevant structural results. While many biologists have embraced molecular structure shall we ever convince holistic biologists? Our studies rarely take us directly to the whole organism. In Section 5.6 I give a case study where that all-important bridge to whole animals is achieved, the colouration of the lobster shell.
4. Structural artefacts
4.1. Cryo versus room temperature
Halle (2004) provided a landmark paper from the physical chemists' point of view: `In biomolecular cryocrystallography: structural changes occur during flash-cooling: conformational switching of solvent-exposed side chains and weak ligand binding are likely artefacts…. Also the sites for molecular recognition, ligand binding or chemical catalysis, usually involve the solvent where consequent cryo-artefacts are expected to be most pronounced.' My own laboratory's 1997 study (Deacon et al., 1997) was cited by Halle (2004) where we had noted in our Section 3.9 that several amino-acid side chains (of the concanavalin A protein) had adopted largely different conformations; most of the changes were in poorly determined residues in the room-temperature structure, especially in the flexible loop regions; the number of detected solvent sites had more than doubled, to 319 at 110 K as compared with 149 at 293 K; and in particular there were 20 non-matching waters in the room-temperature structure which were connected to the movement of side chains by of the low-energy conformations of Asp82, Ser117, His121 (in a loop region), Lys135 and Thr196. But there were no major differences in either the protein or solvent structure around the saccharide-binding sites. Vigilance is needed for such artefacts. A distinction should of course be made between structural details that change (at the two temperatures) and things that become visible at the cryo temperature which were not visible before.
Is there then a science of what sort of structural artefacts can occur in proteins? In a recent review Fischer (2021) discusses practical aspects of preparing, acquiring and analysing X-ray crystallography data at room temperature. There is one, especially interesting, pioneering series of crystal structures of ribonuclease studied at nine temperatures (PDB codes 1rat through to 9rat) from 98 to 320 K (Tilton et al., 1992). These were undertaken to scrutinize protein structure differences with temperature. Unfortunately the bound waters are not in these coordinates' files and the structure factors likewise are unavailable. This illustrates the nature of the field of macromolecular crystallography which is still developing, i.e. crystal structures undertaken at 37°C should become more usual. Furthermore, can thermophilic proteins and extremophilic proteins be studied at these organisms' temperatures, i.e. respectively, between 60 and 80°C and >80°C? Radiation damage by X-rays will presumably become the limiting factor but which can be circumvented by the use of neutrons or by NMR. As to feasibility, within an undergraduate physics project at York University in the mid-1980s, we showed that X-ray diffraction data could be recorded from a crystal of phenol insulin (Derewenda et al., 1989) up to 50°C; at about 55°C the diffraction pattern disappeared but returned on cooling.
4.2. Bond distance and bond angle artefacts
Electron crystallography is a developing method with increasing modern applications and scope in chemical (Gemmi et al., 2019) and biological crystallography (see e.g. Clabbers & Abrahams, 2018; Gemmi et al., 2019). But as Vainshtein (1964) pointed out it is unreliable when a crystal becomes too large, even at 1 µm, giving false distances and angles due to the multiple scattering of the electrons. In the modern era the study of Palatinus et al. (2015) illustrates the point well: working with a Ni2S nanocrystal of 0.1 µm they compared kinematical and dynamical treatments of the data analysis. The dynamical treatment was essential whereby: `The maximum distance to the corresponding atomic position in the reference structure decreased from 0.042 Å for kinematical to 0.020 Å for dynamical The application domain of electron crystallography will have to be carefully monitored according to sample size.
In protein crystallography a hidden artefact is inappropriate numbers of decimal places on non-covalent interatomic distances being presented in the figures of publications and where no ) started addressing this when he introduced the overall diffraction precision index (DPI) [see equation (1) below] which is widely applicable to studies made across a range of diffraction resolutions. The overall DPI for a study can be extended to individual atoms via equation (2) (Gurusaran et al., 2014). What chance is there of reaching precision, let alone accuracy, if the error on the positions of a protein's atoms is rarely considered?
has been given on those distances. This absence of a formalism to calculate a presumably led to the problem in the first place. Cruickshank (1999Cruickshank's form of the DPI is shown in (1), where σ(x, Bavg) is the DPI for an atom with an average B factor, Ni is the number of fully occupied atoms of type i, p = (nobs − nparams) is the number of observations minus the number of parameters for the atoms in the model, C is the completeness of the diffraction data, R is the R factor (or Rfree), dmin is the diffraction resolution and k is ∼1.0:
For low-resolution structures, the number of parameters may exceed the number of diffraction data. Then p = nobs − nparams is negative, so that σ(x) is imaginary. Cruickshank (1999) circumvented this difficulty empirically by replacing p with nobs and R with Rfree.
Since these papers were published there does seem to have been the good practice of non-covalent distances in the drawings of biological publications being at least quoted to an appropriate number of decimal places, usually guessed at previously, presumably.
4.3. Sample artefacts/variations
At the International Symposium for Diffraction Structural Biology 2019 in Osaka, Joachim Frank, recipient of the Nobel Prize in Chemistry 2017, in his plenary lecture described the advantage of cryoEM is that, unlike a crystal, the molecular complex is in its natural state. His objection is to the crystal which involves the regular packing of those molecules, thereby trapping single states when there might be multiple states (Chen et al., 2015). CryoEM offers a big advantage in having the capability to discern multiple states. This also seemed to me subject to the comment `true but', i.e. trading being free of a for possible cryo-artefacts, which as yet is a topic incompletely understood. In any case, protein crystallographers like myself, during the course of research studies, will have encountered the situation where from one crystal to another of the same protein the diffraction data do not agree well enough to merge them together. This has been contained as a problem by using the dendrogram classification approach (Foadi et al., 2013) but it also gives insights, by analysing different crystal clusters of a dendrogram, into the structural variations possible [e.g. see Fig. 6 of Foadi et al. (2013)]. The cryoEM approach works where multi-macromolecular complexes of a large size do not crystallize and it is a very powerful method of `seeing atoms' as well as resolving multiple states.
4.4. Challenges to structure precision
In both chemical and protein crystallography post-publication peer review shows that vigilance is required before a database entry can be trusted. Through the efforts of different groups of crystallographers in post-publication peer review of articles with their database downloaded files it has become apparent that there are numerous examples where corrections to crystal structures have been needed. These have been both at the level of improving the precision of deposited structures and of correcting inaccuracies on asserted structure–function relationships; see e.g. Rupp et al. (2016) for biological crystal structures. But it has not been the rule in biological crystallography for journal referees (and editors) to scrutinize articles with data with a PDB validation report. So it is perhaps unsurprising that database entries contain errors. Efforts have been made to improve the situation by at least describing the data science skills needed for biological X-ray crystallography referees (Helliwell, 2018). One benefit of such scrutiny is to point out to the editor that authors' results often have unexplained density, which can then remain unmodelled and often uncommented on in a publication. The various aspects that can challenge structure precision described above can be further circumvented if the underpinning raw data are available (Helliwell et al., 2017).
The divergence of the PDB from the Cambridge Structural Database (CSD) has also led to concerns by the CCDC (Cambridge Crystallographic Data Centre) itself about the quality of ligand structures in the PDB (Liebeschuetz et al., 2012): `The good, the bad and the twisted: a survey of ligand geometry in protein crystal structures'. This has been reiterated by Jaskolski et al. (2021) in a study of nearly 100 coronavirus main protease crystal structures deposited in the PDB, who concluded with a list of problems associated with reliability of the deposited structures, and offered a list of procedural weaknesses of these database depositions.
The situation in chemical crystallography is much better because at least for IUCr journals in this area refereeing of articles with data with a checkcif report is done. In spite of this leadership for rigour by the IUCr, this is not always followed amongst other chemical journals. As Schwalbe (2018) remarked for high-resolution chemical crystallography: `Problems can arise in crystallographic databases with errors and omissions in the representation of data that impede searches, and with errors in the actual data. While the Cambridge Crystallographic Data Centre with its Improvement Projects has solved many of the first category of problems, errors in atomic coordinates and other crystallographic data are surprisingly common. Although modern software warns of many types of error, such errors appear even in recently deposited Crystallographic Information Files. Richard Marsh found many examples of missed symmetry in assignment of the such errors are now waning. Hydrogen atoms are commonly placed in calculated positions. Particularly for OH and NH groups involved in hydrogen bonds, occupancy factors may need to be reduced to 0.5 or the hydrogen atom positions may require amendment. (For) imidazole derivatives sometimes only the consideration of bond distances and angles at the heteroatom can distinguish between OH or NH and unprotonated O or N. (There are cases of) mis-positioned hydrogen atoms in dihydrogen phosphates and water aggregates as well as mis-identified elements.'
Expanding on this, Clegg (2021) describes a lack of reliability of chemical crystal structures, namely `examples of poor experiments, misinterpretation of data, scientific bias and preconceived ideas, incompetence and even deliberate fraud'.
So, before one can even consider
precision of a physical probe, let alone benchmarks towards accuracy, the reliability as measured by an absence of calculational errors is needed. This lack of reliability across biological and chemical analysis also suggests that teaching schools are needed more than ever and more widely.Within the physics of diffraction, authors of a study generally focus on their `static' model coordinates' file derived from the processed `Bragg reflection intensities' via their model et al. (2018), Meisburger et al. (2020), and for a detailed example see de Klijn et al. (2019).
The diffuse scattering outside the Bragg spots is generally not only uninterpreted but also uncommented on in article narratives. Where this is true these are missed opportunities to connect the structures and their dynamics. For overviews, and descriptions of the benefits of analysing the diffuse scattering, see Wall4.5. Prediction-of-structures artefacts or a more precise check on our molecular structural reality?
The success of DeepMind in CASP14 [for a summary see Helliwell (2020)] raises interesting possibilities, pro and con. It predicts a protein's fold based on its amino-acid sequence remarkably well. So, does DeepMind's prediction mean that the deep learning's collective wisdom of all the experimental protein crystal structures in the PDB at the time of CASP14, i.e. its learning set of structures, achieves the collective precision of all of those? Does this collective precision then reach close to accuracy, as I have defined it in my introduction? More detail is awaited on how DeepMind's algorithms work. But from the DeepMind blog https://deepmind.com/blog/article/AlphaFold-Using-AI-for-scientific-discovery the public at large can learn that DeepMind's methods `relied on deep neural networks that are trained to predict properties of the protein from its genetic sequence. The properties our networks predict are: (a) the distances between pairs of amino acids and (b) the angles between chemical bonds that connect those amino acids. The first development is an advance on commonly used techniques that estimate whether pairs of amino acids are near each other.'
The success of DeepMind in CASP14 stirred me to reflect on our foundations of protein crystallography (Helliwell, 2020); I have written: `it is the detailed protein structure that determines function, not the protein fold per se, but DeepMind are clearly entering the area of positioning atoms in detail but not really with a clear level of precision at the atom by atom detailed level. Secondly, multi-domain proteins are currently outside of this achievement. Thirdly, where a structure is known, I note that structural dynamics studies start, i.e., in effect, while there may be one fold, there isn't one structure in function terms! In our experimental arena, kinetic crystallography is set to thrive with an expansion of X-ray lasers, the ESRF EBS (Extremely Brilliant Source), Diamond II etc. building on many earlier developments. In addition diffuse scattering is all there for the measuring and interpreting, i.e. structural dynamics again. Onto more specific topics: a third of all proteins are metalloproteins. Predicting where a metal might bind and which metal element it is, is not solved. And not just metal ions but all sorts of other ligands too will still have to be done experimentally.'
It was gratifying how DeepMind carefully acknowledged the experimentally determined structures of the protein crystallographers. A worldwide effort had gone into the synchrotron X-ray sources in the 1980s and 1990s and onwards to develop the multiple-wavelength method (Okaya & Pepinsky, 1956) for protein phase determination. A resume of that work at the SRS (Synchrotron Radiation Source) Daresbury, ESRF BM14, Elettra and at CHESS, amongst others, was given by Cassetta et al. (1999) and also the general phasing idea involving selenomethionine (Hendrickson et al., 1990) has been incredibly valuable.
5. Case studies
5.1. Element identification
The two case studies in this section show that the precision of a et al., 2009). Without this it was shown in a retrospective, i.e. post-publication, that reanalyses of the metal sites were necessary in a large proportion of cases and which led to improvements in existing PDB depositions (Grime et al., 2020). This study used ion beam analysis through particle-induced X-ray emission (PIXE), which quantitatively identified the metal atoms in 30 previously structurally characterized proteins. Over half of these metals had been misidentified in the deposited structural models and by using the correct metal their structural models were improved.
can be challenged not only in the standard uncertainties of the bond distances and angles but also in being sure of the identity of metal atoms. In protein crystallography in particular, synchrotron radiation can allow a fairly rapid and effective check of the content of a new protein crystal on the beamline using the spectrum. This is well applied at the ESRF for example (Leonard5.1.1. The location of manganese and calcium ion cofactors in pea lectin crystals by use of and tunable synchrotron X-radiation
From the outset of harnessing synchrotron radiation in crystallography the tunability was exploited to identify which metal atom was which. The study by Einspahr et al. (1985) used the enhancement of the of the manganese ions with a wavelength near the Mn K on the tunable, focused X-ray spectrometer for protein crystallography at the Daresbury Synchrotron Radiation Source. Accurate identification and location of the Mn2+ and Ca2+ ions was possible, based on the large relative difference in f ′′ anomalous components of the ions in the anomalous difference Fourier maps; such was not the case, based on the relative difference in atomic numbers of the ions, in the native electron density map.
5.1.2. Determination of zinc incorporation in the Zn-substituted gallophosphate ZnULM-5 by multiple-wavelength techniques at 11 X-ray wavelengths
The most X-ray wavelengths in an element identification study to my knowledge is that of Helliwell et al. (2010). The location of isomorphously substituted zinc over eight crystallographically different gallium sites was determined in a single-crystal study of the gallophosphate ZnULM5 involving the measurement of single-crystal diffraction data sets around the K edges of both Ga and Zn, as well as two reference data sets away from each (see Fig. 3). With these it was possible to selectively exploit dispersive differences of each metal atom type in turn, which allowed the major sites of Zn incorporation to be identified. As the crystal was non-centrosymmetric, with P21212, it was also possible to use anomalous differences to corroborate the results obtained from the dispersive differences. These results were obtained firstly from difference Fourier maps, calculated using a phase set from the refined structure from data measured at the Zr K edge. Also, refined dispersive and anomalous occupancies, on an absolute scale, could be obtained using the program MLPHARE (Collaborative Computational Project, Number 4, 1994), allowing estimates for the Zn incorporation of approximately 22 and 18 at.% at the M1 and M3 sites, respectively, to be obtained. JANA2006 (Petricek et al., 2014) also allowed the ready determination of standard uncertainties on the occupancy parameters, which for M1 and M3 were 20.6 (3) and 17.2 (3) at.%, respectively.
5.2. Realizing ultimate precision via transfer of ligand charge density results to protein as ligand binder
Whilst a charge density study of a protein et al. (2014) of sunitinib malate, an inhibitor of tyrosine kinases and approved as a drug in 2006. As the authors remark: `To obtain a deeper understanding of the interactions that are present in molecular complexes, i.e. beyond geometrical considerations following standard determination, analysis of charge-density distribution is desirable.' This approach has considerable potential to indirectly extend the precision of a protein to an ultimate level.
especially of functional or medicinal importance, is generally not possible, a charge density study of the ligand on its own may be viable. Furthermore, its electrostatic charge density distribution can be considered as a direct description of the protein's binding site if it is known that the ligand binds well in a competitive assay with the protein's natural substrate. A pioneering example of this kind is that of Malińska5.3. Enzyme catalysis in the crystal: hydroxymethylbilane synthase (Helliwell et al., 1998)
Utilizing the flow cell concept of Wyckoff et al. (1967) the structure of the catalytically active, reduced form of the enzyme hydroxymethylbilane synthase (HMBS, Lys59Gln mutant) was studied by Laue dffraction at the ESRF as the substrate, porphobilinogen (PBG), was fed to an immobilized crystal in a flow cell of our design [Fig. 4(a)]. Laue diffraction data were measured at a variety of time points up to 4 h. For another time point, with the substrate supply to a crystal in the flow cell having been stopped at 4 h, monochromatic data were collected at 12 h 30 min on ESRF BM14 (i.e. about 8 h after the substrate supply was stopped). The difference Fourier electron density maps showed that extended electron density appeared in the active-site region of the dipyrrole cofactor. Fig. 4(b) shows the reduced cofactor [PDB codes 2ypn (Nieh et al., 1999) and 1ah5 (Hädener et al., 1999)], proximal to the oxidized C2 ring position (PDB code 1pda; Louie et al., 1992). Most significantly this electron density was adjacent to and above the side chain of Asp84, which is known to play a pivotal role throughout the catalytic reaction cycle. That the electron density was not visible in the 12 h case showed that the product of the enzyme-catalysed reaction had been released into the crystal solution channel. By the time of the conclusion of this experiment, after 12 h, the crystals had changed from colourless to red. This was a significant, complementary observation to the changes in the enzyme active site. Subsequent to these experiments the human HMBS enzyme has been studied by static X-ray crystallography in the apo and ES2 forms by trapping of this state (Pluta et al., 2018; Kallio et al., 2021; Sato et al., 2021). The two approaches, time-resolved and static crystallography, yield complementary views of the intermediate structures, ES2. The static, trapped intermediate, crystal structures yield the most detail. The use of site-directed mutants of course does risk structural artefacts compared with the native enzyme structure. Overall this is an incredibly interesting enzyme with its complex mechanism fundamental to life. Namely, the addition of different metals to the cyclic ring forms haem, chlorophylls and vitamin B12. In humans, deficiency in the activity of this enzyme is directly associated with the dominant hereditary disease acute intermittent porphyria, more commonly known as the `madness of King George'. This is another direct link from the molecular level to a holistic biology, in this case a medical condition.
5.4. The predictive force of precise crystal structures – an iodoplatin for tumour therapy (Tanley & Helliwell, 2014)
That K edges would allow one to vary the of X-rays into a tumour pre-loaded with the iodoplatin. Cisplatin administration has previously been studied in this way involving radiation therapy using a synchrotron source and the Pt K edge alone (Biston et al., 2004; see also https://www.esrf.fr/UsersAndScience/Publications/Highlights/2004/Imaging/Ima10). To undertake such a dual K-edge radiation therapy involving iodinated cisplatin or carboplatin under patient conditions would build upon those experiments.
studies have predictive force is an implicit and explicit statement of exploring structure and function relationships. In trying to connect with the whole living animal the following example involving anti-tumour therapy is a good one. In an extensive theme of platin binding to proteins we unexpectedly came across an iodoplatin. This arose because of the chemical conversion of cisplatin and carboplatin cocrystallized with a model protein (hen egg white lysozyme) crystallized under sodium iodide conditions. The theme involved over 30 different crystal structures. It showed that the platins chemically bound through the platinum atom to a protein's amino acids (notably histidine) besides their nucleic acid structural chemistry, the former being side effects and the latter the actual anti-cancer therapy. One of our structures in particular, an iodoplatin bound to the protein, showed how a dual photon energy approach targeting the iodine and the platinum5.5. The use of protein powder diffraction and crystallography for the characterization of insulin polymorphs and analogues
The characterization of pharmaceutical samples which are absorbed by a patient in bulk form, such as insulin crystalline slurries, can only be done by powder diffraction. Karavassili et al. (2017), focusing on X-ray protein powder diffraction, reviewed research findings on human insulin microcrystals exhibiting upon physicochemical modifications of their environment (i.e. pH, ligand binding). Four new biologically active types of human insulin crystals were identified, and their structures successfully determined by a combination of powder and single-crystal diffraction measurements. Such research continues for pharmaceutical products containing microcrystals with improved activity and stability for patient benefit.
On the single-crystal side, seeking to improve a medical treatment, the high-resolution et al., 2021). The key molecular level comparisons between this of glulisine and of previous insulin crystal structures showed that a unique position of the glutamic acid, not present in other fast-acting analogues, pointed inwards rather than to the outside surface. This reduces interactions with neighbouring molecules and so increases preference of the more-active-for-patients dimer form, giving then a better understanding of the behaviour of glulisine. An unexpected finding was that the glulisine formulation is documented as a zinc-free insulin analogue for its rapid absorption action. Insulin crystallography has shown that zinc is pivotal for hexamer formation. The new glulisine showed zinc bound in the same way as in native insulin, by three histidine amino acids. This finding must mean that traces of zinc ions are present in the commercial, as-supplied, formulation solution. A further optimization for glulisine is now clear, that of finally removing the zinc.
of a fast-acting human insulin analogue, glulisine, has been determined (GillisThe full scope of protein powder diffraction, including to larger proteins, is described by Margiolaki (2019).
5.6. Determining the molecular mechanism of the colouration of a live animal, the lobster
In this effort to rise to the challenges laid down by the holistic biologists, Ottoline Leyser, Ernst Mayr and Paul Nurse, I describe the molecular mechanism of the colouration of a live animal, the study of the lobster shell being a particularly valid connection between the molecular level and an animal.
Spanning more than 50 years the methods of biochemistry, biological crystallography, spectroscopy, solution X-ray scattering and microscopy have been applied to study the molecular basis of the colouration of the live lobster. The blue colouration of the carapace of the European and American lobsters is provided by a multi-molecular carotenoprotein, α-crustacyanin. The European lobster (Homarus gammarus) crustacyanin has been the most extensively studied. Its α-crustacyanin complex extracted from the lobster carapace is a 16-mer of different protein subunits, each binding the carotenoid, astaxanthin (AXT). The biological purpose of the coloured shell is unknown although it may be a means of camouflage against predators such as octopus. A breakthrough in the structural studies came from the determination of the of β-crustacyanin comprising the protein subunits A1 with A3 with two shared bound astaxanthins (Cianci et al., 2002). This was based on diffraction data measured at 100 K. The crystal was blue at room temperature and blue at cryo temperature from which we can conclude that the 3D structure of the β-crustacyanin is free of cryo-artefacts, at least in terms of the protein astaxanthin molecular interactions. Furthermore solution X-ray scattering (SAXS) measurements of the β-crustacyanin showed an excellent agreement with the predicted SAXS curve from the cryo (Chayen et al., 2003). So, we can conclude that the crystalline state and the solution state structures of β-crustacyanin agree. The importance of colour as a simple but effective assay for a real-life connection is illustrated in Fig. 5 whereby the mechanism of change in colouration on cooking a lobster is explicitly shown and can be compared directly with the colour of a pure astaxanthin crystal, which matches the cooked lobster, and the live lobster whose colour matches the colours of the crustacyanin crystals.
6. Technological innovations matching the modern needs
Research and development of technology directions are highly exciting for structural molecular sciences researchers these days. These technology developments are truly extensive and include: new ultra-brilliant SR (synchrotron radiation) sources known as `ultimate storage rings' (MAX IV, PETRA III, NSLS II, the ESRF Extremely Bright Source, with other upgrade plans for ALBA, Diamond, PETRA IV and the Swiss Light Source for example); the X-ray lasers [Linac Coherent Light Source (LCLS), FERMI, SACLA] are into their second generation with the LCLSII and the EuroXFEL; the new neutron sources JPARC, SNS and ESS as well as the ILL's upgrade programmes; new
and electron crystallography capabilities; new higher-field NMR machines (1.2 GHz). These developments are accompanied by continued highly impressive detector developments such as those by the commercial companies.The case studies I have described in this article as examples from the fields of structural biology and chemistry are being built upon with experiments with these new technologies. The use of the flow cell approach of Wyckoff et al. (1967) is now reaching a much wider range of projects because crystal samples can now be as small as 1 µm (Schmidt, 2013). This means the diffusion time for the reactants to initiate the catalysis by an enzyme is orders of magnitude improved. This makes it feasible to use faster enzymes than hitherto for time-resolved crystallography. There is also a wide adoption of the serial crystallography approach at the new synchrotron beamlines, not only for time-resolved crystallography, but also for room-temperature determination.
The development of neutron sources and instruments is not only speeding up throughput of projects, but also widening their scope via larger unit-cell capabilities for neutron macromolecular crystallography, and thereby the molecular-weight ceiling is raised. CryoEM is also being enhanced by widening its scope to smaller molecular-weight systems. One can expect an overlap of capabilities of cryoEM with neutron studies as well as with X-rays.
7. Conclusions
I have brought to a focus in this article the views of several biologists who are sceptical of the molecular level approach. By quoting them I hope I have shown that their objections can be replied to constructively, if not exactly rebutted, by the crystallographer. Forming a constructive response is as much a state of mind and philosophy, and some might declare it unnecessary. But I think it is important, as important as explaining to the public and school children what we do. With this outlook and with our careful choice of projects, both societally relevant as well as curiosity driven, the investment in our field represented by our technological resources can continue to improve.
By adopting an overview of the probes of the structure of matter that we deploy I hope that I have also shown just how synergistic the one is with the other. Each probe however has at least one overriding advantage. X-ray
analysis offers direct ways to solve the structure in the first place by overcoming the Neutrons yield structures complete with hydrogens, and for biological crystallography this means with deuteriums. Electrons offer the chance to work with ultra-small samples, down to the single molecule. There are then ways to cross-check the crystalline state structural results are meaningful by solution scattering or time-resolved crystallography. NMR can be applied to determine the extent of disorder in a crystal as well as for structure and dynamics determination in solution. CryoEM yields precise structures for multi-macromolecular complexes and their multiple states. Arising from `small-molecule' chemical crystallography I have described the exciting changes deriving from the charge density structures as being transferrable, and indeed with direct relevance to molecular recognition of a ligand binding to a biological structure. The expansion of capability of materials powder diffraction into the protein powder diffraction domain is I think also a remarkable development. An unexpected new resource is the prospect of protein structure prediction of the DeepMind kind, which we are eager to learn more about.When coupled together, the precision of the methods used in structural science achieves accuracy. Unlike in physics training though, there is a middle ground between precision and accuracy where we can achieve predictive force from a precise structure which is in effect a form of accuracy. In similar vein the reductionist approach reaches towards the whole organism and in its physiological state.
All these aspects open new doors to change and extend the foundations of the structural molecular sciences.
Acknowledgements
I heartily thank: my PhD students, postdocs and collaborators; the STFC Daresbury Laboratory; the Universities of Manchester, York, Keele and Oxford; the synchrotron radiation and neutron facilities; the software developers CCP4, COOT, Phenix and JANA as well as my colleagues within the Daresbury Laue analysis software development spanning two decades; the University of Manchester
Unit; Diamond Light Source; the ESRF, the Institut Laue–Langevin and the EMBL in Grenoble; the funding agencies: SERC, Swedish Research Council, NATO, the Wallenberg Foundation, the EU, BBSRC, EPSRC, The Wellcome Trust, The Leverhulme Trust and The British Council; the databases especially the wwPDB and the CSD which our laboratory has used extensively these last 30 years. I am grateful to: Dr Jacqui Gulbis, Laboratory Head in the Division of Structural Biology, The Walter and Eliza Hall Institute of Medical Research, Parkville Victoria, Australia, Professor Alice Brink of the Department of Chemistry, The University of the Free State, Bloemfontein, South Africa and Professor Emmanuel Saradakis of the Institute of Nanoscience and Nanotechnology, National Centre for Scientific Research `Demokritos', Athens, Greece for their comments. I am grateful to a referee for drawing to my attention the powers of infrared spectroscopy.Biographical information
John R. Helliwell, DSc (Physics, University of York), DPhil (Molecular Biophysics, Oxford University) is Emeritus Professor of Chemistry at The University of Manchester, where he served as Professor of Structural Chemistry from 1989 to 2012. He is a researcher in the fields of crystallography, biophysics, structural biology, structural chemistry and data science. He has published over 200 scientific research papers and several books.
References
Ahmed, H. U., Blakeley, M. P., Cianci, M., Cruickshank, D. W. J., Hubbard, J. A. & Helliwell, J. R. (2007). Acta Cryst. D63, 906–922. Web of Science CrossRef CAS IUCr Journals Google Scholar
Almeida, J. D. & Tyrrell, D. A. J. (1967). J. Gen. Virol. 1, 175–178. CrossRef CAS PubMed Google Scholar
Andrews, S. J., Papiz, M. Z., McMeeking, R., Blake, A. J., Lowe, B. M., Franklin, K. R., Helliwell, J. R. & Harding, M. M. (1988). Acta Cryst. B44, 73–77. CSD CrossRef CAS Web of Science IUCr Journals Google Scholar
Bartalucci, G., Coppin, J., Fisher, S., Hall, G., Helliwell, J. R., Helliwell, M. & Liaaen-Jensen, S. (2007). Acta Cryst. B63, 328–337. Web of Science CSD CrossRef IUCr Journals Google Scholar
Barth, A. (2007). Biochim. Biophys. Acta, 1767, 1073–1101. Web of Science CrossRef PubMed CAS Google Scholar
Biston, M. C., Joubert, A., Adam, J. F., Elleaume, H., Bohic, S., Charvet, A. M., Estève, F., Foray, N. & Balosso, J. (2004). Cancer Res. 64, 2317–2323. Web of Science CrossRef PubMed CAS Google Scholar
Blakeley, M. P. & Podjarny, A. D. (2018). Emerging Topics in Life Sciences (Feb. 16), 2, 39–55. CAS Google Scholar
Bragg, W. L. (1968). Sci. Am. 219, 58–70. CrossRef CAS PubMed Google Scholar
Bryce, D. L. (2017). IUCrJ, 4, 350–359. Web of Science CrossRef CAS PubMed IUCr Journals Google Scholar
Cassetta, A., Deacon, A. M., Ealick, S. E., Helliwell, J. R. & Thompson, A. W. (1999). J. Synchrotron Rad. 6, 822–833. Web of Science CrossRef CAS IUCr Journals Google Scholar
Chayen, N. E. (1998). Acta Cryst. D54, 8–15. Web of Science CrossRef CAS IUCr Journals Google Scholar
Chayen, N. E., Cianci, M., Grossmann, J. G., Habash, J., Helliwell, J. R., Nneji, G. A., Raftery, J., Rizkallah, P. J. & Zagalsky, P. F. (2003). Acta Cryst. D59, 2072–2082. Web of Science CrossRef CAS IUCr Journals Google Scholar
Chen, B., Kaledhonkar, S., Sun, M., Shen, B., Lu, Z., Barnard, D., Lu, T.-M., Gonzalez, R. L. & Frank, J. (2015). Structure, 23, 1097–1105. Web of Science CrossRef CAS PubMed Google Scholar
Cianci, M., Rizkallah, P. J., Olczak, A., Raftery, J., Chayen, N. E., Zagalsky, P. F. & Helliwell, J. R. (2002). Proc. Natl Acad. Sci. USA, 99, 9795–9800. Web of Science CrossRef PubMed CAS Google Scholar
Clabbers, M. T. B. & Abrahams, J. P. (2018). Crystallogr. Rev. 24, 176–204. Web of Science CrossRef CAS Google Scholar
Clegg, W. (2021). IUCrJ, 8, 4–11. Web of Science CrossRef CAS PubMed IUCr Journals Google Scholar
Collaborative Computational Project, Number 4 (1994). Acta Cryst. D50, 760–763. CrossRef Web of Science IUCr Journals Google Scholar
Cruickshank, D. W. J. (1999). Acta Cryst. D55, 583–601. Web of Science CrossRef CAS IUCr Journals Google Scholar
Cuypers, M. G., Mason, S. A., Blakeley, M. P., Mitchell, E. P., Haertlein, M. & Forsyth, V. T. (2013). Angew. Chem. 125, 1056–1059. CrossRef Google Scholar
Deacon, A., Gleichmann, T., Kalb (Gilboa), A. J., Price, H., Raftery, J., Bradbrook, G., Yariv, J. & Helliwell, J. R. (1997). Faraday Trans. 93, 4305–4312. Google Scholar
Derewenda, U., Derewenda, Z., Dodson, E. J., Dodson, G. G., Reynolds, C. D., Smith, G. D., Sparks, C. & Swenson, D. (1989). Nature, 338, 594–596. CrossRef CAS PubMed Web of Science Google Scholar
Du, M. & Jacobsen, C. (2018). Ultramicroscopy, 184, 293–309. Web of Science CrossRef CAS PubMed Google Scholar
Einspahr, H., Suguna, K., Suddath, F. L., Ellis, G., Helliwell, J. R. & Papiz, M. Z. (1985). Acta Cryst. B41, 336–341. CrossRef CAS Web of Science IUCr Journals Google Scholar
Emsley, P. & Cowtan, K. (2004). Acta Cryst. D60, 2126–2132. Web of Science CrossRef CAS IUCr Journals Google Scholar
Fischer, M. (2021). Q. Rev. Biophys. 54, e1, 1–15. Google Scholar
Foadi, J., Aller, P., Alguel, Y., Cameron, A., Axford, D., Owen, R. L., Armour, W., Waterman, D. G., Iwata, S. & Evans, G. (2013). Acta Cryst. D69, 1617–1632. Web of Science CrossRef CAS IUCr Journals Google Scholar
Gemmi, M., Mugnaioli, E., Gorelik, T. E., Kolb, U., Palatinus, L., Boullay, P., Hovmöller, S. & Jan Pieter Abrahams, J. P. (2019). ACS Central Sci. 5, 1315–1329. CrossRef CAS Google Scholar
Gillis, R. B., Solomon, H. V., Govada, L., Oldham, N. J., Dinu, V., Jiwani, S. I., Gyasi-Antwi, P., Coffey, F., Meal, A., Morgan, P. S., Harding, S. E., Helliwell, J. R., Chayen, N. E. & Adams, G. G. (2021). Sci. Rep. 11, 1737. CrossRef PubMed Google Scholar
Grime, G. W., Zeldin, O. B., Snell, M. E., Lowe, E. D., Hunt, J. F., Montelione, G. T., Tong, L., Snell, E. H. & Garman, E. F. (2020). J. Am. Chem. Soc. 142, 185–197. CrossRef CAS PubMed Google Scholar
Gurusaran, M., Shankar, M., Nagarajan, R., Helliwell, J. R. & Sekar, K. (2014). IUCrJ, 1, 74–81. CrossRef CAS PubMed IUCr Journals Google Scholar
Hädener, A., Matzinger, P. K., Battersby, A. R., McSweeney, S., Thompson, A. W., Hammersley, A. P., Harrop, S. J., Cassetta, A., Deacon, A., Hunter, W. N., Nieh, Y. P., Raftery, J., Hunter, N. & Helliwell, J. R. (1999). Acta Cryst. D55, 631–643. Web of Science CrossRef IUCr Journals Google Scholar
Haertlein, M., Moulin, M., Devos, J. M., Laux, V., Dunne, O. & Forsyth, V. T. (2016). Methods Enzymol. 566, 113–157. Web of Science CrossRef CAS PubMed Google Scholar
Halle, B. (2004). Proc. Natl Acad. Sci. USA, 101, 4793–4798. Web of Science CrossRef PubMed CAS Google Scholar
Harris, R. K., Wasylishen, R. E. & Duer, M. J. (2009). NMR Crystallography. Chichester, UK: John Wiley. Google Scholar
Helliwell, J. R. (1988). J. Cryst. Growth, 90, 259–272. CrossRef CAS Web of Science Google Scholar
Helliwell, J. R. (2018). Crystallogr. Rev. 24, 263–272. Web of Science CrossRef CAS Google Scholar
Helliwell, J. R. (2020). IUCr Newsl. Vol 28, No. 4, p. 6. Google Scholar
Helliwell, J. R. (2021). Research Features, 134, 150–153. Google Scholar
Helliwell, J. R., McMahon, B., Guss, J. M. & Kroon-Batenburg, L. M. J. (2017). IUCrJ, 4, 714–722. Web of Science CrossRef CAS PubMed IUCr Journals Google Scholar
Helliwell, J. R., Nieh, Y. P., Raftery, J., Cassetta, A., Habash, J., Carr, P. D., Ursby, T., Wulff, M., Thompson, A. W., Niemann, A. C. & Alfons Hädener, M. (1998). Faraday Trans. 94, 2615–2622. CrossRef CAS Google Scholar
Helliwell, M., Helliwell, J. R., Kaucic, V., Zabukovec Logar, N., Teat, S. J., Warren, J. E. & Dodson, E. J. (2010). Acta Cryst. B66, 345–357. CrossRef IUCr Journals Google Scholar
Henderson, R. (1995). Q. Rev. Biophys. 28, 171–193. CrossRef CAS PubMed Web of Science Google Scholar
Hendrickson, W., Horton, J. R. & LeMaster, D. M. (1990). EMBO J. 9, 1665–1672. CrossRef CAS PubMed Google Scholar
Hoogerheide, D. P., Forsyth, V. T. & Brown, K. A. (2020). Phys. Today, 73, 36–42. CrossRef CAS Google Scholar
Ireland, T. (2014). Biologist, 61, 32–35. Google Scholar
Jaskolski, M., Dauter, Z., Shabalin, I. G., Gilski, M., Brzezinski, D., Kowiel, M., Rupp, B. & Wlodawer, A. (2021). IUCrJ, 8, 238–256. Web of Science CrossRef CAS PubMed IUCr Journals Google Scholar
Kallio, J. P., Bustad, H. J. & Martinez, A. (2021). iScience, 24, 102152. PubMed Google Scholar
Karavassili, F., Valmas, A., Fili, S., Georgiou, C. D. & Margiolaki, I. (2017). Biomolecules, 7, 63. Web of Science CrossRef Google Scholar
Klijn, T. de, Schreurs, A. M. M. & Kroon-Batenburg, L. M. J. (2019). IUCrJ, 6, 277–289. CrossRef PubMed IUCr Journals Google Scholar
Kühlbrandt, W. (2014). Science, 343, 1443–1444. Web of Science PubMed Google Scholar
Lattman, E. E., Grant, T. D. & Snell, E. H. (2018). Biological Small Angle Scattering, an IUCr Monograph. Oxford University Press. Google Scholar
Leonard, G., Solé, V. A., Beteva, A., Gabadinho, J., Guijarro, M., McCarthy, J., Marrocchelli, D., Nurizzo, D., McSweeney, S. & Mueller-Dieckmann, C. (2009). J. Appl. Cryst. 42, 333–335. Web of Science CrossRef CAS IUCr Journals Google Scholar
Lewandowski, J. R., Halse, M. E., Blackledge, M. & Emsley, L. (2015). Science, 348, 578–581. Web of Science CrossRef CAS PubMed Google Scholar
Liebeschuetz, J., Hennemann, J., Olsson, T. & Groom, C. R. (2012). J. Comput. Aided Mol. Des. 26, 169–183. Web of Science CrossRef CAS PubMed Google Scholar
Liko, I., Allison, T. M., Hopper, J. T. S. & Robinson, C. V. (2016). Curr. Opin. Struct. Biol. 40, 136–144. CrossRef CAS PubMed Google Scholar
Lin, Y. (2018). Exp. Opin. Drug. Discov. 13, 691–695. Web of Science CrossRef Google Scholar
Louie, G. V., Brownlie, P. D., Lambert, R., Cooper, J. B., Blundell, T. L., Wood, S. P., Warren, M. J., Woodcock, S. C. & Jordan, P. M. (1992). Nature, 359, 33–39. CrossRef PubMed CAS Google Scholar
Malińska, M., Jarzembska, K. N., Goral, A. M., Kutner, A., Woźniak, K. & Dominiak, P. M. (2014). Acta Cryst. D70, 1257–1270. Web of Science CSD CrossRef IUCr Journals Google Scholar
Margiolaki, I. (2019). International Tables for Crystallography, Vol. H. ch. 7.1, pp. 718–720. Chester: IUCr. Google Scholar
Mayr, E. (2007). What Makes Biology Unique? Cambridge University Press. Google Scholar
McCoy, A. (2021). In preparation. Google Scholar
Meisburger, S. P., Case, D. A. & Ando, N. (2020). Nat. Commun. 11, 1271. CrossRef PubMed Google Scholar
Middleton, D. (2009). NMR Crystallography, ch. 27, edited by R. K. Harris, R. E. Wasylishen & M. J. Duer. Chichester, UK: John Wiley. Google Scholar
Neutze, R., Wouts, R., van der Spoel, D., Weckert, E. & Hajdu, J. (2000). Nature, 406, 752–757. Web of Science CrossRef PubMed CAS Google Scholar
Nieh, Y. P. (1997). PhD thesis, University of Manchester, UK. Google Scholar
Nieh, Y. P., Raftery, J., Weisgerber, S., Habash, J., Schotte, F., Ursby, T., Wulff, M., Hädener, A., Campbell, J. W., Hao, Q. & Helliwell, J. R. (1999). J. Synchrotron Rad. 6, 995–1006. Web of Science CrossRef CAS IUCr Journals Google Scholar
Niimura, N. & Podjarny, A. (2011). Neutron Protein Crystallography, an IUCr Monograph. Oxford University Press. Google Scholar
Okaya, Y. & Pepinsky, R. (1956). Phys. Rev. 103, 1645–1647. CrossRef CAS Web of Science Google Scholar
Palatinus, L., Petříček, V. & Corrêa, C. A. (2015). Acta Cryst. A71, 235–244. Web of Science CrossRef IUCr Journals Google Scholar
Pauling, L., Itano, H. A., Singer, S. J. & Wells, I. C. (1949). Science, 110, 543–548. CrossRef PubMed CAS Web of Science Google Scholar
Petricek, V., Dusek, M. & Palatinus, L. (2014). Z. Kristallogr. 229, 345–352. CAS Google Scholar
Petsko, G. (1985). Methods Enzymol. 114, 141–146. CrossRef CAS PubMed Google Scholar
Pluta, P., Roversi, P., Bernardo-Seisdedos, G., Rojas, A. L., Cooper, J. B., Gu, S., Pickersgill, R. W. & Millet, O. (2018). Biochim. Biophys. Acta, 1862, 1948–1955. CrossRef CAS Google Scholar
Rupp, B., Wlodawer, A., Minor, W., Helliwell, J. R. & Jaskolski, M. (2016). FEBS J. 283, 4452–4457. CrossRef CAS PubMed Google Scholar
Sanjuan-Szklarz, W. F., Woińska, M., Domagała, S., Dominiak, P. M., Grabowsky, S., Jayatilaka, D., Gutmann, M. & Woźniak, K. (2020). IUCrJ, 7, 920–933. Web of Science CSD CrossRef CAS PubMed IUCr Journals Google Scholar
Sato, H., Sugishima, M., Tsukaguchi, M., Masuko, T., Iijima, M., Takano, M., Omata, Y., Hirabayashi, K., Wada, K., Hisaeda, Y. & Yamamoto, K. (2021). Biochem. J. 478, 1023–1042. CrossRef CAS PubMed Google Scholar
Schmidt, M. (2013). Adv. Condens. Matter Phys. 2013, 1–10. Web of Science CrossRef Google Scholar
Schrödinger, E. (1943). What is Life? The Physical Aspect of the Living Cell, based on lectures delivered under the auspices of the Dublin Institute for Advanced Studies at Trinity College, Dublin, in February. Reprinted by Cambridge University Press, 2012. Google Scholar
Schwalbe, C. H. (2018). Crystallogr. Rev. 24, 217–235. CrossRef CAS Google Scholar
Schwarzenbach, D., Abrahams, S. C., Flack, H. D., Prince, E. & Wilson, A. J. C. (1995). Acta Cryst. A51, 565–569. CrossRef CAS Web of Science IUCr Journals Google Scholar
Svergun, D. I., Koch, M. H. J., Timmins. P. A. & May, R. P. (2013). Small Angle X-ray and Neutron Scattering from Solutions of Biological Macromolecules, an IUCr Text. Oxford University Press. Google Scholar
Tanley, S. W. M. & Helliwell, J. R. (2014). Acta Cryst. F70, 1127–1131. Web of Science CrossRef IUCr Journals Google Scholar
Tilton, R. F. Jr, Dewan, J. C. & Petsko, G. A. (1992). Biochemistry, 31, 2469–2481. CrossRef PubMed CAS Web of Science Google Scholar
Turney, J. (2019). The Puzzle of Life. Times Higher Educational Suppl. 21st Feb., pp. 44–45. Google Scholar
Vainshtein, B. (1964). Structure Analysis by Electron Diffraction. Oxford: Pergamon Press. Google Scholar
Wall, M. E., Wolff, A. M. & Fraser, J. S. (2018). Curr. Opin. Struct. Biol. 50, 109–116. Web of Science CrossRef CAS PubMed Google Scholar
Wüthrich, K. & Wagner, G. (1975). FEBS Lett. 50, 265–268. PubMed Google Scholar
Wyckoff, H. W., Doscher, M., Tsernoglou, D., Inagami, T., Johnson, L. N., Hardman, K. D., Allewell, N. M., Kelly, D. M. & Richards, F. M. (1967). J. Mol. Biol. 27, 563–578. CrossRef CAS PubMed Web of Science Google Scholar
Yano, J., Kern, J., Irrgang, K.-D., Latimer, M. J., Bergmann, U., Glatzel, P., Pushkar, Y., Biesiadka, J., Loll, L., Sauer, K., Messinger, J., Zouni, A. & Yachandra, V. K. (2005). Proc. Natl Acad. Sci. USA, 102, 12047–12052. CrossRef PubMed CAS Google Scholar
This is an open-access article distributed under the terms of the Creative Commons Attribution (CC-BY) Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original authors and source are cited.