research papers
Squeezing the most from every crystal: the fine details of data collection
aStructural Genomics Consortium, Oxford University, Roosevelt Drive, Oxford OX3 7DQ, England, and bDiamond Light Source, Harwell Science and Innovation Campus, Didcot OX11 0DE, England
*Correspondence e-mail: frank.vondelft@sgc.ox.ac.uk
Modern synchrotron beamlines offer instrumentation of unprecedented quality, which in turn encourages increasingly marginal experiments, and for these, as much as ever, the ultimate success of data collection depends on the experience, but especially the care, of the experimenter. A representative set of difficult cases has been encountered at the Structural Genomics Consortium, a worldwide structural genomics initiative of which the Oxford site currently deposits three novel human structures per month. Achieving this target relies heavily on frequent visits to the Diamond Light Source, and the variety of crystal systems still demand customized data collection, diligent checks and careful planning of each experiment. Here, an overview is presented of the techniques and procedures that have been refined over the years and that are considered synchrotron best practice.
Keywords: data collection; data-collection strategy; structural genomics.
1. Introduction
Recent decades have seen an astonishing improvement in terms of synchrotron performance, beamline instrumentation and data-processing software. Protein crystallography is now a mainstream research technique and the rate of structure deposition in the Protein Data Bank keeps increasing (Abad-Zapatero, 2012). Nevertheless, despite much automation, data collection remains a nontrivial experiment that requires diligent planning and careful checks, while the and of third-generation synchrotrons pose a whole additional set of challenges (Chavas et al., 2012; Dauter et al., 2010; Fodje et al., 2012; Gabadinho et al., 2010; González et al., 2008; McCarthy et al., 2009). The availability of ever smaller and brighter beams has boosted the field on the one hand, but it has also increased the requirement of the experimenter to make the correct choices at the beamline. The experiment potentially remains tricky and still requires experience and quick, accurate decisions.
The validity of this truism can be well observed in large research laboratories dedicated to structure solution, such as the Structural Genomics Consortium (SGC). The Oxford site of this international initiative currently deposits three (historically six) novel human protein crystal structures per month, and its productivity depends crucially on frequent visits to the state-of-the-art beamlines at the Diamond Light Source and, before 2009, to the Swiss Light Source. In terms of operational efficiency, while the cornerstone of the whole gene-to-structure process is to work in parallel, we have consistently found that if diffracting crystals have been identified then the most effective route to reducing additional biochemical effort is to approach data collection with extreme diligence in order to obtain the best data possible.
Thus, the output of a structural genomics project is not sustained by indiscriminately collecting vast amounts of data and hoping that their sheer number will eventually lead to the desired number of structures, contrary to the popular sense invoked by the term `high-throughput' that is often used in the same context. As is frequently pointed out, the collection of diffraction data is the last real experiment and errors made at this stage may make structure solution unnecessarily difficult or even impossible. Indeed, by focusing on collecting few but high-quality data sets, we found that not only is the data analysis itself accelerated, but also the average amount of wet laboratory work required per project is greatly reduced; the time consumed by attempting to evaluate large quantities of mediocre data can never be overestimated.
Here, we provide an overview of the methods and routines that we have applied and refined over eight years of data collection at both the Diamond Light Source and the Swiss Light Source. This topic has been discussed extensively in the literature (Dauter, 1999, 2010; Evans, Axford, Waterman et al., 2011; Pflugrath, 2004; Rupp, 2010; Mueller et al., 2007) and much of this will be obvious to experienced synchrotron users; certainly not everything is crucial to every situation, but we have consistently found that shortcuts taken by impatient operators have come back to haunt them and their projects.
2. Homework
By far the most reliable route to good data is to have a good crystal, so effort spent in finding one before coming to the synchrotron generally pays off. Crystal morphology is not a predictor of diffraction quality, and one should therefore always explore all crystals grown in different crystallization conditions. Moreover, there are many ways to generate variant crystals, e.g. crystallization in the presence of different ligands (Vedadi et al., 2006), crystallization of different constructs (Gileadi et al., 2007), seeding (D'Arcy et al., 2007), surface mutagenesis (Derewenda, 2010), lysine methylation (Walter et al., 2006), diversified screening approaches such as the `silver bullets' screen (McPherson & Cudney, 2006) and in situ proteolysis (Wernimont & Edwards, 2009). Post-crystallization procedures such as dehydration or annealing can in some cases lead to impressive improvements in diffraction quality (Heras & Martin, 2005; Sanchez-Weatherby et al., 2009). Crystals should be mounted with great care and adhering to some simple rules will always be beneficial for the quality of a data set without any additional overheads (Pflugrath, 2004). A few points are worth emphasizing.
Modern synchrotrons, in combination with sample-changing robotics, allow the rapid screening of many crystals at the beamline, but at SGC the policy remains that all crystals have to be first characterized for diffraction quality (although not necessarily space group) on our in-house rotating-anode generator to allow them to be prioritized for the synchrotron; unscreened crystals generally receive low priority (Figs. 1a and 1b). We have found this to be crucial as it allows us to focus our efforts and attention during the limited period of a beamtime on the actual collection of data. State-of-the-art rotating-anode generators often allow a realistic assessment of the quality of protein crystals: the diffraction observed for crystals larger than 100 µm on the Rigaku FR-E generator at the SGC (300 µm beam size) is typically between 0.5 and 2 Å poorer in resolution than that observed at the synchrotron, and spectacular improvements are only observed for crystals smaller than 50 µm (Figs. 1c and 1d). Additionally, pre-screening helps to exclude salt crystals, to identify issues with freezing or to identify crystal problems early on. Samples are prioritized taking account of both project importance and in-house screening results, so generally it is clear what kind of data-collection approach will be required for each project and how much time it will require even before the synchrotron visit starts. This procedure ensures that the time at the synchrotron is used as efficiently as possible.
3. Mechanical alignment of the beam with the spindle axis
Modern beamlines are remarkable machines, and we have found beamline support and maintenance to be consistently excellent worldwide, not only at our usual `home' synchrotrons. Nevertheless, `drift' is an unavoidable phenomenon for any system where movements of 10 µm or less are important, and since these are the typical experimental dimensions at modern synchrotrons, both the beam and the rotation axis can easily drift from the intersection point marked on the screen. It is therefore crucial that the user understands the beamline and takes responsibility for being alert to whether the beam is in fact hitting the crystal at all times and knowing how to diagnose and correct this both before and during the experiment. Even for the Diamond beamlines, whose stability we are very familiar with, we set aside time at the start of each visit for a start-up routine to check some critical features together with the beamline scientists; if nothing else, this reminds us as users how to control the beamline and diagnose it once we are on our own during the rest of the visit.
On-axis viewing systems (Perrakis et al., 1999) offer the extraordinary benefit that users are able to directly visualize the crystal-to-beam intersection. The beam position can easily be checked by mounting an X-ray sensitive screen to verify that the beam is actually at the position of the crystal, and many beamlines have motorized YAG screens that can be rapidly placed in the beam, even while the sample is on the goniometer. What should also be investigated is how different slit or aperture settings affect both the shape and the position of the beam; even if the viewing software draws outlines of the apparent beam, these will necessarily only be approximations of the actual beam profile. It is advisable to repeatedly review the position of the beam over the course of a visit, especially when working with small samples; when beams drift, this happens on the timescale of hours, and if the beam position is not regularly checked this can easily lead to false-negative results. It is also important to be familiar with how to correct for such drifts when they occur.
Equally important is to assess the sphere of confusion: the alignment of the rotation axis and its rotational accuracy. A large sphere of confusion will cause the sample to `wobble' around the beam, and assessing it requires a well defined point mounted on the goniometer; typically, beamlines have sharp needles mounted on standard bases available and we have also found it useful to carry our own. On a well aligned rotation axis, the tip of the object (needle) should stay in the marked beam centre for an entire 360° oscillation, but mechanical limitations or a slight offset of the axis may prevent this. The latter is easily corrected by the beamline controls, but the former is a feature of the instrument and this rotation-dependent deviation may need to be taken into account when planning a particular diffraction experiment, especially for very small crystals and experiments that require a full 360° rotation (§4).
4. Alignment of sample to beam
The most fundamental aspect of the experiment is that the beam must properly intersect with the sample for the entire oscillation. This is not a problem when the crystals are much larger than the beam, but with small crystals and small beams it is easy to get wrong. When aligning visually, the main obstacle to aligning the sample is parallax owing to the shape of the liquid surrounding the crystal in the loop; the only reliable view of the crystal is perpendicular to the loop, and identifying this orientation should be the first step during the alignment of each sample. The quickest way to do so is to find the `side-on' direction, since it is most readily distinguished; from there, a 90° oscillation brings up the `face-on' view. Any alignment should be confirmed by viewing the loop from the other side, i.e. by applying a 180° rotation.
However, the side-on orientation presents a fundamental problem whenever the crystal does not extend between both surfaces of the liquid in the loop or when it is thinner than the thickness of the loop material itself. In this case, diffraction must be used to locate the crystal by scanning X-rays across a region of interest and measuring the diffraction strength (Aishima et al., 2010; Hilgart et al., 2011). Low doses are recommended for such scans in order to avoid unnecessary X-ray damage, since only a small fraction of the photons required for an image in a properly measured data set are sufficient to identify the strongest diffraction. While automated analyses are helpful when available, we recommend visual inspection of the diffraction pattern, especially for low-dose exposures. This is easily performed by opening an image in a diffraction-image viewer, selecting an obvious Bragg spot, zooming in to view the actual pixel counts and then finding the image with the highest number of counts in the spot.
Grid scans also offer the opportunity to characterize diffraction throughout the crystal; in our experience, crystals very commonly diffract inhomogeneously, especially those that are poorly ordered and thus diffract to low resolution, regardless of their morphological appearance. Crystals of membrane proteins appear to be particularly prone to this phenomenon (Evans, Axford & Owen, 2011), and even when large crystals can be generated the use of the microfocus beam of I24 remains informative.
Visual alignment encounters two other common obstacles. (i) Loops that are significantly bent away from the rotation axis will present further parallax, and to view these reliably may require manual intervention to straighten the loop or else the use of kappa goniometry where available (see §6.4). (ii) Surface ice obscures the crystal, but also degrades the diffraction pattern with strong powder diffraction rings. We have been able to remove it routinely by blowing it away with a stream of liquid nitrogen using a Cry-Ac cryosurgery device (Brymill, Connecticut, USA; Fig. 2; James Holton, personal communication). Be aware, though, that the cold gas stream may damage the beamline equipment and the use of such a device should be discussed with the beamline scientists beforehand. Crystal `washers', which flood the crystal with a stream of liquid nitrogen on demand, are also becoming more widespread at beamlines (Diamond Light Source, personal communication).
Use of very small crystals on beamlines with a sphere of confusion of similar magnitude to either the crystal or the beam requires special attention: either angular ranges with the largest offset need to be avoided or else several re-alignments may need to be performed in the course of a 360° data set either side of a particular oscillation range (we use scans of 90°). Care also has to be taken when setting up helical data collection (Evans, Axford, Waterman et al., 2011; Flot et al., 2010; see §6.5): for such scans on thin rods on a microfocus beamline the end points need to be determined with the utmost care and diffraction-based alignment (Flot et al., 2010; Hilgart et al., 2011; Song et al., 2007) is strictly required, since even small parallax may be sufficient to offset the apparent position of the crystal.
Another source of misalignment is more prosaic, namely defective sample mounts, where either the pin is loose in the base or the loop itself is loose in the pin; certainly this is worth investigating before instigating a more wide-ranging diagnosis of potential beamline failures. Incidence of this problem can be avoided by diligent pin management in the home laboratory, but for the inevitable exceptions, if a particularly valuable sample is involved, creative application of Plasticine may temporarily alleviate the problem.
5. Documentation of experiments
The need for rigorous documentation of diffraction experiments is easily overlooked during preparation for synchrotrons, yet it is of the utmost importance, since when it is neglected inconsistent practices will lead over time to a set of widely divergent notes that are difficult to search and are usually significantly incomplete. We have found it invaluable to develop a single documentation convention that has to be followed by every group member. Mostly, this merely requires only a little thought from an experienced synchrotron user in the group, and if the conventions are properly disseminated people are perfectly willing to adopt them.
The most basic aspect concerns files and directories, which can be judiciously named to be self-documenting. The use of arbitrary names is common (`test1', `crystal3', `thing'), but this practice should be strongly discouraged since such naming will at best be informative to one individual, but will mostly mean the experiment cannot be reconstructed by anybody, including the creator after a few weeks. At SGC Oxford we adopted a naming convention that maps onto that of the CCP4 program suite (Winn et al., 2011): each harvested crystal is given a unique crystal ID from a given project (e.g. BBOX1-x023 for project BBOX1) and each data set is given a unique data-set ID (BBOX1-d005), where a data set comprises one or more passes collected consecutively in time. These are combined for image names: BBOX1-d001-x003, BBOX1-d002-x047 etc. All of the data files are stored in a uniformly organized file system, which it is advisable to keep separate from where processing occurs: this greatly facilitates the management of disc space, since it makes it easier to locate the far larger data directories for deletion (after backup). Such practice is not encouraged by the default behaviour of the common data-integration packages, yet is nevertheless easy to perform in all of them.
Rigorous use of database infrastructure, where available, provides even greater benefits. SGC Oxford, which comprises around 80 researchers from diverse backgrounds ranging from cell biology, (bio)chemistry, informatics, engineering to crystallography, has benefited enormously from a central database (BeeHive; Molsoft LLC, La Jolla, California, USA) and an electronic laboratory journal (Contur ELN; Contur Software AB, Sweden). Every step of a project from target selection to structure deposition can be rapidly reconstructed through the query engine, because this has been the only formal experimental record accepted in the laboratory from the start. The consistent utility that this infrastructure has provided, not least for real-time decision-making, prompts us to submit that such systems should be a strict requirement in any large operation, whether industry or academic efforts such as the SGC; even only a subset of functions would provide enormous benefit to smaller research groups. Freely available database systems are available, including a publicly hosted server of PIMS (Morris et al., 2011), and many synchrotrons offer database support systems such as ISPyB, which record all details of a diffraction experiment (Delagenière et al., 2011). However, the usefulness of such a system stands and falls with the willingness of the head of the laboratory to enforce both its use and the consistency of the corresponding naming conventions.
6. Data collection
Even when the crystals have been pre-screened in-house, it is very important to reassess at least several of the best crystals before investing the time to set up and execute data collection. Modern sample changers are extremely reliable and we cannot confirm anecdotal reports that dismounting crystals can lead to loss of diffraction: in more than 650 diffracting projects, crystals that diffracted before being returned to liquid nitrogen, invariably still diffracted just as well when replaced on the goniometer. Indeed, rather than introducing risks, a willingness to remove cryopreserved crystals from the goniometer once tested, allows synchrotron time to be used much more efficiently.
Another potential pitfall is presented by the revolutionary speed of modern detectors, especially pixel-array detectors, which makes it possible to collect many more data sets in the allotted beamtime than in the past. This capability is a powerful tool, but we also observe that, at least for unphased structures, meticulous planning of the diffraction experiment cannot be dispensed with and cannot be compensated for by `data-set roulette' i.e. the collection of data as quickly as possible from as many crystals as possible in the hope that one is good enough. Moreover, data processing can become very time-consuming or even impossible when crystals are suboptimal, suffer from radiation damage or are poorly aligned, and ultimately no time is saved.
Finally, it is important to become closely familiar with the necessary software, experimental requirements and strategy options before arriving at the synchrotron, because once there there is always great time pressure, and the ability to generate and understand real-time diagnostics often makes the difference between success and failure. Historical data sets from other projects in the laboratory are an excellent resource for learning the software.
6.1. Initial characterization
Once a sample is properly aligned we record two diffraction images that are 90° apart, since frequently diffraction appears to be good in one crystal orientation but is problematic in the perpendicular direction. For this reason, careful visual inspection of diffraction is important using an image viewer such as ADXV (https://www.scripps.edu/~arvai/adxv.html ) or ALBULA (https://www.dectris.com/software_albula.html ) to identify diffraction pathologies such as split spots, multiple lattices, smeary spots, ice rings or anisotropic diffraction.
Although beamline staff can advise on typical exposure times and transmission settings for their beamline, this will not necessarily allow adequate characterization of a given crystal and, especially when diffraction is weak, higher fluxes should be explored to see whether reflections start to appear at higher resolution or whether there is just an increase in background intensity. For a semi-quantitive assessment, the program iMosflm (Battye et al., 2011) provides the invaluable facility of a graphical view of I/σ(I) per resolution bin when integrating even only a single image (Fig. 3), provided that the program succeeds in indexing the diffraction. This exercise has the added advantage of allowing a visual comparison between the predicted and observed diffraction, which is easy to overlook when relying on nongraphical (scripted) integration.
If attempts at indexing fail, it may be better to search for a better crystal; on the other hand, if there are no better crystals it is certainly worth proceeding with data collection, since frequently data processing does then succeed when all images of the data set are available.
6.2. Beam size and shape
An ideal diffraction experiment would have the beam illuminate only the crystal and no more, since all unnecessarily illuminated material will contribute background scattering and degrade the measured signal. If the beamline provides the option to change the beam size easily, this should be explored, including a semi-quantitive assessment of the resolution limits detailed in §6.1. In particular, one needs to be aware how the beam size is related to the beam since beamlines achieve changes in beam size either by cutting the beam, which reduces the or by refocusing, which preserves the the exposure time or transmission may need to be adjusted to compensate accordingly, sometimes by several orders of magnitude.
Appropriate tailoring of beam size and shape can also be critical for obtaining good data from inhomogeneous crystals once a volume of good diffraction has been identified (§4). Additionally, one can consider adjusting the beam size as the crystal rotates and presents a changing profile to the beam, and the exposure times need to be adjusted too; this has not been our standard practice, although in extreme cases it may significantly improve data quality and thus be worth the effort of setting up.
6.3. Geometric strategy: oscillation range
Although a 180° oscillation will always yield a set of images in which all accessible indices have appeared at least once, and although this can be achieved within seconds on bright beamlines with fast detectors (Owen et al., 2012), it is critical to realise that the crystal may have deteriorated well before the experiment is over. The consistent undermining factor is radiation-induced decay and the fact that the lifetime of the crystal is in general unknown beforehand. Hence, a sensible data-collection strategy is to use the minimal angular range required to collect a complete data set for the probable Laue group, as this maximizes the likelihood of measuring all of the required reflections before radiation damage dominates. If anomalous signal is to be measured, it is always beneficial to maximize the observational multiplicity by collecting as large an oscillation range as practically feasible, and even multiple revolutions at different crystal offsets (Debreczeni et al., 2003; §6.4), adjusting the dosage strategy accordingly (§6.5).
When targeting high resolution it is the counting statistics of the weak reflections that is limiting (Holton, 2009), and it suffices to ensure that all reflections are measured. However, one should always collect a larger angular range than that prescribed by the geometric strategy in case the crystal does survive the minimal range, because increased multiplicity will yield improved data accuracy and signal to noise (Garman, 1999). If the later images of the data set do end up overly compromised by radiation damage, they can be removed during the final data processing.
All data-processing packages can calculate such strategies, and it is recommended to work with the package that one is most familiar with. A more sophisticated strategy is provided by the program BEST (Bourenkov & Popov, 2010), which is most typically available via the automated EDNA software (Incardona et al., 2009); it not only calculates the geometric strategy but also provides recommendations for exposure time and predicts the outcome of the experiment.
In some cases, especially for crystals with low symmetry, the crystal orientation means that no single oscillation range will yield complete data. In these instances, it is necessary to change the crystal orientation (§6.4) and to collect data for a second oscillation. Strategy programs do allow this to be calculated, but making use of the information requires dedication and patience, unless the beamline is set up appropriately, e.g. with the STAC software (Brockhauser et al., 2011); also, the crystal must be able to survive additional irradiation.
6.4. Overlap strategy: oscillation angle
Another important consideration is the oscillation angle per image, in order to avoid spatial overlap of diffraction spots on the detector. The availability of pixel area detectors makes it feasible to use fine slicing (Pflugrath, 1999) by default, and apart from the potential gains in data quality and signal to noise (Mueller et al., 2012), this also greatly reduces the risk of spot overlap.
However, even fine slicing will not help when a sufficiently long unit-cell axis is oriented perpendicular to the beam: in such cases the geometry ensures that there will be spatial overlap even for still images, with no guarantee that any algorithm can deconvolute the overlapped intensities reliably. This particular problem is easy to spot with an interactive program such as iMosflm, as it displays the percentage of overlaps for a given oscillation angle as long as the mosaicity has been properly estimated.
This long-axis problem can only be circumvented by reorienting the crystal: this is straightforward with a goniometer that supports κ offsets (e.g. a kappa motor), although many modern beamlines do not have this facility. Alternatively, if one has previous knowledge of how the crystal morphology is related to the unit-cell axes, one can try to mount the crystal in a favourable orientation (§2); some manufacturers (e.g. Hampton Research, California, USA) sell special bendable loops for this purpose. However, in the absence of all of this (and with a bit of practice) it is very feasible to bend nylon loops while they are mounted on the goniometer (Fig. 4; Dauter, 1999). This naturally cannot achieve any angular accuracy, yet the procedure has proved itself to be adequate for a significant number of projects in which reorientation was crucial to success but where kappa goniometry was not available. Non-nylon sample mounts are much less amenable to this procedure.
6.5. Dosage strategy: transmission and exposure time
At third-generation synchrotrons, where
is no longer limiting, the specific question of how much X-ray exposure to apply per image or unit of oscillation is almost entirely one of how much radiation damage will be tolerated by the crystal, or indeed the atoms with when measuring anomalous signal.General radiation damage is characterized by the fading of high-resolution reflections over the course of data collection (Owen et al., 2006), and the highest resolution reflections observed on an initial image, usually taken with high seldom survive for the duration of the data set. Thus, the initially estimated resolution is rarely achieved for the whole data set, particularly for smaller crystals (Holton, 2009). Additional technical but also scientific problems arise from site-specific radiation damage of certain chemical groups in the crystal, and such effects may start much earlier and may restrict the usefulness of the final structure (Weik et al., 2000; Ravelli & McSweeney, 2000) or else the anomalous signal. There is now extensive literature on this topic (Garman & Weik, 2013), including software that generates estimates of the total dose that the crystal will tolerate and therefore how to divide it up in the oscillation range (Paithankar & Garman, 2010; Bourenkov & Popov, 2010; see §6.3). However, such estimates are fully reliable only when the whole experiment is very carefully parameterized, which in general is not realistic (Krojer & von Delft, 2011). We therefore resort to an empirical approach, collecting an exploratory data set typically either from one end of the crystal or from a similar crystal of slightly less good quality and judging the radiation tolerance of the crystal from the data-processing statistics.
A rigorous implementation of this approach exists at the beamlines of the ESRF synchrotron as an automated option in the MxCube software (Brockhauser et al., 2012; Leal et al., 2011). If such a proper parameterization is not available, a useful estimate of the lifetime can nevertheless be gained by considering a few statistics and after how many images (or seconds) they reach a certain threshold.
The observed lifetime must of course be corrected for the transmission setting used for the data set. Estimates of radiation tolerance have proven to be approximately transferable between beamlines for similarly shaped crystals of the same
if nominal beamline and the size of the beam are taken into account.One route to avoid gauging radiation tolerance a priori is `dose slicing', whereby the same oscillation range is remeasured repeatedly at low dose, with each pass receiving a fraction of the total tolerated dose. By integrating each pass individually, damage can be assessed with statistics from all reflections and only earlier passes are merged. Theoretically, this is no different to a single high-dose pass, and is also not much slower with modern zero-noise continuous-readout detectors. This approach is standard for systematic studies of radiation damage (Kmetko et al., 2006; Owen et al., 2006), but limitations remain for routine practical use, including the ability of integration programs to process images with very few photons, much increased disk-space requirements and the lack of well established cutoff criteria.
For crystals that are significantly longer than the beam is wide, many beamlines now allow helical data collection, whereby during rotation the crystal is translated through the beam along the rotation axis (Evans, Axford, Waterman et al., 2011; Zeldin et al., 2013). This increases the effective total crystal volume, allowing a higher total dose to be used for the data set; however, it does require the crystal to diffract uniformly (see §4). An estimate of radiation tolerance is still very useful, as it allows one to calculate how slowly the crystal can be translated, which in turn maximizes the yield of diffracted photons from a given crystal volume.
For CCD detectors, which have a lower ). In such cases the approach should be first to collect a quick low-dose low-resolution data set that ensures that all of the reflections can be integrated reliably, followed by a high-dose pass for high-resolution reflections. For the latter, useful rules of thumb can be generated (Holton & Frankel, 2010), such as limiting exposures on ADSC Q315r detectors so that background levels do not exceed 100 counts (see equation 18 of Holton & Frankel, 2010). (Even pixel-array detectors can show a `temporal overload' effect if diffracted photons `pile up' by arriving too close in time, but this occurs only for beams consisting of extremely high-flux pulses, such as X-ray free-electron lasers, and we are not aware of MX beamlines where this is of practical relevance.)
than hybrid pixel detectors, one has to be alert to the possibility of overexposure, since this will lead to individual pixels being overloaded. Consequently, the intensity of the affected reflections cannot be reliably established, leading either to wrong measurements or to otherwise incomplete data in the lowest resolution shells, depending on whether one instructs the integration program to reconstruct the intensity or to reject the reflections as `overloads'; both outcomes present significant problems for structure solution (Dauter, 2010For experiments where a full data set can be collected in a very few seconds, such as very strongly diffracting or low-mosaicity crystals on sufficiently intense beamlines, one should be aware of beam flicker, a high-frequency fluctuation of beam intensity which has many origins. A quick experiment requires the spindle to rotate so fast that each spot is in diffraction condition briefly enough to fall only within an intensity peak or trough of the beam, which introduces potentially significant systematic errors between related reflections. One relevant scenario is when collecting the `low-resolution' pass and, for the sake of saving time, the crystal rotation speed is increased instead of reducing the beam transmission. Beam flicker is a complex property of any beamline and very difficult to characterize, let alone link to diffraction quality; it is easily avoided by maintaining reasonable oscillation speeds (>0.5 s deg−1 at Diamond beamlines) and attenuating the beam instead.
6.6. Crystal-to-detector distance
The detector should on the one hand be placed as far as possible from the crystal to dilute the background scattering of the direct beam by air, the sample mount and indeed the crystal itself, but close enough to record all possible reflections (Dauter, 2010). We usually take the best image from the initial characterization (see §6.1) and integrate it with MOSFLM. As a rule of thumb, we set the resolution limit to where the signal-to-noise ratio drops below 1 in this initial and somewhat crude assessment. Recently, it has been suggested (Karplus & Diederichs, 2012) that somewhat higher resolution data should be allowed than has been traditionally recommended; indeed, this probably matches common practice anyway (and certainly ours), since most experimenters tend to hope for better resolution than will probably be achieved; what is new is the recommendation to include such measurements in processing and Nevertheless, the distance should remain somewhat reasonable, e.g. not setting it for 2 Å resolution data when no spots are evident beyond 3 Å resolution.
7. Data processing and diagnostics
Given how many things can go wrong, it is absolutely crucial to be aware of whether the data being collected are of useful quality or not. In the best case, poor data allow an accurate diagnosis (e.g. misaligned beam, decaying crystal), allowing one to rerun the experiment and obtain better data; in the worst case one can recognize the futility of the project and proceed to the next one.
It is thus uncontestably best practice to attempt data processing in as close to real time as possible. Many synchrotrons offer not only excellent computing facilities but also automated data processing; the facilities at the Diamond Light Source have become indispensable for much of our work, particularly since the installation of continuous-readout PILATUS detectors (Winter & McAuley, 2011). However, the onus still remains on the user to actually look at the output, know how to interpret it, attempt to correlate it to the actual experiment and attempt data processing manually when automated methods fail.
8. Summary
This article emphasizes data collection from novel poorly characterized crystal systems. Of course, many shortcuts make sense when a system is well known, e.g. multiple protein–ligand complexes (often called `molecular substitution'; Wasserman et al., 2012). However, for difficult projects we have found that one cannot be careful enough. Regardless of how easy it seems on modern facilities, X-ray data collection is actually a complicated experiment, and given how tiny the signal actually is, it is astonishing that we are able to measure what we do. The fact that this can nowadays be easily forgotten is a testament to the huge developments over recent decades.
Nevertheless, crystals that need special care will appear with great regularity; for these, an intimate knowledge of the experiment will remain crucial for many years to come.
Acknowledgements
The SGC is a registered charity (No. 1097737) that receives funds from the Canadian Institutes for Health Research, Genome Canada, GlaxoSmithKline, Lilly Canada, the Novartis Research Foundation, Pfizer, Takeda, AbbVie, the Canada Foundation for Innovation, the Ontario Ministry of Economic Development and Innovation and the Wellcome Trust (092809/Z/10/Z). We thank the Swiss Light Source and the staff of the PXII beamline, in particular Clemens Schulze-Briese, Ehmke Pohl and Santina Rossi, as well as Diamond Light Source and the staff of all MX beamlines for providing excellent facilities and patient support. Finally, we thank the reviewers for highly constructive comments that helped to greatly improve the manuscript.
References
Abad-Zapatero, C. (2012). Acta Cryst. D68, 613–617. Web of Science CrossRef CAS IUCr Journals Google Scholar
Aishima, J., Owen, R. L., Axford, D., Shepherd, E., Winter, G., Levik, K., Gibbons, P., Ashton, A. & Evans, G. (2010). Acta Cryst. D66, 1032–1035. Web of Science CrossRef CAS IUCr Journals Google Scholar
Battye, T. G. G., Kontogiannis, L., Johnson, O., Powell, H. R. & Leslie, A. G. W. (2011). Acta Cryst. D67, 271–281. Web of Science CrossRef CAS IUCr Journals Google Scholar
Bourenkov, G. P. & Popov, A. N. (2010). Acta Cryst. D66, 409–419. Web of Science CrossRef CAS IUCr Journals Google Scholar
Brockhauser, S., Svensson, O., Bowler, M. W., Nanao, M., Gordon, E., Leal, R. M. F., Popov, A., Gerring, M., McCarthy, A. A. & Gotz, A. (2012). Acta Cryst. D68, 975–984. Web of Science CrossRef CAS IUCr Journals Google Scholar
Brockhauser, S., White, K. I., McCarthy, A. A. & Ravelli, R. B. G. (2011). Acta Cryst. A67, 219–228. Web of Science CrossRef CAS IUCr Journals Google Scholar
Chavas, L. M. G., Matsugaki, N., Yamada, Y., Hiraki, M., Igarashi, N., Suzuki, M. & Wakatsuki, S. (2012). J. Synchrotron Rad. 19, 450–454. Web of Science CrossRef CAS IUCr Journals Google Scholar
D'Arcy, A., Villard, F. & Marsh, M. (2007). Acta Cryst. D63, 550–554. Web of Science CrossRef CAS IUCr Journals Google Scholar
Dauter, Z. (1999). Acta Cryst. D55, 1703–1717. Web of Science CrossRef CAS IUCr Journals Google Scholar
Dauter, Z. (2010). Acta Cryst. D66, 389–392. Web of Science CrossRef CAS IUCr Journals Google Scholar
Dauter, Z., Jaskolski, M. & Wlodawer, A. (2010). J. Synchrotron Rad. 17, 433–444. Web of Science CrossRef IUCr Journals Google Scholar
Debreczeni, J. É., Bunkóczi, G., Ma, Q., Blaser, H. & Sheldrick, G. M. (2003). Acta Cryst. D59, 688–696. Web of Science CrossRef CAS IUCr Journals Google Scholar
Delagenière, S. et al. (2011). Bioinformatics, 27, 3186–3192. Web of Science PubMed Google Scholar
Derewenda, Z. S. (2010). Acta Cryst. D66, 604–615. Web of Science CrossRef CAS IUCr Journals Google Scholar
Ditzel, L., Löwe, J., Stock, D., Stetter, K.-O., Huber, H., Huber, R. & Steinbacher, S. (1998). Cell, 93, 125–138. Web of Science CrossRef CAS PubMed Google Scholar
Evans, G., Axford, D. & Owen, R. L. (2011). Acta Cryst. D67, 261–270. Web of Science CrossRef CAS IUCr Journals Google Scholar
Evans, G., Axford, D., Waterman, D. & Owen, R. L. (2011). Crystallogr. Rev. 17, 105–142. Web of Science CrossRef Google Scholar
Evans, P. (2006). Acta Cryst. D62, 72–82. Web of Science CrossRef CAS IUCr Journals Google Scholar
Evans, P. & Murshudov, G. (2013). Acta Cryst. D69, 1204–1214. CrossRef IUCr Journals Google Scholar
Flot, D., Mairs, T., Giraud, T., Guijarro, M., Lesourd, M., Rey, V., van Brussel, D., Morawe, C., Borel, C., Hignette, O., Chavanne, J., Nurizzo, D., McSweeney, S. & Mitchell, E. (2010). J. Synchrotron Rad. 17, 107–118. Web of Science CrossRef CAS IUCr Journals Google Scholar
Fodje, M., Janzen, K., Berg, R., Black, G., Labiuk, S., Gorin, J. & Grochulski, P. (2012). J. Synchrotron Rad. 19, 274–280. Web of Science CrossRef CAS IUCr Journals Google Scholar
Gabadinho, J. et al. (2010). J. Synchrotron Rad. 17, 700–707. Web of Science CrossRef CAS IUCr Journals Google Scholar
Garman, E. (1999). Acta Cryst. D55, 1641–1653. Web of Science CrossRef CAS IUCr Journals Google Scholar
Garman, E. F. & Weik, M. (2013). J. Synchrotron Rad. 20, 1–6. Web of Science CrossRef CAS IUCr Journals Google Scholar
Gileadi, O., Knapp, S., Lee, W. H., Marsden, B. D., Müller, S., Niesen, F. H., Kavanagh, K. L., Ball, L. J., von Delft, F., Doyle, D. A., Oppermann, U. C. & Sundström, M. (2007). J. Struct. Funct. Genomics, 8, 107–119. CrossRef PubMed CAS Google Scholar
González, A., Moorhead, P., McPhillips, S. E., Song, J., Sharp, K., Taylor, J. R., Adams, P. D., Sauter, N. K. & Soltis, S. M. (2008). J. Appl. Cryst. 41, 176–184. Web of Science CrossRef IUCr Journals Google Scholar
Heras, B. & Martin, J. L. (2005). Acta Cryst. D61, 1173–1180. Web of Science CrossRef CAS IUCr Journals Google Scholar
Hilgart, M. C., Sanishvili, R., Ogata, C. M., Becker, M., Venugopalan, N., Stepanov, S., Makarov, O., Smith, J. L. & Fischetti, R. F. (2011). J. Synchrotron Rad. 18, 717–722. Web of Science CrossRef CAS IUCr Journals Google Scholar
Holton, J. M. (2009). J. Synchrotron Rad. 16, 133–142. Web of Science CrossRef CAS IUCr Journals Google Scholar
Holton, J. M. & Frankel, K. A. (2010). Acta Cryst. D66, 393–408. Web of Science CrossRef CAS IUCr Journals Google Scholar
Incardona, M.-F., Bourenkov, G. P., Levik, K., Pieritz, R. A., Popov, A. N. & Svensson, O. (2009). J. Synchrotron Rad. 16, 872–879. Web of Science CrossRef IUCr Journals Google Scholar
Karplus, P. A. & Diederichs, K. (2012). Science, 336, 1030–1033. Web of Science CrossRef CAS PubMed Google Scholar
Kendrew, J. C., Dickerson, R. E., Strandberg, B. E., Hart, R. G., Davies, D. R., Phillips, D. C. & Shore, V. C. (1960). Nature (London), 185, 422–427. CrossRef PubMed CAS Web of Science Google Scholar
Kmetko, J., Husseini, N. S., Naides, M., Kalinin, Y. & Thorne, R. E. (2006). Acta Cryst. D62, 1030–1038. Web of Science CrossRef CAS IUCr Journals Google Scholar
Krojer, T. & von Delft, F. (2011). J. Synchrotron Rad. 18, 387–397. Web of Science CrossRef CAS IUCr Journals Google Scholar
Leal, R. M. F., Bourenkov, G. P., Svensson, O., Spruce, D., Guijarro, M. & Popov, A. N. (2011). J. Synchrotron Rad. 18, 381–386. Web of Science CrossRef CAS IUCr Journals Google Scholar
Liu, Z.-J., Chen, L., Wu, D., Ding, W., Zhang, H., Zhou, W., Fu, Z.-Q. & Wang, B.-C. (2011). Acta Cryst. A67, 544–549. Web of Science CrossRef CAS IUCr Journals Google Scholar
Liu, Q., Dahmane, T., Zhang, Z., Assur, Z., Brasch, J., Shapiro, L., Mancia, F. & Hendrickson, W. A. (2012). Science, 336, 1033–1037. Web of Science CrossRef CAS PubMed Google Scholar
McCarthy, A. A., Brockhauser, S., Nurizzo, D., Theveneau, P., Mairs, T., Spruce, D., Guijarro, M., Lesourd, M., Ravelli, R. B. G. & McSweeney, S. (2009). J. Synchrotron Rad. 16, 803–812. Web of Science CrossRef IUCr Journals Google Scholar
McPherson, A. & Cudney, B. (2006). J. Struct. Biol. 156, 387–406. Web of Science CrossRef PubMed CAS Google Scholar
Morris, C. et al. (2011). Acta Cryst. D67, 249–260. Web of Science CrossRef CAS IUCr Journals Google Scholar
Mueller, M., Jenni, S. & Ban, N. (2007). Curr. Opin. Struct. Biol. 17, 572–579. Web of Science CrossRef PubMed CAS Google Scholar
Mueller, M., Wang, M. & Schulze-Briese, C. (2012). Acta Cryst. D68, 42–56. Web of Science CrossRef CAS IUCr Journals Google Scholar
Otwinowski, Z. & Minor, W. (1997). Methods Enzymol. 276, 307–326. CrossRef CAS Web of Science Google Scholar
Owen, R. L., Axford, D., Nettleship, J. E., Owens, R. J., Robinson, J. I., Morgan, A. W., Doré, A. S., Lebon, G., Tate, C. G., Fry, E. E., Ren, J., Stuart, D. I. & Evans, G. (2012). Acta Cryst. D68, 810–818. Web of Science CrossRef CAS IUCr Journals Google Scholar
Owen, R. L., Rudiño-Piñera, E. & Garman, E. F. (2006). Proc. Natl Acad. Sci. USA, 103, 4912–4917. Web of Science CrossRef PubMed CAS Google Scholar
Paithankar, K. S. & Garman, E. F. (2010). Acta Cryst. D66, 381–388. Web of Science CrossRef CAS IUCr Journals Google Scholar
Perrakis, A., Cipriani, F., Castagna, J.-C., Claustre, L., Burghammer, M., Riekel, C. & Cusack, S. (1999). Acta Cryst. D55, 1765–1770. Web of Science CrossRef CAS IUCr Journals Google Scholar
Pflugrath, J. W. (1999). Acta Cryst. D55, 1718–1725. Web of Science CrossRef CAS IUCr Journals Google Scholar
Pflugrath, J. W. (2004). Methods, 34, 415–423. Web of Science CrossRef PubMed CAS Google Scholar
Ravelli, R. B. G. & McSweeney, S. M. (2000). Structure, 8, 315–328. Web of Science CrossRef PubMed CAS Google Scholar
Rupp, B. (2010). Biomolecular Crystallography. New York: Garland Science. Google Scholar
Sanchez-Weatherby, J., Bowler, M. W., Huet, J., Gobbo, A., Felisaz, F., Lavault, B., Moya, R., Kadlec, J., Ravelli, R. B. G. & Cipriani, F. (2009). Acta Cryst. D65, 1237–1246. Web of Science CrossRef CAS IUCr Journals Google Scholar
Song, J., Mathew, D., Jacob, S. A., Corbett, L., Moorhead, P. & Soltis, S. M. (2007). J. Synchrotron Rad. 14, 191–195. Web of Science CrossRef CAS IUCr Journals Google Scholar
Vedadi, M. et al. (2006). Proc. Natl Acad. Sci. USA, 103, 15835–15840. Web of Science CrossRef PubMed CAS Google Scholar
Walter, T. S., Meier, C., Assenberg, R., Au, K. F., Ren, J., Verma, A., Nettleship, J. E., Owens, R. J., Stuart, D. I. & Grimes, J. M. (2006). Structure, 14, 1617–1622. Web of Science CrossRef PubMed CAS Google Scholar
Warkentin, M., Berejnov, V., Husseini, N. S. & Thorne, R. E. (2006). J. Appl. Cryst. 39, 805–811. Web of Science CrossRef CAS IUCr Journals Google Scholar
Wasserman, S. R., Koss, J. W., Sojitra, S. T., Morisco, L. L. & Burley, S. K. (2012). Trends Pharmacol. Sci. 33, 261–267. Web of Science CrossRef CAS PubMed Google Scholar
Weik, M., Ravelli, R. B. G., Kryger, G., McSweeney, S., Raves, M. L., Harel, M., Gros, P., Silman, I., Kroon, J. & Sussman, J. L. (2000). Proc. Natl Acad. Sci. USA, 97, 623–628. Web of Science CrossRef PubMed CAS Google Scholar
Wernimont, A. & Edwards, A. (2009). PLoS One, 4, e5094 CrossRef PubMed Google Scholar
Winn, M. D. et al. (2011). Acta Cryst. D67, 235–242. Web of Science CrossRef CAS IUCr Journals Google Scholar
Winter, G. & McAuley, K. E. (2011). Methods, 55, 81–93. Web of Science CrossRef CAS PubMed Google Scholar
Zeldin, O. B., Gerstel, M. & Garman, E. F. (2013). J. Synchrotron Rad. 20, 49–57. Web of Science CrossRef CAS IUCr Journals Google Scholar
Zhang, Z., Sauter, N. K., van den Bedem, H., Snell, G. & Deacon, A. M. (2006). J. Appl. Cryst. 39, 112–119. Web of Science CrossRef CAS IUCr Journals Google Scholar
This is an open-access article distributed under the terms of the Creative Commons Attribution (CC-BY) Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original authors and source are cited.