 1. Introduction
 2. Data sets
 3. CSPAD detector metrology refinement
 4. Refinement engines and sparse matrices
 5. Advanced refinement: a second detector
 6. Ensemble refinement and crystal isomorphism
 7. Timedependent ensemble refinement of the entire experiment
 8. Merging and error models
 9. Discussion
 10. Data and software availability
 Supporting information
 References
 1. Introduction
 2. Data sets
 3. CSPAD detector metrology refinement
 4. Refinement engines and sparse matrices
 5. Advanced refinement: a second detector
 6. Ensemble refinement and crystal isomorphism
 7. Timedependent ensemble refinement of the entire experiment
 8. Merging and error models
 9. Discussion
 10. Data and software availability
 Supporting information
 References
research papers
Improving signal strength in serial crystallography with DIALS geometry refinement
^{a}Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA, ^{b}STFC Rutherford Appleton Laboratory, Didcot OX11 0QX, England, ^{c}CCP4, Research Complex at Harwell, Rutherford Appleton Laboratory, Didcot OX11 0FA, England, ^{d}Diamond Light Source Ltd, Harwell Science and Innovation Campus, Didcot OX11 0DE, England, and ^{e}MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, England
^{*}Correspondence email: nksauter@lbl.gov
The DIALS diffractionmodeling software package has been applied to serial crystallography data. Diffraction modeling is an exercise in determining the experimental parameters, such as incident beam wavelength, crystal and orientation, and detector geometry, that are most consistent with the observed positions of Bragg spots. These parameters can be refined by nonlinear leastsquares fitting. In previous work, it has been challenging to refine both the positions of the sensors (metrology) on multipanel imaging detectors such as the CSPAD and the orientations of all of the crystals studied. Since the optimal models for metrology and crystal orientation are interdependent, alternate cycles of panel and crystal have been required. To simplify the process, a sparse linear algebra technique for solving the normal equations was implemented, allowing the detector panels to be refined simultaneously against the diffraction from thousands of crystals with excellent computational performance. Separately, it is shown how to refine the metrology of a second CSPAD detector, positioned at a distance of 2.5 m from the crystal, used for recording lowangle reflections. With the ability to jointly refine the detector position against the ensemble of all crystals used for it is shown that ensemble greatly reduces the apparent nonisomorphism that is often observed in the unitcell distributions from stillshot serial crystallography. In addition, it is shown that batching the images by timestamp and rerefining the detector position can realistically model small, timedependent variations in detector position relative to the sample, and thereby improve the integrated structurefactor intensity signal and heavyatom anomalous peak heights.
Keywords: XFEL; metrology; DIALS; refinement; sparse algebra.
1. Introduction
Serial crystallographic methods are widening the scope of structural biology, allowing the examination of macromolecular structure with short radiation pulses that generate diffraction from samples nearly free of radiation damage. Roomtemperature experiments preserve the physiologically relevant dynamic motion of proteins that cryopreservation quenches and can span multiple time points along enzymatic pathways. Presentday developments began with the introduction of Xray freeelectron laser (XFEL) sources (Bergmann et al., 2017); however, the latest generation of synchrotronradiation sources have introduced pulse durations and focus sizes that confer some of the same benefits.
While groundbreaking in its promise, serial crystallography presents numerous technical challenges, including those involving data analysis. With short Xray pulses, ranging from microseconds at synchrotrons to femtoseconds at XFELs, crystalline specimens are essentially still for the duration of a single shot, after which the crystal is replaced. This contrasts with conventional methods, in which a single crystal is continually rotated on a goniometer. The dataprocessing workflows are broadly similar for both methods, consisting of the location of Bragg spots by a spotfinding algorithm, the determination of the
by an indexing procedure, the integration of diffraction intensities at predicted Bragg spot positions, and finally the scaling and merging of repeated Bragg measurements. Also, in both cases the approach involves inverse modeling, in which a computer representation of the experiment is used to predict properties of the diffraction image, including the Bragg spot positions, after which the parameters of the model are iteratively adjusted to best match the observed images.However, while the treatment of rotation shots has been well established for several decades, experimental innovations in serial crystallography have required new models. XFEL facilities, in particular, introduced pixelarray detectors that are uniquely designed to integrate Xray signals over periods of femtoseconds. These carry the performance tradeoff of being constructed as multipanel units, with the added issue that the geometrical relationship between the individual panels (the `metrology') must be included in the computational model. Our group (Hattne et al., 2014) and others (Yefanov et al., 2015; Ginn & Stuart, 2017) have shown that the positions and orientations of individual detector panels, which are poorly determined initially, can be determined to subpixel accuracy by iterative nonlinear leastsquares which minimizes the residual difference between observed and predicted Bragg spot positions.
Despite the success of the Ha14 (Hattne et al., 2014) metrology code included with our program cctbx.xfel, it has been understood that the implementation would eventually need to be redesigned for several reasons. Ha14 embeds the image data from all detector panels into a single squareshaped data array that represents the whole detector, with panels in their approximate geometric positions, against a backdrop of pixels that are set to a special value to signify the inactive areas between panels. Superficially, this is open to the criticism that the inactive areas waste memory and disk space. However, the disadvantage has proven to be more severe owing to the necessity of encoding the number of panels and their dimensions, as well as the need for special code to ignore the inactive pixels for image processing. As a result, while the code supports the 64panel CSPAD (Fig. 1; Hart et al., 2012) installed at the Linac Coherent Light Source (LCLS) at the time of the Ha14 publication, it cannot be easily adapted for use with subsequent detector models, including the eightpanel MPCCD detector at SACLA (Kameshima et al., 2014) and the 128panel AGIPD detector at the European XFEL (Henrich et al., 2011). Even the temporary loss of a single sensor on the CSPAD detector requires the reduced sensor complement to be hardcoded.
Several additional considerations led us to abandon the singlearray approach to data representation. Firstly, the Ha14 design unnecessarily conflates the concepts of measurement and model. For example, if we determine after data collection that our model should move one of the sensors two pixels to the right, a new copy of the data array has to be created to reflect the updated sensor position. Furthermore, the singlearray approach does not allow the possibility that the distances between sensors can assume fractional pixel values or that the sensors might be slightly rotated with respect to each other. Thus, the Ha14 code is forced to maintain a separate data structure that encodes corrections to the unitpixel metrology. A better software design, adopted here, is to maintain two data structures, one that simply contains the original detectorpanel measurements in their unaltered forms (as a list of rectangular sensor arrays of pixels) and another that represents the complete vector description of each panel, including the origin vector d_{0} that locates the panel in relation to the crystal and two vectors d_{x} and d_{y} that define the fast and slow readout directions (Parkhurst et al., 2014). This approach also removes the undesirable requirements in Ha14 that all detector panels are coplanar and that the plane of the detector is normal to the beam.
There is also good reason to reorganize our geometric description of a multipanel detector (referred to below as simply the `detector model') with a hierarchical design that mirrors the physical construction of the device (Brewster et al., 2014). For the CSPAD detector in particular (Fig. 1), we assign four levels of organization, with the overall detector composed of four separately constructed quadrants, each of which in turn is composed of eight siliconwafer sensors. The silicon sensors are bumpbonded to two 194 × 185 pixel ASIC (applicationspecific integrated circuit) arrays (Hart et al., 2012). Model elements at each level contain d_{0} vectors expressed relative to the next highest level (Fig. 1). The LCLS facility has the capability of determining the sensor positions within each quadrant at the time of assembly to pixellevel accuracy, using optical microscopy methods. Expressing our detector model as a hierarchy allows us to insert the LCLS quadrantlevel calibrations at the appropriate level, to be used as starting values for the leastsquares with the main uncertainties therefore being the interrelationship of the four quadrants and the position of the detector as a whole. Additionally for the CSPAD, defining explicit readout directions (d_{x} and d_{y}) accounts correctly for the pinwheel construction of the device, where quadrants have an approximately 90° relation to each other, rotated around a common origin. Within each quadrant, groups of two sensors also have an approximate 90° relationship (Fig. 1). Therefore, in contrast to the monolithic array of Ha14, the present design implements separate data arrays corresponding to each sensor, which, while having a common layout in memory, represent four distinct orientations in space.
To implement our replacement for Ha14, we adopted the DIALS software framework (Diffraction Integration for Advanced Light Sources; Winter et al., 2018) that has previously been used for inverse modeling of synchrotronbased rotation crystallography experiments (Waterman et al., 2016). One relevant difference is that while rotation experiments generally treat one crystal at a time, the of multipanel detector geometry required us to combine Bragg positional data from thousands of crystals. The parameterfitting problem is thus highly interdependent, with all detectorpanel positions feeding into the of the orientation and unitcell parameters of each crystal, while simultaneously each crystal model determines the positions of all of the detector panels. Standard methods in iterative leastsquares parameter such as the Levenberg–Marquardt algorithm (§4), involve the construction of a set of linear equations with as many unknowns n as free parameters; therefore, an n × n normal matrix must be decomposed (Bevington & Robinson, 2003). Naively expressed, this is a very large matrix; for example, 32 sensor tiles with xy translations and one rotation each, plus 3000 hexagonal crystals with three orientation angles plus a and c parameters, would produce a total of n = 15 096. As a short cut, the work presented in Ha14 employed alternating cycles of alternating between the detector panels and the individual crystal models, such that the full matrix is never constructed. However, for the work presented below, we wished as a general principle to minimize the construction of arbitrary pathways (such as detector panels first then crystal models) and to rely whenever possible on the global of all free parameters. To this end, we exploited the fact that many of the parameters are independent (for example, all of the crossterms involving two distinct detector panels or two distinct crystals contribute zerovalued coefficients to the normal equations). Since the sparsely dependent structure of the normal equations is known ahead of time, we show (§4) how sparse linear algebra techniques can be employed to substantially reduce the computational resources needed to solve the problem.
Also, we show below how the DIALS framework can be adapted to describe serial crystallography experiments involving two imaging detectors at different crystaltodetector distances (§5) and how the simultaneous of detector and crystal models improves the accuracy of poorly measured unitcell axis lengths for unitcell axes that are oriented nearly parallel to the Xray beam (§6). Finally, considering recent reports from other groups describing how small changes in the crystaltodetector distance can affect experimental results (Nass et al., 2016), we develop a procedure to discover small timedependent changes in the distance, thereby improving the integrated Bragg spot signal (§7).
2. Data sets
We reprocessed thermolysin diffraction patterns collected at the CXI endstation at LCLS (Table 1). Sample preparation and injection, beamline parameters and datacollection methods are described in Kern et al. (2014). 760 110 shots were collected over 107 min, with an incident photon energy set point of about 9.75 keV. The front CSPAD was positioned at either 130 or 105 mm from the sampleinsertion position, while the back detector was positioned ∼2.5 m from the sampleinsertion position, so that lowangle diffraction transmitted through the central aperture of the front CSPAD was collected on the back detector. Data collection occurs in `runs', which represent continuous time intervals (typically 5–10 min each) during which experimental parameters are held constant. The runs listed in Table 1 are grouped by front detector distance.

We also reprocessed Cry3A toxin data collected about a week later on the same CSPAD detector at CXI. 380 740 shots were collected over 53 min at an incident photon energy set point of 8.5 keV. Sample preparation and injection, beamline parameters and datacollection methods are described in Sawaya et al. (2014).
3. CSPAD detector metrology refinement
We refined the CSPAD detector metrology using customwritten code for serial crystallography incorporated into dials.refine. §3.1 describes the hierarchical organization of the CSPAD and §3.2 describes the automatic determination of initial quadrant locations using powder patterns. Given this initial alignment, we can index the data (§3.3), perform joint on the detector and crystal models (§§3.4 and 3.5) and assess the accuracy of the results (§3.6).
3.1. CSPAD hierarchy
Our detector model represents the panels of the CSPAD in a fourlevel hierarchy (Fig. 1): detector, quadrant, sensor and ASIC. Switching the local frame of reference between levels involves a coordinate transformation F_{parent→child}, defined as a change of basis from a parent coordinate system to a child coordinate system or back again (i.e. F^{−1}_{parent→child} = F_{child→parent}). The transformation F can be expressed with an origin vector that translates from the origin of the parent frame to the origin of the child frame and a unitary matrix describing a rotation. The first frame shift, F_{lab→d}, moves from the laboratory origin (i.e. the crystal location) to the center of the detector as a whole. Next, we describe four detectortoquadrant frame shifts, F_{d→q0} through F_{d→q3}. There are then 32 quadranttosensor frame shifts, F_{qi→s0} through F_{qi→s7}, where i ranges from 0 to 3. Finally, since all pairs of ASICs are constrained to have the same threepixel gap between them, there are exactly two sensortoASIC frame shifts, F_{s→a0} and F_{s→a1}, applied to all ASICs.
The full transformation of position p from the laboratory frame to the ASIC frame would be expressed as
For convenience, we express F_{lab→d} as a homogenous transformation matrix consisting of the components of the rotation matrix (d_{x}, d_{y}, d_{n}) (where d_{n} is the normal vector, d_{x} × d_{y}), and the translation vector d_{0}:
The other frame shifts are expressed in the same form but are generated from different sets of d_{0}, d_{x} and d_{y} vectors. The 4 × 4 homogenous transformation matrix allows both rotation and translation to be expressed by single matrix multiplication. The global, cumulative d_{0}, d_{x} and d_{y} vectors used to compute pixel positions (Parkhurst et al., 2014) can then be easily derived from (1) given a cumulative frame shift F:
In (3), after multiplying F by a fourelement vector, we drop the last element in the resultant homogenous vector to construct the d vectors. The location of a pixel in the plane of the ASIC chip is determined using a pixeltomillimetre conversion function that takes into account pixel size (including rectangular pixels and optionally parallax effects; Parkhurst et al., 2014; Waterman et al., 2016).^{1}
The reverse operation (determining p_{lab} given p_{a}) can be performed using reversed and inverted matrix multiplications from (1):
These transformations group the 64 ASICs into hierarchical sets. With this organization, it is possible to refine the detector as a whole without modifying the frames of child components or to refine a quadrant as a whole without modifying the frames of its parent or children, and so on. Detailed specifications as to how these transformations are recorded have been presented previously (Brewster et al., 2014).
3.2. Automatic CSPAD quadrant alignment using rotational autocorrelation
Within each CSPAD quadrant, the initial panel positions are determined at the LCLS facility using an optical microscope. Therefore, when the CSPAD is assembled and installed, the positions of the quadrants relative to each other and to the beam are unknown.
We developed an automated approach for deriving quadrant positions; more specifically, we calculate the xy positions (in the detector plane) of the four sensors closest to the direct beam (Fig. 1, shaded in pink). Firstly, we generate a `composite maximum' image, taking the maximum pixel values among all images in a dataset run. Overlaying all of the Bragg spots from a run in this manner generates a virtual powder pattern, since the individual crystals have random orientations. If a quadrant is properly placed, after rotating a strong powder pattern 45° around the beam center, the overlapping pixel values will be highly correlated. We can therefore perform a grid search over xy offsets for each quadrant separately (limiting our examination to the sensor closest to the beam center), searching for the position with the highest rotational autocorrelation coefficient (CC). This produces a heat map (Fig. 2) where each pixel [for example (3, 4)] represents the CC when the quadrant is translated by that amount (three pixels in x, four pixels in y). The coordinates of the heatmap maximum give the best positional correction for that quadrant (Table 2).

We tested three virtual powder patterns to estimate the quadrant positions for the front CSPAD detector using the thermolysin data (Fig. 2a). The first is derived from a weak run (run 22) with few hits, leading to thin powder rings. The second is derived from a strong run (run 14) with many hits. The third is a composite of many runs collected at a single detector distance (runs 13–22). In all three patterns, the discontinuities in the powder rings indicate that the quadrants are not aligned.
Fig. 2(b) shows the rotational autocorrelation heat maps for these three virtual powder patterns. For the weakest, run 22, the maximum of the heat map is unclear. It is better resolved for run 14 and is strongest for the composite pattern (runs 13–22). Alternating bands of low and high correlation occur owing to the alternating strong and weak bands in the powder pattern. When, after translating the quadrant, bands in the 45° rotation overlap with similarly bright bands in the unrotated pattern, the CC is higher. Likewise, the translation can cause bands in the rotated pattern to systematically overlap gaps in the unrotated pattern and produce a low CC.
For weaker patterns or sparser data, it can be useful to try many rotations. We repeated the rotation test for each of the virtual powder patterns at different angles from 20 to 70° in 2.5° increments. For each xy offset, we selected the maximum CC observed from all of the angles tested to generate a new heat map. This eliminates the `beat pattern' of local maxima in singleangle heat maps and in general produces smoother heat maps with clear global maxima (Fig. 2c). For weak data (for example run 22), however, the global maximum may still be very far off from the best quadrant position. In such cases, it is prudent to crosscheck by manual examination of virtual powder patterns overlaid with circles emanating from the beam center.
Once the quadrant positions have been optimized, it is often possible to estimate the sampletodetector distance to within ∼1 mm (data not shown) by overlaying the corrected composite image with predicted rings calculated from the known unitcell parameters of the protein producing the virtual powder pattern.
3.3. Initial indexing
After et al., 2014). We then indexed all of the images, determining initial basis vectors using the onedimensional Fourier method (Steller et al., 1997), while guiding the selection of candidate basis vectors using a target (a = b = 92.9, c = 130.4 Å, α = β = 90, γ = 120°; Hattne et al., 2014). For each image, we independently refined the crystal orientation matrix and unitcell parameters, while holding the wavelength and detector position constant. This produced 118 318 indexed lattices from both detector distances (∼130 and 105 mm). In subsequent sections, we refine the detector metrology using a subset of these lattices from initial indexing (§§3.5 and 3.6) and then use the refined metrology to reindex the data, producing successfully indexed patterns that previously did not index (§3.7).
of the quadrant positions by rotational autocorrelation, we performed initial indexing of the thermolysin data on the front CSPAD to generate a data set from which we could refine the complete detector metrology. We determined a starting detector distance by performing rounds of indexing at increasing distances and choosing the distance producing the highest number of indexed images, as described previously (Hattne3.4. target function
Computational models for stillshot experiments were incorporated into the parameterrefinement framework (Waterman et al., 2016) within DIALS. Bestfit models for the detector, crystal and beam were determined by minimizing the nonlinear leastsquares target function
where the index i traverses all m Bragg spot measurements in the entire data set, x and y refer to the fast and slow coordinates of the Bragg spot centers of mass on their respective detector panels, the subscript `obs' refers to the observed position and the subscript `calc' refers to the position predicted by the computational model. The quantity ψ_{calc} is the smallest crystal rotation angle required to place the reciprocallattice point exactly in the reflecting condition described by Bragg's law. As described previously (Sauter et al., 2014), it is necessary to include this restraint to prevent the crystal orientation model from rotating around axes that are perpendicular to the beam vector, as these rotations do not directly change the Bragg spot positions. The weighting scheme (w_{i,x}, w_{i,y} and w_{i,ψ} for the ith observation) uses a for w_{i,x} and w_{i,y} equal to the inverse variance of the observed spot position and a constant weight for the ψ_{calc} angle. The default w_{i,ψ} value of 10^{6} generally puts the ψ_{calc} term on the same scale as the x and y terms.
3.5. of the detector model
To determine the correct positions of the CSPAD detector panels to subpixel accuracy, we conducted iterative nonlinear leastsquares parameter optimizations designed to jointly refine the detector geometry and crystal models. We used Bragg spot positions measured on thermolysin diffraction images selected from the 130 mm run group (Table 1) and limited our to the 3000 images with the most reflections indexed to the corners of the detector. In addition to the geometric of the detector, explained below, we treated the two hexagonal unitcell lengths and the three crystal orientation angles as free parameters refined independently for each shot. The beam direction is considered to be static, and since each Xray pulse has slightly different mean energy, measured by the beamline instrumentation, this measured energy is used here without refinement.
Two different protocols were developed, `hierarchical mode' (Table 3) and `expanding mode' (Table 4), consisting of sequences of either three or nine optimizations, respectively, with each step in the sequence including a wider list of free geometric parameters describing the detector. The general motivation for this was to avoid trapping the detector geometry in a local minimum of (5) and instead to refine the most reliable parameters first. In particular, the `expanding mode' protocol refines the positions of the four sensors (one per quadrant) closest to the direct beam, before sequentially adding groups of sensors at larger diffraction angles, as illustrated in Fig. 3.

The refinable parameters for a detector panel or group of panels include distance (the translation along d_{n}), Shift1 and Shift2 (the translations along d_{x} and d_{y}) and τ_{1}, τ_{2} and τ_{3} (the rotations around d_{n}, d_{x} and d_{y}, respectively). Tables 3 and 4, which list the details of the two protocols, summarize which geometric parameters are refined as the optimizations progress from the entire detector to each quadrant separately and finally to individual sensors. Each row represents a separate operation of up to 3000 crystal models and one detector model. The output models from each row are used as inputs for the next. Since the crystal rotation around the beam axis is directly correlated with the detector rotation around this axis, we fix τ_{1}. This is different from rotation crystallography as rotation around the goniometer breaks the degeneracy between the detector and crystal rotations around the beam axis. However, we do refine τ_{1} at the levels of the individual quadrants and sensors. Furthermore, we fixed the detector τ_{2} and τ_{3}. This is not strictly necessary, as DIALS is capable of refining three translations and rotations for all detector elements. of the tilt is routinely performed for synchrotron data. However, for this particular experiment we found that of the detector τ_{2} and τ_{3} produced little difference in the results (data not shown). For the quadrants and sensors we fixed τ_{2} and τ_{3} as well as distance offsets, so as to constrain all detector panels to be coplanar, because we considered the of independent panel tilts and distances to be beyond the scope of this study.
As detailed in Tables 3 and 4, our sequence of optimizations begins at the level of refining the overall detector distance and xy shift and ends at the level of individual sensor xy shifts and τ_{1} rotations: we do not refine the relative positions of the 2 × 1 pairs of ASICs independently. Doing so would have no physical meaning, as each pair of ASICs is bonded to a single chip, oriented by lithography and uniformly three pixels apart from one another, and any deviations from this are understood to be the result of absorbing error elsewhere in the model. Finally, when refining the τ_{1} angles at the level of individual quadrants or sensors, we are careful to lock down one of them (the first one in each group, τ_{1group1}), since only N − 1 of the angles are independent, where N is the number of detector panels to be refined. In other words, in Table 3 one of the four quadrants in row 2 is fixed and one of the 32 sensors in row 3 is fixed.
After the entire optimization protocol has ended, we repeat the entire cycle up to four times to assess the ability to converge to a stationary solution (Fig. 3a and Tables 5 and 6). We defined convergence as (i) the rootmeansquared differences (r.m.s.d.s) between observed and calculated spot positions no longer decreasing and (ii) the detector positions no longer shifting appreciably. Before each subsequent cycle, the 3000 images were reindexed and crystals with a poor r.m.s.d. were discarded as outliers (Brewster et al., 2016). For this reason, subsequent cycles have fewer than 3000 crystals contributing to the joint (Table 5).


3.6. accuracy and precision
The greatest improvement in r.m.s.d. is found after cycle 1 (Fig. 3a and Table 5, expanding protocol), but reindexing the images and rerefining gives an additional improvement (cycle 2). It is expected that the metrology should converge rapidly, and we see that subsequent cycles do not appreciably improve the spot predictions, so we regard cycles 3 and 4 as controls.
Again, each cycle (1–4) consists of indexing and ) reduces this data set to ∼580 000 reflections. After reindexing and rerefinement (which again includes outlier rejection), cycle 2 produces a data set with only ∼330 000 indexed reflections. In this work, we do not investigate the cause of this falloff; however, we note that it is associated with a radial streaking of reflections at high resolution (Hattne et al., 2014) producing poorly measured spot centroids for reflections with a high Δψ angle. moves the tiles such that these reflections are no longer close enough to their predictions to be indexed with our monochromatic beam model, since when the centroids from the elongated reflection are moved to their noninteger are no longer close enough to a round number (the default cutoff is 0.3 of a whole integer). Because of this, and because r.m.s.d. is sensitive to sample size, we additionally computed an r.m.s.d. using only those reflections indexed in all cycles and both modes. This `common set' of reflections, as determined by observed spot centroids in pixels (an invariant property of the reflections), consists of 218 954 reflections. Determining the r.m.s.d. on only this set ensures that the cycles and modes are comparable. We can see that most of the improvement occurs during a single round of indexing and (cycle 1, 157.9 to 60.1 µm in the expanding case, or roughly half a pixel for these 110 µm CSPAD pixels). An additional round of reindexing and (cycle 2) improves the expanding case by a small amount to 50.1 µm. Cycles 3 and 4 do not appreciably change the r.m.s.d. or tile positions (Table 6). Note that the average change in sensor position (Δxy) is 9.8, 11.1 and 9.0 µm in cycles 2, 3 and 4, respectively, while the overall detector and quadrant shifts are much lower (<2 µm). This suggests that reindexing and reoptimization impart random xy shifts at the sensor level while not appreciably improving the accuracy, thus giving a rough estimate for the precision of about 10 µm. This is similar to the precision that we previously reported for the Ha14 method.
The initial data set consists of ∼700 000 reflections across 3000 images with an r.m.s.d. of 221 µm. During outlier rejection (Sauter & Poon, 2010An interesting trend can be seen in cycles 3 and 4, in which minimal improvement in r.m.s.d. after
is seen. Hierarchy level 0, the whole detector, is refined using only the innermost reflections. During cycles 3 and 4 the detector as a whole shifts away from the crystal by nearly 100 µm. Then, after all the sensors have been added while refining at hierarchy level 2, the detector distance shifts back to where it started. This indicates a small, resolutiondependent misprediction of reflections.3.7. Reindexing using refined metrology
With cycle 4 of the expanding a = b = 93.28, c = 130.81 Å). This produced 119 774 primary lattices (an improvement of nearly 1500 lattices from the unrefined metrology) and 46 476 secondary lattices, giving a total of 166 250 lattices.
showing the best r.m.s.d.s, we repeated the indexing and integration of all shots with this metrology, including searching for a second lattice from a possible second crystal on each image. We also took into account the small change in detector distance from the final refined model (a decrease of about 36 µm), and we used as a target the mean of the refined crystal models (4. engines and sparse matrices
Parameteroptimization engines require a target function to be minimized, a set of parameters and a set of observations. A given engine will modify the parameters in `steps' and accept the incremental change if the target function decreases. The direction of the step is generally determined by a gradient vector consisting of the first derivative ∂L/∂p of the target function L with respect to each parameter p, which indicates how each parameter needs to change to lower the target function. The parameters that have the most influence on the target function can be determined using curvatures, the second derivatives ∂^{2}L/∂p^{2} of the target with respect to each parameter, which determine (in an inverse sense) the step size for each parameter. The engine continues to take steps until some convergence criteria are reached. The overall scale of our optimization problem is uncommonly large, with tens or hundreds of thousands of free parameters (§1), and as such is not adequately treated in recent macromolecular crystallography diffraction modeling literature. We therefore survey the available methods briefly, investigating three potential approaches in common crystallographic use, before finally choosing a fourth method that is adopted from the sparsealgebra community.
Firstly, the limitedmemory Broyden–Fletcher–Goldfarb–Shanno (LBFGS) algorithm is a quasiNewton method that uses a lowmemory approximation to the secondderivative matrix (the Hessian matrix) to compute a step size between iterations, and is thus suitable for optimization problems with large number of parameters (Liu & Nocedal, 1989). It does not rely on analytical second derivatives that can be difficult to derive and thus present a barrier to programming. However, with a large number of parameters LBFGS has a poor convergence rate, as hundreds to thousands of steps may be necessary to find a minimum.
Construction of the approximate Hessian may be seeded by providing a vector of curvatures representing the diagonal elements of the Hessian matrix. Providing the curvatures can dramatically improve the performance of LBFGS, and thus constitutes our second optimization method. As implemented in DIALS, the second partial derivatives are not analytically calculated.^{2} However, we can use the assumption that the target function L is of leastsquares form to calculate an approximate value for the second derivative,
This is the approximation used in the Gauss–Newton algorithm as a modification of Newton's method (see equation 10.24 in §10.3 of Nocedal & Wright, 2006). The approximation has the pleasing property of improving as the approaches convergence. In many instances, we find that supplementing the LBFGS algorithm with approximate curvatures is essential for reducing the number of steps before convergence to an acceptable level.
LBFGS, even with curvatures, can still take too many steps for our joint et al., 2016) that a third algorithm, based on Gauss–Newton methods, but modified by the Levenberg–Marquardt (LevMar) approach to remain robust in the presence of covariance, would be ideal since it explicitly takes account of the nonlinear leastsquares form of the target function and as a result requires far fewer steps, while avoiding any need for second derivatives. Briefly, the Gauss–Newton method derives the vector δ of increments to the current parameter estimates by utilizing the Jacobian matrix J, defined as the matrix of partial derivatives of each residual term r with respect to each parameter. In this matrix, row i represents the derivative of the ith residual. Note that there are m observations but 3m residuals, since (5) gives three residuals (x_{calc} − x_{obs}, y_{calc} − y_{obs} and ψ_{calc}) for each measurement. Furthermore, column j represents the derivative with respect to the jth freely refined parameter, with n total parameters:
of the detector and crystals. We previously noted (WatermanMatrix J is used to construct a set of normal equations Aδ = b, where A = J^{T}J and is a symmetric matrix of size n × n, b = −J^{T}r and r is the vector of all residuals (Nocedal & Wright, 2006). LevMar modifies A, adding a damping factor that affects the step size (Bevington & Robinson, 2003; Nocedal & Wright, 2006). Considering the large matrix sizes here, it is prohibitively expensive to explicitly compute the inverse matrix A^{−1} in order to solve for δ. However, it is possible to perform a Cholesky decomposition, expressing the matrix as a product of a lower triangular matrix L and its transpose, A = L^{T}L, and then to derive δ by backsubstitution. While this algorithm offers the best convergence behavior for optimization sizes typical of rotationbased crystallography (Waterman et al., 2016), the performance of the Cholesky decomposition limits problem sizes to n < 5000 (Fig. 4a).
To overcome this performance limitation, we took advantage of the fact that both the Jacobian and normal matrices for our problem are sparse, meaning we have prior knowledge that most of the elements are zero. Elements of the normal matrix are (structurally) zero when they represent crossterms between independent free parameters. For example, the unitcell parameters and orientation angles for a given crystal are independent of the parameters describing all of the other crystals. Only the crossterms relating detectorpanel parameters and crystals are in general nonzero. Knowing which elements of A are zero leads us to deduce, using graph theory, which elements of the Cholesky factor are also structural zeroes (Liu, 1990; Mehta & Sahni, 2004; Rennich et al., 2014), thus allowing a dramatically reduced level of computational effort to calculate the nonzero elements of the Cholesky factor. Sparsematrix Cholesky decomposition methods, which are well known in mathematics but not in crystallography, therefore afford us our fourth algorithmic approach, and the only one that is suitable for a 3000crystal problem. We incorporated the opensource Eigen linearalgebra library (Guennebaud & Jacob, 2010) into our software distribution for this application.
To evaluate the performance and memory requirements of the four engines, we simultaneously refined between 50 and 5000 random images from a single run jointly with the 32 sensor positions (Fig. 4). For each problem size and each engine, we performed ten independent trials to obtain an average run time, accounting for local variation in the computing environment. Time trials were run singleprocess on a 12core, 64bit Intel Xeon X5675 processor (3.07 GHz) with a 12 MB cache and 24 GB RAM running Red Hat Enterprise Linux Server 7.3. C++ code was compiled under GCC 4.8.5. The initial detector model had quadrants aligned using rotational autocorrelation from a powder pattern, but was otherwise unrefined, meaning that this was equivalent to the third step of Table 3, skipping the first two steps. Starting with the largest data set, each smaller data set was a random subset of the next largest. We disabled outlier rejection in order to focus on the performance of the steps themselves. We averaged the ten run times, but since the inputs were the same the r.m.s.d.s, number of steps until convergence and memory usages (Figs. 4b, 4c and 4d) were constant within each group of ten trials. Sparsematrix LevMar had the shortest run time of the four (Fig. 4a), while the LBFGS implementations were markedly slower owing to the number of steps that were needed for convergence. Once the number of parameters exceeded 5000, LevMar became completely unacceptable in its run time. Indeed, the final data points using 3000 and 5000 images were terminated after 48 h in the queue at LCLS. Of the four, LBFGS without curvatures took the most steps to converge (Fig. 4b) and had slightly poorer r.m.s.d. values than the other three (Fig. 4c; see below).
Fig. 4(d) shows the memory savings achieved using sparsematrix LevMar compared with LevMar. The normal matrix size (the square of the number of parameters n) is shown on a log scale. Likewise, the number of nonzero values in the normal matrix and the number of nonzero values in the Cholesky factor are shown (these nearly overlap). The size of the normal matrix increases on the order of k^{1.89}, where k is the number of images, while the number of nonzero values increases on the order of k^{0.95} to k^{1.01} (Table 7). The savings in memory by using sparse matrices is concurrent with a decrease in run time as a function of the number of images, from order k^{2.45} for LevMar to k^{1.13} for sparsematrix LevMar.

To clearly show the convergence behavior of LBFGS, we removed the termination condition where e). LBFGS with curvatures still terminated early owing to roundoff checks in the LBFGS minimizer, but the same final r.m.s.d. was reached by both LBFGS engines. LBFGS took over 6000 steps to reach the r.m.s.d. that LBFGS with curvatures reached in less than 500 steps.
stops if the r.m.s.d.s cease changing within a certain threshold and ran the refinements for 10 000 steps (Fig. 45. Advanced a second detector
The DIALS platform is highly flexible and configurable, being capable of refining experimental geometry from many crystals and detectors, even if the diffraction pattern is spread across more than one detector simultaneously. As an example, we refined the metrology of the back CSPAD using thermolysin patterns (Table 1) recorded on both the front and back detectors. The front detector, located either 130 or 105 mm from the crystal position, has a central aperture that, while designed to transmit the nondiffracting beam, also permits lowangle Bragg reflections to be recorded on the back detector approximately 2.5 m from the sample (Fig. 5a, top). A detector in this position is not typically used in XFEL experiments, but we can cite two possible roles. Firstly, one can record the fine detail of lowresolution reflections. As the large crystaltodetector distance spreads the reflections out over many pixels, we can potentially analyse the spot shape and mosaic character of the crystals. Secondly, we used the back CSPAD in Duyvesteyn et al. (2018) to examine bacteriophage phiX174, a crystalline viral condensate with a large (∼500 Å) that diffracted to poor resolution (∼50 Å), to the point where diffraction was only seen on the back detector. An analysis of the Bragg spots revealed the and approximate unitcell dimensions, given a histogram of the reciprocalspace reflection distances. Both these use cases can benefit from an accurate detector metrology; therefore, with thermolysin as a standard, we used crystal orientations from lattices recorded on the front detector to index reflections on the back detector from the same crystal and then refined the back detector panels using these indexed reflections.
To perform this, we manually positioned the quadrants to bestfit powder patterns recorded on the back detector, as the diffraction was too sparse to obtain a good fit by rotational autocorrelation. We then performed spotfinding on the back detector images for all Xray events in runs 11–22 that had been successfully indexed on the front detector, and then indexed these back detector reflections using the crystal models derived from the front detector lattices. This yielded 12 313 images where it was possible to index at least one reflection on the back detector. We then performed perimage r.m.s.d. filtering, removing images with an r.m.s.d. over 1.5 times the interquartile range (Tukey's rule of thumb; Tukey, 1977), leaving 9893 images. Perimage r.m.s.d. filtering mainly removed images with false spots found along streaks produced by jet diffraction, which were visible as a long spike on a varying radial axis.
Next, we refined the back detector geometry against this data set using the hierarchical protocol. We fixed the crystal models, which had been refined against front detector data, as we only had a few reflections per image on the back detector. Table 8 shows the r.m.s.d., which improves with each successive hierarchical level of metrology Focusing on the common set only, the r.m.s.d. decreased from 740.5 to 361.3 µm during metrology or slightly over three pixels. Owing to the large sampletodetector distance, these reflections cover many more pixels on the back detector (the mean number of pixels per reflection on the front detector for these 9893 images is 3.5, but on the back detector it is 22.8), yielding an r.m.s.d. on the same fractional order of accuracy as for the front detector (about half a pixel).

Fig. 5(b) shows the composite maximum of the images collected from runs 11–22 after metrology indicating the of each powder ring. The average signal from these reflections is well correlated with the calculated intensities from PDB entry 4ow3 from Hattne et al. (2014) (Fig. 5c). This indicates that the refined metrology would be sufficiently accurate to integrate data from these images. Notably, a 001 ring is present, even though 001 is systematically absent in P6_{1}22. We speculate that this arises from surface effects in small microcrystals with a high surface area to volume ratio. In any case there is some type of disorder that breaks the perfect 6_{1} screw symmetry of the crystal.
6. Ensemble and crystal isomorphism
Having optimized the detector models (§3), we now focus on the improvement of the crystal model, which consists of the crystal orientation and the unitcell parameters. It has been widely observed in serial crystallography that within the ensemble of crystals merged together into a single data set, the unitcell parameters exhibit an unusually broad distribution, far beyond that experienced in singlecrystal work. Fig. 6(a), illustrates, for example, that the a axis of Cry3a in C222_{1} apparently varies from 116 to 120 Å (blue trace). This result was calculated by evaluating 1000 Cry3a patterns from run 4. Beginning with the previously refined front detector model determined from our thermolysin data, we indexed each Cry3a and refined the crystal parameters (unit cell and orientation) using traditional nonlinear leastsquares so as to best fit the reflections on each individual image. After 33 images with high r.m.s.d. were removed, giving a final data set consisting of 967 patterns.
It is critical to understand whether the apparent spread in unitcell parameters represents true physical variation or simply an inability to measure the cell accurately. Indeed, if crystal cell lengths truly vary on the order of 3%, for example owing to differing hydration conditions (Russi et al., 2011), it would challenge our ability to merge the diffraction patterns, since the structurefactor intensities from such a plastic ensemble would have too much variation to be usefully merged (Crick & Magdoff, 1956). We hypothesized instead that our widely distributed unitcell measurements are a consequence of collecting data in still shots. In contrast to goniometerbased rotation experiments, where the is well sampled in all directions, still shots only sample to a limited depth along the direction of the incident beam. Therefore, if the a axis of an orthorhombic crystal is oriented along the beam, the a parameter should have a greater uncertainty than the b or c parameters. We chose an orthorhombic crystal form (Cry3a) to clearly test this supposition. Fig. 6(b) supports the hypothesis, showing that measurements of cell axes aligned with the beam have a much greater variation than cell axes oriented orthogonal to the beam (blue distributions).
We next asked whether it was possible to correct the biased distribution of unit cells with either of two protocols. In the first protocol, which did not provide a solution, we rerefined the unitcell and orientation parameters, and also allowed the detector position (sampletodetector distance and xy translation in the detector plane) to vary independently for each image. The resulting distributions are plotted in green (Fig. 6 and Supplementary Table S1). The unitcell lengths were generally shorter, suggesting that the refined detector distance provides a better model (the sampletodetector distance decreased from 111.00 mm to an average of 109.70 ± 0.64 mm). However, the artificially wide variation in unitcell lengths did not improve; rather, it worsened slightly for cell axes oriented orthogonally to the beam. Unfortunately, we believe that this is the method that has been used historically for most serial crystallography work to date; at least, it is the approach taken by our program cctbx.xfel until this investigation.
In the second protocol, we sought to allow the detector to refine to its optimal position, but without allowing the detector model complete independence for every image. To this end, we performed joint , red models) exhibit a similar decrease in unitcell lengths (with concurrent shortening of the detector distance to 109.82 mm), but now the unitcell standard deviations become much tighter than in the starting model. At least in this serial crystallography case, the apparent nonisomorphism of crystals seems to be an accuracy issue, which can be corrected by the joint of the crystal ensemble against a single detector model, a new protocol that is enabled by the sparsealgebra LevMar technique outlined above.
in which a single detector model was simultaneously refined against the ensemble of all crystal models in the data set. No explicit restraints were placed on the unitcell parameters. The results (Fig. 67. Timedependent ensemble of the entire experiment
From §6, it is clear that serial crystallography data sets can be acutely sensitive to experimental geometry, to the point that a nonphysical standard deviation of the crystaltodetector distance of only 0.64 mm produces a noticeable and deleterious effect on the lattice parameters. Other literature results have also highlighted the need for precise distance determination, including a paper by Nass et al. (2016) that presents data where optimal unitcell distributions and dataquality metrics can only be obtained by accounting for a series of distance shifts spanning 0.62 mm that occurred over a period of 3 days. Indeed, datamodeling software serves a decisive role in cases where it is not feasible to experimentally measure precise sampletodetector distances. Our L785 thermolysin study provides one such example: the detector position is known from motor encoders, but the sample position can readily change depending on how the electrokinetic (Sierra et al., 2012) is inserted and the flow characteristics of the liquid jet at any given point in time.
To investigate the possibility of a time dependence in the sampletodetector distance parameter in our experiment, we split the data chronologically into runs. We then further subdivided each run into a series of chronological batches each containing ∼3000–4500 images. For each batch, we performed joint xy shift in the detector plane) against the ensemble of all crystal models in the batch. We then recomputed the mosaic estimates for each crystal (Sauter et al., 2014), used these models to predict Bragg reflections, and integrated the Bragg intensities.
of a single detector model (freely refining the overall distance and theFig. 7(a) shows the resulting batchwise distance determinations. Baseline distance values are also indicated. For the first group (runs 11–22), the baseline is the direct result of metrology from expandingmode (§3.6), 129.97 mm, based on the 3000 bestdiffracting images. We see that timedependent ensemble produces slightly larger distance estimates, averaging 130.00 ± 0.01 mm. The cause of this increase is unknown, but we speculate that it could result from the timedependent batches having fewer reflections in the detector corners compared with the analysis performed in §3.6 using a subsample of images with the most reflections in the corners. (As mentioned previously, a slight systematic radial misprediction of reflections may cause geometry to be slightly resolutiondependent.) For runs 26–29, the baseline distance of 104.97 mm was produced by subtracting the motorencoder offset (25 mm) for these runs from the calibrated distance in §3.6. It is clear that the distance values for runs 26–29 differ substantially (0.1–0.3 mm) from the baseline, and moreover that there is a 0.2 mm variability over the 36 min period, changing even within the duration of a single run.^{3} We assume that the underlying cause is that the flow direction of the liquid jet from the electrokinetic injector changed continually over time.
We expected that correctly accounting for our timevarying sampletodetector distance would lead to improved Bragg spot predictions, as well as improvements in the data quality similar to those seen by Nass et al. (2016). As a dataquality metric (Fig. 7b) we chose the signal strength, 〈I/σ_{c}(I)〉, where I is the integrated intensity of a measurement, σ_{c}(I) is the uncertainty in the measurement attributable to photoncounting statistics (Leslie, 2006) and 〈〉 represents the mean over all measurements of all For each batch, we compared the resolutionbinned 〈I/σ_{c}(I)〉 averages with the Bragg spots predicted from either the batchwise or the baseline distance values. Each line represents the resolutiondependent percent change in signal strength for a single batch. Runs 11–22 (distance 130 mm) exhibit a modest increase in signal of around 2–3%. However, runs 26–29 (distance 105 mm), especially runs 28–29, exhibit an appreciable increase in signal strength, up to 25% higher at midresolution for the last two batches, where the sample position moved 55 µm midrun. Thus, timedependent, batchwise ensemble appears to offer the possibility of detecting and correcting changes in the experimental model on timescales not previously understood to present a challenge for data quality.
8. Merging and error models
Table 9 shows a comparison between the originally published thermolysin structure (PDB entry 4tnl) and the data reprocessed in this work, modeled using the expandingmode metrology in §3.6. Data were scaled and merged using four alternate protocols differing in the use of postrefinement (Sauter, 2015), the use of timedependent ensemble of the detector distance (§7) and the choice of error model used to adjust the estimated errors of the integrated, merged intensities. Supplementary Tables S2–S5 give detailed statistics for the four resulting data sets.
‡A second zinc site, as shown in Uervirojnangkoorn et al. (2015), is observable in our data as well but was not modeled in this work. 
Critically, interpreting signal quality is highly dependent on the error model, i.e. how the uncertainties in the measured reflection intensities are treated. During integration, we determine a baseline estimate of the error from photoncounting statistics (Leslie, 2006), referred to above as σ_{c}(I), with this being the only source of uncertainty that is readily quantified. Other sources of error from detector calibration, partiality correction, crystal orientation and cell dimensions, among many others, are much more difficult to estimate and therefore propagate. However, because the errors from counting statistics derived from integration represent only a small part of the overall experimental uncertainty, it is imperative to inflate the error determined from counting statistics to better represent the error seen in the sample.
One such treatment, Ha14, was proposed in Hattne et al. (2014). In that work, we considered the distribution of all intensity measurements produced on a single serial crystallography still shot. Familiar principles would lead us to believe that the measurements would form an exponentially decreasing distribution, as originally discussed by Wilson (1949). However, since most (if not all) spots on a still shot are partials, and since the degree of partiality is not known a priori, the prediction of stillshot spots puts us in a difficult position. In order to predict all of the diffracted Bragg spots, we need to slightly overmodel the mosaic parameters (Sauter et al., 2014). As a consequence, many of the predicted reflections will not contain any photons, and the process of integration simply produces noise, which has a Gaussian distribution rather than an exponential one. A recent paper (Sharma et al., 2017) shows this explicitly. In the Ha14 approach, we look at the distribution of the quantity I/σ_{c}(I) over all measurements on the image. The negative values of I/σ_{c}(I) are assumed to form the lower half of a Gaussian distribution (with a mean of zero) for which we determine the standard deviation σ_{neg}. With the assumption that the negative measurements represent the background noise level, we therefore use σ_{neg} as a constant, dimensionless multiplicative factor to inflate the photoncounting uncertainty, giving a new uncertainty estimate for each measurement,
Separately, we have adapted the error model employed by SCALA (Evans, 2006, 2011), Ev11. This model expresses the uncertainty in terms of three refined parameters: SdFac (a multiplicative factor), SdB (a factor proportional to reflection intensity) and SdAdd (a factor proportional to intensity squared), as formulated in the following:
where I_{hl} is a single reflection measurement of h, σ_{c}^{2}(I_{hl}) is the error in that measurement from integration summation, 〈I_{h}〉 is the mean of all measurements of h and σ_{Ev11}(I_{hl}) is the corrected error in a single reflection measurement as treated by Evans (2011).
Comparing Ha14 and Ev11, these two models produce very different error estimates of the error when propagated to the merged intensities. It is likely that Ha14 substantially underestimates the error in the data, producing overall I/σ_{Ha14}(I) estimates of ∼300. The overall I/σ_{Ev11}(I) estimates of ∼30 are more reasonable based on what is known generally about error in crystallographic experiments (Diederichs, 2010). Ha14 was never intended as a final description of the error model; however, Ev11, while giving reasonable numerical error ranges, does not account for the propagation of errors from partiality corrections that are inherently needed for serial crystallography stills.
9. Discussion
DIALS provides a general representation of complex experimental geometry that we have shown here to be suitable for serial crystallography. We demonstrated this by refining the metrology of the CSPAD detector at LCLS using a thermolysin data set. Firstly, we used a rotational autocorrelation approach to position the quadrant locations using a virtual powder pattern. We then treated all 32 CSPAD sensors as independent panels using an expanding approach to metrology first refining the inner tiles and then progressively adding tiles in increasing resolution shells. After refining the metrology, we show a large improvement of the r.m.s.d. of the observed versus predicted spot locations (from 157.9 to 60.1 µm, or about half a pixel given the CSPAD pixel size of 110 µm). We observed a further improvement after reindexing the images using the new metrology and rerefining in cycles until convergence (giving a final r.m.s.d. of 50.1 µm after two cycles).
In order for joint parameter
to be practical using thousands of crystal models with a multipanel detector, we implemented a nonlinear leastsquares approach using sparsematrix algorithms to efficiently solve the normal equations required for computing step sizes. This allows us to use a single joint target function instead of alternating between crystal and detector models, as had been performed previously.We were also able to use the crystal orientation matrices from lattices observed on one CSPAD to refine the metrology of a second CSPAD detector positioned 2.5 m from the sampleinteraction region, demonstrating the general approach to geometry DIALS. We can combine detectors, refine overall detector distance and panel positions simultaneously, and handle detectors with any number of segments in arbitrary orientations.
used byWhen joint
is applied to the ensemble of all crystals used for a data set, it has the added benefit of improving the accuracy of unitcell length estimation for axes closely aligned with the beam, which are difficult to measure from still images. Serial crystallography data sets typically exhibit a broad distribution of unitcell parameters derived from indexing, presumably reflecting several factors, including an inability to accurately model the detector position, crystal parameters and beam parameters, as well as true nonisomorphism among crystals. Ensemble offers a method for removing several types of modeling uncertainty, producing a clearer picture of the inherent nonisomorphism of the crystals, which in the case of our thermolysin data was very low. We note that worries about nonisomorphism in serial crystallography have been broadly expressed, and our results appear to alleviate some of the concern.Ensuring a correct detector position is vital for maximizing the number of integrated patterns and correcting for asymmetry in the unitcell distributions (Hattne et al., 2014; Nass et al., 2016). It is expected that the sample position can and will vary slightly over time during data collection, and even drift from its original position, as scientists replace the sample, recalibrate equipment positions or simply bump instruments. To this end, we tried to correct for systematic changes in detector position over time by applying a timedependent ensemble approach to improving the integrated data quality across an entire experiment. We batched the data into granular time intervals and rerefined the detector position, crystal models and mosaic estimates within each batch, using the new models to predict and integrate the Bragg spots. This gave a slight improvement in signal strength for data collected at the same time that the detector metrology calibration data were collected (130 mm detector distance). When the detector was moved to 105 mm, rerefining the detector position for each batch further improved the integrated signal strength.
After metrology DIALS or converting it to LCLS geometry format for general users of nonDIALS software (Brewster et al., 2016). However, changes in detector position can require tileposition rerefinement. Small, resolutiondependent systematic errors in spot prediction (§3.6) can be absorbed by local tile movements, so potentially users should always rerefine the detector metrology when moving the detector.
the improved tile positions can be reused during later data collection by either specifying the refined geometry file directly inWe merged the data using four different protocols. The bestpractice method, as measured by the anomalous peak height for the Zn atom of 74.0σ, included postrefinement, timedependent ensemble and the Ev11 error model. Removing postrefinement, removing timedependent ensemble or using the Ha14 error model each resulted in lower anomalous peak heights (42.6, 69.4 and 44.6σ, respectively). We recommend that software users experiment with these algorithm choices when analyzing future XFEL data sets. Scripts to do so have been made available (see §10).
10. Data and software availability
Raw thermolysin data files for experiment L785 are available from the Coherent Xray Imaging Data Bank as deposition 81 (http://www.cxidb.org/id81.html). Jupyter notebooks that reproduce all seven figures in this work are available at https://github.com/phyynx/dials_refinement_brewster2018. All of the metrology integration and merging approaches outlined here are implemented in the software package DIALS, which is publicly available at https://dials.github.io. Documentation regarding metrology and XFEL processing in DIALS is also available in Brewster et al. (2016) and on the cctbx.xfel wiki page at http://cci.lbl.gov/xfel.
Supporting information
Supplementary tables. DOI: https://doi.org//10.1107/S2059798318009191/lp5037sup1.pdf
Footnotes
^{1}More complex panels, such as curved or nonrectangular chips, can be represented in DIALS given an appropriate pixeltomillimetre implementation. At present, the pixels need to be addressable using two dimensions, but the software can be extended to triangular or hexagonal pixel shapes should the need arise. In the case of the CSPAD all panels have the same pixel size, but this need not be true for other detectors represented by models in DIALS.
^{2}We developed a prototype using explicit second derivatives but found this to be a timeconsuming process that does not lend itself well to rapid software development and algorithm improvement.
^{3}For runs 11–22 the mean r.m.s.d. for all of the batches after ensemble was 40.5 µm. For runs 26–29 it was 73.6 µm. Why runs 26–29 have a larger r.m.s.d. is unknown. We attempted ensemble of each batch again, but instead of only refining detector distance and xy shift, we also refined all of the sensor positions (see Table 4, last row). This ensemble did not further improve the r.m.s.d.s for runs 26–29 (data not shown).
Acknowledgements
We thank Vittal Yachandra and Jan Kern for their continuing collaboration on XFEL data collection. We thank Johan Hattne and Helen Ginn for productive conversations regarding metrology
and Petrus Zwart for insight into XFEL error modeling.Funding information
This research was supported by NIH grant GM117126 to NKS for dataprocessing methods, NIH grant GM110501 to JY for via CCP4 and the Wellcome Trust [202933/Z/16/Z]. Portions of this research were carried out at LCLS at the SLAC National Accelerator Laboratory, supported by the US DOE, Office of Science, OBES under Contract No. DEAC0276SF00515. Data processing was performed in part at the National Energy Research Scientific Computing Center, supported by the DOE Office of Science, Contract No. DEAC0205CH11231.
serial crystallography, Diamond Light Source, STFCReferences
Bergmann, U., Yachandra, V. & Yano, J. (2017). Xray Free Electron Lasers. London: Royal Society of Chemistry. Google Scholar
Bevington, P. R. & Robinson, D. K. (2003). Data Reduction and Error Analysis for the Physical Sciences, 3rd ed. Boston: McGraw–Hill. Google Scholar
Brewster, A. S., Hattne, J., Parkhurst, J. M., Waterman, D. G., Bernstein, H. J., Winter, G. & Sauter, N. K. (2014). Comput. Crystallogr. Newsl. 5, 19–24. https://www.phenixonline.org/newsletter/CCN_2014_01.pdf. Google Scholar
Brewster, A. S., Waterman, D. G., Parkhurst, J. M., Gildea, R. J., MichelsClark, T., Young, I. D., Bernstein, H. J., Winter, G., Evans, G. & Sauter, N. K. (2016). Comput. Crystallogr. Newsl. 7, 32–53. https://www.phenixonline.org/newsletter/CCN_2016_07.pdf. Google Scholar
Crick, F. H. C. & Magdoff, B. S. (1956). Acta Cryst. 9, 901–908. CrossRef CAS IUCr Journals Web of Science Google Scholar
Diederichs, K. (2010). Acta Cryst. D66, 733–740. Web of Science CrossRef CAS IUCr Journals Google Scholar
Duyvesteyn, H. M. E., Ginn, H. M., Pietilä, M. K., Wagner, A., Hattne, J., Grimes, J. M., Hirvonen, E., Evans, G., Parsy, M. L., Sauter, N. K., Brewster, A. S., Huiskonen, J. T., Stuart, D. I., Sutton, G. & Bamford, D. H. (2018). Sci. Rep. 8, 3771. CrossRef Google Scholar
Evans, P. (2006). Acta Cryst. D62, 72–82. Web of Science CrossRef CAS IUCr Journals Google Scholar
Evans, P. R. (2011). Acta Cryst. D67, 282–292. Web of Science CrossRef CAS IUCr Journals Google Scholar
Ginn, H. M. & Stuart, D. I. (2017). J. Synchrotron Rad. 24, 1152–1162. CrossRef IUCr Journals Google Scholar
Guennebaud, G. & Jacob, B. (2010). Eigen v.3. http://eigen.tuxfamily.org/. Google Scholar
Hart, P. et al. (2012). Proc. SPIE, 8504, 85040C. CrossRef Google Scholar
Hattne, J. et al. (2014). Nature Methods, 11, 545–548. Web of Science CrossRef CAS PubMed Google Scholar
Henrich, B. et al. (2011). Nucl. Instrum. Methods Phys. Res. A, 633, S11–S14. Web of Science CrossRef CAS Google Scholar
Kameshima, T., Ono, S., Kudo, T., Ozaki, K., Kirihara, Y., Kobayashi, K., Inubushi, Y., Yabashi, M., Horigome, T., Holland, A., Holland, K., Burt, D., Murao, H. & Hatsui, T. (2014). Rev. Sci. Instrum. 85, 033110. Web of Science CrossRef PubMed Google Scholar
Kern, J. et al. (2014). Nature Commun. 5, 4371. Web of Science CrossRef Google Scholar
Leslie, A. G. W. (2006). Acta Cryst. D62, 48–57. Web of Science CrossRef CAS IUCr Journals Google Scholar
Liu, J. W. H. (1990). SIAM J. Matrix Anal. Appl. 11, 134–172. CrossRef Google Scholar
Liu, D. C. & Nocedal, J. (1989). Math. Program. 45, 503–528. CrossRef Web of Science Google Scholar
Mehta, D. P. & Sahni, S. (2004). Handbook Of Data Structures And Applications. Boca Raton: Chapman & Hall/CRC. Google Scholar
Nass, K., Meinhart, A., Barends, T. R. M., Foucar, L., Gorel, A., Aquila, A., Botha, S., Doak, R. B., Koglin, J., Liang, M., Shoeman, R. L., Williams, G., Boutet, S. & Schlichting, I. (2016). IUCrJ, 3, 180–191. Web of Science CrossRef CAS PubMed IUCr Journals Google Scholar
Nocedal, J. & Wright, S. (2006). Numerical Optimization, 2nd ed. New York: SpringerVerlag. Google Scholar
Parkhurst, J. M., Brewster, A. S., FuentesMontero, L., Waterman, D. G., Hattne, J., Ashton, A. W., Echols, N., Evans, G., Sauter, N. K. & Winter, G. (2014). J. Appl. Cryst. 47, 1459–1465. Web of Science CrossRef CAS IUCr Journals Google Scholar
Rennich, S. C., Stosic, D. & Davis, T. A. (2014). Proceedings of the 4th Workshop on Irregular Applications: Architectures and Algorithms, pp. 9–16. Piscataway: IEEE. Google Scholar
Russi, S., Juers, D. H., SanchezWeatherby, J., Pellegrini, E., Mossou, E., Forsyth, V. T., Huet, J., Gobbo, A., Felisaz, F., Moya, R., McSweeney, S. M., Cusack, S., Cipriani, F. & Bowler, M. W. (2011). J. Struct. Biol. 175, 236–243. Web of Science CrossRef CAS PubMed Google Scholar
Sauter, N. K. (2015). J. Synchrotron Rad. 22, 239–248. Web of Science CrossRef CAS IUCr Journals Google Scholar
Sauter, N. K., Hattne, J., Brewster, A. S., Echols, N., Zwart, P. H. & Adams, P. D. (2014). Acta Cryst. D70, 3299–3309. Web of Science CrossRef IUCr Journals Google Scholar
Sauter, N. K. & Poon, B. K. (2010). J. Appl. Cryst. 43, 611–616. Web of Science CrossRef CAS IUCr Journals Google Scholar
Sawaya, M. R. et al. (2014). Proc. Natl Acad. Sci. USA, 111, 12769–12774. Web of Science CrossRef CAS PubMed Google Scholar
Sharma, A., Johansson, L., Dunevall, E., Wahlgren, W. Y., Neutze, R. & Katona, G. (2017). Acta Cryst. A73, 93–101. Web of Science CrossRef IUCr Journals Google Scholar
Sierra, R. G. et al. (2012). Acta Cryst. D68, 1584–1587. Web of Science CrossRef CAS IUCr Journals Google Scholar
Steller, I., Bolotovsky, R. & Rossmann, M. G. (1997). J. Appl. Cryst. 30, 1036–1040. Web of Science CrossRef CAS IUCr Journals Google Scholar
Tukey, J. W. (1977). Exploratory Data Analysis. Reading: Addison–Wesley. Google Scholar
Uervirojnangkoorn, M., Zeldin, O. B., Lyubimov, A. Y., Hattne, J., Brewster, A. S., Sauter, N. K., Brunger, A. T. & Weis, W. I. (2015). Elife, 4, e05421. Web of Science CrossRef Google Scholar
Waterman, D. G., Winter, G., Gildea, R. J., Parkhurst, J. M., Brewster, A. S., Sauter, N. K. & Evans, G. (2016). Acta Cryst. D72, 558–575. Web of Science CrossRef IUCr Journals Google Scholar
Wilson, A. J. C. (1949). Acta Cryst. 2, 318–321. CrossRef IUCr Journals Web of Science Google Scholar
Winter, G., Waterman, D. G., Parkhurst, J. M., Brewster, A. S., Gildea, R. J., Gerstel, M., FuentesMontero, L., Vollmar, M., MichelsClark, T., Young, I. D., Sauter, N. K. & Evans, G. (2018). Acta Cryst. D74, 85–97. Web of Science CrossRef IUCr Journals Google Scholar
Yefanov, O., Mariani, V., Gati, C., White, T. A., Chapman, H. N. & Barty, A. (2015). Opt. Express, 23, 28459–28470. Web of Science CrossRef PubMed Google Scholar
This is an openaccess article distributed under the terms of the Creative Commons Attribution (CCBY) Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original authors and source are cited.