## CCP4 study weekend

## Measurement errors and their consequences in protein crystallography

^{a}Department of Biochemistry, UT Southwestern Medical Center, 5323 Harry Hines Boulevard, Dallas 75390-9038, Texas, USA, and ^{b}Department of Molecular Physiology and Biological Physics, University of Virginia, 1300 Jefferson Park Avenue, Charlottesville, VA 22908, USA^{*}Correspondence e-mail: zbyszek@work.swmed.edu

This article analyzes the relative impact of various types of measurement uncertainties on different stages of *DENZO* and *SCALEPACK* are presented.

Keywords: measurement errors; uncertainty estimation; sigma estimates; *DENZO*; *SCALEPACK*.

### 1. Definitions

In this article, the terms `measurement error' and `measurement uncertainty' will be used in their precise statistical meanings. The formal meaning of error is the difference between the result of a measurement and the true value of the measurand. The true value of the measured quantity is typically not known, so the error is not known either and has to be described by a statistical distribution. This distribution is estimated based on the overall knowledge of how the measurement was made and also on the internal consistency of measurements. The width of the distribution is the uncertainty of the measurement and in the case of a Gaussian probability distribution the σ is a synonym for this uncertainty. The application of Bayes's theorem can convert the error probability distribution into the probability distribution of measured value. This article focuses on estimating σ, which describes only the experimental input to the Bayesian reasoning, rather than the subsequent applications of Bayesian statistics (French & Wilson, 1978).

Standard abbreviations for crystallographic phasing methods are used: MAD, multiple-wavelength anomalous diffraction; SAD, single-wavelength anomalous diffraction; MIR, multiple

The abbreviation for the method may be preceded by the atomic symbol of the heavy atom or anomalous scatterer.### 2. Introduction

The measurement of diffraction peak intensities starts the multi-step process of obtaining a three-dimensional atomic structure from the collected data. To solve the structure, all measured intensities *I _{m}* have to be on the same scale, preferably the same as that of the squared amplitude of the structure factors:

where *K* is the scale factor of a particular measurement and |**F**|^{2} is the squared amplitude of the **F**.

Any conclusions based on intensity measurements are always affected by a degree of uncertainty resulting from the errors inherent to the measurement process. Assuming a Gaussian probability function of the intensity measurement error, the estimate of uncertainty can be expressed by a sigma value σ* _{I}*.

Procedures that determine the scale factor *K* have also a level of uncertainty owing to the unavoidable computational simplifications in describing the sample and the experimental setup; for example, beam stability, geometry of diffraction and X-ray absorption in the crystal.

During data processing, we usually assume that the intensity of multiple measurements of a Bragg reflection, including symmetry-related reflections, arises from a single ) can be extended in the form

However, this assumption may not be satisfactory in many experimental situations, for example owing to structural variations between crystals. To accommodate the above potential uncertainties, (1where σ* _{I}*, σ

*and*

_{K}**σ**represent the estimates of uncertainties regarding the intensity of a diffraction peak, a scale factor and a of a given

_{F}*hkl*, respectively. The ± sign is a shorthand notation to describe a Gaussian probability function which will be used throughout this paper. In case of a its uncertainty

**σ**is the two-dimensional Gaussian function of a complex variable. In cases when more than one such sign appears in an equation, the probability functions have to be appropriately convolved. The scale factor

_{F}*K*is determined by procedures (Otwinowski

*et al.*, 2003) that make assumptions about the experiment. Uncertainty in

*K*mostly arises from these assumptions necessarily being only approximate. It is convenient to describe the uncertainty of the scale factor

*K*in relative terms using the form exp(±σ

*) ≃ (1 ± σ*

_{K}*). The main purpose of (2) is to emphasize that every component of (1) has some level of uncertainty.*

_{K}In macromolecular crystallography, there are two main situations where we have to consider the significance of errors. Firstly, in σ* _{I}* is significant and its estimate is only important for weak intensities (§2.2.1). Secondly, for obtaining the phase information, which is always obtained from the differences between the diffraction intensities. Such differences are typically relatively small and for this reason even small uncertainties of the three types (σ

*, σ*

_{I}*and*

_{K}**σ**) can be significant. Here, the consequences of errors are particularly important for large intensities (§2.2.3.).

_{F}#### 2.1. Types of errors

The classification of measurement errors in crystallography is based on statistical properties of their distribution and correlations. The simplest type of error is one with no correlation, described by a well defined, typically Gaussian, probability distribution. This is effectively a definition of random error.

The error is called `systematic' when a group of measurements is affected in a well defined, correlated fashion. When such a correlation is included as a part of the problem analysis it can be considered an effect rather than an error. The remaining errors of large magnitude, which should be rare, are called measurement outliers.

##### 2.1.1. Random errors

An unavoidable source of random error in measurements arises from the quantum nature of X-rays. The resulting error is described by the σ value being the square root of the expected number of photons. The relative error of a diffraction peak intensity measurement owing to counting statistics is

of counting statistics, which can be effectively approximated by a Gaussian function, with thewhere *n* is the number of photons. Random error results not only from fluctuations in the number of photons in the peak, but also from fluctuations in the number of photons in the background measured together with the peak. Thus, to effectively measure small differences in diffraction intensities, a large number of photons is required.

(3) defines the lowest possible error in a measurement, which needs to be adjusted for the efficiency of the instrument. Random error from counting statistics in integrating detectors (CCD and image plate) is multiplied by the detector inefficiency factor, which is typically about 1.2. These detectors also have electronic read-out noise and, in the case of CCDs, dark-current noise, which add other components to the random error. When considering the experimental strategy, the oscillation range for diffraction images affects the X-ray background and electronic read-out noise in opposite ways. Since it is best to minimize the sum of these two effects, it is convenient to convert the electronic noise into the equivalent X-ray background noise by expressing it as a (wavelength-dependent) number of photons per pixel.

Random-error magnitude, being very predictable, should be assessed after a test exposure(s) to define the optimal data-collection strategy. Formal prediction of random and other types of errors can be used to choose between the alternative experimental strategies (Popov & Bourenkov, 2003).

##### 2.1.2. Systematic errors

Systematic errors can be classified according to their sources and to the types of correlation among the measured values of the diffraction peaks. Systematic errors arise from simplifying assumptions about the instrumentation, the sample and diffraction physics and from approximations in computational procedures. Depending on how the systematic error affects groups of measurements, it can be characterized as belonging to one of the following categories.

##### 2.1.3. Outliers

There is a group of sporadic but significant errors that do not belong to either random or systematic error categories. For example, cosmic radiation or radioactivity can randomly create large peaks in diffraction images (zingers). As a consequence, some diffraction intensities calculated by an integration program can be highly incorrect. Measurements affected by such errors are called outliers. They can be recognized during the analysis of symmetry-related observations by differing from other measurements much more than expected from the estimates of experimental errors (Blessing, 1997). This simple concept of outlier analysis is not straightforward to apply in practice owing to its sensitivity to assumptions about data errors. In particular, this analysis can consider consequences of unaccounted for systematic effects as outliers.

#### 2.2. Error assessment

Analysis of errors should start with an overall assessment of how they affect the structure-determination procedure and the final result. For example, in the molecular-replacement method the impact of errors is very different than in other macromolecular crystallography procedures.

##### 2.2.1. Molecular replacement

A target function in *et al.*, 1998). All other functions used in have very similar properties. Random errors have a minimal impact on the value of the correlation function unless, on average, they reach the level of average measured intensities, where averages are considered in resolution shells. Owing to the model typically only approximating the real structure, is usually limited to low-resolution data, for which experimental random errors are not significant. However, is very sensitive to systematically missing the strongest intensities owing to detector saturation. Missing measurements effectively have an implied value of zero in the simplest form of the target function in the molecular-replacement method,

and, in the case of reflections saturating the detector, it is better to use approximate values of such reflections rather than ignoring them. This could be achieved, for example, by fitting intensities from the reflection tail, as discussed by Leslie (1999) for *MOSFLM*. This option is also available in other programs (Otwinowski & Minor, 1997). In a more elaborate version of the target function,

the implied values of missing reflections are equal to the average intensities in the resolution shells. It makes the

only slightly less sensitive to the missing data.##### 2.2.2. of molecular structure

In a typical macromolecular *R*_{free} factor of about 20% or higher. The main source of the discrepancy between the predicted and the observed intensities arises from the atomic model inadequately describing the diffracting electron density rather than from the measurement error. A typical magnitude of the structure-factor error of the atomic model is closely related to the *R*_{free} in atomic The relative error of the intensity is twice the relative error of the structure-factor amplitude. As a consequence, only (relative) measurement errors exceeding twice the *R*_{free} value impact on the procedure.

##### 2.2.3. Experimental phasing

Experimental phasing is based on measuring differences of same-index (or symmetry-related) reflections between different crystals in MIR, at different wavelengths for dispersive differences in MAD and between Friedel symmetry-related reflections in SAD, MIRAS and MAD. The magnitude of these differences is related to the magnitude of the phasing signal. The quality of phasing information is defined by the phasing power, which can be generalized as

where `phasing signal' represents the magnitude of the phasing signal and its error is the sum of the contributions from the experimental error and the error in the modeling of the phasing signal, including non-isomorphisms. The magnitude of the phasing power is resolution dependent. The resolution where the phasing power drops substantially below 1 defines in practice the limit of useful contribution to *etc*.). Any discussion of phasing-power magnitude has to consider that it can be improved equally well by an increase in phasing signal or by reducing errors associated with the signal (8). For different types of experiments, the following practical observations can be applied.

### 3. Correction of systematic effects

Systematic errors, when accounted for, can be considered to be a feature of the experiment.

Some types of systematic effects have minimal impact on the result (structure, phasing *etc*.). For example, systematic underestimation of the diffraction intensity by a constant factor will only produce a change in the overall scale factor during atomic Such a change will fully compensate for this type of error. If such underestimation slowly changes with resolution, its main impact will be a small change in the overall *B* factor, an issue of little significance. However, other types of systematic effects, if ignored, may impact on particularly experimental phasing. The following three categories of systematic effects can be corrected for by more elaborate data analysis.

#### 3.1. Scaling corrections

The practice of correcting for various multiplicative effects has a long history (Hamilton *et al.*, 1965; Fox & Holmes, 1966; Monahan *et al.*, 1967; Diamond, 1969; Rossmann, 1979; Rossmann *et al.*, 1979; Evans, 1993, 1997; Leslie, 1993, 1999; Otwinowski, 1993; Otwinowski & Minor, 1997, 2001). Absorption correction parameterized by spherical harmonics (Katayama, 1986) has been added to most scaling programs in the last few years. Corrections for inaccuracies in crystal rotation and corrections for an integration inaccuracy, the so-called `missing-tail' correction (Evans, 1997), are other recent improvements.

#### 3.2. Corrections for non-isomorphism

The assumptions about the internal isomorphism of crystal(s) used to produce a single data set are often quite problematic. For data with a high multiplicity of symmetry-related reflections it is feasible to model non-isomorphisms, as discussed previously (§2.1.2). This analysis can be performed when merging already scaled data.

The introduction of intense synchrotron beamlines to crystallography improved the resolution of data, particularly from small crystals, but not necessarily the low-resolution *R*_{merge} statistics. Beam-intensity and goniostat-rotation fluctuations are partially responsible for these results. Another source of poor merging quality is the fact that high radiation doses induce chemical changes that cannot be corrected by time- and resolution-dependent scaling. These changes represent a systematic effect that can be corrected for in principle. The impact of uncorrected radiation-induced non-isomorphism on MAD experiments is discussed below.

#### 3.3. analysis

It was recently recognized that ). One approach is to correct the already scaled and merged data for this problem. Such deconvolution of double (multiple) measurements results in their errors being correlated. One can ignore this correlation, but to include this additional information in structure-solving programs, the programs would have to be modified to analyze directly.

by perfect superimposition of crystal lattices, if allowed by space-group symmetry, is quite frequent (Yeates, 1997### 4. Error estimation

Experienced researchers can sometimes be assured that experimental errors only impact 2.2.1.). Otherwise, if practical, systematic errors should be corrected for and, if errors are unavoidable, their consequences may be minimized by optimal weighting of the results. Even if errors are small enough to be ignored, their magnitude first has to be estimated in order to provide assurance of their insignificance.

and final conclusions in a minimal fashion. In such a lucky situation, experimental errors may be ignored (§#### 4.1. Estimation of random errors

In theory, the rules for propagation of uncertainties in raw data to the final results are well defined (Fisher, 1959; Diamond, 1969). Unfortunately, for virtually all detectors now used in macromolecular crystallography, the pixel measurements are highly correlated on the short distance scale. The distances involved are short enough to make the errors of separate Bragg peaks independent, but error correlations complicate the estimates of individual intensity peak uncertainties. Instead of calculating a random-error estimate from a complex theory, the practical approach is to account for differences in symmetry-related observations with equations that have been validated by extensive experience. Owing to the history of how such estimates were derived, they account not only for random error but also for a small amount of systematic errors present in all experiments.

The programs *DENZO* (Otwinowski, 1993) and *MOSFLM* (Leslie, 1993) initially estimated errors of integrated diffraction peaks recorded on X-ray film. Subsequently, their error-estimate equations were adjusted to fit detectors with larger dynamic range. Since these two programs together are used in more than 90% of the structure determinations deposited in the PDB, their design philosophy defines the prevailing approach to estimating random errors. The complex process by which this is performed in *DENZO* is described below.

Preliminary error estimates, which are subsequently adjusted to describe better the disagreements among measurements in *DENZO*, are given by

where *p _{i}* is the fraction of a predicted profile in a particular pixel

*i*,

*b*is the calculated value of the background for the pixel

_{i}*i*,

*I*is the profile-fitted intensity,

*n*is the number of pixels used in background estimation and

_{b}*e*is the error density parameter defined for each instrument, which can also be overridden by the user (Gewirth, 1998). The sums are over all the pixels in a reflection profile. The left sum is the main contribution resulting from the uncertainty of the pixel measurements in the The right sum under the square root is the contribution of the background estimate uncertainty to the measured intensity.

_{d}Next, the *g* (goodness of profile fitting) factor is calculated, describing how well the predicted profile fits a particular intensity peak

where *n _{i}* is the number of pixels in a reflection profile and

*m*is the observed value of intensity for the pixel

_{i}*i.*For weak reflections, the parameter

*g*should be relatively close to 1; if it is systematically off by a large factor, the parameter

*e*should be adjusted. The next step depends on the value of

_{d}*g*:

The values of σ* _{D}* and

*g*are then output by

*DENZO*. Subsequently, the

*SCALEPACK*program applies an additional level of adjustment to the output produced by

*DENZO*,

Together, (11) and (12) produce a simpler formula.

The steps described in (11) and (12) are performed separately, instead of applying (13) directly, owing to the need to preserve compatibility with the old *DENZO* output file format, which is based on a previous (prior to version 1.97) method of estimating random error.

The value of σ* _{S}* is subsequently scaled by a user-adjustable factor

*E*(called the error scale factor in

_{S}*SCALEPACK*), with typical value 1.3, to make disagreements among symmetry-related measurements consistent with the scaled σ

*:*

_{S}However, even a scaled σ* _{S}* does not account for all types of errors and additional adjustments are needed for a variable component of systematic errors.

#### 4.2. Estimation of systematic errors

##### 4.2.1. Estimation of multiplicative errors

As described in §1, multiplicative errors result from the imprecision of scale factors applied to the integrated diffraction peak intensities. The magnitude of such errors tends to be in the range of single-digit percent. Still, such small errors can be of importance when calculating the differences between measurements used in phasing procedures. Errors in the scale factors are definitely not random and they have rather complex correlations. There is a correlated component of errors that equally affects the measurements of intensities in phasing differences, so it does not impact on the differences themselves. Normally, one is only interested in estimating the magnitude of the remaining component of scaling errors, described by σ* _{K}*. The practice of estimating the multiplicative errors by comparing symmetry-related reflections has an advantage of estimating only the relevant component of multiplicative errors. The overall magnitude of the scaling error would have to be estimated differently, but typically it can be ignored since it is of little relevance to macromolecular crystallography.

The scaled errors (14) from an integration program can be combined with σ* _{K}* into the final estimated error of the measurement,

The σ* _{E}* is used to check whether the observed differences between symmetry-related measurements statistically agree with the final estimate of the measurement error. In an ideal case, the normalized goodness-of-fit index (often called normalized χ

^{2}, one of the most important statistics in merging programs) should be about 1. If it is significantly below 1, the errors are overestimated and either

*E*or σ

_{S}*should be reduced. Such an adjustment does not have to be very precise as, for example, a χ*

_{K}^{2}of 0.9 means that the magnitude of estimated error is probably underestimated only by about 5%. If χ

^{2}is much larger then 1, it may indicate that

*E*and/or σ

_{S}*should be increased. However, large increases of these parameters should not be automatically applied. Firstly, the values of these parameters should be compared with the previous measurements of similar crystals under similar experimental conditions. In most cases, the values of*

_{K}*E*and σ

_{S}*tend to be consistent in similar experiments. Unexpectedly large values of χ*

_{K}^{2}may indicate that the error model defined by (15) is not adequate. When a more detailed analysis eliminates the obvious reasons for such a problem (poorly edited beam-stop shadow, hardware failures, mistakes in processing

*etc*.), the most likely source of unaccounted for differences between symmetry-related measurements is non-isomorphism.

##### 4.2.2. Estimation of non-isomorphism error

Even though variations in crystal structure factors arising from non-isomorphism do not result from the measurement error, if left uncorrected they can have the same impact on the merging statistics and phasing differences. To include the non-isomorphisms in the overall analysis of data variations, it is convenient to convert the uncertainty in the structure factors to the same scale as the measurement error.

In the case of non-isomorphism, it is reasonable to assume that the level of structure-factor uncertainty is smaller than the magnitude of

So we can approximateFor centrosymmetric reflections, the equation simplifies to the form

**σ _{F}** symbolizes a shorthand notation of a Gaussian probability function of a complex variable, which describes the uncertainty of the

**F**. Since there is no standard convention to describe the width of such a distribution, the term 〈|

**σ**|

_{F}^{2}〉 is used to unambiguously specify that width. (17) needs to be integrated over the cosine of phase difference between

**F**and

**σ**. When calculating the variance of the resulting distribution, an average value of cosine squared equal to 1/2 appears, resulting in |

_{F}**F**|

^{2}having the following magnitude of uncertainty:

and for centrosymmetric data the corresponding magnitude is

The estimates of data variation from non-isomorphisms (18 and 19) should be combined with estimates of the measurement error (15) to obtain an overall estimate of uncertainty. Typically, we do not have an *a priori* estimate of 〈|**σ _{F}**|

^{2}〉, so we need to determine it by generalizing the procedure used to estimate the values of

*E*and σ

_{S}*. All the parameters of this overall estimate should be adjusted to obtain a reasonable agreement between the predicted and the observed spread of data.*

_{K}(18) can also be applied to differences between measurements caused by in order to estimate the magnitude of the phasing signal. When applying (19), one has to remember that in that case there are no Bijvoet differences.

While analyzing the consequences of non-isomorphism, one has to consider that its impact on experimental phasing is not random, particularly in MAD experiments. For data measured consecutively at different wavelengths, the correlations between phasing signal and radiation-induced non-isomorphism are very different for Bijvoet differences and for dispersive differences. When rotating a crystal around twofold symmetry, Friedel pairs diffract together, so radiation damage affects them equally and does not affect the difference between them. Otherwise, the members of Friedel pairs are collected at various times during data collection at one wavelength. Radiation-induced changes are quite uniform and linear with dose, so they will still average similarly for both components of the *et al.*, 2000).

### 5. Weighting data by error estimates

#### 5.1. Using sigmas to define data limits

The purpose of estimating errors is to minimize their consequences. The simplest form of using error estimates is to decide which observations should be used at a particular stage of analysis. The most widely used approach of this type is to define the resolution limit, which is typically different at different stages of structure solution. For example, a reasonably defined upper resolution limit in the atomic *I*/σ(*I*) test]. Typically, the resolution limit will be lower in the heavy-atom and still lower in to locate heavy-atom positions. Other criteria for excluding reflections are the ratio of intensity to sigma for a particular reflection being larger than a certain number or its sigma exceeding a particular value. These criteria are simple to apply but unfortunately the thresholds are rarely established by a formal statistical reasoning; instead, they are derived from past experiences with similar analysis. Rather than introducing limits, a better method of using sigmas is to assign a continuous weight between zero and one for every measurement, instead of effectively restricting the weights to the values of zero and one when using exclusion/inclusion criteria.

#### 5.2. Using sigmas to calculate weights

In macromolecular crystallography, the measurement-error estimates are used to calculate weights in heavy-atom and atomic *et al.*, 1997; Schneider & Sheldrick, 2002). Other procedures, such as calculation of difference maps, solvent flattening and non-crystallographic averaging, typically do not use continuous weighting. This shows that methods of macromolecular crystallography can still be improved by means of optimal handling of uncertainties. This would be especially important in case of weaker observations, which are now rejected by data limits, but still contain a certain amount of information. Applying weights at all stages of the structure-determination process is part of a general trend of implementing more elaborate Bayesian statistical reasoning in macromolecular crystallography.

### 6. Discussion

The main challenge in a macromolecular crystallography experiment is to obtain sufficient experimental information to solve a structure and/or answer detailed questions about it. What defines this information is the signal-to-error ratio, so it is equally important to maximize the signal and to minimize the error. As we reach the radiation-damage limit, there are few remaining methods to increase the number of diffracted photons: growing larger crystals, improving the crystal microscopic order and using multiple crystals. Additionally, the phasing signal can be improved for some heavy atoms (sulfur, iodine, calcium and a few others) by going to longer wavelengths up to a point when crystal absorption severely limits the number of diffracted photons. Since it is very difficult to increase the signal, minimizing errors becomes the main pursuit.

In this light, errors should not be treated as just a nuisance but rather as a subject of analysis. Their sources and magnitude should be understood even before the experiment. Since many crystals and many data-collection sessions are typically used to solve a structure, errors and their sources should be continuously reassessed. It is important to separately estimate each source of error, as they have to be minimized by different, sometimes even conflicting, approaches.

The main variability in inaccuracy of results produced by instruments is in the amount of systematic rather than random errors. There are often larger variations among instruments of the same type than between types of the instruments, so it is important to ascertain the quality of a particular experimental setup at a particular time. This assessment combined with the expected magnitude of phasing signal can be used to reasonably predict the quality of phasing information and its suitability to solve the structure.

Since a large fraction of overall error is systematic in nature, it can be reduced by advances in experimental protocols and corrected by data-analysis programs (Evans, 1999; Otwinowski *et al.*, 2003). Such progress will make weak phasing sources, particularly those already present in native proteins, more suitable for structure solving.

### Acknowledgements

This work was supported by grant GM53163 from the National Institutes of Health.

### References

Barna, S. L., Tate, M. W., Gruner, S. M. & Eikenberry, E. F. (1999). *Rev. Sci. Instrum.* **70**, 2927–2934. Web of Science CrossRef CAS

Blessing, R. H. (1997). *J. Appl. Cryst.* **30**, 421–426. CrossRef CAS Web of Science IUCr Journals

Brünger, A. T., Adams, P. D., Clore, G. M., DeLano, W. L., Gros, P., Grosse-Kunstleve, R. W., Jiang, J.-S., Kuszewski, J., Nilges, M., Pannu, N. S., Read, R. J., Rice, L. M., Simonson, T. & Warren, G. L. (1998). *Acta Cryst.* D**54**, 905–921. Web of Science CrossRef IUCr Journals

Burmeister, W. P. (2000). *Acta Cryst.* D**56**, 328–341. Web of Science CrossRef CAS IUCr Journals

Dauter, Z. (2003). *Acta Cryst.* D**59**, 2004–2016. Web of Science CrossRef CAS IUCr Journals

Diamond, R. (1969). *Acta Cryst.* A**25**, 43–55. CrossRef CAS IUCr Journals Web of Science

Evans, P. R. (1993). *Proceedings of the Daresbury CCP4 Study Weekend. Data Collection and Processing*, edited by L. Sawyer, N. Isaacs & S. Bailey, pp. 114–123. Warrington: Daresbury Laboratory.

Evans, P. R. (1997). *Proceedings of the CCP4 Study Weekend. Recent Advances In Phasing*, edited by K. S. Wilson, G. Davies, A. W. Ashton & S. Bailey, pp. 97–102. Warrington: Daresbury Laboratory.

Evans, P. R. (1999). *Acta Cryst.* D**55**, 1771–1772. Web of Science CrossRef CAS IUCr Journals

Fisher, R. A. (1959). *Statistical Methods and Scientific Inference.* Edinburgh: Oliver & Boyd.

Fox, G. C. & Holmes, K. C. (1966). *Acta Cryst.*** 20**, 886–891. CrossRef CAS IUCr Journals Web of Science

French, S. & Wilson, K. (1978). *Acta Cryst.* A**34**, 517–525. CrossRef CAS IUCr Journals Web of Science

Gewirth, D. (1998). *HKL Manual.* Charlottesville, VA, USA: HKL Research, Inc.

Hamilton, W. C., Rollett, J. S. & Sparks, R. A. (1965). *Acta Cryst.* **18**, 129–130. CrossRef IUCr Journals Web of Science

Katayama, C. (1986). *Acta Cryst.* A**42**, 19–23. CrossRef CAS Web of Science IUCr Journals

Leiros, H. K. S., McSweeney, S. M. & Smalås, A. O. (2001). *Acta Cryst.* D**57**, 488–497. Web of Science CrossRef CAS IUCr Journals

Leslie, A. (1993). *Proceedings of the CCP4 Study Weekend. Data Collection and Processing*, edited by N. Isaacs, L. Sawyer & S. Bailey, pp. 44–51. Warrington: Daresbury Laboratory.

Leslie, A. G. W. (1999). *Acta Cryst.* D**55**, 1696–1702. Web of Science CrossRef CAS IUCr Journals

Monahan, J. E., Schiffer, M. & Schiffer, J. P. (1967). *Acta Cryst.* **22**, 322. CrossRef IUCr Journals Web of Science

Murshudov, G. N., Vagin, A. A. & Dodson, E. J. (1997). *Acta Cryst.* D**53**, 240–255. CrossRef CAS Web of Science IUCr Journals

Otwinowski, Z. (1993). *Proceedings of the CCP4 Study Weekend. Data Collection and Processing*, edited by N. Isaacs, L. Sawyer & S. Bailey, pp. 56–62. Warrington: Daresbury Laboratory.

Otwinowski, Z., Borek, D., Majewski, W. & Minor, W. (2003). *Acta Cryst.* A**59**, 228–234. Web of Science CrossRef CAS IUCr Journals

Otwinowski, Z. & Minor, W. (1997). *Methods Enzymol.* **276**, 307–326. CrossRef CAS Web of Science

Otwinowski, Z. & Minor, W. (2001). *International Tables for Crystallography*, Vol. *F*, edited by M. G. Rossmann & E. Arnold, pp. 226–235. Dordrecht: Kluwer Academic Publishers.

Parsons, S. (2003). *Acta Cryst.* D**59**, 1995–2003. Web of Science CrossRef CAS IUCr Journals

Popov, A. N. & Bourenkov, G. P. (2003). *Acta Cryst.* D**59**, 1145–1153. Web of Science CrossRef CAS IUCr Journals

Ramagopal, U. A., Dauter, M. & Dauter, Z. (2003). *Acta Cryst.* D**59,** 868–875. Web of Science CrossRef CAS IUCr Journals

Ravelli, R. B. G. & McSweeney, S. M. (2000). *Struct. Fold. Des.* **8**, 315–328. Web of Science CrossRef CAS

Rice, L. M., Earnest, T. N. & Brunger, A. T. (2000). *Acta Cryst.* D**56**, 1413–1420. Web of Science CrossRef CAS IUCr Journals

Rossmann, M. G. (1979).* J. Appl. Cryst.* **12**, 225–238. CrossRef CAS IUCr Journals Web of Science

Rossmann, M. G., Leslie, A. G. W., Abdel-Meguid, S. S. & Tsukihara, T. (1979). *J. Appl. Cryst.* **12**, 570–581. CrossRef CAS IUCr Journals Web of Science

Schneider, T. R. & Sheldrick, G. M. (2002). *Acta Cryst.* D**58**, 1772–1779. Web of Science CrossRef CAS IUCr Journals

Gruner, S. M., Eikenberry, E. F. & Tate, M. W. (2001). *International Tables for Crystallography*, Vol. *F*, edited by M. G. Rossmann & E. Arnold, pp. 143–153. Dordrecht: Kluwer Academic Publishers.

Weik, M., Berges, J., Raves, M. L., Gros, P., McSweeney, S., Silman, I., Sussman, J. L., Houee-Levin, C. & Ravelli, R. B. G. (2002). *J. Synchrotron. Rad.* **9**, 342–346. Web of Science CrossRef CAS IUCr Journals

Weik, M., Ravelli, R. B., Kryger, G., McSweeney, S., Raves, M. L., Harel, M., Gros, P., Silman, I., Kroon, J. & Sussman, J. L. (2000). *Proc. Natl Acad. Sci. USA*, **97**, 623–628. Web of Science CrossRef PubMed CAS

Yang, F., Dauter, Z. & Wlodawer, A. (2000). *Acta Cryst.* D**56**, 959–964. Web of Science CrossRef CAS IUCr Journals

Yeates, T. O. (1997). *Methods Enzymol.* **276**, 344–358. CrossRef CAS PubMed Web of Science

© International Union of Crystallography. Prior permission is not required to reproduce short quotations, tables and figures from this article, provided the original authors and source are cited. For more information, click here.