scientific commentaries
Serial crystallography for the masses?
aESRF, 71 Avenue des Martyrs, Grenoble, 38000, France
*Correspondence e-mail: wright@esrf.fr
Keywords: serial crystallography; sparse data; reconstruction of diffraction intensity; EMC algorithm.
There are many important materials which do not form nice single crystals for X-ray diffraction experiments. When the crystal size is too small then powder diffraction is normally used instead. But what is the minimum crystal size needed to get three-dimensional single-crystal data? Usually this depends on instrumentation, but in this issue Ayyer et al. (2015) show that drastic improvements can still be made in software and algorithms.
A 100 µm crystal diffracts X-rays a million times more strongly than a 1 µm crystal, as diffracting power relates to crystal volume, not size. Signal intensity also depends on electron density and so crystals of dense heavy metals give the strongest X-ray scattering. Proteins are much more challenging; the unit cells are very large, the electron density is low, and large regions of the structure are disordered. For synchrotron radiation sources, a minimum crystal size of around 20 µm is realistic for many proteins (Sliz et al., 2003) with a theoretical lower bound of about 1.2 µm for good quality lysozyme crystals (Holton & Frankel, 2010). Radiation damage is the limitation that prevents smaller protein crystals from surviving long enough to collect a full three-dimensional dataset.
Neutze et al. (2000) suggested that a (FEL) would have enough X-ray to do away with crystals altogether. Single molecule structures should be possible provided the X-ray beam can be diffracted in the femtoseconds before the molecule is destroyed. Miao et al. (2001) showed the could be solved from single molecule data, and one of the first steps would be to orient many two-dimensional snapshots into a three-dimensional dataset. Similar to cryo-electron microscopy, reliably finding the particle orientation for each image needs sufficient intensity statistics. This challenge motivated Loh & Elser (2009) to invent the EMC algorithm. Simplistically, EMC assigns each two-dimensional projection the most likely orientation based on the current estimate of the three-dimensional data. The process is started by using noise for the three-dimensional data and this estimate is updated using the newly orientated projections. After several iterations the procedure can settle down on a self-consistent set of orientations. Using three-dimensional averaging gives a large boost in the signal to noise compared with pair-wise comparisons of single projections. Orientation finding turned out to be much easier for FEL data from crystals because existing software can index individual frames. The randomized orientations create a problem when the crystal symmetry is lower than the lattice symmetry which Liu & Spence (2014) recently overcame using a version of the EMC algorithm applied to peak intensities.
In this issue, Ayyer et al. have applied the EMC algorithm to finding snap shot orientations with a known and real experimental data. Their data only needed to be precise enough to find the most likely orientation, not for complete indexing, which greatly reduces the counting statistics and number of peaks needed. Only 48 photons per frame were needed for the algorithm to converge (Fig. 1). Accumulating the snap shots directly in three-dimensional is also convenient for subsequent integration and processing; Yefanov et al. (2014) also assembled snap shot data in three-dimensional and showed that diffuse scattering in between the Bragg peaks can be extracted. Using a three-dimensional frame accumulation seems like it might even increase resolution limits for conventional data that are weak at high angles but can be measured with some redundancy.
Orientation finding is a ubiquitous problem in crystallography, from the use of the rotation function in et al., 2014). In the current EMC algorithm an exhaustive search of all orientation space is performed and this search scales poorly with the resolution of the orientation grid. If this technique takes off then further optimizations may be expected. Ayyer et al. have shown the EMC algorithm works when the data were very weak and in a restricted orientation space. Assuming the computations will scale to full orientation space, the only problem is to collect such sparse data. This has only recently been possible with X-rays: a very fast detector which has no read noise is needed for these experiments. Having just one crystal in the beam at a time might also be difficult to confirm.
to indexing thousands of grains in a single rotation dataset (NervoBy using data from a laboratory instrument Ayyer et al. highlight that these revolutionary methods of serial crystallography do not need a FEL and they could make an impact on the synchrotron community in years to come. When a sample diffracts well as a powder but does not grow larger crystals, this will be a method of choice. This new way to obtain three-dimensional data could bring far more complex structures into reach for `powder' diffraction.
References
Ayyer, K., Philipp, H. T., Tate, M. W., Wierman, J. L., Elser, V. & Gruner, S. M. (2015). IUCrJ, 2, 29–34. CrossRef IUCr Journals Google Scholar
Holton, J. M. & Frankel, K. A. (2010). Acta Cryst. D66, 393–408. Web of Science CrossRef CAS IUCr Journals Google Scholar
Liu, H. & Spence, J. C. H. (2014). IUCrJ, 1, 393–401. CrossRef CAS PubMed IUCr Journals Google Scholar
Loh, N.-T. D. & Elser, V. (2009). Phys. Rev. E, 80, 026705. Web of Science CrossRef Google Scholar
Miao, J., Hodgson, K. O. & Sayre, D. (2001). PNAS, 98, 6641–6645. CrossRef PubMed CAS Google Scholar
Nervo, L., King, A., Wright, J. P., Ludwig, W., Reischig, P., Quinta da Fonseca, J. & Preuss, M. (2014). J. Appl. Cryst. 47, 1402–1416. CrossRef CAS IUCr Journals Google Scholar
Neutze, R., Wouts, R., van der Spoel, D., Weckert, E. & Hajdu, J. (2000). Nature, 406, 752–757. Web of Science CrossRef PubMed CAS Google Scholar
Sliz, P., Harrison, S. C. & Rosenbaum, G. (2003). Structure, 11, 13–19. Web of Science CrossRef PubMed CAS Google Scholar
Yefanov, O., Gati, C., Bourenkov, G., Kirian, R. A., White, T. A., Spence, J. H. C., Chapman, H. N. & Barty, A. (2014). Philos. Trans. R. Soc. B, 369, 20130333. CrossRef Google Scholar
This is an open-access article distributed under the terms of the Creative Commons Attribution (CC-BY) Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original authors and source are cited.