editorial
Introduction to the virtual collection of papers on Artificial neural networks: applications in X-ray photon science and crystallography
aDepartment of Cell and Molecular Biology, Uppsala University, Husargatan 3 (Box 596), SE-751 24 Uppsala, Sweden
*Correspondence e-mail: tomas.ekeberg@icm.uu.se
Keywords: artificial intelligence; AI; machine learning; deep learning; neural networks.
Artificial intelligence (AI) is more present than ever, both in our society in general and in science. At the center of this development has been the concept of deep learning, the use of
(ANNs) that are many layers deep and can often reproduce human-like behavior much better than other machine-learning techniques.This development has been going on for a long time, and it is ambiguous to identify a specific starting date for it. Possible candidates might be the first ANNs in 1943 (McCulloch & Pitts, 1943), the popularization of backpropagation to train many layers in a network at once in 1985 (Rumelhart et al., 1986) and the application to identify human handwriting in 1989 (Le Cun et al., 1989).
Today the growth of this field is seemingly exponential, with major breakthroughs being reported almost every year. Some of the most notable examples are
(1) beating a professional player at the game of go (Silver et al., 2016),
(2) residual neural networks (ResNets) for image recognition (He et al., 2016),
(3) AlphaFold protein structure prediction (Jumper et al., 2021),
(4) ChatGPT and other large language models (LLMs) (https://gluebenchmark.com/leaderboard).
During the same time frame, deep learning has also seeped into our society in many less prominent cases such as
(1) automatic image processing of photographs from our cell-phone cameras,
(2) recommendations for music or movies on streaming services,
(3) optimization of server hall power efficiency.
It is certainly no understatement that AI has become a foundation for society as we know it today. And so far, the technology has mainly been developed by big tech companies.
With this knowledge, many scientists have started to investigate what these new tools could mean for their field of research. While some areas of science were fairly slow to adopt these methods, we are now seeing a rapid spread of AI applications in many different fields, and the contributions in this collection of articles from Journal of Applied Crystallography (https://journals.iucr.org/special_issues/2024/ANNs) manifest that observation for X-ray photon science and crystallography.
Most applications of AI are, however, not of the transformative kind, comparable to the effect that AlphaFold had on the field of structure prediction. Instead, we have examples such as preprocessing that gives smaller improvements to data quality, or automation of a mundane task that would otherwise have to be done by a researcher.
Classification. One of the first applications of machine learning was to classify data. Famous examples include recognizing handwritten digits and letters or identifying human faces. In photon science this is also a common use case, and recently we have, for example, seen it being used for femtosecond X-ray imaging patterns (FXI) (Assalauova et al., 2022), X-ray photon correlation spectroscopy (Timmermann et al., 2022) and serial femtosecond crystallography (Rahmani et al., 2023). This is often a convenient way to speed up a researcher's work by automating an otherwise labor-intensive task, by classifying a small set of patterns by hand and then training a machine-learning algorithm to classify a bigger dataset in a similar fashion.
Another example is from neutron diffraction (Hao et al., 2023) where a neural network approach was used not only to filter out relevant data but also to label regions of interest in each diffraction pattern.
A common critique towards neural network approaches is their black-box nature—that we usually have very limited understanding of the analysis that a trained network actually performs. A particularly interesting development is, therefore, the attempts to understand the internal workings of our networks. One such example is reported by Nawaz et al. (2023).
Data analysis. A more recent development is that neural networks take a more active role in data analysis in addition to classification. We see cases in the entire pipeline of data processing. A successful example is data preprocessing where, for example, missing regions of data can be recreated using deep neural networks. This has been shown both for small/wide-angle X-ray scattering (SAXS/WAXS) (Chavez et al., 2022) and FXI (Bellisario et al., 2022) experiments.
It is still fairly rare for neural networks to perform the bulk of the analysis. One such case is, however, a study of band gap structures in metal–organic frameworks where an ANN trained on a large set of simulated data was applied to experimental data with remarkable success (Gómez-Peralta et al., 2022). In another example by Lim et al. (2023), subsurface temperatures are estimated during laser melting comparing experimental data with ANN-based simulations.
Parameter estimation. A more common application is the use of neural networks to provide parameters to an analysis that is otherwise using conventional methods or to reduce the search space for an existing algorithm. Chitturi et al. (2021) use neural networks to estimate lattice parameters for X-ray powder diffraction data, which can reduce the search space for this time-consuming algorithm by more than a factor of 100. A similar example comes from X-ray reflectivity, where a neural network was used to estimate model parameters in a fraction of the time required for traditional methods (Mareček et al., 2022). Recently, a web-service called CrystalMELA was also released, which can estimate the using several machine-learning methods (Corriero et al., 2023)
In addition to direct applications, AI is mentioned by a vast number of articles, and it is clear that there is a curiosity and optimism about AI techniques within the field. The implementation of AI techniques is, however, most likely in its infancy. Most examples included here are the work of small talented teams of researchers using relatively modest hardware. The effects are already tangible, by speeding up research, identifying hidden patterns and freeing up time for researchers by automating tasks. Given the success so far, more effort and more computational power could potentially have the same transformative effect in our field as AlphaFold has had on protein structure prediction.
References
Assalauova, D., Ignatenko, A., Isensee, F., Trofimova, D. & Vartanyants, I. A. (2022). J. Appl. Cryst. 55, 444–454. Web of Science CrossRef CAS IUCr Journals Google Scholar
Bellisario, A., Maia, F. R. N. C. & Ekeberg, T. (2022). J. Appl. Cryst. 55, 122–132. Web of Science CrossRef CAS IUCr Journals Google Scholar
Chavez, T., Roberts, E. J., Zwart, P. H. & Hexemer, A. (2022). J. Appl. Cryst. 55, 1277–1288. Web of Science CrossRef CAS IUCr Journals Google Scholar
Chitturi, S. R., Ratner, D., Walroth, R. C., Thampy, V., Reed, E. J., Dunne, M., Tassone, C. J. & Stone, K. H. (2021). J. Appl. Cryst. 54, 1799–1810. Web of Science CrossRef CAS IUCr Journals Google Scholar
Corriero, N., Rizzi, R., Settembre, G., Del Buono, N. & Diacono, D. (2023). J. Appl. Cryst. 56, 409–419. Web of Science CrossRef CAS IUCr Journals Google Scholar
Gómez-Peralta, J. I., Bokhimi, X., García-Peña, N. G., Quintana-Owen, P. & Rodríguez-Gattorno, G. (2022). J. Appl. Cryst. 55, 1538–1548. Web of Science CrossRef IUCr Journals Google Scholar
Hao, Y., Feng, E., Lu, D., Zimmer, L., Morgan, Z., Chakoumakos, B. C., Zhang, G. & Cao, H. (2023). J. Appl. Cryst. 56, 519–525. Web of Science CrossRef CAS IUCr Journals Google Scholar
He, K., Zhang, X., Ren, S. & Sun, J. (2016). 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 770–778, https://doi.org/10.1109/CVPR.2016.90. Piscataway: IEEE. Google Scholar
Jumper, J., Evans, R., Pritzel, A., Green, T., Figurnov, M., Ronneberger, O., Tunyasuvunakool, K., Bates, R., Žídek, A., Potapenko, A., Bridgland, A., Meyer, C., Kohl, S. A. A., Ballard, A. J., Cowie, A., Romera-Paredes, B., Nikolov, S., Jain, R., Adler, J., Back, T., Petersen, S., Reiman, D., Clancy, E., Zielinski, M., Steinegger, M., Pacholska, M., Berghammer, T., Bodenstein, S., Silver, D., Vinyals, O., Senior, A. W., Kavukcuoglu, K., Kohli, P. & Hassabis, D. (2021). Nature, 596, 583–589. Web of Science CrossRef CAS PubMed Google Scholar
Le Cun, Y., Jackel, L. D., Boser, B., Denker, J. S., Graf, H. P., Guyon, I., Henderson, D., Howard, R. E. & Hubbard, W. (1989). IEEE Commun. Mag. 27, 41–46. CrossRef Web of Science Google Scholar
Lim, R. E., Mukherjee, T., Chuang, C., Phan, T. Q., DebRoy, T. & Pagan, D. C. (2023). J. Appl. Cryst. 56, 1131–1143. Web of Science CrossRef CAS IUCr Journals Google Scholar
Mareček, D., Oberreiter, J., Nelson, A. & Kowarik, S. (2022). J. Appl. Cryst. 55, 1305–1313. CrossRef IUCr Journals Google Scholar
McCulloch, W. & Pitts, W. (1943). Bull. Math. Biophys. 5, 115–133. CrossRef Google Scholar
Nawaz, S., Rahmani, V., Pennicard, D., Setty, S. P. R., Klaudel, B. & Graafsma, H. (2023). J. Appl. Cryst. 56, 1494–1504. Web of Science CrossRef CAS IUCr Journals Google Scholar
Rahmani, V., Nawaz, S., Pennicard, D., Setty, S. P. R. & Graafsma, H. (2023). J. Appl. Cryst. 56, 200–213. Web of Science CrossRef CAS IUCr Journals Google Scholar
Rumelhart, D., Hinton, G. & Williams, R. (1986). Nature, 323, 533–536. CrossRef Web of Science Google Scholar
Silver, D., Huang, A., Maddison, C. J., Guez, A., Sifre, L., van den Driessche, G., Schrittwieser, J., Antonoglou, I., Panneershelvam, V., Lanctot, M., Dieleman, S., Grewe, D., Nham, J., Kalchbrenner, N., Sutskever, I., Lillicrap, T., Leach, M., Kavukcuoglu, K., Graepel, T. & Hassabis, D. (2016). Nature, 529, 484–489. Web of Science CrossRef CAS PubMed Google Scholar
Timmermann, S., Starostin, V., Girelli, A., Ragulskaya, A., Rahmann, H., Reiser, M., Begam, N., Randolph, L., Sprung, M., Westermeier, F., Zhang, F., Schreiber, F. & Gutt, C. (2022). J. Appl. Cryst. 55, 751–757. Web of Science CrossRef CAS IUCr Journals Google Scholar
This is an open-access article distributed under the terms of the Creative Commons Attribution (CC-BY) Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original authors and source are cited.