- 1. Introduction
- 2. The von Mises distribution
- 3. The generalized von Mises distribution and its equivalence with the Hendrickson–Lattman distribution
- 4. The first trigonometric moment of a circular probability distribution and the best Fourier synthesis
- 5. The first trigonometric moment of the GvM2 distribution
- 6. Conclusions
- References
- 1. Introduction
- 2. The von Mises distribution
- 3. The generalized von Mises distribution and its equivalence with the Hendrickson–Lattman distribution
- 4. The first trigonometric moment of a circular probability distribution and the best Fourier synthesis
- 5. The first trigonometric moment of the GvM2 distribution
- 6. Conclusions
- References
short communications
A note on the Hendrickson–Lattman phase probability distribution and its equivalence to the generalized von Mises distribution
aSchool of Biological Sciences, University of Auckland, Auckland, New Zealand
*Correspondence e-mail: rl.kingston@auckland.ac.nz
Hendrickson & Lattman [Acta Cryst. (1970), B26, 136–143] introduced a method for representing crystallographic phase probabilities defined on the unit circle. Their approach could model the bimodal phase probability distributions that can result from experimental phase determination procedures. It also provided simple and highly effective means to combine independent sources of phase information. The present work discusses the equivalence of the Hendrickson–Lattman distribution and the generalized von Mises distribution of order two, which has been studied in the statistical literature. Recognizing this connection allows the Hendrickson–Lattman distribution to be expressed in an alternative form which is easier to interpret, as it involves the location and concentration parameters of the component von Mises distributions. It also allows clarification of the conditions for bimodality and access to a simplified analytical method for evaluating the trigonometric moments of the distribution, the first of which is required for computing the best Fourier synthesis in the presence of phase, but not amplitude, uncertainty.
Keywords: crystallographic phase determination; probability density functions; circular statistics; computational crystallography.
1. Introduction
To enable determination of protein structures using X-ray crystallography, a variety of methods for experimental phase determination were developed and refined over several decades [see Hendrickson (2023) for a review]. All of these methods involved the systematic perturbation of Bragg diffraction from the crystal, by manipulating either the chemical composition of the crystal, the physical properties of the irradiating X-rays or both. A well understood feature of some approaches to experimental phase determination, including single and single wavelength is that a twofold geometric ambiguity in the phase results, even in the absence of error (Matthews, 1970; Vijayan, 1980; Dauter et al., 2002; McCoy & Read, 2010; Hendrickson, 2014). Hence the practical application of these phase determination procedures naturally generates bimodal phase probability distributions. This complexity must be captured in any mathematical function used to represent these probabilities. In addition, resolving the crystallographic for biological molecules experimentally often requires the combination of phase information from independent experiments.
The probability density function introduced by Hendrickson & Lattman (1970) addressed these issues. It has the form
Here (A, B, C, D) are the four coefficients of the distribution, which encode the phase information, and N is a normalization constant. Depending on the values of the coefficients, (1) may be either unimodal or bimodal. Most conveniently, when (1) is used to represent phase probabilities, independent sources of phase information can be combined through simple addition of the coefficients (A, B, C, D) because of the exponential form of the distribution.
Hence, (1), being both sufficiently flexible and numerically very convenient, became widely used to represent phase probability distributions in protein crystallography. We note that the Hendrickson–Lattman distribution is useful for modeling the phase probability distributions of acentric data, where the phase can take any value in the range 0–2π. For centric data, where there are only two phase possibilities, always separated by π, a discrete circular probability mass function provides the most straightforward descriptor.
Although the treatment of error in experimental phase determination has become increasingly sophisticated and is now generally based on the principle of ; Bricogne et al., 2003; McCoy & Read, 2010), the Hendrickson–Lattman distribution is still used to represent phase probability distributions in modern crystallographic software. Hence some clarification of its basic characteristics seems worthwhile.
with joint consideration of uncertainty in both amplitudes and phases (Read, 2003Hendrickson & Lattman (1970) briefly noted the similarities between their probability density distribution and the von Mises distribution, and the connection has been remarked on subsequently (Murshudov et al., 2011). However, to the best of our knowledge, these observations have not been systematically developed. Fully documenting the relation between the Hendrickson–Lattman and von Mises distributions and placing the procedures used by Hendrickson & Lattman (1970) within the framework of circular statistics is the purpose of this short review.
2. The von Mises distribution
The von Mises probability density function is central to circular statistics, being the circular analog of the Gaussian probability density function on a line, and its properties are consequently very well documented (Batschelet, 1981; Fisher, 1993; Mardia & Jupp, 1999; Jammalamadaka & Sengupta, 2001). Like the Gaussian, the von Mises distribution is a mirror symmetric mono-modal distribution, defined by two parameters (Fig. 1). μ is a location parameter. The function takes on its maximum value at μ, which is both the modal and mean value of the distribution. κ is a concentration parameter, named because as κ increases the distribution becomes more concentrated around μ. The von Mises probability density function is given by
where μ ∈ [0, 2π), κ ≥ 0, and I0 is the modified Bessel function of the first kind and order 0.
A simple extension of the von Mises distribution allows for multimodality, subject to symmetry restrictions (Mardia & Spurr, 1973; Batschelet, 1981). The multimodal von Mises probability density function is given by
where μ ∈ [0, 2π/n), κ ≥ 0 and n is a positive integer that specifies the number of modes. The modes of this highly symmetric distribution are separated by 2π/n, as depicted in Fig. 2 for the monomodal (n = 1), bimodal (n = 2) and trimodal (n = 3) cases.
3. The generalized von Mises distribution and its equivalence with the Hendrickson–Lattman distribution
The ordinary von Mises distribution [equation (2a), Fig. 1] is both unimodal and mirror symmetric, whereas its multimodal extension [equation (2b), Fig. 2] has both mirror and rotational symmetry. This limits applications. An important generalization of the von Mises distribution (Gatto & Jammalamadaka, 2007), which allows for both bimodality and asymmetry, is given by
where μ1 ∈ [0, 2π) and μ2 ∈ [0, π) are location parameters, and κ1 ≥ 0 and κ2 ≥ 0 are concentration parameters. The distribution (3) can be considered to arise from the multiplication of a unimodal and a bimodal von Mises distribution (Fig. 3), and is hence termed the generalized von Mises distribution of order 2 (the GvM2 distribution). Incorporating multimodal von Mises distributions of higher order into the product gives rise to an infinite series of probability distributions [see Gatto & Jammalamadaka (2007) and Gatto (2009) for context and commentary]; however, GvM distributions with order greater than two have found limited practical applications. The GvM2 distribution (3) appears to have been first proposed by Maksimov (1967) and its properties are now well studied (Yfantis & Borgman, 1982; Gatto & Jammalamadaka, 2007; Gatto, 2008, 2009; Salvador & Gatto, 2022a,b).
The normalizing constant G0 appearing in (3) ensures that the distribution is a probability density function, and is obtained by definite integration of the function over the unit circle. This integral cannot be evaluated in closed form, but can be written in terms of an infinite series expansion (Yfantis & Borgman, 1982; Gatto & Jammalamadaka, 2007) as
where In are the modified Bessel functions of the first kind and integer order n. The derivation of this result relies on the Jacobi–Anger expansion (Olver et al., 2010):
As the modified Bessel functions decrease rapidly to zero with increasing order (Oldham et al., 2009), accurate evaluation of the normalizing constant G0 using (4) can be achieved with only the first few summands of the infinite series.
The GvM2 distribution (3) can be symmetric or asymmetric, unimodal or bimodal, depending on its parameters (μ1, μ2, κ1, κ2) [see the literature (Yfantis & Borgman, 1982; Gatto & Jammalamadaka, 2007; Kato & Jones, 2010; Salvador & Gatto, 2022a) for demonstration and discussion]. If κ1 = 0, the GvM2 distribution [equation (3)] reduces to a bimodal von Mises distribution [equation (2b) with n = 2], whereas if κ2 = 0, it reduces to a monomodal von Mises distribution [equation (2a); equation 2(b) with n = 1]. If κ1 = 0 and κ2 = 0, the GvM2 distribution reduces to the uniform circular distribution (Yfantis & Borgman, 1982). The general conditions for bimodality of the GvM2 distribution are elaborated below.
As written, the connections between the GvM2 (3) and the Hendrickson–Lattman distribution (1) are not immediately seen. However the GvM2 distribution can be reparameterized (Gatto & Jammalamadaka, 2007) as follows.
If
then the GvM2 probability density function can be expressed as
where the constant K(λ1, λ2, λ3, λ4) is an appropriate transformation of the normalizing factor appearing in the denominator of (3),
Practically, the normalizing constant K(λ1, λ2, λ3, λ4) [equation (8)] can be evaluated using (4). Equations (6) have the form of a polar-to-Cartesian coordinate transformation. Parameters (μ1, μ2, κ1, κ2) can therefore be recovered from parameters (λ1, λ2, λ3, λ4) using
The reparameterized version of the GvM2 distribution (7) is clearly equivalent to the Hendrickson–Lattman probability distribution (1), with A = λ1, B = λ2, C = λ3, D = λ4 and N = exp(−K) = 1/(2πG0). The equivalence has been noted previously (Murshudov et al., 2011). The GvM2/Hendrickson–Lattman distributions belong to the exponential family of probability distributions, with (7) being the canonical representation of that family. The relationships between the two parameterizations of the Hendrickson–Lattman/GvM2 distribution are illustrated in Fig. 4. Some aspects of the distribution are easier to recognize when it is written in the order factorized form (3) rather than the expanded form of (1) or (7). For example, when μ1 approaches μ2, the GvM2 probability density function (3) approaches mirror symmetric, and is either unimodal or bimodal with peaks at antipodal positions [Fig. 4(a)] (Gatto & Jammalamadaka, 2007; Salvador & Gatto, 2022b). When both concentration parameters (κ1, κ2) become small, the distribution approaches uniform circular.
The order factorized form of the GvM2 distribution (3) also allows analysis of the conditions for bimodality of the distribution, which are of particular interest in crystallography. These conditions are most readily expressed in terms of two derived quantities: the scaled ratio of the two concentration parameters, ρ = κ1/4κ2; and the difference between the location parameters, δ = μ1 − μ2 mod(π). When ρ ≤ 1/2, the GvM2 distribution is always bimodal [see e.g. Figs. 3(c) and 4(a)]. When ρ ≥ 1, the GvM2 distribution is always unimodal [see e.g. Fig. 4(b)]. When 1/2 < ρ < 1, the GvM2 distribution may be either unimodal or bimodal ,dependent on the value of δ [see e.g. Figs. 4(c) and 4(d)], the detail being somewhat complex as it involves the roots of a quartic equation. Full details are given by Salvador & Gatto (2022a).
Hendrickson & Lattman (1970) actually used a functionally equivalent reparameterization of the probability distribution in their paper. To facilitate analytical integration of the distribution, and calculation of its normalizing constant N, they perform a change of variables, almost identical to (9), which effectively switches from the expanded form of the distribution (1) or (7) to the order factorized form (3). Allowing for the variations in definitions and notation, the result obtained for the normalization constant [equation (21a) of Hendrickson & Lattman (1970)] is the same as (4), up to a factor of 2π. Other integrations were performed that enable calculation of the best Fourier synthesis. Before considering these results, we first reframe the crystallographic problem being treated using the terminology of directional statistics.
4. The first trigonometric moment of a circular probability distribution and the best Fourier synthesis
As with probability distributions defined on the line, probability distributions defined on the circle can be characterized by a series of moments, which are obtained by integration of products of the distribution. However, these moments must be defined differently because of the circular periodicity. The trigonometric moments used to characterize circular distributions are named for the trigonometric functions that appear inside the integral. Unlike the regular moments, the trigonometric moments are complex-valued quantities. Though trigonometric moments of arbitrary order can be defined, we consider here only the first trigonometric moment which is defined as (Fisher, 1993; Mardia & Jupp, 1999; Jammalamadaka & Sengupta, 2001)
where f(θ) is the probability density function.
The quantities
are the components of the first trigonometric moment expressed in Cartesian form.
The quantities
are the components of the first trigonometric moment expressed in polar form.
In the field of circular statistics, the modulus (ρ1) of the first trigonometric moment is termed the mean length (sometimes the mean resultant length), while the argument (θ1) is termed the mean direction. For the ordinary von Mises distribution (2a), the mean length and mean direction are given by (Fisher, 1993; Mardia & Jupp, 1999; Jammalamadaka & Sengupta, 2001)
The first trigonometric moments for particular instantiations of the ordinary von Mises distribution are displayed in Fig. 1. The first trigonometric moment identifies the center of mass of a circular probability density function. The mean length, which can vary between 0 and 1, provides a useful measure of the dispersion of a unimodal distribution, such as the von Mises, though the interpretation is less straightforward for a potentially multimodal distribution such as the generalized von Mises.
Irrespective of the form of a circular probability distribution, the first trigonometric moment is of particular importance in crystallography. This is because, ignoring errors in the Fourier amplitudes, and given probability density functions for the phases, the best Fourier synthesis (in a least-squares sense) is obtained using the product of the first trigonometric moment and the measured Fourier amplitudes as coefficients. Therefore, the required coefficients are
where Fbest(hkl) represents the complex Fourier coefficients and |F(hkl)| represents the measured Fourier amplitudes. Hence the best Fourier synthesis is computed using the mean direction as the phase, while weighting the Fourier amplitudes by the mean length. This is the essential result given in the foundational paper by Blow & Crick (1959) [see the literature (Matthews, 1970; Vijayan, 1980; McCoy & Read, 2010) for discussion]. In crystallographic applications, the mean length has historically been termed the `figure of merit', and the mean direction the `best' or `centroid' phase (Matthews, 1970; Vijayan, 1980).
5. The first trigonometric moment of the GvM2 distribution
We now consider the analytical evaluation of the first trigonometric moment of the GvM2 distribution, which involves the integrals in (11). For the GvM2 distribution, no closed form solution for these integrals exists. However, as for the normalizing constant of the distribution [equation (4)], solutions can again be obtained that involve rapidly converging series expansions. For clarity, we restate the results obtained by Hendrickson & Lattman (1970), using the standard notation for the GvM2 distribution (3). The procedure described by Hendrickson & Lattman (1970), when applied to evaluate the integrals
yields
The proof again rests on the repeated use of the Jacobi–Anger expansion (5) and standard trigonometric identities. By making substitutions that reflect the variant re-parameterization of the probability density function used by Hendrickson & Lattman (1970),
then expressions (16) are seen to be equivalent to the equations appearing at the bottom of page 141 of the article by Hendrickson & Lattman (1970), up to a factor of 2π (noting the presence of a typographical error resulting in an erroneous change of sign when specifying the Bessel functions). The result (16) can also be obtained from the expressions for the trigonometric moments of arbitrary order, reported by Yfantis & Borgman (1982), who used an identical method of derivation.
Without loss of generality, we now consider the case where μ1 = 0. For any GvM2 distribution this can be achieved by an angular coordinate transformation. When setting μ1 = 0, expressions (16) for the components of the first trigonometric moment simplify to
where δ = μ1 − μ2 is the difference between the location parameters of the distribution. This is a computationally more convenient way to analytically evaluate the integrals, and is also the result given by Gatto (2009), made specific for the first trigonometric moment.
The first trigonometric moments for particular instantiations of the GvM2 distribution, evaluated using (18), are displayed in Fig. 4.
6. Conclusions
Directional data are ubiquitous in the physical and biological sciences, so it is probably unsurprising that the circular probability distribution developed by Hendrickson & Lattman (1970) was independently discovered and characterized by others. The exponential form of the Hendrickson–Lattman probability distribution confers many desirable properties. However, the Hendrickson–Lattman coefficients A, B, C and D lack straightforward meaning. Recognizing the equivalence of the Hendrickson–Lattman and GvM2 distributions allows reparameterization of the distribution to a more intuitive form that reflects the relationship with the von Mises distribution. It also allows a fuller appreciation of the general mathematical and statistical properties of the distribution, including the conditions for bimodality, and access to analytical procedures for computing all its trigonometric moments. There may be applications in crystallography where the inferential properties of the Hendrickson–Lattman/GvM2 distribution become important (i.e. when the parameters of the distribution need to be inferred, on the basis of computational procedures that effectively sample phase probabilities), and these have been studied in the statistical literature.
Acknowledgements
Open access publishing facilitated by The University of Auckland, as part of the Wiley–The University of Auckland agreement via the Council of Australian University Librarians.
Funding information
The following funding is acknowledged: Royal Society of New Zealand (grant No. 20-UOA-138).
References
Batschelet, E. (1981). Circular Statistics in Biology. London: Academic Press. Google Scholar
Blow, D. M. & Crick, F. H. C. (1959). Acta Cryst. 12, 794–802. CrossRef CAS IUCr Journals Web of Science Google Scholar
Bricogne, G., Vonrhein, C., Flensburg, C., Schiltz, M. & Paciorek, W. (2003). Acta Cryst. D59, 2023–2030. Web of Science CrossRef CAS IUCr Journals Google Scholar
Dauter, Z., Dauter, M. & Dodson, E. J. (2002). Acta Cryst. D58, 494–506. Web of Science CrossRef CAS IUCr Journals Google Scholar
Fisher, N. I. (1993). Statistical Analysis of Circular Data. Cambridge University Press. Google Scholar
Gatto, R. (2008). Stat. Comput. 18, 321–331. Web of Science CrossRef Google Scholar
Gatto, R. (2009). Statistics, 43, 409–421. Web of Science CrossRef Google Scholar
Gatto, R. & Jammalamadaka, S. R. (2007). Stat. Methodol. 4, 341–353. CrossRef Google Scholar
Hendrickson, W. A. (2014). Q. Rev. Biophys. 47, 49–93. Web of Science CrossRef PubMed Google Scholar
Hendrickson, W. A. (2023). IUCrJ, 10, 521–543. Web of Science CrossRef CAS PubMed IUCr Journals Google Scholar
Hendrickson, W. A. & Lattman, E. E. (1970). Acta Cryst. B26, 136–143. CrossRef CAS IUCr Journals Google Scholar
Jammalamadaka, S. R. & Sengupta, A. (2001). Topics in Circular Statistics. Singapore: World Scientific. Google Scholar
Kato, S. & Jones, M. C. (2010). J. Am. Stat. Assoc. 105, 249–262. Web of Science CrossRef CAS Google Scholar
Maksimov, V. M. (1967). Theory Probab. Appl. 12, 267–280. CrossRef Web of Science Google Scholar
Mardia, K. V. & Jupp, P. E. (1999). Directional Statistics. Chichester: John Wiley. Google Scholar
Mardia, K. V. & Spurr, B. D. (1973). J. R. Stat. Soc. Ser. B (Methodological), 35, 422–436. Google Scholar
Matthews, B. W. (1970). Crystallographic Computing, pp. 146–159. Copenhagen: Munksgaard. Google Scholar
McCoy, A. J. & Read, R. J. (2010). Acta Cryst. D66, 458–469. Web of Science CrossRef CAS IUCr Journals Google Scholar
Murshudov, G. N., Skubák, P., Lebedev, A. A., Pannu, N. S., Steiner, R. A., Nicholls, R. A., Winn, M. D., Long, F. & Vagin, A. A. (2011). Acta Cryst. D67, 355–367. Web of Science CrossRef CAS IUCr Journals Google Scholar
Oldham, K. B., Myland, J. C. & Spanier, J. (2009). An Atlas of Functions. Springer. Google Scholar
Olver, F. W., Lozier, D. W., Boisvert, R. F. & Clark, C. W. (2010). NIST Handbook of Mathematical Functions. New York: Cambridge University Press. Google Scholar
Read, R. J. (2003). Acta Cryst. D59, 1891–1902. Web of Science CrossRef CAS IUCr Journals Google Scholar
Salvador, S. & Gatto, R. (2022). Comm. Statist. Theory Methods, pp. 1–17. Google Scholar
Salvador, S. & Gatto, R. (2022b). Comput. Stat. 37, 947–974. Web of Science CrossRef Google Scholar
Vijayan, M. (1980). Computing in Crystallography, pp. 1901–1925. Bangalore: The Indian Academy of Sciences. Google Scholar
Yfantis, E. A. & Borgman, L. E. (1982). Comm. Statist. Theory Methods, 11, 1695–1706. CrossRef Web of Science Google Scholar
This is an open-access article distributed under the terms of the Creative Commons Attribution (CC-BY) Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original authors and source are cited.