research papers
Hirshfeld atom like
with alternative electron density partitionsaBiological and Chemical Research Centre, Department of Chemistry, University of Warsaw, Żwirki i Wigury 101, Warszawa, 02-089 Warszawa, Poland
*Correspondence e-mail: ml.chodkiewicz@uw.edu.pl
Hirshfeld atom R factor (e.g. much smaller than for refinements with different quantum chemistry methods, i.e. Hartree–Fock and coupled cluster) and therefore no single partition was clearly the best in terms of experimental data reconstruction. In the case of structural parameters the differences between the partitions are comparable to those related to the choice of other factors. We have observed the systematic effects of the partition choice on bond lengths and ADP values of polar hydrogen atoms. The bond lengths were also systematically influenced by the choice of electron density calculation methodology. This suggests that GAR-derived structural parameters could be systematically improved by selecting an optimal combination of the partition and quantum chemistry method. The results of the refinements were compared with those of neutron diffraction experiments. This allowed a selection of the most promising partition methods for further optimization of GAR settings, namely the Hirshfeld, iterative stockholder and minimal basis iterative stockholder.
is one of the most successful methods for the accurate determination of structural parameters for hydrogen atoms from X-ray diffraction data. This work introduces a generalization of the method [generalized atom (GAR)], consisting of the application of various methods of partitioning electron density into atomic contributions. These were tested on three organic structures using the following partitions: Hirshfeld, iterative Hirshfeld, iterative stockholder, minimal basis iterative stockholder and Becke. The effects of partition choice were also compared with those caused by other factors such as quantum chemical methodology, basis set, representation of the crystal field and a combination of these factors. The differences between the partitions were small in terms ofKeywords: Hirshfeld atom refinement; electron density partition; generalized atom refinement; GAR; HAR.
CCDC references: 2040066; 2042674; 2042675; 2042676; 2042677; 2042678; 2042679; 2042680; 2042681; 2042682; 2042683; 2042684; 2042685; 2042686; 2042687; 2042688; 2042689; 2042690; 2042691; 2042692; 2042693; 2042694; 2042695; 2042696; 2042697; 2042698; 2042699; 2042700; 2042701; 2042702; 2042703; 2042704; 2042705; 2042706; 2042707; 2042708; 2042709; 2042710; 2042711; 2042712; 2042713; 2042714
1. Introduction
Ongoing progress in experimental technique development in X-ray crystallography makes this method an excellent tool to observe aspherical electron density deformations that can be attributed to bond formation and other interactions. However, the simplest and most popular approach, in fact, the only one practically available for many decades is the independent atom model (IAM), which treats the crystal as a set of spherical atomic densities centred on the atomic nuclei. However, it does not take into account the aspherical nature of atomic electron densities. For this reason, IAM fails to correctly describe those aspects of molecular geometry which are influenced by aspherical electron density deformations, such as the positions and anisotropic displacement parameters of hydrogen atoms. Consequently, the bond lengths formed by hydrogen atoms are on average shorter by 0.1 Å compared with their benchmark values as reported by neutron diffraction experiments and anisotropic refinements of hydrogen atom thermal motions resulting in non-positive definite ADP values. Therefore, methods introducing the various models of aspherical atomic scattering factors were developed (Weiss, 1964; DeMarco & Weiss, 1965; Kurki-Suonio, 1968; Stewart, 1969, 1973; Hirshfeld, 1971; Hansen & Coppens, 1978). Unfortunately the most successful and popular of them, the multipole model proposed by Hansen & Coppens (1978), does not allow for free of hydrogen atom positions and ADP values (Hoser et al., 2009), except in certain special cases of high-resolution good-quality data (Zhurov et al., 2011; Woinska et al., 2019). This problem was overcome by use of the transferable aspherical atom model (TAAM) which takes advantage of the fact that the parameters of the multipole model are similar to those of atoms in similar chemical environments (Pichon-Pesme et al., 1995) and uses predefined sets of such parameters for This allows for free of hydrogen positions and leads to more accurate X—H bond lengths, as shown previously for a number of databanks of multipole model parameters (Bąk et al., 2011).
Performing Hirshfeld atom ; Capelli et al., 2014) which utilizes stockholder partitioning (Hirshfeld, 1977) of the electron density for a molecule in the crystal turned out to be an even more promising method as it avoids the atomic density transferability assumption used in TAAM and the limitations of the electron density model used in multipole formalism (Koritsanszky et al., 2011). This implements the atomic aspherical structure factors obtained from the Hirshfeld-partitioned electron density of the asymmetric unit/molecule/cluster in the crystal calculated by an iterative procedure with the effects of the crystal environment included via surrounding the central unit by a cluster of electric multipoles. The improved model of the aspherical atomic resulted in more accurate and precise of the hydrogen atom positions and considerable progress in the of hydrogen ADP values (Capelli et al., 2014; Woińska et al., 2016, 2017; Malaspina et al., 2017; Orben & Dittrich, 2014; Dittrich et al., 2017; Sanjuan-Szklarz et al., 2020), even for X-ray data of standard resolution. Therefore HAR was added to a group of methods for deriving ADP values for hydrogen atoms, including the `TLS+ONIOM' approach based on ab initio calculations (Whitten & Spackman), with the TAAM model (Dittrich et al., 2008), TLS-based analysis available via the web service SHADE (Munshi et al., 2008) and lattice dynamical models (Hoser & Madsen, 2016; 2017).
(HAR) (Jayatilaka & Dittrich, 2008HAR was applied to et al., 2019; Orben & Dittrich, 2014), of compounds that contain transition metals (Woińska et al., 2016; Bučinský et al., 2016, 2019; Malaspina et al., 2019) and refinements including relativistic effects (Bučinský et al., 2016, 2019; Malaspina et al., 2019). Initial work aimed at optimizing HAR for the of macromolecules is also available: HAR-ELMO (Malaspina et al., 2019) and fragHAR (Bergmann et al., 2020). A recent study of TAAM (K. Jha et al., 2020) using the same set of test systems, as in an analogous study of HAR (Woińska et al., 2016) revealed that HAR produced bond lengths slightly closer to those obtained from neutron diffraction than TAAM. The average bond length underestimation was 0.020 Å for TAAM and 0.014 Å for HAR. It should be noted that such results apply for specific TAAM parameterizations and specific HAR methodologies (defined by the quantum chemical method, basis set and representation of crystal field).
of anharmonic thermal motions (WoinskaNevertheless, HAR is still not a fully mature method since there are still many areas with potential for improvement including long computational times required for repeated quantum mechanical calculation; quality of hydrogen ADP values refined with HAR;
of structures other than those of molecular crystals (network structures, ionic crystals) may be suboptimal; of disorder is not yet properly handled; of structures containing heavy metals is difficult due to the limited choice of available basis sets and challenges related to application of relativistic methods; lack of a well established optimal combination of settings (including the quantum chemistry method, basis set and representation of crystal field).It must also be stressed that increasing the applicability of HAR, adapting the method to perform more challenging tasks such as OLEX2 (Dolomanov et al., 2009) in the pre-installed HARt interface, enabling simple access to the basic functionalities of HAR (Fugel et al., 2018). with HARt, similar to the classical version of HAR (Jayatilaka & Dittrich, 2008; Capelli et al., 2014) can be carried out against F, unlike IAM in OLEX2, which is based on F2. As far as treatment of macromolecular structures is concerned, the development of HAR-based methods is proceeding in two directions: database-related methods and fragmentation techniques. The first involves the HAR-ELMO method (Malaspina et al., 2019) which combines HAR with libraries of extremely localized molecular orbitals (Meyer & Genoni, 2018). This method was tested on a few small-molecule structures and proved capable of locating hydrogen atoms (in terms of bond lengths and ADP values) as accurately and precisely as traditional HAR at significantly lower computational costs. It was also successfully applied in the of two and the of crambin (for two X-ray datasets collected at different subatomic resolutions). The other fragmentation-related method was first implemented in the fragHAR method (Bergmann et al., 2020) using molecular fractionation with the conjugate caps method of fragmentation (Zhang & Zhang, 2003) in order to divide the molecule of interest into smaller fragments for which a wavefunction can be calculated with quantum mechanical methods. This method was implemented in the TONTO program (Jayatilaka & Grimwood, 2003) and tested on three oligopeptide crystal structures. It yields hydrogen positions and ADP values in statistical agreement with HAR; however, any interactions that involve hydrogen atoms must be given special attention during the fragment-selection process.
of macromolecular structures and increasing its popularity among users requires the creation of new software tools and/or the incorporation of the method in existing, commonly used programs dedicated to the processing of crystallographic data. A step in this direction was its implementation in a popular program for chemical crystallographyHAR is based on a model defined by computational chemistry methods (e.g. Hartree–Fock or density functional theory with BLYP functional) as well as the basis set and representation of the molecular environment in the crystal. Although there are recommendations to guide the choice of model settings (Capelli et al., 2014; Fugel et al., 2018), these are not yet well established. Most HAR refinements have been performed with the Hartree–Fock method or a density functional approach with BLYP functional, which are usually not considered methods of choice for accurate quantum mechanical calculation for molecular systems. Very recently, an application of more accurate quantum chemistry methods [second-order Møller–Plesset perturbation theory (MP2) and coupled cluster singles and doubles (CCSD)] have been tested (Wieduwilt et al., 2020); however, conclusions about the advantages of these methods over computationally cheaper methods were not discussed. It was observed that an agreement between experimental and HAR-derived structure factors for L-alanine improves in the order HF, BLYP, MP2 ≃ B3LYP, CCSD.
In this study we examine additional options for choosing the model. We test the effects of the choice of electron density partitioning on atomic contributions. Since we are no longer limited to the partition proposed by Hirshfeld, the resulting method can be thought of as a generalization of HAR; to distinguish this from HAR it will be referred to as generalized atom
(GAR).If the effects of partition choice on
accuracy are comparably important as those of the other settings in GAR and the newly introduced partitions are not clearly inferior compared with Hirshfeld partition then we can conclude that there is an important new dimension in the search for an optimal HAR-like model. In this work we aim to identify promising directions for such a search.Partitioning electron density into atomic contributions is closely related to popular chemical concepts such as net atomic charge and atoms in molecules. Many partition methods have been proposed: those related to Bader's quantum theory of atoms in molecules (Bader, 1990), to the Mulliken (1955) and Löwdin (1955) population analyses (among others), stockholder partitioning as proposed by Hirshfeld (1977), and related methods such as the iterative Hirshfeld method (Bultinck et al., 2007), the iterative stockholder partitioning method (Lillestolen & Wheatley, 2008), the minimal basis iterative stockholder (Lillestolen & Wheatley, 2008) and the DDEC6 method (Manz & Limas, 2016). Electron density partitions are also used for computational purposes [e.g. in the Becke scheme for numerical integration (Becke, 1988)]. In this preliminary test of the effect of electron density partitioning on HAR-like we tested partitions implemented in the quantum chemistry program HORTON (Verstraelen et al., 2017) which mainly includes partitions inspired by the one introduced by Hirshfeld (1977).
2. Methodology
2.1. Investigated structures
In this work, three organic systems were selected for testing: urea, oxalic acid dihydrate and 9,10-bis-diphenylthiophosphoranylanthracene·toluene (SPAnPS). Urea and oxalic acid dihydrate have been used in many studies on accurate refinements with aspherical atom models, and for electron density distributions in crystals (Stevens et al., 1979; Stevens & Coppens, 1980; Zobel et al., 1993; Krijn et al., 1988; Swaminathan et al., 1984; Gatti et al., 1994; Birkedal et al., 2004; Jayatilaka & Dittrich, 2008; Pisani et al., 2011; Wall, 2016). Urea and oxalic acid are small polar organic molecules, which makes testing them relatively easy with, in general, computationally demanding methods such as GAR. The polarity of these systems is an advantage when examining the differences between the Hirshfeld and iterative Hirshfeld partitions as they are expected to be more pronounced than in non-polar systems. Ionic systems would probably be even more appropriate for such comparison, but HAR methodology for these kinds of systems is not yet well established so was not examined. The third system, SPAnPS, contains substantially larger molecules with no polar hydrogen atoms. Each X-ray dataset has known neutron measurements at the same temperature that we measured. We used the following sources of datasets/structures: urea – neutron structure (Swaminathan et al., 1984) and X-ray data (Birkedal et al., 2004) both at 123 K, oxalic acid (Kamiński et al., 2014), and SPAnPS (Köhler et al., 2019). ADP values for oxalic acid neutron measurement were scaled isotropically because we noticed systematic differences in ADP values of non-hydrogen atoms. The scale factors were derived by the method of least squares (Blessing, 1995). One of possible explanations is a difference in true temperatures of diffraction experiments.
2.2. Partitions
The following partitions were examined in this study: the original stockholder partition proposed by Hirshfeld (H) (Hirshfeld, 1977) which was used in HAR, the iterative Hirshfeld (IH) (Bultinck et al., 2007), iterative stockholder (IS) (Lillestolen & Wheatley, 2008), the minimal basis iterative stockholder (MBIS) (Verstraelen et al., 2016) and the partition proposed by Becke (B) (Becke, 1988). Most of the tested methods are based on the stockholder partition of the electron density, which expresses the electron density ρ of an atom a at point r as
with a summation over all atoms in the system (indexed with subscript k), wk(r) is the spherical weighting function for the kth atom and Rk is the position of the kth atom. Such methods can be viewed as an extension of the original stockholder method proposed by Hirshfeld.
In the original Hirshfeld partition the function wk(r) corresponds to the spherically averaged atomic densities of the isolated atoms. Hirshfeld partitioning leads to relatively low partial charges (Davidson & Chakravorty, 1992), which has been considered to be a deficiency in the method (Bultinck et al., 2007). These low partial charges are not surprising since they are maximally similar to the isolated atoms under an information theory framework (Nalewajski & Parr, 2000).
The iterative Hirshfeld method is similar to Hirshfeld partitioning but also takes the atomic charges into account for the weighting function. The weighting function for a given atom is a combination of the electron densities of the isolated neutral atom and an isolated ion(s) of a given element. This combination is chosen in such a way that it reproduces the charge of the corresponding atom. This method produces considerably higher partial charges than Hirshfeld partitioning which can lead to problems [e.g. in the case of one highly polar oxide, calculation of the electron density of a nonexistant oxygen dianion was required (Verstraelen et al., 2013)].
The iterative stockholder method does not require supplementary atomic density gas-phase calculations. It uses spherically averaged atomic densities for the atoms as weight functions with the initial weight functions normalized to wk(r) = 1 for all atoms. While this method avoids some problems which appear in the Hirshfeld and iterative Hirshfeld methods, it has been shown (Bultinck et al., 2009) that the spherically averaged atomic densities resulting from this method can be not monotonically decaying, a counterintuitive result.
In the case of the minimal basis iterative stockholder method, the weighting function for a given atom is expressed using a minimal basis set of spherical Slater-type functions. This method is similar to the iterative stockholder method.
The partition proposed by Becke was designed to deal with three-dimensional integration in molecular systems and therefore it is not expected that the resulting atomic charges will correspond to chemical intuition. This partition is not based on stockholder-type partitions. Instead it is similar to Voronoi tessellation with adjustments for atomic sizes and `softened' boundaries (atomic densities are continuous).
2.3. Implementation
A locally modified version of OLEX2 (Dolomanov et al., 2009) was used in the refinements. It incorporated a development version of the DiSCaMB library (Chodkiewicz et al., 2018) into the olex2.refine module which allows for the application of form factors corresponding to aspherical atomic densities.
In general, GAR implementation is quite similar to HAR implementation (Capelli et al., 2014; Fugel et al., 2018). The information necessary for electron density calculation for a non-periodic molecular system representing a given is generated by computational chemistry software. This information incorporates either the set of molecular orbitals or the first-order reduced density matrix. The molecular system was built from molecule(s) comprising the but this can be increased in order to better represent the molecular environment in the crystal. The effect of the crystal field can be modelled by surrounding the studied system with a set of atomic electric multipole moments. Atomic electron densities corresponding to a chosen partition of molecular electron density are calculated and used in the computation of atomic form factors. The form factors are then used in least squares Since the leads to a new geometry, new quantum chemical calculations are run and a new least squares is performed. This procedure consisting of quantum chemical calculations followed by least squares is repeated until convergence criteria are met.
In practice, our implementation differs not only by allowing more electron density partitions, but also on many other points, among them the most important is probably Fobs2. Quantum chemical calculations are performed with GAUSSIAN16 (Frisch et al., 2016). The atomic multipole moments needed to represent the effect of the crystal field are calculated using the Hirshfeld partition, even if the atomic form factors are calculated for the other partitions. In this way the equal treatment of crystal field effects for refinements based on different partitions is preserved. Atomic multipoles are calculated in a self-consistent embedding scheme in which a newly calculated electron density is the source for new multipole moments which generate new representations of the crystal field, giving rise to new electron density results. Cycles of such calculations are performed until the differences between the components of the multipole moments are smaller than 0.003 a.u. Only point charges and dipoles are used. Charges representing dipoles are separated by 0.02 Å.
againstThe electron densites at molecular integration grid points were calculated with HORTON (Verstraelen et al., 2017), which reads in the first-order reduced density matrix from a Gaussian formatted check-point file. It also performs molecular electron density partitioning and prints out the atomic electron densities and the details of the molecular integration grid. This is implemented as a part of the DiSCaMB library which is responsible for the calculation of atomic multipoles and atomic form factors.
A Becke-type multicenter integration scheme for molecular integrals (Becke, 1988) is used with pruned grids as defined in HORTON (Verstraelen et al., 2017). A radial grid is generated using the power transform (r = axp) and a Lebedev–Laikov grid (Lebedev & Laikov, 1999) is used for angular integration. Form factor calculation involves the largest predefined grid in HORTON. Parameters of the grid are element specific, e.g. for carbon atoms 148 radial points are used and up to 1730 angular points.
Least squares Fobs2 as implemented in olex2.refine (Bourhis et al., 2015) with no use of additional SHELX-type parameters in the weighting scheme, i.e. the weights are defined as , where is the variance of the observed intensity. Absence of additional parameters in the weighting scheme allows for direct comparison of the discrepancy in R factors. The whole GAR procedure is finished when the difference in geometry after the least squares and the one used in the quantum chemical calculations is less than 0.001 Å for both the atomic positions and the lengths.
is performed against2.4. Reported statistics
In order to statistically assess the results of GAR refinements, we have compared discrepancy R factor values, the lengths of covalent bonds to hydrogen atoms and anisotropic displacement parameters for the hydrogen atoms. The difference between the values obtained from X-ray and neutron measurements are referred to as ΔR for bond lengths and ΔUij for ADP tensor components. Their average absolute values are calculated (〈|ΔR|〉 and 〈|ΔUij|〉, respectively) and also averaged (〈ΔR〉) in the case of ΔR. We also calculated the average ratio of the square of the difference to its variance (we reported the root of that value) referred to as the weighted root mean square difference for the bond lengths, wRSMD(ΔR), and for the ADP values, wRSMD(ΔUij). The statistics are defined as follows,
and
where the subscript X indicates X-ray values, N the neutron values while the angle brackets (chevrons) denote the average value of the expression in the brackets (averaged over atoms). It should be noted that the lower value of wRSMD is not an indicator that one method is better than another since it can happen that a method with higher accuracy and precision corresponds also to higher wRSMD, an example is provided further in the text where the effects of electron density partition choice on ADP values are discussed. We also used the average values of the S12 similarity index as introduced by Whitten & Spackman (2006). It is defined as S12 = 100(1 − R12), where R12 describes the overlap between the density distribution functions (p1, p2) for nuclei defined by two ADP tensors:
In order to identify patterns in bond lengths, the average ratio of the X-ray to neutron bond lengths 〈RX/RN〉 was calculated. In order to identify patterns in the ADP values, we compared the averaged ratios of the volumes of `vibrational' ellipsoids 〈VX/VN〉 [e.g. known from ORTEP (Johnson, 1965)]. This ratio was calculated by taking into account: (1) the volume of the ellipsoid is proportional to the product of the lengths of its semi-axes, (2) the semi-axes of the thermal ellipsoids are proportional to the eigenvalues of the ADP tensor in Cartesian coordinates and (3) the product of the matrix eigenvalues is equal to its determinant. Taking (1)–(3) together gives
where UXCart and UNCart are the X-ray and neutron measurement-based ADP tensors in the Cartesian coordinates, respectively.
When the average over the atoms or bonds is reported, the uncertainty indicated in brackets is given as the population standard deviation, defined as
3. Results
In order to compare the results of the GAR refinements with the different partitioning schemes, we focused on the structural parameters related to the hydrogen atoms, specifically the lengths of bonds involving hydrogen atoms and hydrogen ADP values, since accurate determination of these with X-ray
is more challenging than for heavier atoms. Unless stated otherwise, all results refer to those parameters.In order to put the analyzed differences between the experimentally derived X-ray and neutron bond lengths (ΔR) into context here are some related values. For example, the standard deviations for bond lengths from neutron measurements referenced in this work for N—H and O—H bonds lie in the ranges 2–3 and 2.5–6.3 mÅ for HAR (for B3LYP/cc-pVTZ), whereas in the case of C—H bonds in SPAnPS those numbers are 1–5 mÅ for neutron data and 3.4–5.7 mÅ for HAR. For example, X—H bond lengths from neutron diffraction data (Allen & Bruno, 2010) for functional groups are C(sp3)—O—H 0.970 (12), C(aryl)—O—H 0.992 (17) and O—C(sp2)—O—H 1.018 (22) Å, and the differences between those values are 22, 26 and 48 mÅ. The analogous values for HAR (Woińska et al., 2016; HAR with BLYP/cc-pVDZ) are (data for maximum resolution) 0.953 (28), 0.965 (32) and 0.983 (35) Å and the differences between the HAR and the neutron values (ΔR) for maximum available resolution are −17, −27 and −35 mÅ (however, they are smaller for 0.8 Å resolution: −8, −8, −32 mÅ). In the case of IAM the differences are much larger: −122, −147 and −132 mÅ. Although the average distances for HAR are much closer to neutron ones than those from IAM, there are still considerable differences, comparable to the differences in O—H bond lengths for different functional groups. This can be partially explained by the fact that different sets of molecules were used for evaluation of the averages for HAR and neutron data. On the other hand, the average HAR bond lengths reported by Woińska et al. (2016) seem to be systematically shorter than neutron bond lengths in the case of polar hydrogen atoms. We should however remember that these are shorter for HAR using DFT with the BLYP functional and cc-pVDZ basis set. HAR with Hartree–Fock produces longer bonds for polar hydrogens [see the results in the work by Capelli et al. (2014) or Wieduwilt et al. (2020)] as well as higher R factors. The problem of the choice of optimal settings for HAR is still far from exhaustively explored.
3.1. Testing factors other than electron density partition
Before discussing the effects of the electron density partitioning method, we will describe the results of tests involving other components of the model including the quantum chemistry method, the representation of the crystal field and the basis set, thereby allowing a comparison between the effects of the electron density partition and the effects related to the other settings (which were not tested in HAR against Fobs2). The tests of these factors were performed on urea and oxalic acid since, as relatively small systems, they are well suited for testing computationally demanding settings such as post-Hartree–Fock methods or quantum mechanical representation of molecular surrounding.
These factors were tested with the Hirshfeld partition only. Unless stated otherwise, electron density was calculated using B3LYP, the cc-pVTZ basis set, and the crystal field was represented by atomic point charges and dipoles located at the atoms in molecules with at least one atom within 8 Å of any atom of the molecule for which the wavefunction is calculated. Those settings were selected on the basis of results from the initial phase of refinements for this section. Ideally when testing one of the components of the model, one would use the optimal setting for the other components. This is not possible in practice and not always necessary. At some point, higher levels of theory ceased producing improvements in the results. Little gain has been observed upon switching from the cc-pVTZ to the cc-pVQZ basis set, suggesting that the cc-pVTZ set is more than adequate. In the case of crystal field representation, a model with point charges seemed to produce results of similar quality to those from the more expensive model. B3LYP was selected as the quantum chemical method for the tests since it belonged to the set of methods (B3LYP, MP2, CCSD) which gave the best R factors in the initial tests and the two other methods are much more computationally demanding.
The results of the tests are presented in Table 1. Urea and oxalic acid dehydrate structures have five unique hydrogen atoms (see Fig. 1), all bonded to electronegative atoms (N and O), referred to throughout the text as polar hydrogen atoms. Values related to the structural descriptors in Table 1 are given as an average over those atoms. Values of the descriptors are given separately for urea and oxalic acid in Table S1 of the supporting information. Individual values of the structural parameters are shown in Figs. 2–5.
3.1.1. Basis set
The effect of the choice of basis set on HAR was tested with the family of correlation-consistent basis sets developed by Dunning and coworkers (Dunning, 1989; Kendall et al., 1992; Woon & Dunning, 1993), from the smallest to the largest: cc-pVDZ, cc-pVTZ, cc-pVQZ. Each of the listed basis sets roughly doubles the number of functions of the previous one, making wavefunction calculations considerably slower since computational time formally scales as N4 (where N is the number of base functions) in the case of the least computationally expensive methods used.
Similar to the earlier tests of HAR (Capelli et al., 2014), it appears that the cc-pVTZ basis set is sufficient since switching to cc-pVQZ brings only a small reduction in wR2 [≤0.02 p.p. (percent point)] and relatively small changes in the structural parameters [see Table 1 and Figs. 2(a)–(c)]. Switching from cc-pVDZ to cc-pVTZ leads to a much larger reduction in wR2 (≤0.3 p.p.). The cc-pVTZ basis set is also visibly better than the cc-pVDZ basis set in terms of the similarity between the X-ray and neutron determined ADP values in the case of oxalic acid [see Figs. 2(b)–2(c)], but not in the case of urea. Some patterns can be observed for volumes of thermal ellipsoids: for all five hydrogen atoms those derived with cc-pVDZ are smaller than those obtained with cc-VTZ [Fig. 2(d)]. Discrepancies in bond lengths were especially visible for O1—H1 bond in oxalic acid [ΔR 8(6) mÅ for cc-pVTZ and 24 (6) mÅ for cc-pVDZ], which is involved in a very strong hydrogen bond (O1—H1⋯O3, H1 to O3 distance 1.423 Å). Results for the other bonds alone do not suggest that cc-pVTZ is superior to cc-pVDZ for estimating bond lengths. Also the results in the work by Capelli et al. (2014) do not show that the use of a larger basis set (cc-pVTZ) leads to better bond lengths (for HAR with Hartree–Fock and for DFT with the BLYP functional). In that work it was also observed that cc-pVTZ leads to better ADP values. In summary, switching from cc-pVDZ to the larger basis set cc-pVTZ seems to improve the ADP values. In the case of one of the bonds in this study it also significantly improved the bond length, but it was unclear if this was an isolated case or if such improvement can be expected in certain situations.
3.1.2. Quantum chemistry method
The following methods were compared: DFT with B3LYP and the BLYP functional, Hartree–Fock(HF), and the post-Hartree–Fock methods: MP2 and CCSD. The same set of quantum chemistry methods was tested in refinements for L-alanine (Wieduwilt et al., 2020); however, effects of the crystal field were not taken into account in that work. We can roughly order the accuracy of the methods for calculating the energies and corresponding properties in the following way: HF < MP2 ≤ CCSD and BLYP < B3LYP, also MP2 ≃ B3LYP. A general perception of the accuracy of the quantum chemistry method is reflected in the values of the discrepancy factor wR2, where HF gave the highest values, greater than B3LYP by ≤0.51 p.p. Similar trends were observed in the work by Wieduwilt et al. (2020); however, in the current work MP2, CCSD and B3LYP gave similar agreement factors (measured as wR2) whereas in the other study CCSD gave superior results. The higher wR2 values do not automatically translate into higher discrepancies between X-ray and neutron structural parameters (e.g. HF produces relatively good bond lengths in this work).
It has been observed that the choice of quantum chemistry method in HAR has a systematic effect on the lengths of bonds to hydrogen. For example, Hartree–Fock of Gly-L-Ala produces systematically too long and BLYP too-short bonds (Capelli et al., 2014). Too-short N—H bonds (up to 36 mÅ) for with BLYP were also reported for carbamazepine (Sovago et al., 2016). We have also observed some trends in bond lengths. For all bonds, they can be ordered in the following way (average X-ray value minus the neutron value in mÅ given in parentheses): BLYP (−12.5) < B3LYP (−8) < CCSD (−3.6) ≤ HF (4.2). The same ordering was also observed for L-alanine (Wieduwilt et al., 2020) for all polar bonds to hydrogen (in the –NH3+ group). Although the differences are sometimes within experimental uncertainty, the fact that the order of the bond lengths can be observed for all bonds [Fig. 3(a)] suggests that this is not an artefact but a real trend in the bond length values. Some of the differences are quite substantial, the largest one, between BLYP and HF, takes, on average, a value of 17 mÅ.
In terms of bond length accuracy (see 〈|ΔR|〉 in Table 1), BLYP produces the largest discrepancies in bond lengths [12 (6) mÅ on average], B3LYP smaller [8(5) mÅ] and Hartree–Fock and post-Hartree–Fock methods the smallest (3.5–4.6 mÅ). Different conclusions on relative accuracy could be drawn from results for Gly-L-Ala (Capelli et al., 2014), where the discrepancies for polar bonds for BLYP and Hartree–Fock were quite similar (on average 11 versus 14 mÅ). However, those results are not in contrast with the observation that post-Hartree–Fock methods produce relatively good bond lengths and that B3LYP produces better bond lengths than BLYP. Certainly more tests are required to establish relative accuracy of bond length estimation with various quantum chemistry methods, but it can already be concluded that BLYP leads to too-short bond lengths.
Clear assessment of relative accuracy is also not possible for ADP determination. Some of the worst values of ADP accuracy descriptors are associated with the HF method [Figs. 3(b) and 3(c)], which also gave the worst ADP in terms of 〈|ΔUij|〉 and 〈S12〉 (Table 1). Also for Gly-L-Ala (Capelli et al., 2014) HF-derived ADPs were worse than those from BLYP in terms of 〈|ΔUij|〉. However, the evidence for the inferiority of the ADP values from HF calculations in the current work is not strong, hence it is probably not possible to draw such a conclusion on the basis of visual inspection of 〈|ΔUij|〉 for individual atoms [Fig. 3(b)]. In the case of S12 [Fig. 3(c)] there is one atom with a much higher S12 value for HF than for other methods (S12 = 8.7 versus <3) – in oxalic acid, H1 which participates in a very strong hydrogen bond. Interestingly, the large 〈|ΔUij|〉 in urea from HF (0.0078 for HF versus 0.0044 Å2 for MP2) does not translate into a visibly larger wRMSD (1.9 versus 1.7, see Table S1). This is caused by the fact that standard deviations for HF-derived Uij are about 50% larger than those for MP2 (although they are quite similar in the case of oxalic acid). No trends in the volumes of thermal ellipsoids have been observed [Fig. 3(d)].
3.1.3. Representation of molecular environment
The most common approach applied in HAR uses point multipoles. It is clear that the lack of such representation leads to an inferior structural model in terms of the averaged discrepancies in both bond lengths and ADP values, and a larger wR2 for the tested systems (see Table 1). For all hydrogen atoms, ADP value agreement factors 〈|ΔUij|〉 and S12 for models with crystal field representations (CFR) were close to or better than those for models with no such representation [Figs. 4(b) and 4(c)]. Volumes of thermal ellipsoids were larger for models with no CFR [Fig. 4(d)]. For bond lengths, the largest discrepancies were also generated with models with no CFR [Fig. 4(a)]. Significant effects for neglecting strong intermolecular interactions were also observed in HAR refinements with fragmentation (fragHAR) for (Bergmann et al., 2020).
Typically in HAR only the molecules/ions constituting the et al., 2018) that the accuracy of HAR may improve when molecules/ions surrounding the central part are treated at the quantum mechanical level. We have tested two variants of this approach. In one, quantum mechanical calculations were performed on a cluster of molecules including the central part (urea or oxalic acid with two neighbouring water molecules) and molecules involved in hydrogen bonds with the central part, which lead to a cluster of 56 atoms in the case of urea, and 58 atoms in the case of oxalic acid. This variant is referred to as `qm cluster smaller' in Fig. 4. Another variant involved those molecules which are up to 3.5 Å from the central part. Corresponding clusters included 88 atoms in the case of urea and 136 atoms for oxalic acid. This variant is referred to as `qm cluster larger' in Fig. 4. The clusters were also surrounded by point multipoles (using an 8 Å threshold). Replacing the point-charge values and dipoles with explicit quantum mechanical representations of the surrounding molecules did not generate a visible improvement, but significantly increased the computational cost of It was suggested (Capelli et al., 2014) that such a representation could improve the accuracy of polar N—H bond lengths and its lack could be a reason why that accuracy was lower than for C—H bonds. Our results do not support such a supposition; however, it seems quite probable that an explicit quantum mechanical representation of neighbouring molecules would be advantageous for systems with very strong interactions. Oxalic acid dihydrate has a very strong hydrogen bond (O1—H1⋯O3), but it was always treated at the quantum mechanical level in this work. This is related to the technical aspects of the implementation (all components of the have to be represented in the same quantum mechanical calculations).
(hereafter referred as the `central part') are treated at a quantum mechanical level. It was reported (Fugel3.1.4. Combination of less expensive HAR settings
In this study, we have also examined a combination of settings which are computationally less expensive than B3LYP/cc-pVTZ with surrounding multipoles, which is used as a reference model in this paragraph. We have also performed TAAM UBDB data bank (Volkov et al., 2007; Dominiak et al., 2007; Jarzembska & Dominiak, 2012; Kumar et al., 2019) using a locally modified version of OLEX2. The reference model clearly gave a better agreement factor than the computationally less expensive models (wR2, see Table 1). It also outperformed them in terms of accuracy for ADP values in terms of 〈|ΔUij|〉 and S12 (see Table 1), those values are also consistently relatively low for the method for all hydrogen atoms [see Figs. 5(b) and 5(c)].
with theInterestingly the combination of the quantum chemistry method which led to the highest wR2 – Hartree–Fock – with the smallest basis set tested – cc-pVDZ – led to the best bond lengths. This seems to be the result of two systematic effects: the Hartree–Fock method giving slightly too-long bonds combined with a smaller basis set which, in the case of Hartree–Fock, leads to shortening of the bond (which can be observed for all bonds with polar hydrogen atoms, see Fig. S2 of the supporting information). Similarly in the work by Capelli et al. (2014) it was observed that the smaller basis set does not lead to inferior bond lengths compared with the larger one for the Hartree–Fock method and average bond lengths are smaller for this basis set.
In the case of urea, lower quality HAR models gave results comparable to TAAM(UBDB) [worse in the case of HAR with HF/cc-pVDZ(−), see Table S1]. In the case of oxalic acid dehydrate, with very strong hydrogen bonds, TAAM(UBDB) gave clearly worse results [see Table S1 and Fig. 5(a)]. We have also included structural models based on the standardized neutron bond lengths in the comparison. They are in relatively good agreement with those from neutron experiments except for the H1 atom in oxalic acid involved in a very strong hydrogen bond, for which the discrepancy reaches 54 mÅ (which is still less than 73 mÅ in the case of TAAM).
3.2. Electron density partition
In terms of wR2 statistics, the differences between the partitions are quite small (see Table 2), maximally 0.07 p.p., which was much less than between models using the cc-pVTZ and cc-pVDZ basis sets (maximum 0.3 p.p.) or between the HF and B3LYP methods (maximum 0.51 p.p.) or – to a lesser extent – between the models with and without surrounding multipoles (up to 0.15 p.p.). The differences were relatively small despite quite large differences in the atomic charge values (see Table 3), e.g. IH partition gave a charge of 0.53 on the oxalic acid H1 atom while the H partition gave 0.12, which means that for H partition the atoms carry almost twice as many electrons as for IH partition (0.88 e versus 0.47 e). Usually the absolute values of the charge for IH, IS and MBIS are significantly larger than those for H and B in the case of polar hydrogen atoms.
|
|
Standard uncertainties (SU) for bond lengths (σbond) and ADP values (σADP) varied significantly between partitions (see Table 4). The highest values of σbond for the covalent bonds to hydrogen were observed for refinements with iterative Hirshfeld partition. A similar situation was found for the hydrogen ADP values. The smallest SU values were obtained for the B partition. Those differences were more pronounced for the polar hydrogen atoms (i.e. in urea and oxalic acid), e.g. the average σbond in oxalic acid is 8.6 mÅ for IH and 2.7 mÅ for B. For SPAnPS (the non-polar hydrogens), those values were 3.6 and 2.6 mÅ, respectively.
|
In the case of SPAnPS, larger discrepancies were observed for atoms with larger ADP values and/or those bonded to carbon atoms with larger ADP values and/or with higher contributions from anharmonic terms in their atomic displacement descriptions. This effect is shown in Table 5 for Hirshfeld partitions (see Table S2 for data for all partitions) in which the statistics for four groups of hydrogen atoms in SPAnPS are presented (see Fig. 6): (1) bonded to the carbon atoms for which no anharmonic motion was refined in the work by Köhler et al. (2019), (2) other atoms in the larger molecule, (3) aryl hydrogens in toluene and (4) methyl hydrogens in toluene. There was a very clear increase in 〈|ΔR|〉 for subsequent groups; when hydrogen atoms and their bonding partners had larger ADP values and the anharmonic displacement effects were more pronounced, 〈|ΔR|〉 was larger. Similar patterns could be observed for 〈|ΔUij|〉, which was similar for the two groups of hydrogen atoms present in the larger molecule and much larger for the hydrogen atoms in toluene, especially in the methyl group. The larger absolute error in ADP values and bond lengths for atoms with larger ADP values was an expected observation. The more interesting one, however, is a clear increase of the ratio of X-ray to neutron bond lengths (see RX/RN in Table 5), which rise in the following way: 0.995, 0.998, 1.000, 1.025. One of the possible reasons could be an effect introduced by convolution approximation which is specific to X-ray models. The SPAnPS example suggests that, in general, the discrepancies resulting from large ADP values and anharmonic effects could be much larger than those related to electron density partitions.
|
Only atoms of group (1) were used for further analysis in SPAnPS (see Table 6) in order to avoid additional discrepancies between X-ray and neutron results which can be expected for atoms with possibly higher contributions from anharmonic effects and/or with large ADP values. Standard uncertainties for the bond lengths for this group are 2.8–4.6 mÅ for X-ray and 1–1.5 mÅ for neutron data. In this case, 〈|ΔR|〉 for all partitions are very similar, about 5 mÅ, and the differences in 〈|ΔR|〉 between the partitions did not exceed 1 mÅ and were smaller than the population standard deviations (∼3 mÅ), indicating that all partitions gave C—H bond lengths of similar accuracy for that structure. While wRMSD values in the 1.2–1.56 range did not give a clear indication that there was a statistical difference between the X-ray and neutron bond lengths, all GAR-derived bonds were shorter than the neutron ones suggesting that this is a systematic effect.
|
The situation is quite different in the case of urea and oxalic acid, which have only polar hydrogen atoms (see Tables 2 and 7). Systematic differences between the partitions could be observed. In all cases the bond lengths can be ordered in the following way: B < H < IS, MBIS, IH [see Fig. 7(a)]. IS produced similar bond lengths to MBIS. For N—H bonds (urea) HI also gave similar bond lengths except for O—H (oxalic acid), which were all longer by about 5 mÅ than for IS. The differences in the average bond length between the X-ray and neutron measurements for those bonds were B −18.2, H −8.3, IH 1.7, IS −3.1 and MBIS −3.6 mÅ (see Table 7).
|
In terms of bond length, the accuracy for polar hydrogen atoms in MBIS, IS and IH are very similar [4.4 (13), 5.0 (33) and 5.5 (37) mÅ], H is slightly worse [8(5) mÅ]. The differences between MBIS, IS and IH appear to be too small relative to the spread of the results (measured as population standard deviations, see Table 7) to conclude on the superiority of any of the methods in bond length estimation. H partition probably produces worse results, but more research is needed to justify this statement since the difference between the methods is not large compared with the error spread for each method (see Table 7) and standard uncertainties of bond lengths. We can clearly point to B as an inferior partition in terms of ΔR (see Fig. S3 and Table 7). Interestingly, it is comparable to other methods in the case of urea but much worse in the case of oxalic acid.
Reported values of wRMSD provide information on the differences relative to its standard deviations. The value for the Hirshfeld partition (2.1) borders the value for which we can conclude that the bond lengths for this partition are statistically different from those obtained from neutron diffraction, in this case they are shorter.
In the case of ADP values, a clear trend could be observed for the volumes of thermal ellipsoids (see the 〈VX/VN〉 statistics in Table 2). For all individual hydrogen atoms in all of the tested systems, the volumes can be ordered in the following way: B < H, IS, MBIS < IH. This regular difference is especially striking in the case of the polar hydrogen atoms [see Fig. 7(d)]. Iterative Hirshfeld produces worse ADP values then the other methods, in terms of both 〈|ΔUij|〉 and S12 statistics [see Figs. 7(b) and 7(c)]. ADP values from IH are too large, whereas B ADP values tend to be too small (see 〈VX/VN〉 values in Table 2). Data for IH and H are good examples of the situation when values with higher accuracy and precision, in this case for H in terms of |ΔUij| (Table 2 and 7) and σUij (Table 4), can at the same time correspond to higher wRMSD values (Table 2).
With B producing inferior bond lengths, IH inferior ADP values and H probably overly short bond lengths, MBIS and IS seem to be the most promising methods. When comparing the partition methods we should bear in mind that the values are reported for a particular method of computational chemistry – DFT with B3LYP functional – and could differ for other methods. Recommendations for GAR settings should be given for a set of settings (e.g. IH with B3LYP/cc-pVTZ and point multipoles for crystal field representation) rather than for each setting separately (e.g. recommendation for IH without specifying the other settings). In principle, finding such an optimal set of settings would require an exhaustive search over all combinations of the settings. In practice we can try to identify the most promising combinations without exhaustive search. For example, IH produces too-large ADP values for B3LYP and replacing B3LYP with other quantum chemistry methods probably cannot solve this problem since for these methods no systematic trends were observed for thermal ellipsoids volumes. Similarly in the case of B partition. The most promising partitions are therefore H, IS and MBIS. In the case of quantum chemistry methods we can probably eliminate BLYP as it seems to produce too-short bonds and there is probably no partition among the `promising' ones that can correct for that. Also Hartree–Fock would not be a good first choice since it produces clearly inferior agreement factors and this cannot be changed with a different partition, since partitions have a very small effect on the agreement factors. Therefore in the case of quantum chemistry methods B3LYP and post-Hartree–Fock methods seem to be the most promising. Hartree–Fock is also certainly worth further testing as it gave relatively accurate bond lengths in this work despite high wR2 agreement factor values.
From a practical point of view, refinements with IH partitions turned out to be the slowest to converge in terms of the number of wavefunction calculations required. For SPAnPS and urea, we observed an oscillatory behaviour of some structural parameters when comparing the results of consecutive least square refinements. To account for this, we averaged the structural parameters from the last two least squares refinements and averaged the atomic form factors from the last two calculations of the wavefunction, which led to significantly improved convergence (see Fig. S5).
4. Conclusions
HAR is a wR2 agreement factor, the differences caused by the electron density partitioning scheme were much smaller than those introduced by the choice of basis set (e.g. cc-pVDZ versus cc-pVTZ), the method of wavefunction calculation (e.g. HF versus B3LYP), or the difference between the refinements with and without crystal field representation (in the case of polar molecules). Therefore none of the partitions were clearly the best in terms of experimental data reconstruction. In the case of the structural parameters, the differences between the partitions were comparable to those caused by changing the other settings. Since the results depend on a combination of GAR settings, it is possible to asses such a combination and not an individual component of the GAR model. In principle, finding such an optimal combination of the settings would require an exhaustive search over all the possible combinations of the settings.
technique that allows for the determination of accurate structural parameters for hydrogen atoms from X-ray diffraction data. A generalization of HAR (GAR) was introduced in this work. Aside from the Hirshfeld electron density partition used in the original version of HAR, other partitions were applied: Becke, Hirshfeld, iterative Hirshfeld, iterative stockholder and the minimal basis iterative stockholder. The effects of the electron density partitioning choice on GAR-like were tested on the structures of two small polar organic molecules (urea and oxalic acid dihydrate) and a larger one (SPAnPS) with no polar hydrogen atoms. The effects of partition choice were also compared with those caused by other settings of GAR such as (1) the quantum chemistry method (Hartree–Fock, DFT with BLYP and DFT with B3LYP functional, CCSD, MP2), (2) the basis set, (3) representation of the crystal field and (4) some combination of these factors. Since the set of tested structures is rather small, the results should be treated as suggestive of the most promising HAR-like settings as well as an indication of the most promising directions for a further search for the optimal choice of such settings, and not as a concluding recommendation. In all of the GAR refinements the hydrogen atom positions and anisotropic displacement parameters were refined freely and these refinements led to positively defined ADP tensors. The discrepancies between GAR and neutron-derived bond lengths to hydrogen atoms were much smaller than those which can be expected for the classical model (IAM) with spherical electron densities – the maximal average difference for GAR was 26 mÅ and minimal 3.3 mÅ, compared with about 100 mÅ which can be expected for IAM. In terms of theOur analysis does not clearly show which combinations of setting should be recommended for GAR. Among the tested partitions the most promising ones for the further search of optimal settings are Hirshfeld partition, iterative stockholder and minimal basis iterative stockholder. Among the quantum chemistry methods DFT with BLYP functional seems to be the least likely to be used in an optimal combination of settings.
We have analyzed structural parameters related to hydrogen atoms. Many systematic effects related to choice of GAR settings were observed, especially in the case of polar hydrogen atoms. For example, when comparing the results of the refinements using different partitions, in all of the cases bond lengths can be ordered in the following way: Becke < Hirshfeld < other partitions. A clear trend can be also observed in the thermal ellipsoid volumes – Becke partition, for all of the atoms in all of the tested structures, produces the lowest values whereas the iterative Hirshfeld produces the largest values. The effects of a particular partition on the
could be summarized as follows (when used with the B3LYP functional and cc-pVTZ basis set):· The Becke partition was expected to produce inferior results as it was designed purely for the purpose of numerical integration used in density functional theory calculations. It indeed gave the worst results in terms of wR2 (still a relatively small difference) and seemed to be more prone to produce inferior results than the other partitions. For polar hydrogens, it led to the shortest bonds and the smallest ADP values. However, it gave the smallest standard uncertainties for the hydrogen parameters.
· The Hirshfeld partition gave relatively good ADP values, wR2, and slightly too-short bond lengths (8.3 mÅ on average)
· Iterative Hirshfeld –gave the largest bond lengths for the polar hydrogens, on average 1.7 mÅ longer than those from neutron measurement. It also gave the largest ADP values, which led to the largest discrepancies in terms of absolute difference when compared with the neutron diffraction results. It also led to the highest standard uncertainties and was more prone to problems with convergence of refinement.
· The iterative stockholder and minimal basis iterative stockholder gave similar results. Bonds to polar hydrogen atoms were on average longer than those from the Hirshfeld partition and slightly more similar to those from the neutron diffraction measurements and the ADP values were relatively good. They seem to produce a slightly better result than other partitions but more research is needed to verify this observation.
Tests of the other factors influencing the HAR-like
performed on the crystal structures of urea and oxalic acid showed also some systematic trends (all refinements were performed with Hirshfeld partition of the electron density):· Similarly to the previous publications regarding the classical HAR approach, the cc-pVDZ basis set was clearly inferior to cc-pVTZ in terms of wR2 but extending the basis set further to cc-pVQZ did not give a clear improvement. The cc-pVTZ basis set gave systematically larger volumes for thermal ellipsoids than cc-pVDZ. For oxalic acid it also led to clearly better structural parameters.
· Bonds to hydrogen atoms derived using GAR with the Hartree–Fock method were systematically shorter when the cc-pVDZ basis set was used instead of cc-pVTZ (by 6 mÅ on averge).
· Systematic differences in bond lengths calculated with different quantum chemistry methods have been observed for polar hydrogen atoms. They can be ordered in the following way for averaged X-ray–neutron values (given in parentheses in mÅ): BLYP(−12) < B3LYP(−8) < CCSD(−4) < HF(4).
· Hartree–Fock was clearly an inferior method for wavefunction calculation in terms of wR2. B3LYP seemed to be a better choice than BLYP in terms of bond length accuracy and agreement factors, and the application of the most expensive methods (MP2, CCSD) gave similar agreement factors as B3LYP but with better bond lengths. Yet when B3LYP was paired with other electron density partitions – i.e. the iterative stockholder – the bond length accuracy improved and was similar to the accuracy of MP2 and CCSD with the Hirshfeld partition.
· While there is a clear advantage in representing the crystal field with multipoles, we did not observe further improvement when treating the surrounding molecules quantum mechanically (such an improvement was expected for compounds with very strong intermolecular interactions).
with no crystal field representation led to the largest volumes of thermal ellipsoids in the case of polar hydrogen atoms.· TAAM, which is based on the use of fixed, predefined parameters derived from the Hansen–Coppens multipole model, performed roughly similarly to HAR with B3LYP/cc-pVDZ and no crystal field representation with point multipoles or with HF/cc-pVDZ and point multipoles in the case of urea. It was however clearly worse than any of the HAR approaches in the case of oxalic acid, which is a system with very strong hydrogen bonds.
Those results may differ when other partition methods are used rather than the applied Hirshfeld, especially in the case of quantum chemistry methods which also exhibit systematic differences in bond lengths.
While it is becoming clear that GAR (including HAR) is probably the most accurate method for deriving structural parameters for hydrogen atoms from X-ray
it is still unclear what the optimal method settings are. With this work we add another dimension to the methodology. The next step will be to test various combinations of the settings using a larger set of test structures and a wider choice of quantum chemistry methods and electron density partitions. Further improvement towards ultra-accurate X-ray refinements may also require a more advanced model of atomic displacements and an examination of the effects of the convolution approximation.Supporting information
CCDC references: 2040066; 2042674; 2042675; 2042676; 2042677; 2042678; 2042679; 2042680; 2042681; 2042682; 2042683; 2042684; 2042685; 2042686; 2042687; 2042688; 2042689; 2042690; 2042691; 2042692; 2042693; 2042694; 2042695; 2042696; 2042697; 2042698; 2042699; 2042700; 2042701; 2042702; 2042703; 2042704; 2042705; 2042706; 2042707; 2042708; 2042709; 2042710; 2042711; 2042712; 2042713; 2042714
Supporting figures and tables. DOI: https://doi.org/10.1107/S2052252520013603/lt5030sup1.pdf
Supporting cifs. DOI: https://doi.org/10.1107/S2052252520013603/lt5030sup2.zip
Acknowledgements
The authors declare that they have no competing interests.
Funding information
Calculations have been carried out using resources (grant No. 115) provided by Wroclaw Centre for Networking and Supercomputing (https://wcss.pl). The authors thank the Polish National Science Centre for financial support within OPUS (grant No. 2018/31/B/ST4/02142). The work has been accomplished at the TEAM TECH Core Facility for crystallographic and biophysical research to support the development of medicinal products sponsored by the Foundation for Polish Science (FNP). Support was also provided by the European Regional Development Fund under the Operational Programme Innovative Economy (2007–2013).
References
Allen, F. H. & Bruno, I. J. (2010). Acta Cryst. B66, 380–386. Web of Science CrossRef CAS IUCr Journals Google Scholar
Bader, R. F. W. (1990). Atoms in Molecules: a Quantum Theory. Oxford University Press. Google Scholar
Bąk, J. M., Domagała, S., Hübschle, C., Jelsch, C., Dittrich, B. & Dominiak, P. M. (2011). Acta Cryst. A67, 141–153. Web of Science CrossRef IUCr Journals Google Scholar
Becke, A. D. (1988). J. Chem. Phys. 88, 2547–2553. CrossRef CAS Web of Science Google Scholar
Bergmann, J., Davidson, M., Oksanen, E., Ryde, U. & Jayatilaka, D. (2020). IUCrJ, 7, 158–165. CrossRef CAS PubMed IUCr Journals Google Scholar
Birkedal, H., Madsen, D., Mathiesen, R. H., Knudsen, K., Weber, H.-P., Pattison, P. & Schwarzenbach, D. (2004). Acta Cryst. A60, 371–381. Web of Science CSD CrossRef CAS IUCr Journals Google Scholar
Blessing, R. H. (1995). Acta Cryst. B51, 816–823. CrossRef CAS Web of Science IUCr Journals Google Scholar
Bourhis, L. J., Dolomanov, O. V., Gildea, R. J., Howard, J. A. K. & Puschmann, H. (2015). Acta Cryst. A71, 59–75. Web of Science CrossRef IUCr Journals Google Scholar
Bučinský, L., Jayatilaka, D. & Grabowsky, S. (2016). J. Phys. Chem. A, 120, 6650–6669. Web of Science PubMed Google Scholar
Bučinský, L., Jayatilaka, D. & Grabowsky, S. (2019). Acta Cryst. A75, 705–717. CrossRef IUCr Journals Google Scholar
Bultinck, P., Cooper, D. L. & Van Neck, D. (2009). Phys. Chem. Chem. Phys. 11, 3424–3429. Web of Science CrossRef PubMed CAS Google Scholar
Bultinck, P., Van Alsenoy, C., Ayers, P. W. & Carbó-Dorca, R. (2007). J. Chem. Phys. 126, 144111. Web of Science CrossRef PubMed Google Scholar
Capelli, S. C., Bürgi, H.-B., Dittrich, B., Grabowsky, S. & Jayatilaka, D. (2014). IUCrJ, 1, 361–379. Web of Science CSD CrossRef CAS PubMed IUCr Journals Google Scholar
Chodkiewicz, M. L., Migacz, S., Rudnicki, W., Makal, A., Kalinowski, J. A., Moriarty, N. W., Grosse-Kunstleve, R. W., Afonine, P. V., Adams, P. D. & Dominiak, P. M. (2018). J. Appl. Cryst. 51, 193–199. [Reference added OK?] Google Scholar
Davidson, E. R. & Chakravorty, S. (1992). Theor. Chim. Acta, 83, 319–330. CrossRef CAS Google Scholar
DeMarco, J. J. & Weiss, R. J. (1965). Phys. Rev. 137, A1869–A1871. CrossRef Google Scholar
Dittrich, B., Lübben, J., Mebs, S., Wagner, A., Luger, P. & Flaig, R. (2017). Chem. Eur. J. 23, 4605–4614. Web of Science CrossRef CAS PubMed Google Scholar
Dittrich, B., McKinnon, J. J. & Warren, J. E. (2008). Acta Cryst. B64, 750–759. Web of Science CSD CrossRef CAS IUCr Journals Google Scholar
Dolomanov, O. V., Bourhis, L. J., Gildea, R. J., Howard, J. A. K. & Puschmann, H. (2009). J. Appl. Cryst. 42, 339–341. Web of Science CrossRef CAS IUCr Journals Google Scholar
Dominiak, P. M., Volkov, A., Li, X., Messerschmidt, M. & Coppens, P. (2007). J. Chem. Theory Comput. 3, 232–247. Web of Science CrossRef CAS PubMed Google Scholar
Dunning, T. H. (1989). J. Chem. Phys. 90, 1007–1023. CrossRef CAS Web of Science Google Scholar
Frisch, M. J., Trucks, G. W., Schlegel, H. B., Scuseria, G. E., Robb, M. A., Cheeseman, J. R., Scalmani, G., Barone, V., Petersson, G. A., Nakatsuji, H., Li, X., Caricato, M., Marenich, A. V., Bloino, J., Janesko, B. G., Gomperts, R., Mennucci, B., Hratchian, H. P., Ortiz, J. V., Izmaylov, A. F., Sonnenberg, J. L., Williams-Young, D., Ding, F., Lipparini, F., Egidi, F., Goings, J., Peng, B., Petrone, A., Henderson, T., Ranasinghe, D., Zakrzewski, V. G., Gao, J., Rega, N., Zheng, G., Liang, W., Hada, M., Ehara, M., Toyota, K., Fukuda, R., Hasegawa, J., Ishida, M., Nakajima, T., Honda, Y., Kitao, O., Nakai, H., Vreven, T., Throssell, K., Montgomery, J. A. Jr, Peralta, J. E., Ogliaro, F., Bearpark, M. J., Heyd, J. J., Brothers, E. N., Kudin, K. N., Staroverov, V. N., Keith, T. A., Kobayashi, R., Normand, J., Raghavachari, K., Rendell, A. P., Burant, J. C., Iyengar, S. S., Tomasi, J., Cossi, M., Millam, J. M., Klene, M., Adamo, C., Cammi, R., Ochterski, J. W., Martin, R. L., Morokuma, K., Farkas, O., Foresman, J. B. & Fox, D. J. (2016). Gaussian16, Revision C. 01. Gaussian, Inc., Wallingford, Connecticut, USA. Google Scholar
Fugel, M., Jayatilaka, D., Hupf, E., Overgaard, J., Hathwar, V. R., Macchi, P., Turner, M. J., Howard, J. A. K., Dolomanov, O. V., Puschmann, H., Iversen, B. B., Bürgi, H.-B. & Grabowsky, S. (2018). IUCrJ, 5, 32–44. Web of Science CSD CrossRef CAS PubMed IUCr Journals Google Scholar
Gatti, C., Saunders, V. R. & Roetti, C. (1994). J. Chem. Phys. 101, 10686–10696. CrossRef CAS Web of Science Google Scholar
Hansen, N. K. & Coppens, P. (1978). Acta Cryst. A34, 909–921. CrossRef CAS IUCr Journals Web of Science Google Scholar
Hirshfeld, F. L. (1971). Acta Cryst. B27, 769–781. CSD CrossRef CAS IUCr Journals Web of Science Google Scholar
Hirshfeld, F. L. (1977). Theor. Chim. Acta, 44, 129–138. CrossRef CAS Web of Science Google Scholar
Hoser, A. A., Dominiak, P. M. & Woźniak, K. (2009). Acta Cryst. A65, 300–311. Web of Science CrossRef CAS IUCr Journals Google Scholar
Hoser, A. A. & Madsen, A. Ø. (2016). Acta Cryst. A72, 206–214. Web of Science CSD CrossRef IUCr Journals Google Scholar
Hoser, A. A. & Madsen, A. Ø. (2017). Acta Cryst. A73, 102–114. Web of Science CrossRef IUCr Journals Google Scholar
Jarzembska, K. N. & Dominiak, P. M. (2012). Acta Cryst. A68, 139–147. Web of Science CrossRef CAS IUCr Journals Google Scholar
Jayatilaka, D. & Dittrich, B. (2008). Acta Cryst. A64, 383–393. Web of Science CrossRef CAS IUCr Journals Google Scholar
Jayatilaka, D. & Grimwood, D. J. (2003). Comput. Sci. 4, 142–151. Google Scholar
Jha, K. K., Gruza, B., Kumar, P., Chodkiewicz, M. L. & Dominiak, P. M. (2020). Acta Cryst. B76, 296–306. Web of Science CrossRef IUCr Journals Google Scholar
Johnson, C. K. (1965). ORTEP. Report ORNL-3794. Oak Ridge National Laboratory, Tennessee, USA. Google Scholar
Kamiński, R., Domagała, S., Jarzembska, K. N., Hoser, A. A., Sanjuan-Szklarz, W. F., Gutmann, M. J., Makal, A., Malińska, M., Bąk, J. M. & Woźniak, K. (2014). Acta Cryst. A70, 72–91. Web of Science CSD CrossRef IUCr Journals Google Scholar
Kendall, R. A., Dunning, T. H. & Harrison, R. J. (1992). J. Chem. Phys. 96, 6796–6806. CrossRef CAS Web of Science Google Scholar
Köhler, C., Lübben, J., Krause, L., Hoffmann, C., Herbst-Irmer, R. & Stalke, D. (2019). Acta Cryst. B75, 434–441. Web of Science CSD CrossRef IUCr Journals Google Scholar
Koritsanszky, T., Volkov, A. & Chodkiewicz, M. (2011). New Directions in Pseudoatom-Based X-ray Charge Density Analysis In Electron Density and Chemical Bonding II. Structure and Bonding, vol 147. Springer, Berlin, Heidelberg. Google Scholar
Krijn, M. P. C. M., Graafsma, H. & Feil, D. (1988). Acta Cryst. B44, 609–616. CSD CrossRef CAS Web of Science IUCr Journals Google Scholar
Kumar, P., Gruza, B., Bojarowski, S. A. & Dominiak, P. M. (2019). Acta Cryst. A75, 398–408. Web of Science CrossRef IUCr Journals Google Scholar
Kurki-Suonio, K. (1968). Acta Cryst. A24, 379–390. CrossRef CAS IUCr Journals Web of Science Google Scholar
Lebedev, V. I. & Laikov, D. N. (1999). Dokl. Math. 59, 477–481. Google Scholar
Lillestolen, T. C. & Wheatley, R. J. (2008). Chem. Commun. 45, 5909. CrossRef Google Scholar
Löwdin, P. (1955). J. Chem. Phys. 21, 374–375. Google Scholar
Macrae, C. F., Sovago, I., Cottrell, S. J., Galek, P. T. A., McCabe, P., Pidcock, E., Platings, M., Shields, G. P., Stevens, J. S., Towler, M. & Wood, P. A. (2020). J. Appl. Cryst. 53, 226–235. Web of Science CrossRef CAS IUCr Journals Google Scholar
Malaspina, L. A., Edwards, A. J., Woińska, M., Jayatilaka, D., Turner, M. J., Price, J. R., Herbst-Irmer, R., Sugimoto, K., Nishibori, E. & Grabowsky, S. (2017). Cryst. Growth Des. 17, 3812–3825. CSD CrossRef CAS Google Scholar
Malaspina, L. A., Wieduwilt, E. K., Bergmann, J., Kleemiss, F., Meyer, B., Ruiz-López, M. F., Pal, R., Hupf, E., Beckmann, J., Piltz, R. O., Edwards, A. J., Grabowsky, S. & Genoni, A. (2019). J. Phys. Chem. Lett. 10, 6973–6982. Web of Science CrossRef CAS PubMed Google Scholar
Manz, T. A. & Limas, N. G. (2016). RSC Adv. 6, 47771–47801. CrossRef CAS Google Scholar
Meyer, B. & Genoni, A. (2018). J. Phys. Chem. A, 122, 8965–8981. Web of Science CrossRef CAS PubMed Google Scholar
Mulliken, R. S. (1955). J. Chem. Phys. 23, 1833–1840. CrossRef CAS Web of Science Google Scholar
Munshi, P., Madsen, A. Ø., Spackman, M. A., Larsen, S. & Destro, R. (2008). Acta Cryst. A64, 465–475. Web of Science CrossRef CAS IUCr Journals Google Scholar
Nalewajski, R. F. & Parr, R. G. (2000). Proc. Natl Acad. Sci. USA, 97, 8879–8882. CrossRef PubMed CAS Google Scholar
Orben, C. M. & Dittrich, B. (2014). Acta Cryst. C70, 580–583. Web of Science CSD CrossRef IUCr Journals Google Scholar
Pichon-Pesme, V., Lecomte, C. & Lachekar, H. (1995). J. Phys. Chem. 99, 6242–6250. CrossRef CAS Web of Science Google Scholar
Pisani, C., Dovesi, R., Erba, A. & Giannozzi, P. (2011). Modern Charge-Density Analysis, edited by C. Gatti & P. Macchi, pp. 79-132. Dordrecht: Springer. Google Scholar
Sanjuan-Szklarz, W. F., Woińska, M., Domagała, S., Dominiak, P. M., Grabowsky, S., Jayatilaka, D., Gutmann, M. & Woźniak, K. (2020). IUCrJ, 7, 920–933. CSD CrossRef CAS PubMed IUCr Journals Google Scholar
Sovago, I., Gutmann, M. J., Senn, H. M., Thomas, L. H., Wilson, C. C. & Farrugia, L. J. (2016). Acta Cryst. B72, 39–50. Web of Science CSD CrossRef IUCr Journals Google Scholar
Stevens, E., Coppens, P., Feld, R. & Lehmann, M. (1979). Chem. Phys. Lett. 67, 541–543. CrossRef CAS Web of Science Google Scholar
Stevens, E. D. & Coppens, P. (1980). Acta Cryst. B36, 1864–1876. CSD CrossRef CAS IUCr Journals Google Scholar
Stewart, R. F. (1969). J. Chem. Phys. 51, 4569–4577. CrossRef CAS Web of Science Google Scholar
Stewart, R. F. (1973). J. Chem. Phys. 58, 1668–1676. CrossRef CAS Web of Science Google Scholar
Swaminathan, S., Craven, B. M. & McMullan, R. K. (1984). Acta Cryst. B40, 300–306. CSD CrossRef CAS Web of Science IUCr Journals Google Scholar
Verstraelen, T., Ayers, P. W., Van Speybroeck, V. & Waroquier, V. (2013). J. Chem. Theory Comput. 9, 2221–2225. CrossRef CAS PubMed Google Scholar
Verstraelen, T., Tecmer, P., Heidar-Zadeh, F., González-Espinoza, C. E., Chan, M., Kim, T. D., Boguslawski, K., Fias, S., Vandenbrande, S., Berrocal, D. & Ayers, P. W. (2017). HORTON 2.1.1, https://theochem.github.com/horton/. Google Scholar
Verstraelen, T., Vandenbrande, S., Heidar-Zadeh, F., Vanduyfhuys, L., Van Speybroeck, V., Waroquier, M. & Ayers, P. W. (2016). J. Chem. Theory Comput. 12, 3894–3912. CrossRef CAS PubMed Google Scholar
Volkov, A., Messerschmidt, M. & Coppens, P. (2007). Acta Cryst. D63, 160–170. Web of Science CrossRef CAS IUCr Journals Google Scholar
Wall, M. E. (2016). IUCrJ, 3, 237–246. Web of Science CrossRef CAS PubMed IUCr Journals Google Scholar
Weiss, R. J. (1964). Phys. Lett. 12, 293–295. CrossRef CAS Web of Science Google Scholar
Whitten, A. E. & Spackman, M. A. (2006). Acta Cryst. B62, 875–888. Web of Science CrossRef CAS IUCr Journals Google Scholar
Wieduwilt, E. K., Macetti, G., Malaspina, L. A., Jayatilaka, D., Grabowsky, S. & Genoni, A. (2020). J. Mol. Struct. 1209, 127934. CrossRef Google Scholar
Woińska, M., Grabowsky, S., Dominiak, P. M., Woźniak, K. & Jayatilaka, D. (2016). Sci. Adv. 2, e1600192. Web of Science PubMed Google Scholar
Woon & Dunning (1993). J. Chem. Phys. 98, 1358–1371. Google Scholar
Woińska, M., Jayatilaka, D., Dittrich, B., Flaig, R., Luger, P., Woźniak, K., Dominiak, P. M. & Grabowsky, S. (2017). ChemPhysChem, 18, 3334–3351. Web of Science PubMed Google Scholar
Woinska, M., Wanat, M., Taciak, P., Pawinski, T., Minor, W. & Wozniak, K. (2019). IUCrJ, 6, 868–883. Web of Science CSD CrossRef CAS PubMed IUCr Journals Google Scholar
Zhang, D. W. & Zhang, J. Z. H. (2003). J. Chem. Phys. 119, 3599–3605. Web of Science CrossRef CAS Google Scholar
Zhurov, V. V., Zhurova, E. A., Stash, A. I. & Pinkerton, A. A. (2011). Acta Cryst. A67, 160–173. Web of Science CSD CrossRef CAS IUCr Journals Google Scholar
Zobel, D., Luger, P. & Dreißig, W. (1993). Z. Naturforsch. A, 48, 53–54. CrossRef CAS Google Scholar
This is an open-access article distributed under the terms of the Creative Commons Attribution (CC-BY) Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original authors and source are cited.