research papers
A third blind test of
predictionaThe Pfizer Institute for Pharmaceutical Materials Science, University Chemical Laboratory, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, England, bCambridge Crystallographic Data Centre, 12 Union Road, Cambridge CB2 1EZ, England, cDepartment of Chemistry and Biochemistry, University of Maryland, College Park, MD 20742-2021, USA, dSchool of Pharmacy and Pharmacal Sciences, Purdue University, West Lafayette, Indiana, USA, eDipartimento di Chimica Fisica e Inorganica and INSTM-UdR, Universita di Bologna, Viale Risorgimento 4, I-40136 Bologna, Italy, fKarpov Institute of Physical Chemistry, Vorontsovo pole 10, 103064 Moscow, Russia, gOrganic Chemical Laboratory, ETH-Zurich, CH-8093 Zurich, Switzerland, hDepartment of Crystal and Structural Chemistry, Utrecht University, Padualaan 8, 3584 CH Utrecht, The Netherlands, iPerformance Chemicals Research, BASF AG, 67056 Ludwigshafen, Germany, jCenter for High Performance Computing, University of Utah, Salt Lake City, Utah 84112-0190, USA, kInstitute of Inorganic and Analytical Chemistry, University of Frankfurt, Marie Curie Str. 11, 60439 Frankfurt, Germany, lInstitute of Pharmaceutical Innovation, University of Bradford, Bradford BD7 1DP, England, mAccelrys Inc., San Diego, California, USA, nCenter for Process Systems Engineering, Department of Chemical Engineering and Chemical Technology, Imperial College London, South Kensington Campus, London SW7 2AZ, England, oDepartment of Chemistry, University College London, 20 Gordon St, London WC1H 0AJ, England, pThe Royal Institution, 21 Albermarle St, London W1X 4BS, England, qBaker Laboratory of Chemistry and Chemical Biology, Cornell University, Ithaca, NY 14853-1301, USA, and rSolid State Chemistry Group and CMBI, University of Nijmegen, The Netherlands
*Correspondence e-mail: gmd27@cam.ac.uk
Following the interest generated by two previous blind tests of Z′ = 2 structures were allowed. Part-way through the test, a report was discovered for one of the molecules, which could no longer be considered a blind test. Hence, a second molecule from the same category (small, rigid with common atom types) was offered to the participants as a replacement. Success rates within the three submitted predictions were lower than in the previous tests – there was only one successful prediction for any of the three `blind' molecules. For the `simplest' rigid molecule, this lack of success is partly due to the observed structure crystallizing with two molecules in the As in the 2001 blind test, there was no success in predicting the structure of the flexible molecule. The results highlight the necessity for better energy models, capable of simultaneously describing conformational and packing energies with high accuracy. There is also a need for improvements in search procedures for crystals with more than one independent molecule, as well as for molecules with conformational flexibility. These are necessary requirements for the prediction of possible thermodynamically favoured polymorphs. Which of these are actually realised is also influenced by as yet insufficiently understood processes of nucleation and crystal growth.
prediction (CSP1999 and CSP2001), a third such collaborative project (CSP2004) was hosted by the Cambridge Crystallographic Data Centre. A range of methodologies used in searching for and ranking the likelihood of predicted crystal structures is represented amongst the 18 participating research groups, although most are based on the global minimization of the lattice energy. Initially the participants were given molecular diagrams of three molecules and asked to submit three predictions for the most likely of each. Unlike earlier blind tests, no restriction was placed on the possible of the target crystal structures. Furthermore,Keywords: prediction; blind; polymorphs; crystal structure prediction.
1. Introduction
This paper reports the results of a third blind test of ). Both problems depend on our ability to calculate the most stable arrangements of large molecular aggregates from general principles and both are also subject to mostly unknown kinetic influences. Blind tests have been reported in both prediction (Lommerse et al., 2000; Motherwell et al., 2002) and protein folding (Moult et al., 1999) communities, and both have generated considerable interest. Therefore, the present collaboration was organized as a follow-up to the 1999 (Lommerse et al., 2000) and 2001 (Motherwell et al., 2002) CSP blind tests and the results presented here should be digested in the context of the earlier studies.
prediction (CSP), hosted by the Cambridge Crystallographic Data Centre and which we will refer to as CSP2004. Researchers have been struggling with the challenge of reliably predicting the observed crystal structure(s) of a given molecule, which shares many of its difficulties with the prediction of protein folding. Similarities and differences between the two types of problem have recently been discussed (Dunitz & Scheraga, 2004Periodically repeating the blind test can assess progress in the field, but the necessarily small number of molecules studied in each blind test means that the choice of one slightly more difficult or easier molecule (whose difficulty often cannot be judged until the predictions have been made) can have as much or more influence on the results as the methodological advances made since the last such project. In total, 11 molecules have now been studied under blind test conditions [the four molecules in the current blind test are numbered (VIII)–(XI)] and the successes and failures in these overall results point to the areas where improved methodologies are most urgent.
2. Organization and approach
The organization of this latest collaboration, CSP2004, was essentially the same as the first two, which have already been published (CSP1999, Lommerse et al., 2000, and CSP2001, Motherwell et al., 2002). Invitations to participate were sent to ca 25 research groups known to be active in the field and 18 of these groups participated. We initially intended to include three molecules in the test, retaining the same three categories of molecules as in CSP1999 and CSP2001:
All methods of Z′ < 3. Therefore, unlike the first two blind tests, participants had to contemplate the possibility that the molecules crystallize in uncommon space groups, or with Z′ = 2.
prediction involve the generation of possible crystal structures, followed by some form of ranking of their likelihood of being observed. Results of the CSP2001 blind test suggested that the generation of all the likely crystal structures in the most common space groups, with one rigid molecule in the is no longer a major problem. Therefore, the challenges for search methods are flexible molecules, less common space groups and crystals with more than one independent molecule. We therefore relaxed the limitations on that were used in the first two blind tests – candidate crystals for CSP2004 were allowed to be in any and only restricted toCrystallographers were contacted with a request for unpublished structures and suitable candidates were sent to an independent referee (Professor A. L. Spek, Laboratory of Crystal and Structural Chemistry, Utrecht University) who checked that they conformed to our criteria. After considerable effort, we collected six candidates for category 1, two for category 2 and six for category 3. (Here, we make a plea to the crystallographic community; suitable structures of molecules that fit the three categories were difficult to find. We would be grateful for contacts from crystallographers with suitable unpublished structures for the next blind test.)1
To choose which of these molecules were used in the blind test, we took sketches of the molecular structures to a second independent colleague, this time a non-crystallographer with no access to the crystal structures – Professor Jeremy Sanders, University Chemical Laboratory, Cambridge. He was asked to choose one molecule from each of the three categories. The contributing crystallographers agreed to postpone the publication of these structures and they were held until April 2004 by Professor Spek. The molecular diagrams, as shown in the first three rows of Table 1, were sent by email to all of the participants on 1 October 2003. Participants were asked to send their predictions for each molecule to Professor Spek by 1 April 2004. We kept the format the same as in previous blind tests, allowing each participating group to submit three predictions for each molecule, with reasons for the prediction. Analysis of longer lists of generated structures provided useful insight following the CSP2001 workshop, so participants were encouraged to submit extended lists of their other predicted structures separately from their `official' three, but preferably in order of likelihood (i.e. structures 4, 5, etc.).
On 31 December 2003 we became aware2 that, while unpublished, the of molecule VIII (hydantoin) had been presented in a poster session of a meeting of the British Crystallographic Association in March 2002 (Yu & Schwalbe, 2002). Many of the participants had already spent considerable effort on their calculations for this molecule, so we agreed that predictions should still be submitted for molecule (VIII), with participants indicating whether information contained in the conference abstract (space group, unit-cell dimensions and a diagram of the hydrogen bonding, Yu & Schwalbe, 2002) had influenced their predictions. While most of the participants agreed to submit their predictions without the use of information in the report, the predictions for molecule (VIII) could no longer be considered truly `blind'. Hence, a second molecule from category 1 was chosen as a replacement – its molecular diagram and conditions of crystallization (last row of Table 1) were circulated on 7 January 2004. The timeline for the blind test and dates of the workshop could not be rearranged, so predictions for this molecule [henceforth referred to as molecule (XI)] had to be submitted along with (VIII) to (X). Thus, only half the time available for the other targets was available for predictions on molecule (XI).
On 5 April 2004 the experimental crystal structures of all four molecules were circulated to each participant for post-analysis of their predictions and, on 4 and 5 May 2004 a workshop was held at the Cambridge Crystallographic Data Centre to discuss the results.
3. Methodologies
Methodologies varied significantly amongst the 18 groups who submitted predictions. For brevity, we provide very short descriptions of the methodologies in Table 2. References are provided for most of the methods used and details specific to the work on the CSP2004 molecules are available in the supplementary information.3
|
Although the techniques vary, all methods of
prediction involve three general steps:There have been no major advances in step (i) since CSP2001. Some new algorithms for structure generation (step ii) have been applied since the last blind test (particularly the development of search algorithms designed to take advantage of modern computing technology). The e.g. the semiclassical density sums, or PIXEL, approach (Gavezzotti, 2002, 2003a). No major progress has been made since CSP2001 in defining the non-energetic criteria of ranking putative structures.
of energy models for step (iii) continues partly through further parameterization of empirically derived model potentials, but also the formulation of new models for intermolecular interactions,3.1. Methods of generating the molecular structure
Two approaches can be used to treat the molecular structure during via gas-phase quantum chemical calculations and it is assumed that the environment of the crystal has no effect on the bond lengths, angles and torsions. Here, the choice of method used to generate the rigid molecular structure can be crucially important – the effect of assumed molecular structure on crystal energy calculations can be large (Beyer & Price, 2000).
predictions; the molecule can be treated as flexible and varied along with the usually through bond stretching, angle bending and torsional terms in a force-field. Alternatively, the molecule can be treated as rigid throughout the calculations. In the latter approach, the molecular structure is usually generated3.2. Generating trial crystal structures
Methods for generating crystal packing arrangements are diverse and those used during this and earlier blind tests include the stepwise building-up of dimers and layers, grid-based systematic searches, genetic algorithms, Monte Carlo and random searches. Since random sampling often exhibits gaps and clusters of points, some participants in CSP2004 instead chose to follow a low-discrepancy Sobol' sequence (Press et al., 1992; Sobol', 1967), which ensures a more uniform, and thus more efficient, sampling. Another difference between search methods is the use of symmetry; because of the large search space involved in finding all possible crystal structures, many methods make use of space-group symmetry to reduce the computing time required. For these methods, each (and Z′) is searched separately, so the choice of which symmetries to consider is crucial. Ideally, all space groups would be searched, with varying numbers of molecules in the for each. However, computing time is finite and the majority of known structures of organic molecules populate a handful of space groups, so it is usual to restrict searches to a set of the most commonly observed. Much more extensive searches (e.g. involving hundreds of thousands of candidate structures in large numbers of space groups) are now carried out by some researchers by exploiting parallel computation on distributed computer networks. Three participants did not use the information about the most common space groups to generate trial structures. Instead, they considered the P1 with different numbers (up to eight) of independent molecules and checked for symmetry in the final structures, using CRYCOM (Dzyabchenko, 1994), PLATON (Spek, 1980–2004) or the symmetry finder in Cerius2 (Accelrys Inc., 1997).
3.3. Ranking of structures
As in CSP1999 and CSP2001, the final ranking of crystal structures is still almost exclusively based on the minimized lattice energy, so varies from participant to participant depending on the choice of model for the intra- and intermolecular energies. The lattice energy is `temperature-less', in that entropic contributions to the free energy are by and large neglected, although some groups have improved the thermodynamic model to include estimates of the vibrational contributions to the free energy. Others have introduced additional criteria (such as calculated mechanical properties and morphologies or structural isotropy) as an initial attempt to consider kinetic differences between predicted structures. Previous blind tests and many other studies show that dozens of possibilities often exist within a few kJ mol−1, sometimes hundreds within, say, 5 kJ mol−1 of the calculated global minimum (Day et al., 2004). Therefore, the required accuracy of calculated relative energies is very high and any additional or alternate criteria that can reliably discriminate between likely and unlikely structures will be valuable. Still other participants have chosen scoring criteria based on statistical comparisons to the known structures in the Cambridge Structural Database as an alternative to lattice energy minimization.
3.4. Treatments of the molecular flexibility in (X)
We summarize the various methods of treating the flexible molecule (X) in Table 3, adopting the nomenclature shown in Fig. 1 for the six exocyclic torsion angles.
|
4. Results
This paper is accompanied by a large amount of supplementary material : the coordinates of the experimental structures, lists of predicted structures by each participant, as well as detailed descriptions of methodology, results and post-analysis by many of the participants.
4.1. Experimental structures
Small rigid molecules fitting our criteria for category 1 with no published 4 The molecules are linked into chains via pairs of N—H⋯O=C (RNO = 2.913 Å) interactions forming centrosymmetric motifs (Fig. 2, left), which pack into a layered in the C2/c (Fig. 2, right). The reported at the BCA conference in 2002 is now published (Yu et al., 2004).
are now uncommon. Therefore, a colleague was asked to grow crystals of hydantoin [molecule (VIII)], a readily available molecule with no reported as a candidate for CSP2004. The compound was obtained from Aldrich and recrystallized from methanol by slow evaporation.Compound (IX) is a derivative of a commercial pigment (2,9-dibromoanthanthrone is commercially produced as a scarlet pigment for use in coatings). The diiodo derivative was synthesized by the pigments research group of Hoechst in the eighties. Due to the low solubility, harsh crystallization conditions had to be applied to grow a single crystal. The molecule crystallizes in layers (Paulus et al., 2005), with molecules on an inversion centre in the P21/c, and with neighbouring molecules forming close I⋯O contacts (Fig. 3, left). The C—I⋯O angle is almost 180° and the I⋯O contact distance of 3.144 Å is consistent with the polar flattening observed in contact distances with halogen atoms (Nyburg & Faerman, 1985; the sum of van der Waals radii is approx 3.4 Å). These slightly buckled sheets stack parallel to the (10) plane (Fig. 3, right).
The crystals of several nitrotoluene derivatives, synthesized almost a century ago, were discovered in the School of Agricultural Sciences at the University of Milano and submitted as candidates for CSP2004 (Demartin et al., 2004). One of these, 2-acetamido-4,5-dinitrotoluene, was selected as molecule (X); it crystallizes in the P21/c with one molecule per (Watkin et al., 2004). The amide substituent is twisted out of the plane of the benzene ring, with φ = 41°. This twist orients the polar H atom at a sufficient angle away from the ring methyl substituent to form N—H⋯O=C hydrogen bonds (RNO = 2.865 Å) forming molecular chains along the b axis of the (Fig. 4, left). Within these chains, the aromatic parts of the molecules stack in an offset face-to-face manner, while the main interaction between antiparallel chains is the interlocking of acetamide methyl groups between the two nitro substituents (Fig. 4, right).
Azetidine [molecule (XI)] is a low melting compound, so a sample of the pure liquid (obtained from Aldrich) was crystallized in a capillary at 170 K using the laser-assisted zone-refinement technique of Boese (Boese & Nussbaumer, 1994). The molecule is puckered in the observed (Parsons, 2005) and, predictably, packs with N—H⋯N hydrogen bonding. There are several possible arrangements for hydrogen bonding and the observed is built up from the chains shown in Fig. 5 (left), with N⋯N distances of 3.121 and 3.102 Å. These chains contain two azetidine molecules in the and pack in an antiparallel manner along the crystallographic b axis in the P21/c (Fig. 5, right).
4.2. Comparison of calculated and experimental structures
The predicted and experimental crystal structures were compared with two similarity measures. We performed the first screen for correct predictions using Chisholm's COMPACK program (Chisholm & Motherwell, 2005), which searched the lists of submitted crystal structures for predictions that reproduced the experimental structure to within a specified tolerance. The COMPACK program avoids the use of space-group symmetry by defining a in terms of a molecular packing environment; the experimental structure is represented by a molecule and its coordination shell, which we took as the closest 12–16 molecules. The relationships between these molecules are then represented by a set of interatomic distances and angles, which are searched for in the target structures. If the distances and angles match to within specified tolerances, then the coordination spheres are overlaid and a root-mean-squared deviation (RMSD) in atomic positions is calculated for all 12–16 molecules. In these calculations, we ignored the H atoms because of the usual uncertainties in the X-ray determination of their positions. Typically, a matched gave an RMSD of less than 1 Å for a 16-molecule coordination sphere. A second screen for correct predictions was performed by Dzyabchenko on the three predictions by each group using the program CRYCOM (Dzyabchenko, 1994). This method also generates an RMSD value for the similarity between structures based on a comparison of the fractional atomic coordinates for the reference molecule, suitably transformed by space-group operators for maximum overlap. These RMSD values are directly comparable to those reported in CSP2001. The two methods found the same set of correct predictions amongst the submitted structures for all four molecules and examples of the best simulations of the true crystal structures are shown in Fig. 6. In cases where the structure was not found in the submitted list, many of the participants energy minimized the experimental using the same energy model as used in the predictions to test how well their methods describe the energy minimum. These results are presented in an `energy minimized in post-analysis' section of each table.
We present the results and discuss the difficulties encountered for molecules (IX), (X) and (XI), whose predictions were purely blind, in §4.4. Although many of the predictions for molecule (VIII) were performed without recourse to the information in the BCA conference abstract (Yu & Schwalbe, 2002), we distinguish the molecule (VIII) results by presenting them in a separate section.
4.3. Molecule (VIII) predictions
From the 18 participants, 15 attempted the prediction of molecule (VIII) – the other three (Dzyabchenko, Motherwell and Schmidt) abandoned their calculations when the conference report was discovered. Most of those who did submit predictions had either finished their calculations before the conference abstract was discovered or continued their calculations with no bias towards the observed ). In CSP1999, the small rigid molecule was known to have two polymorphs, one of which was successfully predicted by 4 of 11 participants, while the other was not predicted amongst anyone's `top 3'. Only one polymorph is known for the small rigid molecule studied in CSP2001 and two out of 15 groups had successful predictions in their top three. For molecule (VIII), four groups found the observed structure in their three official predictions, as ranks 1, 1, 1 and 2. Six of the others had found the correct but higher in their list of generated structures (Table 4). This molecule crystallized in one of the most common space groups for organic molecules (C2/c, Z = 8), so was easily located by most search methods that rely on the use of these groups. On the other hand, searching in P1 with eight independent molecules is a very difficult task, even for rigid molecules. Nevertheless, the experimental structure was located by one of the participants (Scheraga & Arnautova) who carried out a global search in P1, Z = 8.
or hydrogen-bonding pattern. The results for this molecule are in line with previous blind test results for molecules of this category (Fig. 7
‡Predictions were not ranked on lattice energy alone – a good reproduction of the experimental structure was found as the global minimum in lattice energy in the search with a planar molecule (first line), and second lowest in energy when searched with an `envelope' molecular structure (second line). However, these were not amongst the submitted predictions, as the three for submission were chosen fairly arbitrarily on energy, growth rate and mechanical properties as described in the supplementary material . §Only one structure submitted as the lowest energy in the search with dimer geometry based on the BCA conference abstract (Yu & Schwalbe, 2002). ¶Submitted as the lowest energy structure found in C2/c, the of the observed This was the highest density structure in the entire search, but some lower energy structures were found in other space groups. |
For the ranking of the predicted structures, many model potentials are available that have been parameterized to similar molecules, so most participants found the experimentally observed −1 of the global minimum in lattice energy (Table 4). Almost all of the predicted structures are compressed in the a direction and expanded along c, indicating a consistent problem with either most of the force fields used, the assumed molecular geometry or the temperature-less nature of lattice-energy minimization. Besides these distortions of the lattice, the predicted structures reproduce the experimentally observed crystal very well. The lowest RMSD in atomic positions are around 0.5–0.7 Å for a sphere of 16 molecules using Chisholm's comparison algorithm (Chisholm & Motherwell, 2005) or about 0.1 Å from Dzyabchenko's comparison method (Dzyabchenko, 1994). These are similar to the deviations in the predictions of the corresponding molecule in CSP2001. The overlay in Fig. 6 is typical of most of the correctly predicted structures and demonstrates the quality of these predictions.
either at or within a few kJ molOne exception is Leusen's predicted structure: his third predicted structure does correspond to the same lattice-energy minimum obtained by energy minimizing the observed structure, so is the closest to this structure that could be predicted with his choice of force field. However, the deviations are too great to be considered a successful prediction (high RMSD and errors in lattice vectors of > 10%) – here, the search algorithm has located the correct structure, but the predictions are a failure due to inadequacies of the chosen force field.
A newly discovered polymorph of a similar molecule (barbituric acid) indicates that slight molecular deformations can be an issue even for what seems to be a typical rigid molecule (Lewis et al., 2004). However, for (VIII) the choice of different molecular structures had only a small effect on search results – Price & Lewis found the experimental as a global minimum with a planar model, and the second lowest with an envelope molecular structure.
Besides molecule (VIII), participants in this third blind test felt that the chosen molecules were generally more difficult prediction targets than the molecules of previous blind tests.
4.4. Purely blind predictions – molecules (IX), (X) and (XI)
4.4.1. Molecule (XI)
Molecule (XI) was attempted by all 18 groups, but no correct predictions were made amongst the first three predictions of any of the participants. Here, a large factor in the poor predictive success is that the molecule crystallizes with two molecules in the 5 only eight of the 18 groups could have produced Z′ = 2, P21/c structures in their structure generation. Of these, three participants did generate the observed structure, but ranked it outside of the top 3 (Table 5). Leusen encountered similar problems as molecule (VIII) – the closest lattice-energy minimum to the true structure was present in his extended list, but severely distorted from the experimentally observed structure. Again, the structure generation found the appropriate minimum, but the force field was unsatisfactory.
In fact, because of limitations of some methods or constraints on computing time,
‡The unstable P21/c structures, after minimization with symmetry constraints were removed, resulting in a final structure in the space roup with Z′ = 4. The lowering of the energy is very small for van Eijck (0.09 kJ mol−1) and Pantelides & Karamertzanis (0.001 kJ mol−1), while the structure lowers in energy by 0.38 kJ mol−1 with Day's model, making this structure lower in energy than any other hypothetical structure. §Deviations from the experimental structure are so pronounced that no RMSD could be calculated. ¶Minimization in P1. |
Crystallization with two molecules in the σ = 0.018 Å; Allen et al., 1987). Similarly, the C—N bond lengths in the crystal of (XI) – 1.474 (3), 1.465 (4), 1.470 (3) and 1.465 (4) Å – are all at least 0.1 Å shorter than the average bond lengths in other crystals of azetidines (1.484 Å, σ = 0.018 Å; Allen et al., 1987). The sensitivity of lattice-energy calculations to the assumed molecular structure is such that such abnormalities led to large variations in the calculated relative energies (Table 5). Compounding the problem was the observation that many distinct crystal structures exist in a small energy window around the global minimum (e.g. Day found 20 within 1 kJ mol−1; Pantelides & Karamertzanis found 77 unique stationary points within 1 kJ mol−1). The accuracy required in the energy ranking is evident in Pantelides & Karamertzanis' and van Eijck's predictions; the observed structure is within ca 1 kJ mol−1 of the global minimum in both cases, but ranked 24th and 30th in their lists, respectively. Given this observation, it was somewhat surprising that the calculation of the and vibrational contributions to the free energy (see Day; Della Valle & Venuti) led to minimal rearrangements of the stability ordering.
was not the only difficulty with molecule (XI). Some participants had problems modelling this molecular crystal, even after the experimentally observed crystal was made available; the strained four-membered ring is not typical of molecules used in parameterizing many of the common force fields. Some force fields predicted a planar ring structure, so the experimental could not be adequately modelled. Others found that even quantum mechanical calculations did not reproduce the observed molecular structure as accurately as expected for such a small molecule. The observed molecular structure is atypical of such azetidine rings – the bond lengths are shorter than in the crystals of other molecules with azetidine rings – the C—C bond lengths of 1.517 (4), 1.514 (4), 1.510 (4) and 1.514 (3) Å are all much shorter than the average observed bond lengths of 1.548 Å (A final comment on the experimentally observed structure of (XI) – at least four participants reported that this structure corresponds to a saddle point on the P21/c symmetry of the structure, shifting it to a , Z′ = 4 crystal (blue overlay in Fig. 6). This shift lowers the energy by a tiny amount – 0.008 (Pantelides & Karamertzanis), 0.08 (van Eijck), 0.04 (Price & Torrisi) and 0.4 kJ mol−1 (Day). For the latter, this energy lowering is enough to make this structure more stable than the three submitted structures. At least on this (exp-6 + multipoles) surface, the experimental structure appears to correspond to the global minimum in lattice energy. With such a low energy barrier, the observed P21/c symmetry may be a thermal average over two symmetrically equivalent lower symmetry structures. Such a flat surface would also lead to a large amplitude degree of freedom with an important increase and possible disorder, although none is reported in the determination (Parsons, 2005).
surface – small negative eigenvalues of the Hessian were observed, indicating a saddle point on a flat surface. This flat surface and instability further complicated the calculations on this molecule. Relaxing the along the path to lower energy breaks theIn short, while molecule (XI) may at first seem to be a typical `simple' target, there were a multitude of complications making this a very difficult system to predict.
4.4.2. Molecule (IX)
Molecule (IX) was attempted by 15 groups and the experimental structure was present in only one participant's `top 3' – as the global minimum in lattice energy from Day's search. Eight others predicted the correct structure, but outside of their best three (Table 6) and several of these show large distortions away from the observed structure – RMSD values greater than 1 Å are fairly high for rigid molecules. Predictions for molecule (IX) were less successful than for the corresponding molecules in previous blind tests (Fig. 8), which were predicted in the best three guesses by 1 of 8 (CSP1999) and 4 of 15 (CSP2001) groups.
‡Reported as matches to the experimental structure, but with extreme deviations. |
The poor success rate may be partly due to the size of the molecule (35 atoms), but is largely a consequence of problems in modelling iodine, whose repulsive wall shows an extreme anisotropy; Nyburg and Faerman found the axial ratio of end-on to side-on contact distances is 0.83 (Nyburg & Faerman, 1985). The lattice parameters in the one correct prediction are all within 1% of the observed values and the small RMSD shows very good structural reproduction (see overlay in Fig. 6). For this prediction, a sophisticated and computationally expensive model potential had to be developed specifically, using anisotropic atom–atom repulsion and electrostatic models to correctly represent the I atoms' contribution to the lattice energy. The parameterization and testing of such models was hindered by the poor quality of the reported crystal structures of similar compounds, e.g. CSD refcodes IOANTQ, IONTRQ, IANTRQ, whose R factors are 0.106, 0.130 and 0.0175, with no hydrogen positions reported.
4.4.3. Molecule (X)
Molecule (X) (the conformationally flexible molecule) was attempted by 15 groups, but there were no correct `top three' predictions. For comparison, the flexible molecule in CSP1999 (Fig. 9, left) was only predicted once from 11 submissions, while there were no successes from 11 submissions in CSP2001 (Fig. 9, right). Problems with simultaneously treating the conformation and crystal packing meant that when a close match to the experimental structure was found in seven of the participants' extended lists, the ranking on lattice energy was generally poor and deviations in lattice parameters were often higher than for the rigid molecules (Table 7). The overlay in Fig. 6 shows the best reproduction (van Eijck's 135th ranked structure) of the experimentally observed crystal structure.
‡Minimization in P1. §Estimating the intramolecular energy from MP2/6-31G** partially optimized experimental molecular geometries. |
Improving the predictability of such molecules requires advances in search procedures for the packing of conformationally flexible molecules, as well as greater accuracy in models of intra- and intermolecular energies. It was found that all of the torsion angles defined in Fig. 1 are important in determining the packing of molecule X – τ1, φ, ψ and ω all influence the intermolecular hydrogen bonding, while τ2 and τ3 have an important effect on close packing of the molecules. More details are available in the individual accounts in the supplementary information . The balance of the calculated intra- and intermolecular energies was crucial – the gas-phase minimum energy structures from quantum chemical calculations had the acetamide substituent in the plane of the benzene ring (θ = 0°), with the polar H atom shielded by the ortho-methyl substituent, preventing efficient intermolecular hydrogen bonding. The energy required for the observed 41.5° twist of the acetamide substituent was calculated to be between 5 and 14 kJ mol−1, and the improvement in calculated lattice energies with this conformation did not compensate for this conformational energy. Crystals in which conformations of the constituent molecules are far above the gas-phase energy minimum must be critically stabilized by non-bonded interactions in the crystal. Such cases may be the greatest challenge for energy models in prediction. One approach is the coupling of quantum mechanics calculations with very accurate force fields during energy minimization and this has led to successful predictions for some molecules where crystal and vacuum conformations are entirely different (van Eijck et al., 2001a,b).
5. Discussion
5.1. Overall success rates
The overall success rates in this blind test are a combination of the successes and failures of different aspects of the prediction methodology – namely, the generation and the evaluation of possible structures.
5.1.1. The search problem
After CSP2001, it was felt by many of the participants that methodologies for generating the structures of rigid molecules corresponding to the common space groups with Z′ = 1 were then reliable. For the category 1 molecule in CSP2001, 11 out of 15 participants generated the experimentally observed somewhere in their lists. For molecule (VIII), 11 out of 14 groups (not counting the search using the reported dimer as a starting point) generated the correct structure. These success rates are fairly good and some of the methodologies used in more than one blind test appear to be consistently successful. Where it seems that simple rigid molecule searches need improvement is when more than one independent molecule is present in the Only eight out of 18 participants searched for structures with Z′ > 1, some due to limitations of their search methodology and others because of limited computing resources and time. However, even those search algorithms that have the capacity to search with multiple independent molecules performed poorly; several participants reported poor sampling of phase space with Z′ > 1 and, indeed, only three groups generated the correct structure for molecule (XI). The predictability of such structures is important – over 10% of homonuclear organic crystals have Z′ > 1 (Steiner, 2000). Furthermore, the application of prediction to salts and co-crystals requires the improvement of search methods with multiple molecular units. There has been success in searches for salt structures of flexible molecules, demonstrating that the search problem can be solved (Leusen, 2003). However, the time and computational expense required prohibit such predictions from becoming routine without significant increases in computing power. Some advances will come with increasing use of parallel computing and distributed computing networks, but methodological advances will also be needed.
In CSP2001 the search methods were as successful with the category 2 molecule as with category 1: 12 out of 15 submitted sets of structures contained the experimentally observed
This year's molecule (IX) has internal symmetry which predisposes it to crystallize on a crystallographic inversion centre. This should have simplified the search, but the success rate dropped to only nine out of 15 participants generating the correct structure. Why the poor search results? This is probably due partly to the size of the molecule, but the main reason must be the critical difficulties in modelling the I atoms – both in quantum chemical calculations used to generate electrostatic models and in limitations of the spherical atom–atom method for interactions with I atoms, which are known to behave anisotropically. Most of the search algorithms rely on a reasonable model for interaction energies; minima on the model surface must be reasonably close to minima on the `real' energy surface and the relative energies of the minima must also be reasonably accurate. Search methods must discard high-energy structures to avoid infinitely many solutions, so a model that wrongly assigns a high energy to the correct may result in a failed search.The search algorithms struggled with the flexible molecule. Only seven out of 15 groups generated the correct structure somewhere in their extended list – approximately the same success rate as in CSP2001 (four of 11). Apparently, little progress has been made in solving the problems involved in simultaneously searching both molecular conformations and crystal-packing alternatives. The problems are partly the same as for Z′ > 1 crystals – the extra quickly expand the phase space that must be sampled. Participants took different approaches to this problem. Some performed rigid-body searches on a range of conformations and took the conformational energy from quantum mechanical calculations. Others searched phase space with rigid molecular structures, but allowed these to adapt to the crystalline environment during energy minimization with some or all of the intramolecular coordinates free to vary. Still others included conformational flexibility from the start – defining intramolecular during the search and treating them as parameters in the same way as changes to the cell dimensions and positions of the molecules in the Each of these approaches had some success. Perhaps more important is the inadequacy of many current force fields in satisfactorily modelling both intra- and intermolecular energies, which are on such different scales. Energy models that cannot treat these contributions accurately and in a balanced manner will inevitably lead to inconsistent or failed searches. Attempts to couple atomistic intermolecular potentials with ab initio calculations of intramolecular energies during energy minimization, e.g. van Eijck et al. (2001a,b) and Pantelides & Karamertzanis (this work), are a possible way forward.
While there are still challenges in this structure generation aspect of
prediction, we are encouraged by the consistent success of some methods – two participants (Day; Pantelides & Karamertzanis) generated the experimental somewhere in their list for all three molecules and others (van Eijck found three out of four, Dzyabchenko two out of three and Schmidt one out of two) missed only one of the molecules that they attempted. The comparison of methods and understanding of any failures should guide the continued advancement in search methodologies.5.1.2. Ranking of the generated structures
Most methods still rely on calculated lattice energies as the sole scoring function for ranking predicted structures and the discrepancies in relative energies amongst the successful predictions and post-analysis energy minimizations highlight the variety of energy models applied to this test. The part of the energy models that has seen the most attention is the description of electrostatic interactions. There were several approaches used in CSP2004 to improve on the atomic point-charge model: atomic multipole expansions (Day; van Eijck; Price et al.), off-nuclear charges (Pantelides & Karamertzanis) and bond dipoles (Dzyabchenko), and the representation of the molecular charge density in terms of a large number (ca 104) of charged pixels (Schweizer & Dunitz). A recent study (Day et al., 2005) has directly compared the prediction of a large set of rigid molecules using atomic point charges and atomic multipole models. The results demonstrated an improved success rate from approximately 40% of the experimentally observed structures found amongst the lowest three in lattice energy with a point charge model to 55% using atomic multipole expansions. Optimized off-nuclear charge sites appear to be able to reproduce molecular electrostatic potentials to about the same accuracy as atomic multipole models (Karamertzanis & Pantelides, 2004) and bond dipoles give a comparable quality of molecular electrostatic potential, so such models should lead to similar improvements. The pixel approach, recently developed by Gavezzotti (Gavezzotti, 2002, 2003a) and used by Schweizer & Dunitz in this year's blind test, is an exciting new methodology for quantifying intermolecular interactions and shows promise for rationalizing and predicting organic crystal structures (Gavezzotti, 2003b,c). While all of these methodologies can offer improved reliability over the spherical atom–atom approach for small, rigid molecules, there is much work to be done in transferring such methods to larger molecules and improving on the consistently disappointing prediction results for flexible molecules. For example, the conformational dependence of atomic multipoles means that a static set of multipoles offers little improvement over atomic charges in the energy minimization of conformationally flexible molecules (Brodersen et al., 2003). Better results are obtained by recalculating the multipoles as the conformation changes during energy minimization (van Eijck et al., 2001a,b); this procedure is promising, but currently too expensive for handling large molecules.
Model potentials can be fine-tuned even further by modifying the form and parameters of the model of van der Waals interactions specifically for the molecule being studied. Such an approach led to the only successful prediction for the truly blind molecules in this test [Day, molecule (IX)]. Currently, the time required to build such a molecule-specific model means that the approach is not generally applicable. However, systematic approaches to building such model potentials are developing (Day & Price, 2003; Mitchell & Price, 2000).
Besides lattice energy, several additional selection criteria have been tried. As an improvement within the thermodynamic model, a few participants calculated the vibrational contribution to the crystals' free energies (Price et al.) and free energy minimization was also attempted (Day; Della Valle & Venuti). For the molecules in this study, such calculations did not lead to significant reordering of the thermodynamic stabilities. It is also clear that thermodynamics do not control the crystallization process alone – kinetic factors can be crucial. Two groups (Boerrigter; Price et al.) re-ranked their lattice energy predictions by relating other calculated properties (e.g. structural isotropy, mechanical properties and morphology calculations) to possible kinetic preferences for certain structures. Such ideas have shown promise in other studies (Anghel et al., 2002; Beyer et al., 2001) and may play an important role in future crystal-structure prediction methodologies. However, in the current blind test, these extra scoring functions failed to improve on lattice-energy predictions; in fact, Price & Lewis would have correctly predicted the structure of molecule (VIII) had they not submitted a slightly less stable structure with a predicted favourable growth rate instead of their global minimum with the planar molecular conformation.
6. Conclusions
Blind predictions have been attempted for a set of molecules in a test organized by the Cambridge Crystallographic Data Centre. Besides molecule (VIII), whose structure had unfortunately already been partially reported, the overall success rates remain poor and do not reflect any major advances since the previous blind test, CSP2001 (Motherwell et al., 2002). In the previous test, there were six successes within the three predictions allowed for each molecule. For the truly blind predictions [i.e. excluding molecule (VIII)] in the present test, there was only one success – the prediction of molecule (IX)'s as a global minimum in lattice energy by one of the participants. Analysis of the lowest energy predictions, extended lists of generated structures and post-analysis of the experimentally observed crystal structures has highlighted possible reasons for the poorer success rates compared with the previous blind tests and so the blind test has been very useful for participants in pointing to the current obstacles to reliable prediction. In fact, despite the lack of observable progress, there is a feeling among the participants that the programs and methodologies are improving.
In summary, search methods must be improved for crystals with multiple independent molecules and conformationally flexible molecules. To find the most stable arrangement of molecules, the basis of energy calculations still needs to be improved. Electrostatic models that are more detailed than the traditional atomic point charge model are becoming more common and seem to improve the reliability of lattice-energy calculations. Descriptions of the van der Waals interactions are still largely empirically derived and the growing amount of experimental data has been vital in the development of model potentials. While such models are often satisfactory, the treatment of less common atom types (such as iodine) and atoms in bonding environments that are not adequately represented in empirical parameterizations [such as the strained ring in molecule (XI)] may require more specifically developed models. The successful advances in energy models for rigid molecules (such as atomic multipoles and off-centre charge sites) should be extended to flexible molecules, for which energy models still need improvement. Most current approaches to the intermolecular energy are still based on atom–atom interactions, with or without atomic point charges or multipoles, and, while this may be computationally convenient, its physical basis leaves something to be desired. At the close intermolecular distances that occur in crystals, the accurate calculation of the electrostatic energy requires the interaction between the molecular charge density distributions rather than between a few localized charge monopoles, dipoles, quadrupoles, etc. As computational possibilities improve, the role of individual atomic nuclei as the loci of intermolecular attractions and repulsions will possibly need to be replaced by better models based on more delocalized quantum-mechanical charge-density distributions (Gavezzotti, 2003b,c).
However, even with advances in our models for the energy, most methodologies continue to rely solely on global minimization of the lattice energy. A few participants included estimated vibrational contributions to the free energy of predicted structures through lattice dynamical calculations, thus accounting for
differences between polymorphs. In the present blind test, such calculations did not significantly change the stability order of the hypothetical crystals, so had little effect on the reliability of the predictions. Such calculations may become more common as such a capability is implemented in the programs and the required computing time becomes available for such calculations on the thousands of structures that are produced for a given molecule.Of course, we recognize that crystallization is not an equilibrium process, so not under ). These extra criteria did not improve predictions for the molecules in this blind test, but the testing of such ideas is a necessary step towards including kinetic criteria in CSP and towards an understanding of crystallization and polymorphism.
It seems that reasonable success can be achieved through lattice-energy minimization, which suggests that molecules often tend to find their lowest-energy arrangement during crystallization. However, in some cases, although we do not actually know with what frequency, metastable crystals will be favoured at the nucleation event and preserved during crystal growth. To thoroughly account for such effects in prediction, a full treatment of nucleation at the molecular level is required and a detailed description of crystallization conditions would be necessary. While there has been some progress in the molecular-level modelling of nucleation and crystal growth, general application to the large number of hypothetical structures in CSP is currently far beyond computational feasibility. In the absence of the ability to model some participants have tried to relate simpler calculated properties (such as structural isotropy and crystal morphologies) to preferences for some of the low-energy predicted structures during Alternatively, statistical comparisons to the known structures in the Cambridge Structural Database may uncover kinetic preferences for some hypothetical structures (Desiraju, 2002In conclusion, this third blind test has provided an objective evaluation of the state of the art in
prediction. There are technical and fundamental obstacles to reliable predictions and the comparison of a variety of approaches and problems encountered while working on a difficult set of molecules has served to highlight the immediate areas requiring attention and methodological innovation.Supporting information
10.1107/S0108768105016563/de5014sup1.cif
contains datablocks VIII, IX, X, XI. DOI:Supporting information file. DOI: 10.1107/S0108768105016563/de5014sup2.pdf
Supporting information file. DOI: 10.1107/S0108768105016563/de5014sup3.txt
Data collection: COLLECT (Nonius BV, 1997) for (VIII), (X); R3m/V-software (Siemens) for (IX); SMART (Siemens, 1993) for (XI). Cell
DENZO/SCALEPACK (Otwinowski & Minor, 1996) for (VIII), (X); R3m/V-software (Siemens) for (IX); SAINT (Siemens ,1995) for (XI). Data reduction: DENZO/SCALEPACK (Otwinowski & Minor, 1996) for (VIII), (X); R3m/V-software (Siemens) for (IX); SAINT (Siemens ,1995) for (XI). Program(s) used to solve structure: SIR92 (Altomare et al., 1994) for (VIII), (X), (XI); SHELXS97 (Sheldrick, 1990) for (IX). Program(s) used to refine structure: CRYSTALS (Watkin et al. 2001) for (VIII), (XI); SHELXL97 (Sheldrick, 1997) for (IX). Molecular graphics: CAMERON (Watkin et al. 1996) for (VIII), (XI). Software used to prepare material for publication: CRYSTALS (Watkin et al. 2001) for (VIII), (XI).C3H4N2O2 | F(000) = 416.000 |
Mr = 100.08 | Dx = 1.669 Mg m−3 |
Monoclinic, C2/c | Mo Kα radiation, λ = 0.71073 Å |
a = 9.3538 (7) Å | Cell parameters from 730 reflections |
b = 12.1757 (11) Å | θ = 5–27° |
c = 7.2286 (6) Å | µ = 0.14 mm−1 |
β = 104.593 (4)° | T = 190 K |
V = 796.70 (11) Å3 | Plate, colourless |
Z = 8 | 0.18 × 0.10 × 0.02 mm |
Nonius Kappa CCD diffractometer | 886 reflections with I > −10.00u(I) |
Graphite monochromator | Rint = 0.02 |
ω scans | θmax = 27.4°, θmin = 5.5° |
Absorption correction: multi-scan DENZO/SCALEPACK (Otwinowski & Minor, 1996) | h = −12→12 |
Tmin = 0.99, Tmax = 1.00 | k = −14→15 |
1593 measured reflections | l = −9→9 |
886 independent reflections |
Refinement on F2 | 0 restraints |
R[F2 > 2σ(F2)] = 0.061 | P = P(6)*max(Fo2,0) + (1-P(6))Fc2 Method = SHELXL 97 (Sheldrick, 1997) W = 1. / [Sigma2(F*) + (P(1)p)2 + P(2)p + P(4) + P(5)Sin(theta)] P(i) are: 0.00 1.89 0.00 0.00 0.00 0.333 |
wR(F2) = 0.131 | (Δ/σ)max = 0.000095 |
S = 1.13 | Δρmax = 0.27 e Å−3 |
886 reflections | Δρmin = −0.25 e Å−3 |
77 parameters |
C3H4N2O2 | V = 796.70 (11) Å3 |
Mr = 100.08 | Z = 8 |
Monoclinic, C2/c | Mo Kα radiation |
a = 9.3538 (7) Å | µ = 0.14 mm−1 |
b = 12.1757 (11) Å | T = 190 K |
c = 7.2286 (6) Å | 0.18 × 0.10 × 0.02 mm |
β = 104.593 (4)° |
Nonius Kappa CCD diffractometer | 886 independent reflections |
Absorption correction: multi-scan DENZO/SCALEPACK (Otwinowski & Minor, 1996) | 886 reflections with I > −10.00u(I) |
Tmin = 0.99, Tmax = 1.00 | Rint = 0.02 |
1593 measured reflections |
R[F2 > 2σ(F2)] = 0.061 | 77 parameters |
wR(F2) = 0.131 | 0 restraints |
S = 1.13 | Δρmax = 0.27 e Å−3 |
886 reflections | Δρmin = −0.25 e Å−3 |
x | y | z | Uiso*/Ueq | ||
C1 | 0.7974 (3) | 0.00484 (19) | 0.8218 (3) | 0.0271 | |
N2 | 0.6630 (2) | 0.01532 (16) | 0.6856 (3) | 0.0271 | |
C3 | 0.6326 (3) | 0.12247 (19) | 0.6338 (3) | 0.0280 | |
C4 | 0.7522 (3) | 0.1883 (2) | 0.7508 (4) | 0.0326 | |
N5 | 0.8510 (3) | 0.10869 (19) | 0.8685 (3) | 0.0374 | |
O6 | 0.5237 (2) | 0.15458 (14) | 0.5136 (3) | 0.0376 | |
O7 | 0.8518 (2) | −0.08264 (14) | 0.8866 (3) | 0.0366 | |
H41 | 0.802 (3) | 0.224 (3) | 0.685 (5) | 0.0388* | |
H42 | 0.723 (3) | 0.236 (3) | 0.815 (5) | 0.0388* | |
H51 | 0.943 (3) | 0.119 (2) | 0.959 (4) | 0.0309* | |
H2 | 0.610 (4) | −0.038 (3) | 0.623 (5) | 0.0500* |
U11 | U22 | U33 | U12 | U13 | U23 | |
C1 | 0.0268 (12) | 0.0249 (11) | 0.0285 (11) | −0.0012 (9) | 0.0050 (9) | 0.0002 (9) |
N2 | 0.027 (1) | 0.0198 (9) | 0.0303 (11) | −0.0027 (7) | −0.0006 (8) | −0.0005 (7) |
C3 | 0.0328 (12) | 0.0236 (11) | 0.0255 (12) | −0.0033 (9) | 0.0035 (9) | 0.0005 (8) |
C4 | 0.0474 (15) | 0.0179 (11) | 0.0314 (13) | −0.003 (1) | 0.0077 (11) | 0.0001 (9) |
N5 | 0.0327 (12) | 0.0373 (12) | 0.0362 (13) | −0.004 (1) | −0.003 (1) | −0.0052 (9) |
O6 | 0.0437 (11) | 0.0257 (9) | 0.035 (1) | 0.0016 (7) | −0.0063 (8) | 0.0020 (7) |
O7 | 0.036 (1) | 0.0262 (9) | 0.0417 (11) | 0.0036 (7) | −0.0009 (8) | 0.0020 (7) |
C22H8I2O2 | F(000) = 524 |
Mr = 558.08 | Dx = 2.309 Mg m−3 |
Monoclinic, P21/c | Mo Kα radiation, λ = 0.71072 Å |
a = 4.202 (1) Å | Cell parameters from 25 reflections |
b = 20.956 (4) Å | θ = 3.5–23.5° |
c = 9.276 (2) Å | µ = 3.93 mm−1 |
β = 100.63 (3)° | T = 293 K |
V = 802.8 (3) Å3 | 0.38mm × 0.08mm × 0.04mm mm |
Z = 2 |
R3m/V (Siemens) diffractometer | Rint = 0.074 |
Radiation source: fine-focus sealed tube | θmax = 25.0°, θmin = 1.9° |
Graphite monochromator | h = −5→5 |
Absorption correction: psi-method ? | k = 0→24 |
Tmin = 0.278, Tmax = 0.337 | l = −11→11 |
2784 measured reflections | 1 standard reflections every 69 reflections |
1398 independent reflections | intensity decay: <1% |
1057 reflections with I > 2σ(I) |
Refinement on F2 | Primary atom site location: structure-invariant direct methods |
Least-squares matrix: full | Secondary atom site location: difference Fourier map |
R[F2 > 2σ(F2)] = 0.036 | Hydrogen site location: inferred from neighbouring sites |
wR(F2) = 0.083 | H atoms treated by a mixture of independent and constrained refinement |
S = 1.06 | w = 1/[σ2(Fo2) + (0.0199P)2] where P = (Fo2 + 2Fc2)/3 |
1398 reflections | (Δ/σ)max < 0.001 |
118 parameters | Δρmax = 0.67 e Å−3 |
0 restraints | Δρmin = −0.54 e Å−3 |
C22H8I2O2 | V = 802.8 (3) Å3 |
Mr = 558.08 | Z = 2 |
Monoclinic, P21/c | Mo Kα radiation |
a = 4.202 (1) Å | µ = 3.93 mm−1 |
b = 20.956 (4) Å | T = 293 K |
c = 9.276 (2) Å | 0.38mm × 0.08mm × 0.04mm mm |
β = 100.63 (3)° |
R3m/V (Siemens) diffractometer | 1057 reflections with I > 2σ(I) |
Absorption correction: psi-method ? | Rint = 0.074 |
Tmin = 0.278, Tmax = 0.337 | 1 standard reflections every 69 reflections |
2784 measured reflections | intensity decay: <1% |
1398 independent reflections |
R[F2 > 2σ(F2)] = 0.036 | 0 restraints |
wR(F2) = 0.083 | H atoms treated by a mixture of independent and constrained refinement |
S = 1.06 | Δρmax = 0.67 e Å−3 |
1398 reflections | Δρmin = −0.54 e Å−3 |
118 parameters |
Geometry. All e.s.d.'s (except the e.s.d. in the dihedral angle between two l.s. planes) are estimated using the full covariance matrix. The cell e.s.d.'s are taken into account individually in the estimation of e.s.d.'s in distances, angles and torsion angles; correlations between e.s.d.'s in cell parameters are only used when they are defined by crystal symmetry. An approximate (isotropic) treatment of cell e.s.d.'s is used for estimating e.s.d.'s involving l.s. planes. |
Refinement. Refinement of F2 against ALL reflections. The weighted R-factor wR and goodness of fit S are based on F2, conventional R-factors R are based on F, with F set to zero for negative F2. The threshold expression of F2 > σ(F2) is used only for calculating R-factors(gt) etc. and is not relevant to the choice of reflections for refinement. R-factors based on F2 are statistically about twice as large as those based on F, and R- factors based on ALL data will be even larger. |
x | y | z | Uiso*/Ueq | ||
C01 | −0.6628 (17) | −0.0254 (3) | −0.3520 (8) | 0.0434 (17) | |
H01 | −0.7587 | 0.0052 | −0.4187 | 0.065* | |
C02 | −0.7525 (17) | −0.0890 (3) | −0.3744 (8) | 0.0479 (18) | |
H02 | −0.9064 | −0.1010 | −0.4553 | 0.072* | |
C03 | −0.6104 (15) | −0.1337 (3) | −0.2753 (8) | 0.0424 (17) | |
H03 | −0.6739 | −0.1761 | −0.2893 | 0.064* | |
C04 | −0.3734 (14) | −0.1181 (3) | −0.1537 (7) | 0.0329 (14) | |
C05 | −0.2822 (14) | −0.0524 (3) | −0.1318 (7) | 0.0334 (14) | |
C06 | −0.0458 (14) | −0.0333 (3) | −0.0101 (7) | 0.0306 (14) | |
C07 | 0.2124 (15) | 0.1637 (3) | 0.0509 (8) | 0.0362 (15) | |
C08 | −0.0110 (15) | 0.1448 (3) | −0.0646 (7) | 0.0383 (16) | |
H08 | −0.1107 | 0.1752 | −0.1310 | 0.057* | |
C09 | −0.0971 (14) | 0.0801 (3) | −0.0869 (7) | 0.0323 (14) | |
C10 | −0.3481 (16) | 0.0621 (3) | −0.2157 (7) | 0.0397 (16) | |
C11 | −0.4364 (14) | −0.0068 (3) | −0.2338 (7) | 0.0354 (15) | |
O1 | −0.4750 (13) | 0.1019 (2) | −0.3044 (6) | 0.0606 (17) | |
I1 | 0.31891 (11) | 0.26105 (2) | 0.08029 (6) | 0.04881 (19) |
U11 | U22 | U33 | U12 | U13 | U23 | |
C01 | 0.051 (4) | 0.037 (4) | 0.040 (4) | 0.002 (3) | 0.003 (3) | 0.006 (3) |
C02 | 0.054 (4) | 0.044 (4) | 0.044 (4) | −0.004 (3) | 0.004 (3) | −0.006 (4) |
C03 | 0.053 (4) | 0.024 (3) | 0.053 (5) | −0.002 (3) | 0.018 (4) | −0.006 (3) |
C04 | 0.036 (3) | 0.026 (3) | 0.040 (4) | 0.001 (2) | 0.013 (3) | 0.004 (3) |
C05 | 0.041 (3) | 0.025 (3) | 0.036 (4) | 0.002 (3) | 0.011 (3) | −0.001 (3) |
C06 | 0.035 (3) | 0.024 (3) | 0.036 (4) | 0.001 (2) | 0.013 (3) | 0.000 (3) |
C07 | 0.043 (3) | 0.023 (3) | 0.044 (4) | −0.001 (3) | 0.014 (3) | −0.002 (3) |
C08 | 0.051 (4) | 0.027 (3) | 0.040 (4) | 0.006 (3) | 0.019 (3) | 0.007 (3) |
C09 | 0.041 (3) | 0.024 (3) | 0.033 (4) | 0.003 (3) | 0.009 (3) | 0.003 (3) |
C10 | 0.053 (4) | 0.029 (3) | 0.037 (4) | 0.009 (3) | 0.005 (3) | 0.005 (3) |
C11 | 0.042 (4) | 0.027 (3) | 0.039 (4) | 0.000 (3) | 0.010 (3) | 0.002 (3) |
O1 | 0.074 (4) | 0.034 (3) | 0.062 (4) | 0.002 (2) | −0.017 (3) | 0.009 (3) |
I1 | 0.0528 (3) | 0.0224 (2) | 0.0702 (4) | −0.0045 (2) | 0.0089 (2) | −0.0005 (2) |
C01—C11 | 1.369 (9) | C06—C09i | 1.389 (8) |
C01—C02 | 1.390 (10) | C06—C06i | 1.451 (12) |
C01—H01 | 0.9300 | C07—C08 | 1.347 (9) |
C02—C03 | 1.370 (9) | C07—C04i | 1.429 (9) |
C02—H02 | 0.9300 | C07—I1 | 2.096 (6) |
C03—C04 | 1.398 (9) | C08—C09 | 1.410 (9) |
C03—H03 | 0.9300 | C08—H08 | 0.9300 |
C04—C05 | 1.433 (9) | C09—C06i | 1.389 (8) |
C04—C07i | 1.429 (9) | C09—C10 | 1.489 (9) |
C05—C06 | 1.417 (8) | C10—O1 | 1.223 (7) |
C05—C11 | 1.414 (8) | C10—C11 | 1.493 (8) |
C11—C01—C02 | 121.5 (6) | C05—C06—C06i | 120.3 (7) |
C11—C01—H01 | 119.3 | C08—C07—C04i | 120.6 (6) |
C02—C01—H01 | 119.3 | C08—C07—I1 | 119.3 (5) |
C03—C02—C01 | 118.8 (6) | C04i—C07—I1 | 120.1 (5) |
C03—C02—H02 | 120.6 | C07—C08—C09 | 121.4 (6) |
C01—C02—H02 | 120.6 | C07—C08—H08 | 119.3 |
C02—C03—C04 | 122.6 (6) | C09—C08—H08 | 119.3 |
C02—C03—H03 | 118.7 | C06i—C09—C08 | 121.0 (6) |
C04—C03—H03 | 118.7 | C06i—C09—C10 | 119.9 (6) |
C03—C04—C05 | 118.2 (5) | C08—C09—C10 | 119.1 (5) |
C03—C04—C07i | 124.1 (6) | O1—C10—C09 | 121.6 (6) |
C05—C04—C07i | 117.8 (6) | O1—C10—C11 | 121.1 (6) |
C06—C05—C11 | 120.7 (5) | C09—C10—C11 | 117.3 (5) |
C06—C05—C04 | 121.0 (6) | C01—C11—C05 | 120.6 (6) |
C11—C05—C04 | 118.4 (5) | C01—C11—C10 | 119.1 (6) |
C09i—C06—C05 | 118.2 (6) | C05—C11—C10 | 120.3 (5) |
C09i—C06—C06i | 121.5 (7) | ||
C11—C01—C02—C03 | 0.1 (13) | C07—C08—C09—C10 | 179.4 (6) |
C01—C02—C03—C04 | −1.3 (12) | C06i—C09—C10—O1 | −178.9 (7) |
C02—C03—C04—C05 | 1.2 (11) | C08—C09—C10—O1 | 1.9 (11) |
C02—C03—C04—C07i | −177.8 (7) | C06i—C09—C10—C11 | 0.4 (10) |
C03—C04—C05—C06 | 179.4 (6) | C08—C09—C10—C11 | −178.9 (6) |
C07i—C04—C05—C06 | −1.5 (10) | C02—C01—C11—C05 | 1.1 (12) |
C03—C04—C05—C11 | 0.1 (10) | C02—C01—C11—C10 | 179.7 (7) |
C07i—C04—C05—C11 | 179.2 (6) | C06—C05—C11—C01 | 179.5 (7) |
C11—C05—C06—C09i | 179.8 (6) | C04—C05—C11—C01 | −1.2 (10) |
C04—C05—C06—C09i | 0.5 (10) | C06—C05—C11—C10 | 0.9 (10) |
C11—C05—C06—C06i | −1.1 (12) | C04—C05—C11—C10 | −179.7 (6) |
C04—C05—C06—C06i | 179.5 (7) | O1—C10—C11—C01 | 0.1 (11) |
C04i—C07—C08—C09 | 0.9 (11) | C09—C10—C11—C01 | −179.1 (6) |
I1—C07—C08—C09 | −178.3 (5) | O1—C10—C11—C05 | 178.7 (7) |
C07—C08—C09—C06i | 0.2 (11) | C09—C10—C11—C05 | −0.5 (10) |
Symmetry code: (i) −x, −y, −z. |
C9H9N3O5 | F(000) = 496.000 |
Mr = 239.19 | Dx = 1.528 Mg m−3 |
Monoclinic, P21/n | Mo Kα radiation, λ = 0.71073 Å |
a = 12.5693 (4) Å | Cell parameters from 2364 reflections |
b = 4.8531 (1) Å | θ = 5–27° |
c = 17.2663 (5) Å | µ = 0.13 mm−1 |
β = 99.1624 (15)° | T = 150 K |
V = 1039.81 (5) Å3 | Plate, pale yellow |
Z = 4 | 0.83 × 0.15 × 0.04 mm |
Nonius Kappa CCD diffractometer | 1497 reflections with I > 3.00u(I) |
Graphite monochromator | Rint = 0.04 |
ω scans | θmax = 27.5°, θmin = 5.2° |
Absorption correction: multi-scan DENZO/SCALEPACK (Otwinowski & Minor, 1996) | h = −16→16 |
Tmin = 0.98, Tmax = 0.99 | k = −6→5 |
4326 measured reflections | l = −22→22 |
2357 independent reflections |
Refinement on F2 | 20 restraints |
R[F2 > 2σ(F2)] = 0.032 | P = P(6)*max(Fo2,0) + (1-P(6))Fc2 Method = SHELXL 97 (Sheldrick, 1997) W = 1. / [Sigma2(F*) + (P(1)p)2 + P(2)p + P(4) + P(5)Sin(theta)] P(i) are: 0.328E-01 0.500 0.00 0.00 0.00 0.333 |
wR(F2) = 0.078 | (Δ/σ)max = 0.000200 |
S = 1.04 | Δρmax = 0.19 e Å−3 |
1497 reflections | Δρmin = −0.24 e Å−3 |
181 parameters |
C9H9N3O5 | V = 1039.81 (5) Å3 |
Mr = 239.19 | Z = 4 |
Monoclinic, P21/n | Mo Kα radiation |
a = 12.5693 (4) Å | µ = 0.13 mm−1 |
b = 4.8531 (1) Å | T = 150 K |
c = 17.2663 (5) Å | 0.83 × 0.15 × 0.04 mm |
β = 99.1624 (15)° |
Nonius Kappa CCD diffractometer | 2357 independent reflections |
Absorption correction: multi-scan DENZO/SCALEPACK (Otwinowski & Minor, 1996) | 1497 reflections with I > 3.00u(I) |
Tmin = 0.98, Tmax = 0.99 | Rint = 0.04 |
4326 measured reflections |
R[F2 > 2σ(F2)] = 0.032 | 181 parameters |
wR(F2) = 0.078 | 20 restraints |
S = 1.04 | Δρmax = 0.19 e Å−3 |
1497 reflections | Δρmin = −0.24 e Å−3 |
x | y | z | Uiso*/Ueq | ||
C1 | −0.11496 (12) | 0.5270 (3) | 0.17561 (9) | 0.0172 | |
C2 | −0.07824 (12) | 0.4022 (3) | 0.11197 (9) | 0.0181 | |
C3 | 0.00456 (13) | 0.2130 (3) | 0.12495 (9) | 0.0197 | |
C4 | 0.05357 (12) | 0.1464 (3) | 0.20089 (9) | 0.0189 | |
C5 | 0.01682 (12) | 0.2781 (3) | 0.26387 (9) | 0.0179 | |
C6 | −0.06717 (12) | 0.4686 (3) | 0.25078 (9) | 0.0177 | |
N7 | 0.06285 (11) | 0.2111 (3) | 0.34129 (8) | 0.0196 | |
C8 | 0.08511 (13) | 0.3976 (3) | 0.40076 (9) | 0.0207 | |
O9 | 0.07502 (11) | 0.6454 (2) | 0.38929 (7) | 0.0315 | |
C10 | 0.12185 (16) | 0.2782 (4) | 0.4805 (1) | 0.0280 | |
C11 | 0.14477 (14) | −0.0572 (4) | 0.21298 (11) | 0.0247 | |
N12 | −0.11871 (11) | 0.4842 (3) | 0.03132 (8) | 0.0213 | |
O13 | −0.1081 (1) | 0.3211 (3) | −0.02156 (6) | 0.0286 | |
O14 | −0.1578 (1) | 0.7153 (3) | 0.02015 (7) | 0.0292 | |
N15 | −0.2115 (1) | 0.7036 (3) | 0.16574 (7) | 0.0190 | |
O16 | −0.20322 (9) | 0.9277 (2) | 0.19842 (7) | 0.0280 | |
O17 | −0.29431 (9) | 0.6110 (3) | 0.12827 (7) | 0.0293 | |
H71 | 0.0734 (14) | 0.043 (4) | 0.352 (1) | 0.0230* | |
H31 | 0.0311 (14) | 0.139 (4) | 0.082 (1) | 0.0243* | |
H61 | −0.0945 (13) | 0.549 (4) | 0.293 (1) | 0.0216* | |
H3 | 0.1465 (17) | 0.094 (4) | 0.4771 (12) | 0.0500* | |
H4 | 0.0586 (17) | 0.275 (5) | 0.5083 (12) | 0.0500* | |
H5 | 0.1722 (16) | 0.396 (5) | 0.5107 (12) | 0.0500* | |
H6 | 0.1770 (16) | −0.080 (5) | 0.1689 (12) | 0.0500* | |
H7 | 0.2017 (16) | 0.002 (5) | 0.2541 (12) | 0.0500* | |
H8 | 0.1243 (17) | −0.229 (4) | 0.2314 (12) | 0.0500* |
U11 | U22 | U33 | U12 | U13 | U23 | |
C1 | 0.0162 (7) | 0.0138 (8) | 0.0222 (8) | −0.0014 (6) | 0.0048 (6) | −0.0002 (6) |
C2 | 0.0201 (8) | 0.0170 (8) | 0.0177 (8) | −0.0026 (7) | 0.0047 (6) | 0.0009 (6) |
C3 | 0.0223 (8) | 0.0162 (8) | 0.0224 (8) | −0.0016 (7) | 0.0088 (7) | −0.0018 (7) |
C4 | 0.0188 (8) | 0.0136 (8) | 0.0247 (8) | −0.0026 (6) | 0.0049 (6) | −0.0004 (6) |
C5 | 0.0197 (8) | 0.0135 (8) | 0.0205 (8) | −0.0038 (6) | 0.0032 (6) | 0.0004 (6) |
C6 | 0.0186 (8) | 0.0160 (8) | 0.0195 (8) | −0.0026 (6) | 0.0059 (6) | −0.0010 (7) |
N7 | 0.0249 (7) | 0.0118 (7) | 0.0212 (7) | 0.0012 (6) | 0.0010 (5) | 0.0019 (6) |
C8 | 0.0216 (8) | 0.0169 (9) | 0.0235 (8) | 0.0003 (7) | 0.0028 (6) | −0.0001 (7) |
O9 | 0.0478 (8) | 0.0149 (6) | 0.0286 (7) | 0.0008 (6) | −0.0038 (6) | 0.0005 (5) |
C10 | 0.036 (1) | 0.022 (1) | 0.0236 (9) | 0.0028 (8) | −0.0020 (8) | 0.0008 (7) |
C11 | 0.0235 (9) | 0.0209 (9) | 0.0301 (9) | 0.0030 (7) | 0.0061 (7) | 0.0007 (8) |
N12 | 0.0226 (7) | 0.0216 (8) | 0.0207 (7) | −0.0002 (6) | 0.0068 (5) | 0.0017 (6) |
O13 | 0.0369 (7) | 0.0296 (7) | 0.0202 (6) | 0.0001 (6) | 0.0072 (5) | −0.0065 (5) |
O14 | 0.0372 (7) | 0.0248 (7) | 0.0266 (6) | 0.0085 (6) | 0.0080 (5) | 0.0061 (5) |
N15 | 0.0196 (7) | 0.0194 (7) | 0.0189 (6) | 0.0014 (6) | 0.0057 (5) | 0.0015 (6) |
O16 | 0.0261 (6) | 0.0173 (6) | 0.0418 (7) | 0.0017 (5) | 0.0089 (5) | −0.0055 (5) |
O17 | 0.0202 (6) | 0.0394 (8) | 0.0265 (6) | 0.0007 (6) | −0.0024 (5) | −0.0071 (6) |
C3H7N | F(000) = 256.000 |
Mr = 57.10 | Dx = 1.007 Mg m−3 |
Monoclinic, P21/c | Melting point: 190 K |
Hall symbol: -P 2ybc | Mo Kα radiation, λ = 0.71073 Å |
a = 9.507 (3) Å | Cell parameters from 1395 reflections |
b = 9.122 (3) Å | θ = 2–25° |
c = 9.790 (3) Å | µ = 0.06 mm−1 |
β = 117.469 (4)° | T = 170 K |
V = 753.2 (4) Å3 | Cylinder, colourless |
Z = 8 | 1.00 × 0.66 × 0.66 mm |
Bruker SMART APEX diffractometer equipped with an Oxford Cryosystems low-temperature device and an OHCD laser-assisted crystallisation device (Scientific Consulting, Essen, Germany). | 1075 reflections with I > 2.00u(I) |
Graphite monochromator | Rint = 0.04 |
ϕ & ω scans | θmax = 28.7°, θmin = 2.4° |
Absorption correction: multi-scan SADABS (Siemens, 1996) | h = −10→12 |
Tmin = 0.717, Tmax = 1.000 | k = −11→12 |
4625 measured reflections | l = −12→13 |
1797 independent reflections |
Refinement on F2 | Hydrogen site location: difference Fourier map |
Least-squares matrix: full | The positional parameters of the hydrogen atoms were restrained subject to the restraint that all chemically equivalent bond lengths and angles were similar. Two overall isotropic displacement parameters were refined for the methylene and amino hydrogen atoms. |
R[F2 > 2σ(F2)] = 0.050 | Method = Prince-modified Chebychev polynomial, (Carruthers & Watkin, 1979) W = [weight] * [1-(deltaF/6*sigmaF)2]2 57.9 95.3 55.5 21.3 4.03 |
wR(F2) = 0.166 | (Δ/σ)max = 0.003 |
S = 1.01 | Δρmax = 0.14 e Å−3 |
1797 reflections | Δρmin = −0.14 e Å−3 |
118 parameters | Extinction correction: Larson 1970 Crystallographic Computing eq 22 |
Primary atom site location: structure-invariant direct methods | Extinction coefficient: 222.2 (199) |
C3H7N | V = 753.2 (4) Å3 |
Mr = 57.10 | Z = 8 |
Monoclinic, P21/c | Mo Kα radiation |
a = 9.507 (3) Å | µ = 0.06 mm−1 |
b = 9.122 (3) Å | T = 170 K |
c = 9.790 (3) Å | 1.00 × 0.66 × 0.66 mm |
β = 117.469 (4)° |
Bruker SMART APEX diffractometer equipped with an Oxford Cryosystems low-temperature device and an OHCD laser-assisted crystallisation device (Scientific Consulting, Essen, Germany). | 1797 independent reflections |
Absorption correction: multi-scan SADABS (Siemens, 1996) | 1075 reflections with I > 2.00u(I) |
Tmin = 0.717, Tmax = 1.000 | Rint = 0.04 |
4625 measured reflections |
R[F2 > 2σ(F2)] = 0.050 | 118 parameters |
wR(F2) = 0.166 | The positional parameters of the hydrogen atoms were restrained subject to the restraint that all chemically equivalent bond lengths and angles were similar. Two overall isotropic displacement parameters were refined for the methylene and amino hydrogen atoms. |
S = 1.01 | Δρmax = 0.14 e Å−3 |
1797 reflections | Δρmin = −0.14 e Å−3 |
Experimental. The following data were produced using XPREP (Sheldrick, 2002) INTENSITY STATISTICS FOR DATASET # 1 azetid.hkl Resolution #Data #Theory %Complete Redundancy Mean I Mean I/s R(int) R(sigma) Inf − 2.10 101 103 98.1 3.23 69.9 18.54 0.0293 0.0457 2.10 − 1.60 117 117 100.0 3.40 17.4 19.83 0.0199 0.0400 1.60 − 1.40 103 103 100.0 3.24 9.9 15.93 0.0261 0.0453 1.40 − 1.25 126 127 99.2 3.10 6.8 14.41 0.0375 0.0479 1.25 − 1.15 119 119 100.0 2.81 4.8 13.33 0.0479 0.0552 1.15 − 1.05 177 177 100.0 2.62 2.3 8.23 0.0647 0.0771 1.05 − 1.00 109 109 100.0 2.54 1.8 8.77 0.0723 0.0886 1.00 − 0.95 140 140 100.0 2.31 1.7 7.55 0.0942 0.1020 0.95 − 0.90 164 165 99.4 2.21 1.1 5.21 0.1096 0.1407 0.90 − 0.85 222 222 100.0 2.14 0.7 3.72 0.1814 0.2229 0.85 − 0.80 265 268 98.9 1.94 0.3 1.94 0.4053 0.4562 0.80 − 0.75 247 338 73.1 1.17 0.3 1.55 0.4250 0.5806 0.75 − 0.74 15 88 17.0 0.18 0.0 0.00 14.2449 —————————————————————————— 0.85 − 0.74 527 694 75.9 1.34 0.3 1.70 0.4135 0.5361 Inf − 0.74 1905 2076 91.8 2.23 6.8 8.02 0.0353 0.0576 Merged [A], lowest resolution = 9.12 Angstroms, 67 outliers downweighted |
x | y | z | Uiso*/Ueq | ||
N1 | 0.37675 (19) | 0.53435 (17) | 0.63871 (17) | 0.0492 | |
C2 | 0.3325 (3) | 0.5991 (3) | 0.4863 (2) | 0.0608 | |
C3 | 0.2523 (3) | 0.7229 (3) | 0.5267 (3) | 0.0647 | |
C4 | 0.2494 (3) | 0.6187 (3) | 0.6462 (3) | 0.0631 | |
N5 | 0.33617 (19) | 0.19478 (18) | 0.60288 (17) | 0.0490 | |
C6 | 0.3048 (3) | 0.1498 (3) | 0.4472 (2) | 0.0569 | |
C7 | 0.1294 (3) | 0.1480 (3) | 0.3995 (2) | 0.0679 | |
C8 | 0.1774 (3) | 0.1442 (3) | 0.5700 (2) | 0.0641 | |
H1 | 0.367 (2) | 0.4438 (17) | 0.639 (2) | 0.056 (4)* | |
H5 | 0.410 (2) | 0.149 (2) | 0.674 (2) | 0.056 (4)* | |
H21 | 0.417 (2) | 0.620 (3) | 0.465 (3) | 0.080 (2)* | |
H22 | 0.258 (2) | 0.536 (2) | 0.410 (2) | 0.080 (2)* | |
H31 | 0.324 (2) | 0.801 (2) | 0.577 (2) | 0.080 (2)* | |
H32 | 0.154 (2) | 0.760 (3) | 0.452 (2) | 0.080 (2)* | |
H41 | 0.274 (3) | 0.657 (3) | 0.745 (2) | 0.080 (2)* | |
H42 | 0.152 (2) | 0.567 (3) | 0.608 (2) | 0.080 (2)* | |
H61 | 0.343 (3) | 0.219 (2) | 0.401 (3) | 0.080 (2)* | |
H62 | 0.347 (3) | 0.0566 (18) | 0.444 (3) | 0.080 (2)* | |
H71 | 0.078 (3) | 0.2367 (18) | 0.355 (3) | 0.080 (2)* | |
H72 | 0.068 (3) | 0.0700 (19) | 0.337 (3) | 0.080 (2)* | |
H81 | 0.124 (3) | 0.208 (2) | 0.606 (3) | 0.080 (2)* | |
H82 | 0.175 (3) | 0.0471 (17) | 0.607 (3) | 0.080 (2)* |
U11 | U22 | U33 | U12 | U13 | U23 | |
N1 | 0.0574 (9) | 0.0421 (8) | 0.0418 (8) | 0.0021 (7) | 0.0176 (7) | −0.0012 (6) |
C2 | 0.0631 (13) | 0.0700 (14) | 0.0536 (11) | 0.0101 (11) | 0.031 (1) | 0.009 (1) |
C3 | 0.0633 (14) | 0.0660 (13) | 0.0619 (13) | 0.0171 (11) | 0.026 (1) | 0.016 (1) |
C4 | 0.0774 (15) | 0.0647 (14) | 0.0564 (12) | 0.0122 (11) | 0.0388 (11) | 0.005 (1) |
N5 | 0.0524 (9) | 0.0474 (9) | 0.0402 (8) | 0.0023 (7) | 0.0153 (7) | −0.0021 (6) |
C6 | 0.0582 (12) | 0.0704 (14) | 0.044 (1) | −0.003 (1) | 0.0247 (9) | 0.0008 (9) |
C7 | 0.0554 (13) | 0.0967 (18) | 0.0444 (11) | −0.0035 (12) | 0.0170 (9) | −0.0027 (11) |
C8 | 0.0586 (13) | 0.0902 (17) | 0.0483 (11) | 0.0017 (11) | 0.0288 (9) | −0.006 (1) |
N1—C2 | 1.474 (2) | N5—H5 | 0.839 (15) |
N1—C4 | 1.465 (3) | C6—C7 | 1.510 (3) |
N1—H1 | 0.832 (15) | C6—H61 | 0.946 (15) |
C2—C3 | 1.513 (3) | C6—H62 | 0.945 (15) |
C2—H21 | 0.940 (15) | C7—C8 | 1.514 (3) |
C2—H22 | 0.949 (15) | C7—H71 | 0.941 (15) |
C3—C4 | 1.518 (3) | C7—H72 | 0.943 (15) |
C3—H31 | 0.947 (15) | C8—H81 | 0.940 (14) |
C3—H32 | 0.943 (15) | C8—H82 | 0.962 (15) |
C4—H41 | 0.952 (15) | N1—N5 | 3.121 (2) |
C4—H42 | 0.951 (14) | N1—N5i | 3.102 (2) |
N5—C6 | 1.470 (2) | N1—H5i | 2.265 (15) |
N5—C8 | 1.465 (3) | N5—H1 | 2.297 (16) |
C2—N1—C4 | 89.07 (15) | C6—N5—H5 | 114.6 (15) |
C2—N1—H1 | 115.1 (14) | C8—N5—H5 | 113.9 (15) |
C4—N1—H1 | 115.0 (14) | N5—C6—C7 | 89.46 (15) |
N1—C2—C3 | 88.72 (15) | N5—C6—H61 | 111.8 (15) |
N1—C2—H21 | 115.7 (15) | C7—C6—H61 | 118.4 (15) |
C3—C2—H21 | 119.4 (16) | N5—C6—H62 | 113.2 (15) |
N1—C2—H22 | 108.5 (15) | C7—C6—H62 | 113.8 (15) |
C3—C2—H22 | 112.2 (14) | H61—C6—H62 | 109.0 (17) |
H21—C2—H22 | 110.4 (17) | C6—C7—C8 | 86.04 (16) |
C2—C3—C4 | 85.68 (16) | C6—C7—H71 | 113.3 (15) |
C2—C3—H31 | 111.6 (15) | C8—C7—H71 | 111.0 (15) |
C4—C3—H31 | 109.3 (15) | C6—C7—H72 | 118.0 (15) |
C2—C3—H32 | 119.9 (15) | C8—C7—H72 | 118.7 (15) |
C4—C3—H32 | 117.2 (15) | H71—C7—H72 | 108.4 (17) |
H31—C3—H32 | 110.7 (18) | N5—C8—C7 | 89.48 (15) |
N1—C4—C3 | 88.88 (15) | N5—C8—H81 | 113.7 (15) |
N1—C4—H41 | 115.3 (15) | C7—C8—H81 | 117.3 (15) |
C3—C4—H41 | 117.9 (15) | N5—C8—H82 | 114.2 (15) |
N1—C4—H42 | 113.6 (15) | C7—C8—H82 | 113.0 (15) |
C3—C4—H42 | 111.9 (15) | H81—C8—H82 | 108.4 (16) |
H41—C4—H42 | 108.4 (16) | N1—H1—N5 | 171.1 (19) |
C6—N5—C8 | 89.35 (14) | N1ii—H5—N5 | 175 (2) |
Symmetry codes: (i) −x+1, y+1/2, −z+3/2; (ii) −x+1, y−1/2, −z+3/2. |
Experimental details
(VIII) | (IX) | (X) | (XI) | |
Crystal data | ||||
Chemical formula | C3H4N2O2 | C22H8I2O2 | C9H9N3O5 | C3H7N |
Mr | 100.08 | 558.08 | 239.19 | 57.10 |
Crystal system, space group | Monoclinic, C2/c | Monoclinic, P21/c | Monoclinic, P21/n | Monoclinic, P21/c |
Temperature (K) | 190 | 293 | 150 | 170 |
a, b, c (Å) | 9.3538 (7), 12.1757 (11), 7.2286 (6) | 4.202 (1), 20.956 (4), 9.276 (2) | 12.5693 (4), 4.8531 (1), 17.2663 (5) | 9.507 (3), 9.122 (3), 9.790 (3) |
α, β, γ (°) | 90, 104.593 (4), 90 | 90, 100.63 (3), 90 | 90, 99.1624 (15), 90 | 90, 117.469 (4), 90 |
V (Å3) | 796.70 (11) | 802.8 (3) | 1039.81 (5) | 753.2 (4) |
Z | 8 | 2 | 4 | 8 |
Radiation type | Mo Kα | Mo Kα | Mo Kα | Mo Kα |
µ (mm−1) | 0.14 | 3.93 | 0.13 | 0.06 |
Crystal size (mm) | 0.18 × 0.10 × 0.02 | 0.38mm × 0.08mm × 0.04mm | 0.83 × 0.15 × 0.04 | 1.00 × 0.66 × 0.66 |
Data collection | ||||
Diffractometer | Nonius Kappa CCD diffractometer | R3m/V (Siemens) diffractometer | Nonius Kappa CCD diffractometer | Bruker SMART APEX diffractometer equipped with an Oxford Cryosystems low-temperature device and an OHCD laser-assisted crystallisation device (Scientific Consulting, Essen, Germany). |
Absorption correction | Multi-scan DENZO/SCALEPACK (Otwinowski & Minor, 1996) | Psi-method | Multi-scan DENZO/SCALEPACK (Otwinowski & Minor, 1996) | Multi-scan SADABS (Siemens, 1996) |
Tmin, Tmax | 0.99, 1.00 | 0.278, 0.337 | 0.98, 0.99 | 0.717, 1.000 |
No. of measured, independent and observed reflections | 1593, 886, 886 [I > −10.00u(I)] | 2784, 1398, 1057 [I > 2σ(I)] | 4326, 2357, 1497 [I > 3.00u(I)] | 4625, 1797, 1075 [I > 2.00u(I)] |
Rint | 0.02 | 0.074 | 0.04 | 0.04 |
(sin θ/λ)max (Å−1) | 0.648 | 0.595 | 0.649 | 0.676 |
Refinement | ||||
R[F2 > 2σ(F2)], wR(F2), S | 0.061, 0.131, 1.13 | 0.036, 0.083, 1.06 | 0.032, 0.078, 1.04 | 0.050, 0.166, 1.01 |
No. of reflections | 886 | 1398 | 1497 | 1797 |
No. of parameters | 77 | 118 | 181 | 118 |
No. of restraints | 0 | 0 | 20 | ? |
H-atom treatment | ? | H atoms treated by a mixture of independent and constrained refinement | ? | The positional parameters of the hydrogen atoms were restrained subject to the restraint that all chemically equivalent bond lengths and angles were similar. Two overall isotropic displacement parameters were refined for the methylene and amino hydrogen atoms. |
Δρmax, Δρmin (e Å−3) | 0.27, −0.25 | 0.67, −0.54 | 0.19, −0.24 | 0.14, −0.14 |
Computer programs: COLLECT (Nonius BV, 1997), R3m/V-software (Siemens), SMART (Siemens, 1993), DENZO/SCALEPACK (Otwinowski & Minor, 1996), SAINT (Siemens ,1995), SIR92 (Altomare et al., 1994), SHELXS97 (Sheldrick, 1990), CRYSTALS (Watkin et al. 2001), SHELXL97 (Sheldrick, 1997), CAMERON (Watkin et al. 1996).
N1—C2 | 1.474 (2) | C6—C7 | 1.510 (3) |
N1—C4 | 1.465 (3) | C7—C8 | 1.514 (3) |
C2—C3 | 1.513 (3) | N1—N5 | 3.121 (2) |
C3—C4 | 1.518 (3) | N1—N5i | 3.102 (2) |
N5—C6 | 1.470 (2) | N1—H5i | 2.265 (15) |
N5—C8 | 1.465 (3) | N5—H1 | 2.297 (16) |
C2—N1—C4 | 89.07 (15) | N5—C6—C7 | 89.46 (15) |
N1—C2—C3 | 88.72 (15) | C6—C7—C8 | 86.04 (16) |
C2—C3—C4 | 85.68 (16) | N5—C8—C7 | 89.48 (15) |
N1—C4—C3 | 88.88 (15) | N1—H1—N5 | 171.1 (19) |
C6—N5—C8 | 89.35 (14) | N1ii—H5—N5 | 175 (2) |
Symmetry codes: (i) −x+1, y+1/2, −z+3/2; (ii) −x+1, y−1/2, −z+3/2. |
Footnotes
1Please make initial contact with either Graeme Day (gmd27@cam.ac.uk) or Sam Motherwell (motherwell@ccdc.cam.ac.uk).
2Thanks are due to Professor Gautam Desiraju for bringing this reported structure to our attention.
3Supplementary data for this paper are available from the IUCr electronic archives (Reference: DE5014 ). Services for accessing these data are described at the back of the journal.
4We thank Mr Andrew Trask and Dr David J. Watkin for hydantoin crystallization and X-ray respectively.
5Recall that molecule (XI) was not added to the list of targets until midway through the test.
Acknowledgements
Many thanks are due to Professor A. L. Spek for accepting the role of independent referee, holding the experimental structures during the course of the blind test and collecting submitted predictions from each of the participants. We are grateful to the crystallographers who supplied candidate structures: Andrew Trask and David Watkin [molecule (VIII)], Erich F. Paulus and Martin U. Schmidt [molecule (IX)], Angelo Gavezzotti and David Watkin [molecule (X)], and Simon Parsons and Andrew Bond [molecule (XI)]. Also thanks to Professor Jeremy Sanders for choosing the blind test molecules from the list of candidates and James Chisholm for help in analysing the submitted structures. We also thank the CCDC for hosting the workshop. G. M. Day thanks the Pfizer Institute for Pharmaceutical Materials Science (University of Cambridge) for funding. A. Dzyabchenko thanks Harold A. Scheraga for computational resources of his laboratory. The J. C. Facelli group thanks the following grants: US NSF #INT-0071032, University of Buenos Aires #UBACYT-X098 and the Argentinean CONICET #PIP 02151/01. Pantelides and Karamertzanis gratefully acknowledge financial support from the Mitsubishi Chemical Corporation and the United Kingdom's Engineering and Physical Sciences Research Council (EPSRC) under Platform Grant GR/N08636. S. L. Price and their group thank the Basic Technology Program of the Research Councils UK and the EPSRC E-Science program for financial support, and the EPSRC for computational resources funded under GR/S06233. Martin U. Schmidt thanks Thomas Müller and Alexander Degen (both University of Frankfurt) for supporting ab initio calculations.
References
Accelrys Inc. (1997). Cerius2. San Diego, USA. Google Scholar
Accelrys Inc. (2004). MS Modelling, Release 3.0.1. San Diego, USA. Google Scholar
Allen, F. H., Kennard, O., Watson, D. G., Brammer, L. & Orpen, A. G. (1987). J. Chem. Soc. Perkin Trans. II, pp. S1–S19. CrossRef Google Scholar
Anghel, A. T., Day, G. M. & Price, S. L. (2002). CrystEngComm, 4, 348–355. Web of Science CrossRef CAS Google Scholar
Arnautova, Y. A., Jagielska, A., Pillardy, J. & Scheraga, H. A. (2003a). J. Phys. Chem. B, 107, 7143–7154. Web of Science CrossRef CAS Google Scholar
Arnautova, Y. A., Pillardy, J., Czaplewski, C. & Scheraga, H. A. (2003b). J. Phys. Chem. B, 107, 712–723. Web of Science CrossRef CAS Google Scholar
Bazterra, V. E., Ferraro, M. B. & Facelli, J. C. (2002a). J. Chem. Phys. 116, 5984–5991. Web of Science CrossRef CAS Google Scholar
Bazterra, V. E., Ferraro, M. B. & Facelli, J. C. (2002b). J. Chem. Phys. 116, 5992–5995. Web of Science CrossRef CAS Google Scholar
Bazterra, V. E., Ferraro, M. B. & Facelli, J. C. (2004a). Int. J. Quant. Chem. 96, 312–320. Web of Science CrossRef CAS Google Scholar
Bazterra, V. E., Ona, O., Caputo, M. C., Ferraro, M. B., Fuentealba, P. & Facelli, J. C. (2004b). Phys. Rev. A, 69, 053202. Web of Science CrossRef Google Scholar
Beyer, T., Day, G. M. & Price, S. L. (2001). J. Am. Chem. Soc. 123, 5086–5094. Web of Science CrossRef PubMed CAS Google Scholar
Beyer, T. & Price, S. L. (2000). CrystEngComm, 34, 183–190. Web of Science CrossRef Google Scholar
Boese, R. & Nussbaumer, M. (1994). Tranformations and Interactions in Organic Crystal Chemistry. International Union of Crystallography, Crystallographic Symposia, 7. Google Scholar
Brodersen, S. W., Leuson, F. J. J. & Engel, G. (2003). Phys. Chem. Chem. Phys. 5, 4923–4931. Web of Science CrossRef CAS Google Scholar
Busing, W. R. (1981). WMIN, Report ORNL-5747. Oak Ridge National Laboratory, Tennessee, USA. Google Scholar
Chisholm, J. & Motherwell, W. D. S. (2005). J. Appl. Cryst. 38, 228–231. Web of Science CrossRef IUCr Journals Google Scholar
Day, G. M., Chisholm, J., Shan, N., Motherwell, W. D. S. & Jones, W. (2004). Cryst. Growth Des. 4, 1327–1340. Web of Science CrossRef CAS Google Scholar
Day, G. M. Motherwell, W. D. S. & Jones, W. (2005). Cryst. Growth Des. 5, 1023–1033. Web of Science CrossRef CAS Google Scholar
Day, G. M. & Price, S. L. (2003). J. Am. Chem. Soc. 125, 16434–16443. Web of Science CrossRef PubMed CAS Google Scholar
Della Valle, R. G., Venuti, E. & Brillante, A. (2003). J. Chem. Phys. 118, 807–815. Web of Science CrossRef CAS Google Scholar
Demartin, F., Filippini, G., Gavezzotti, A. & Rizzato, S. (2004). Acta Cryst. B60, 609–620. Web of Science CSD CAS Google Scholar
Desiraju, G. R. (2002). Nat. Mater. 1, 77–79. Web of Science CrossRef PubMed CAS Google Scholar
Dunitz, J. D. & Scheraga, H. A. (2004). Proc. Nat. Acad. USA, 101, 14309–14311. Web of Science CrossRef CAS Google Scholar
Dzyabchenko, A. V. (1994). Acta Cryst. B50, 414–425. CrossRef CAS Web of Science IUCr Journals Google Scholar
Dzyabchenko, A. V. (2004). PMC, Version 2004. Karpov Institute of Physical Chemistry, Moscow. Google Scholar
Dzyabchenko, A. V., Agafonov, V. & Davydov, V. A. (1999). J. Phys. Chem. A, 103, 2812–2820. Web of Science CrossRef CAS Google Scholar
Eijck, B. P. van (2002). J. Comput. Chem. 23, 456–462. Web of Science CrossRef PubMed Google Scholar
Eijck, B. P. van & Kroon, J. (2000). Acta Cryst. B56, 535–542. Web of Science CrossRef IUCr Journals Google Scholar
Eijck, B. P. van, Mooij, W. T. M. & Kroon, J. (2001a). J. Comput. Chem. 22, 805–815. Web of Science CrossRef Google Scholar
Eijck, B. P. van, Mooij, W. T. M. & Kroon, J. (2001b). J. Phys. Chem. B, 105, 10573–10578. Web of Science CrossRef Google Scholar
Erk, P. (1999). Crystal Design: From Molecular to Application Properties in Crystal Engineering: From Molecules and Crystals to Materials, edited by D. Braga, F. Grepioni & G. A. Orpen, NATO Science Series C, Vol. 538. Dordrecht: Kluwer Academic Publishers. Google Scholar
Gavezzotti, A. (1999–2000). ZIP-PROMET. University of Milano, Italy. Google Scholar
Gavezzotti, A. (2002). J. Phys. Chem. B, 106, 4145–4154. Web of Science CrossRef CAS Google Scholar
Gavezzotti, A. (2003a). J. Phys. Chem. B, 107, 2344–2353. Web of Science CrossRef CAS Google Scholar
Gavezzotti, A. (2003b). CrystEngComm, 5, 429–438. Web of Science CrossRef CAS Google Scholar
Gavezzotti, A. (2003c). CrystEngComm, 5, 439–446. Web of Science CrossRef CAS Google Scholar
Gavezzotti, A. (2004). OPIX. University of Milano, Italy. Google Scholar
Gemmel, E., Beck, H. Bolte, M. & Egert, E. (1999). MOMO. University of Frankfurt, Germany. Google Scholar
Hofmann, D. W. M. & Apostolakis, J. (2003). J. Mol. Struct. 647, 17–39. Web of Science CrossRef CAS Google Scholar
Hofmann, D. W. M. & Lengauer, T. (1997). Acta Cryst. A53, 225–235. CrossRef CAS Web of Science IUCr Journals Google Scholar
Holden, J. R., Du, Z. Y. & Ammon, H. L. (1993). J. Comput. Chem. 14, 422–437. CrossRef CAS Web of Science Google Scholar
Karamertzanis, P. G. & Pantelides, C. C. (2004). Mol. Simulation, 30, 413–436. Web of Science CrossRef CAS Google Scholar
Karamertzanis, P. G. & Pantelides, C. C. (2005). J. Comput. Chem. 26, 304–324. Web of Science CrossRef PubMed CAS Google Scholar
Kuleshova, L. N., Hofmann, D. W. M. & Antipin, M. Y. (2005). Cryst. Rep. 50, 167–176. Web of Science CrossRef CAS Google Scholar
Leuson, F. J. J. (2003). Cryst. Growth Des. 3, 189–192. Web of Science CrossRef Google Scholar
Lewis, T. C., Tocher, D. A. & Price, S. L. (2004). Cryst. Growth Des. 4, 979–987. Web of Science CSD CrossRef CAS Google Scholar
Lommerse, J. P. M., Motherwell, W. D. S., Ammon, H. L., Dunitz, J. D., Gavezzotti, A., Hofmann, D. W. M., Leuson, F. J. J., Mooij, W. T. M., Price, S. L., Schweizer, B., Schmidt, M. U., van Eijck, B. P., Verwer, P. & Williams, D. E. (2000). Acta Cryst. B56, 697–714. Web of Science CSD CrossRef CAS IUCr Journals Google Scholar
Mitchell, J. B. O. & Price, S. L. (2000). J. Phys. Chem. A, 104, 10958–10971. Web of Science CrossRef CAS Google Scholar
Mooij, W. T. M., van Eijck, B. P. & Kroon, J. (1999). J. Phys. Chem. A, 103, 9883–9890. Web of Science CrossRef CAS Google Scholar
Motherwell, W. D. S. (2001). Mol. Cryst. Liq. Cryst. 356, 559–567. Web of Science CrossRef CAS Google Scholar
Motherwell, W. D. S., Ammon, H. L., Dunitz, J. D., Dzyabchenko, A., Erk, P., Gavezzotti, A., Hofmann, D. W. M., Leuson, F. J. J., Lommerse, J. P. M., Mooij, W. T. M., Price, S. L., Scheraga, H., Schweizer, B., Schmidt, M. U., van Eijck, B. P., Verwer, P. & Williams, D. E. (2002). Acta Cryst. B58, 647–661. Web of Science CrossRef CAS IUCr Journals Google Scholar
Moult, J., Hubbard, T., Fidelis, K. & Perdersen, J. T. (1999). Proteins: Struct. Funct. Genet. 37, 2–6. Google Scholar
Nyburg, S. C. & Faerman, C. H. (1985). Acta Cryst. B41, 274–279. CrossRef CAS Web of Science IUCr Journals Google Scholar
Parsons, S. (2005). To be submitted. Google Scholar
Paulus, E. F., Acs, A. & Schmidt, M. U. (2005). In preparation. Google Scholar
Pillardy, J., Arnautova, Y. A., Czaplewski, C., Gibson, K. D. & Scheraga, H. A. (2001). Proc. Nat. Acad. Sci. USA, 98, 12351–12356. Web of Science CrossRef PubMed CAS Google Scholar
Press, W. H., Teukolsky, S. A., Vetterling, W. T. & Flannery, B. P. (1992). Numerical Recipes in FORTRAN. Cambridge Unviersity Press. Google Scholar
Price, S. L., Willock, D. J., Leslie, M. & Day, G. M. (2001). DMAREL. https://www.ucl.ac.uk/~ucca17p/dmarelmanual/dmarel.html . Google Scholar
Schmidt, M. U. & Englert, U. (1996). J. Chem. Soc. Dalton Trans. pp. 2077–2082. CrossRef Web of Science Google Scholar
Schmidt, M. U. & Kalkhof, H. (1997). CRYSCA. Frankfurt, Clariant GmbH. Google Scholar
Signorini, G. F., Righini, R. & Schettino, V. (1991). Chem. Phys. 154, 245–261. CrossRef CAS Web of Science Google Scholar
Sobol', I. M. (1967). Comput. Math. Math. Phys. 7, 86–112. CrossRef Google Scholar
Spek, A. L. (1980–2004). PLATON. https://www.cryst.chem.uu.nl/platon . Utrecht University, The Netherlands. Google Scholar
Steiner, T. (2000). Acta Cryst. B56, 673–676. Web of Science CrossRef CAS IUCr Journals Google Scholar
Watkin, D. J., Motherwell, W. D. S., Cooper, R. I. & Pantos, S. (2004). Acta Cryst. E60, o2295–o2297. Web of Science CSD CrossRef IUCr Journals Google Scholar
Willock, D. J., Price, S. L., Leslie, M. & Catlow, C. R. A. (1995). J. Comput. Chem. 16, 628–647. CrossRef CAS Web of Science Google Scholar
Yu, F. & Schwalbe, C. H. (2002). Structure of hydantoin from a twinned crystal. British Crystallographic Association Spring Meeting. Nottingham, UK. Google Scholar
Yu, F.-L., Schwalbe, C. H. & Watkin, D. J. (2004). Acta Cryst. C60, o714–o717. Web of Science CSD CrossRef CAS IUCr Journals Google Scholar
© International Union of Crystallography. Prior permission is not required to reproduce short quotations, tables and figures from this article, provided the original authors and source are cited. For more information, click here.