Unveiling the self-association and desolvation in crystal nucleation

The importance of self-association and desolvation in the process of crystal nucleation is emphasized; the rearrangement of supramolecular structures also has a non-negligible effect on the nucleation kinetics.


Introduction
Crystallization from solution is one of the most widely used unit operations and has been extensively used in the food, chemical, dye, pharmaceutical and agrochemical industries. Nucleation is crucial as the first step of the crystallization process. It can affect the physical properties of a crystal, such as the structure, shape, defects, domain size and polymorphs (Ou et al., 2020).
Although there have been more and more studies on clusters in the nucleation process over the past decade, the molecular mechanism of the process is still unclear. During crystal nucleation, molecular aggregates in solution play an important role, though their characterization is extremely difficult to describe. Recent evidence suggests that solution chemistry can be an effective method to investigate selfassembly in the nucleation process by exploring the solutesolvent interactions and self-association of solute molecules in solution (Davey et al., 2013;Gebauer et al., 2014;Tang et al., 2018). For example, Davey and Trout used FTIR to study the existing form of butynoic acid in different solvents. In chloroform it is a carboxyl hydrogen-bonded dimer, whereas in ethanol, though the solute has the potential to form hydrogen bonds, it does not participate in the formation of dimers. This result also matches the crystallization behaviour. Dimers are easily formed in non-polar solvents and cyclic structures are easily formed in polar solvents (Parveen et al., 2005;Chen & Trout, 2008). Other systems were investigated using the solution chemistry method, such as 2,6-dihydroxybenzoic acid (Davey et al., 2001), p-acetanisidide (Saito et al., 2002), isonicotinamid (Kulkarni et al., 2012;Maggioni et al., 2017), benzoic acid Burton et al., 2010), tolfenamic acid Mattei & Li, 2012), mandelic acids (Davey et al., 2006), inosine (Chiarella et al., 2007), etc. Recent advances in computational and analytical techniques have also facilitated investigations of larger molecular clusters more efficiently (Sosso et al., 2016). Therefore, several attempts have been made to explain the evolution of the so-called 'growth unit' during nucleation (Gavezzotti et al., 1997;Di Tommaso, 2013;Zeglinski et al., 2018). By studying the molecular structure evolution pathways from solute molecules to supramolecular arrangements, the structure correlation between the solution aggregates and crystal syntheses can be explained (Byrn et al., 1976;Bernstein & Hagler, 1978;Habgood, 2012).
From these explorations it was found that not only can the final crystal structure and nucleation mechanism be determined by the solvent, but that the nucleation rate can also be affected by the solvent in the crystallization process. Therefore, a number of researchers suggested a link between the solution structure and measured nucleation kinetic data (Davey et al., 2015;Sullivan et al., 2014). For example, Davey et al. studied the influence of solvent on nucleation kinetics and concluded that the process of solute dimerization and desolvation is the speed-control step of the entire nucleation process (Sullivan et al., 2014). Similar conclusions have also been obtained in other systems, such as risperidone, where the stronger the solvent-solute interaction is, the slower the nucleation rate will be (Khamar et al., 2014;Mealey et al., 2015). However, other research studies showed contrasting results, for example, a series of structure-related benzoic acids were investigated in four solvents. For a variety of solvents and solutes, this assumption was still valid, but when all the solutes and solvents were considered, it did not hold, and eventually led to the aromatic stacking being assigned the key step in nucleation (Cruz-Cabeza et al., 2017).
In order to better understand the relationship between molecular conformations, crystal structure, solution chemistry and nucleation kinetics, investigations focused on the relationship between solution chemistry and nucleation kinetics were carried out here using phenacetin (PHEN) as the model compound, as shown in Fig. 1. The nucleation process of PHEN in six solvents [chloroform, acetonitrile, methanol, toluene, N,N-dimethylacetamide (DMA), dimethyl sulfoxide (DMSO)] was investigated through a combination of spectroscopic techniques (FTIR, NMR, NOESY) and computational methods [density functional theory (DFT)]. Crystallization of PHEN was carried out in six solvents, and the induction time under different supersaturations was measured to obtain the nucleation kinetic data. Furthermore, computational chemistry was also employed to aid in consistent interpretation, linking solute-solvent interactions and molecular conformations to nucleation behaviours.

Materials
Phenacetin (PHEN) was purchased from Shanghai Yuanye Biological Technology Co. Ltd, China, and its mass fraction purity was higher than 98%. All the solvents employed (methanol, chloroform, acetonitrile, toluene, DMA and DMSO) were analytical reagent grade with molar purity higher than 99.5% and were obtained from Tianjin Kewei Chemical Technology Co. Ltd, China. Chloroform-d (99.8% D), acetonitrile-d 3 (99.8% D), methanol-d 4 (99.8% D) and DMSO-d 6 (99.8% D) were purchased from SAAN Chemical Technology Co. Ltd of China. All chemicals were used without any further purification.

Induction time measurement
The solubility data for PHEN were collected using the gravimetric method (equilibrium for 24 h, three repeats) at 25 and 40 C, and are presented in Table S1 of the supporting information. For the purpose of solubility measurement, data were employed to give guidance for the concentration of detection in ATR-FTIR (attenuated total reflectance Fourier transform infrared spectroscopy) and prepare a solution with different concentrations to create specific supersaturations. The definition of the supersaturation ratio is S = x/x*, where x is the actual mole fraction of PHEN and x* is the equilibrium solubility mole fraction.
The induction time, which is defined as the time when the constant supersaturation is established (temperature of solution reaches nucleation temperature) to the moments at which the detectable crystal particles appear. The turbidimeter (Crystal Eyes, DMS-2, HEL Ltd) was used to monitor the formation of nuclei. To begin this process, a round-bottomed jacketed glass batch crystallizer (350 ml) was employed to prepare different concentrations of solution by dissolving appropriate amounts of PHEN in the respective solvents at 45 C. To stir thoroughly, a mechanical stirrer with an agitation speed of 300 rpm was used in the crystallizer. The temperature  was controlled by two thermostats (Xianou Laboratory Instrument Works Co. Ltd, Nanjing) connected to two tbranch pipes. The temperature accuracy was AE0.01 K. The temperature was first set at 45 C to dissolve the solid completely, then the t-branch pipes were changed to facilitate shock cooling to 25 C. The point at which the system dropped to 25 C was noted as the start of induction time, and when the turbidmeter indicated a sudden increase this was noted as the end point. In order to reduce the experimental error, six reproducible experiments were performed at each composition.
The relationship between the nucleation rate and supersaturation can be described well using the classical nucleation theory model: where J is the nucleation rate (m À3 s À1 ), A is nucleation kinetic parameter, S is the degree of supersaturation, B is the nucleation thermodynamic parameter, f 0 is the collision frequency factor independent of supersaturation, C 0 is the concentration of nucleation sites, v 0 is the volume of the solute molecule (m 3 ), is the interface energy (mJ m À2 ), k is Boltzmann constant (J K À1 ) and T is the absolute temperature (K). So there is a linear relationship between ln(J/S) and 1/ln 2 s. By plotting a linear fit, we can obtain the pre-exponential kinetic factor A from the intercept and the thermodynamic parameter B from the slope.

Powder X-ray diffraction analysis
Crystals were isolated immediately upon the appearance of solids under different conditions. Powder X-ray diffraction (PXRD) was used to identify samples on a Rigaku D/max-2500 (Rigaku) using Cu K radiation (0.15405 nm) in the 2 range 5-50 and with a scanning speed of 8 min À1 to determine the crystal form nucleated in these experiments. The results showed that all the PHEN obtained in this work was pure form I.

Single-crystal growth and crystal structure analysis
The slow solvent evaporation method was employed to obtain single crystals of PHEN form I. Specific amounts of PHEN solid were dissolved in methanol, and then the solution was transferred to an open beaker with parafilm. A few holes were made on the parafilm to ensure the solution evaporated slowly. The whole system was placed into an oven and kept at 293.15 K. Crystals of PHEN and its solvent of appropriate size for single-crystal X-ray diffraction (SCXRD) were obtained after several days. SCXRD measurements were conducted on a Rigaku Saturn 70 CCD diffractometer using Mo K radiation ( = 0.71073 Å ) with a graphite monochromator. Integration and scaling of intensity data were accomplished using the program SAINT (Bruker, 2017). The structures were solved using the SHELXS2014 (Sheldrick, 2014) suite of programs, and refinement was conducted using SHELXL2018 (Sheldrick, 2015).

FTIR spectroscopy
Solid spectroscopy data were collected using Bruker ATR-FTIR (attenuated total reflectance Fourier transform infrared spectroscopy), with a resolution value of 4 cm À1 , a scan time of 32 and wavenumber ranging from 400 to 4000 cm À1 . An ATR-FTIR spectrometer (ReactIRTM45, Mettler-Toledo) equipped with a Duradisc Dicomp probe was adopted to facilitate solution spectroscopy. For each sample, 32 scans were collected over a spectra range from 650 to 2800 cm À1 at 2 cm À1 resolution to investigate the molecular structure of PHEN at different concentrations in the six solvents tested. The concentration of PHEN solution used in this work was determined by solubilities in different solvents, which varies from unsaturated to supersaturated.

NMR spectroscopy
Different concentrations of 13 C-NMR and 1 H-NMR spectra were measured in DMSO, acetonitrile, methanol and chloroform. All the 13 C-NMR and 1 H-NMR spectra were detected using a 600 MHz liquid NMR spectrometer (Bruker AVANCE III) at 298 K after 32 and 1024 scans. The software Mestrenova (http://mestrelab.com/software/mnova/nmr/) was employed to process and analyze the data. The chemical shifts in the 1 H and 13 C spectra were determined relative to the internal reference TMS.

Nuclear Overhauser effect spectroscopy
2D NOESY experiments were carried out for PHEN solution in DMSO, methanol, acetonitrile and chloroform at room temperature using a 600 MHz Bruker AVANCE III NMR spectrometer. 2D NOE spectra were measured with a standard pulse for both F1 and F2 dimensions. The number of F1 increments was 256, each with 65 536 data points in the F2 dimension. The NOE mixing time was optimized to 0.8 s by measuring NOE buildups. The number of scans and dummy scans were set to be 16 and 2, respectively.

Crystal structure analysis
Hirshfeld surface and 2D fingerprint analyses were employed to quantitatively analyze and compare the intermolecular interactions of PHEN using Crystal Explorer 17 (Turner et al., 2017) software.
2.9. Computational method 2.9.1. Potential energy surface computation. For the potential energy surface (PES) scan, the conformer in phenacetin was extracted and its geometry optimized (herein, conformer A). Then the PES of phenacetin conformer A with dihedral angles 1 and 2 (Fig. 2) was generated by scanning for 18 steps with a step length of 10 for both 1 and 2 . All 2.9.2. Electrostatic potential distribution. It is generally believed that electrostatic potential can be used to predict and explain the relative molecular orientation and the strength of combination if a complex is mainly assembled by static electricity (such as a hydrogen bond, dihydrogen bond, halogen bond, etc.). The quantitative molecular surface analysis module of the Multiwfn program (Lu & Chen, 2012a,b) is capable of partitioning the whole van der Waals surface into multiple fragments, allowing us to study the characteristics of electrostatic potential distribution (Lu & Manzetti, 2014). Multiwfn and VMD (Humphrey et al., 1996) were used plot the van der Waals surface electrostatic potential distribution.
2.9.3. Solvent-solute interaction calculation. Density functional theory (DFT) calculations were performed by Gaussian09 to quantify the interactions in (1:1) molecular complexes of PHEN in the six solvents (Frisch et al., 2016). The geometries were optimized by the hybrid M06-2x function and 6-31 + G(d,p) basis set with the Grimme D3 dispersion correction using the SMD implicit solvation model (Grimme et al., 2010;Pratt et al., 2007). The Grimme dispersion correction allows a good description of weak interactions, such as van der Waals interactions. The binding energy (ÁE bind ) between two molecules is calculated using the following equation: where E AB is the energy of the PHEN-solvent complex, and E A and E B are the energies of the isolated monomer PHEN and the solvent, respectively. All the energies have been corrected for the zero-point vibrational energies. BSSE is the basis set superposition error and is calculated to correct the over-estimation of binding energies caused by overlapping of the basis functions (Boys & Bernardi, 1970).

Crystallization outcomes
The solid forms of PHEN in methanol, chloroform, acetonitrile, toluene, DMA and DMSO at different supersaturations and temperatures were studied. In all solutions form I was obtained and the corresponding PXRD patterns of are shown in Fig. 3.
The experimentally measured PXRD patterns of PHEN crystallized from different solvents and the simulated patterns from the Cambridge Structural Database (CSD) are compared in Fig. 3. The main diffraction peaks are consistent, indicating that they are all the same crystal form.

Nucleation rate in different solvents
The induction time of PHEN in different solvents at various supersaturations was measured and the results are shown in Table S2. Because the volume of solution used to measure the induction time is relatively large (generally ! 150 ml), the measured induction time fluctuation does not show the random phenomenon of the induction time usually observed in small volumes. By relating the induction time (t) to supersaturation (S), it is possible to estimate the nucleation rates and nucleation kinetic parameters of crystals (Zong et al., 2019). Fig. 4(a) presents the relationship between the nucleation rate J and solution supersaturation S. The results indicate that PHEN showed the fastest nucleation rate in acetonitrile within the experimental supersaturation, followed by methanol. In the case of low supersaturation, the nucleation rate of PHEN in DMSO is faster than that in toluene, whereas the situation is the opposite for high supersaturation. Among the six selected solvents, the slowest nucleation rate is in DMA and chloroform. In Fig. 4(b), ln(J/S) and ln 2 S show a good linear relationship in six solvents, indicating that method, supersaturation range and control of experiment conditions to conduct induction time measurements are suitable for this system. The kinetic parameter A, thermodynamic parameter B, molecular collision frequency f 0 C 0 and interfacial energy can be calculated from the slope and intercept, as shown in Table 1. It can be seen from the data that the interfacial energy followed the order of acetonitrile < methanol < DMSO < toluene < DMA < chloroform, almost the same with that of   nucleation rate data. That is, the greater the interface energy, the more difficult it is to nucleate, indicating that nucleation is mainly controlled by thermodynamic processes. In contrast, the order of molecular collision frequency was acetonitrile < methanol < DMSO < toluene < DMA < chloroform. A higher collision frequency should lead to a shorter induction time, but this parameter had no obvious relationship with the order of the nucleation rates. It can be explained that the nucleation rate of the same solute molecule in different solvents may be determined by the interface energy, and the interface energy is closely related to the interactions between the solute molecule and the solvent molecule, such as hydrogen bonding and solvation.

Crystal structure analysis
The single-crystal data of PHEN form I are presented in Table 2. According to the SCXRD data, PHEN form I belongs to the monoclinic crystal system and the space group is P2 1 /c. There is one PHEN molecule in the asymmetric unit.  Table 1 Calculated kinetic and thermodynamic parameters for PHEN nucleation in different solvents.

Figure 4
Relationships between (a) nucleation rates J and supersaturation S; (b) ln(J/S) and 1/(ln 2 S) according to classic nucleation theory.  The crystal structures of PHEN form I are shown in Fig. 5. The Hirshfeld surface was further used to quantify the different types of interactions and their contributions in crystal packing. The 2D fingerprint plot and the percentage of various contacts are shown in Fig. 6. In Fig. 5(a), form I was arranged alternately through N-HÁ Á ÁO interactions, which corresponds to OÁ Á ÁH as the strongest interactions in the 2D fingerprint plot. It can be seen that HÁ Á ÁH contacts and C-HÁ Á Á interactions contribute most to the Hirshfeld surface in form I, which is in agreement with the crystal structure analysis in Fig. 5. The HÁ Á ÁH contacts contributing the most part (56.2%) are likely due to the short contacts between the aromatic rings. The formation of a hydrogen bond between the amine and carboxyl group leads to close HÁ Á ÁH contacts. On the other hand, the hydrogen atoms on aromatic rings become close when aromatic interactions (C-HÁ Á Á and Á Á Á) are formed, which contributes another part of the HÁ Á ÁH contacts.

FTIR spectroscopy
The solid spectra of PHEN show strong bands for carbonyl stretching at 1643 and 1655 cm À1 , indicating the formation of strong hydrogen bonds, as shown in Fig. 7(a). The crystal structure in Fig. 5(a) shows the formation of hydrogen bonds between the carbonyl and the amine group, which is consistent with the results of the solid spectra.
Compared with the solid spectra, the carbonyl peak in the IR spectrum of PHEN in solution shows an obvious blue shift, this indicates the weakening of interactions of the carbonyl groups in the solution. The observed displacements of the (C O) modes upon solution results from the reduction in the C OÁ Á ÁH-N hydrogen bonding present between the molecules in solution. The different values in different solvents are associated with the involvement of hydrogen bonding. Thus, it is reasonable to assume that the solute molecules were solvated by the solvents in solution. The stronger the interactions, the more the carbonyl band is displaced to lower wavenumbers. This feature can be used to rank the strength of solvent-solute interactions (Khamar et al., 2014;Mealey et al., 2015). Thus, Fig. 7(a) shows that the interaction strength of the PHEN carbonyl with the solvent increases in the order toluene < acetonitrile < DMA < DMSO < chloroform < methanol.
The IR spectra of PHEN in acetonitrile, methanol, DMSO and toluene [Figs. 7(b)-7(e)] show strong bands for carbonyl stretching at 1688, 1670, 1681 and 1700 cm À1 , which represents varying degrees of solvation. Acetonitrile and DMSO can be hydrogen acceptors, whereas toluene is neither a hydrogenbond donor nor a hydrogen-bond acceptor. It can only form weak interactions through C-H. Thus, we expect PHEN to show the highest stretching vibration peak of the carbonyl group in toluene. The IR spectrum of PHEN in toluene shows two peaks in the carbonyl group region, the strong peaks at 1700 cm À1 indicate the existence of non-solvated aggregates in toluene, and the weaker one at 1680 cm À1 suggests a small fraction involve hydrogen bonding. The shoulder peak of low wavenumber in toluene is not obvious due to the too-low solubility of PHEN in toluene. With increasing solute concentration in methanol, the carbonyl peak shows a shoulder at a lower wavenumber about 1655 cm À1 which continually increases with concentration. This phenomenon suggests an increase in strong bound carbonyl species with an increase in the concentration of solute. It is difficult to interpret the solute-solute aggregation information in alcohol solutions, as alcohol can act as both a hydrogen bond donor and a hydrogen bond acceptor, which makes it difficult to differentiate solvent-solute and solute-solute interactions.
The IR spectra of PHEN in chloroform shows that the stretching peak of carbonyl group has a significant red shift with increasing concentration, indicating that the C O hydrogen-bond complex is formed. According Fig. S1 of the supporting information, the spectra of the same concentration of PHEN in chloroform and chloroform-d have the same value for carbonyls, indicating that the hydrogen-bond complex is a solute-solute aggregate formed by C OÁ Á ÁH-N, which is  consistent with the hydrogen bond formed in the crystal. At the same time, as the concentration increases, the blue shift of the benzene ring stretching peak also shows self-association between solutes. This supports that C OÁ Á ÁH-N and Á Á Á interactions play an important role in self-association. Furthermore, a similar trend was observed in DMA solution: with increasing concentration the spectra move towards their position in the solid state. increases, the proportion of solvent decreases and the solvation strength decreases. Chemical shifts of 1 H and 13 C in chloroform-d were present in Figs. 8(c) and 8(d). All the protons display downfield changes with increasing concentration, which supports desolvation and self-association. H 23 shows the largest changes, indicating that the carbonyl group plays an important role in the desolvation process. H 7 exhibits a shielding effect as the concentration increases, these changes can be attributed to self-association through C OÁ Á ÁH-N and PHEN-PHEN stacking in the solvent , which also corresponds to the deshielding effect of the carbonyl group; these correspond to the red shift of the carbonyl group and the blue shift of the C C of the benzene ring in the IR spectrum.
In acetonitrile-d 3 [ Fig. 8( f )], carbonyl 13 C of PHEN unveils similar concentration-dependent changes to those seen in chloroform-d. The deshielding effect of NH and carbonyl 13 C can be attributed to self-association through C OÁ Á ÁH-N

2D NOESY spectra
The structural details of these solute-solute and solutesolvent assemblies were further explored by 2D NOESY. As shown in Fig. 9, the NOE cross peak of H 7 and H 23 appears in chloroform-d solution. Combining the crystal structure data in Figs. 9(a) and 9(b), the distance between H 7 and H 23 is 6.695 Å . But when the PHEN molecules assemble by forming a C OÁ Á ÁH-N hydrogen bond, the distance between H 7 and H 23 is 4.271 Å . Additionally, an NOE cross peak appears when the space distance is closer than 5 Å . However, PHEN in acetonitrile-d 3 , methanol-d 4 and DMSO-d 6 did not show the NOE cross peak of H 7 and H 23 (Figs. S2 for acetonitrile, S3 for DMSO and S4 for methanol). Therefore, PHEN shows an obvious self-association effect in chloroform, which is consistent with FTIR and NMR spectroscopy results. The selfassembly is not obvious in acetonitrile, methanol and DMSO, this may be due to the solvation effect which creates an energy barrier for its self-association.

Molecular conformation
Conformation adjustment is a vital process during nucleation (Derdour & Skliar, 2014;Li et al., 2020). If there is a high rotation barrier between the conformations in solid and solution, conformation adjustment may have an obvious effect on the nucleation process, which could decrease the nucleation rate (Zeglinski et al., 2018). Thus, a PES about 1 and 2 was generated as shown in Fig. 10. From the PES, the results are evident. Regardless of whether it is in solution or solid, the energy change trend with the dihedral angle is the same. Both 1 and 2 reach the minimum energy at 179 , which is the most stable conformation. The conformations in the six solvents are essentially the same as in the solid, indicating they easily transform to the solid conformation during nucleation. Thus, it is reasonable to consider that the conformation adjustment does not have an important effect on the nucleation rate of PHEN in these six solvents.

Molecular interactions
The use of electrostatic potential is a well established approach to predict and explain the relative molecular  Top: (a) distance between H 7 and H 23 in a PHEN molecule, (b) distance between H 7 and H 23 while PHEN molecules assemble by forming a C OÁ Á ÁH-N hydrogen bond. Bottom: 2D NOESY of PHEN in chloroform-d at room temperature.  orientation and the strength of the combination if a complex is mainly assembled by static electricity (such as a hydrogen bond, dihydrogen bond or halogen bond). And the more negative (or positive) the electrostatic potential is, the more electrophilic (nucleophilic) the atom is likely to be. Therefore, the distribution of the van der Waals surface electrostatic potential of molecules can be analyzed and used to predict the most active sites; four sites on PHEN were selected to optimize the 1:1 solute-solvent complexes and calculate the binding energy. As can be seen from Fig. 11, the carbonyl oxygen exhibits the greatest electronegativity and was selected as site 1. The amino hydrogen showing the most positive surface electrostatic potential was selected as site 2. The other two sites are ethoxy oxygen (site 3) and benzene ringelectrons (site 4). The distributions of the van der Waals surface electrostatic potential of solvent molecules are shown in Fig. S4. The optimized geometries and binding energy results are presented in Fig. 12.
It can be seen from Fig. 12 that site 1 presents the largest binding energy in acetonitrile, chloroform and methanol, this is because these all have strong hydrogen-bond donors, favouring the formation of heterodimers which can significantly affect the binding energy. Toluene and DMA as hydrogen-bond donors show the largest binding energy at site 3. Only DMSO as a hydrogen-bond acceptor shows the largest binding energy at site 2.

Discussion
Given the above data, it is now possible to investigate the role of solvents in the nucleation process from a molecular perspective. First, the relative nucleation rate of PHEN does not correlate with the fundamental properties of solvents, such as boiling point and density. Also, there is no correlation between the nucleation rate and the relevant solubility.
In recent years, a number of works focused on building a link between the nucleation rate and the solvation strength, such as risperidone (Mealey et al., 2015), tolbutamide (-Zeglinski et al., 2018), fenoxycarb (Zeglinski et al., 2019), salicylic acid (Khamar et al., 2014) and PABA (Sullivan et al., 2014). A similar approach was also taken in this study. The nucleation rates followed the order of acetonitrile < methanol < DMSO < toluene < DMA < chloroform, which was consistent with the interface energy. IR and NMR spectroscopy show solute aggregation effects in DMA and chloroform, but the nucleation rates in chloroform and DMA are slowest. According to the calculated 1:1 solute-solvent binding energy, the binding energy of site 3 (ethoxy oxygen) and the nucleation rate are basically consistent, except for chloroform, indicating that site 3 plays a major role in the nucleation process.
Conformation adjustment also plays an important role in the nucleation process. Thus, the discrepancy in the chloroform and DMA cases may be caused by a high energy barrier between the conformations in chloroform and DMA. However, the computational results of the potential energy scan about the torsion angles ( 1 and 2 ) in different solvents demonstrated that neither an obvious conformational difference in chloroform and DMA nor a high energy barrier exist in this system. Thus, it is concluded that the conformation adjustment should not be the main reason for an abnormal low nucleation rate.
It may be that as the concentration increases, PHEN forms a supramolecular structure in these two solvents which differs from the crystal structure, and then needs to undergo a rearrangement process, so the nucleation rate is slowest. At the same time, the solvation in chloroform is through a special halogen-bond interaction, which may also cause a relatively large energy barrier to the subsequent rearrangement of the supramolecular structure. van der Waals surface electrostatic potential of PHEN plotted using Multiwfn and VMD.
However, when all the solvent systems are considered together, no clear relationship between the specific site interaction, the solvation strength obtained from IR spectroscopy, the conformation structure similarity, the way of gathering and the nucleation difficulty could be summarized. No single factor could individually describe the actual order of the nucleation difficulty and each factor does play a crucial role in certain situations. Therefore, we suggest that none of the four factors: the similarity of the solute in liquid and solid states, the specific site interaction, the way of gathering and the solvation strength, could be neglected. It is speculated that the rearrangement of the supramolecular structure plays a crucial role in determining the nucleation rate.

Conclusions
Investigations on the relationship between solution chemistry and nucleation kinetics have been carried out using PHEN as a model compound. We found that form I of PHEN could be obtained in acetonitrile, methanol, toluene, chloroform, DMA and DMSO. The crystal structures were analysed. FTIR, NMR and NOESY spectroscopies were used to analyze the solute species in solution. The results showed that in chloroform and DMA, PHEN undergoes nucleation through C OÁ Á ÁH-N hydrogen bond self-association, whereas in the other solvents PHEN more or less undergoes a desolvation process. However, the nucleation kinetics data showed that chloroform and DMA had the slowest nucleation rates. We speculate that the self-association in these two solutions produces a supramolecular structure which is different from the solid structure. Rearrangement plays a major role in the rate of nucleation. This work confirms the importance of the self-association and desolvation processes during crystal nucleation. However, since the nucleation of crystals is complicated and many factors could affect the nucleation process, much more work needs to be carried out to fully understand nucleation phenomena.