Emma H.
Wolpert
* and
Kim E.
Jelfs
*
Department of Chemistry, Imperial College London, Molecular Sciences Research Hub, White City Campus, Wood Lane, London, W12 0BZ, UK. E-mail: e.wolpert@imperial.ac.uk; k.jelfs@imperial.ac.uk; Tel: +44 (0)20759 43438
First published on 25th October 2022
How molecules pack has vital ramifications for their applications as functional molecular materials. Small changes in a molecule's functionality can lead to large, non-intuitive, changes in their global solid-state packing, resulting in difficulty in targeted design. Predicting the crystal structure of organic molecules from only their molecular structure is a well-known problem plaguing crystal engineering. Although relevant to the properties of many organic molecules, the packing behaviour of modular porous materials, such as porous organic cages (POCs), greatly impacts the properties of the material. We present a novel way of predicting the solid-state phase behaviour of POCs by using a simplistic model containing the dominant degrees of freedom driving crystalline phase formation. We employ coarse-grained simulations to systematically study how chemical functionality of pseudo-octahedral cages can be used to manipulate the solid-state phase formation of POCs. Our results support those of experimentally reported structures, showing that for cages which pack via their windows forming a porous network, only one phase is formed, whereas when cages pack via their windows and arenes, the phase behaviour is more complex. While presenting a lower computational cost route for predicting molecular crystal packing, coarse-grained models also allow for the development of design rules which we start to formulate through our results.
Currently, state-of-the-art computational efforts focus on crystal structure prediction (CSP) which, although successful,5–7 is computationally expensive and must be employed on a case-by-case basis. This results in difficulty formulating design rules between structural motifs, chemical functionality, and their packing behaviour. Moreover, weak intermolecular interactions result in a high number of possible low-energy polymorphs and, coupled with the high computational cost of these calculations,8 this restricts the ability to use high-throughput approaches.5 Additionally, it is even more computationally expensive to apply CSP to multicomponent systems,9 where it has been shown that combining different molecules can allow property tuning in a way that is not possible with extended framework materials.10 Therefore, a novel method of predicting the packing of molecular materials must be introduced in order to simultaneously improve our understanding of molecular crystals, reduce computational efforts, and elucidate design rules for targeted phase formation.
In this paper, we develop a novel way of predicting the molecular solid-state based on a simplistic, coarse-grained model containing the dominant degrees of freedom driving crystalline phase formation. We focus on determining the packing behaviour of a subset of porous organic cages (POCs) formed through a condensation reaction between a trialdehyde and diamine in a [4 + 6] cycloimination, creating discrete organic molecules which contain a permanent internal cavity (Fig. 1). Although the cages formed are pseudo-octahedral geometrically, the molecules have pseudo-tetrahedral symmetry due to the different chemical features of the cages, where each face of the octahedra is either a cavity, known as a window, or contains an arene (Fig. 1(c)). The porous nature of POCs leads to a variety of potential applications, including in catalysis,11 sensing,12 encapsulation13 and molecules separations.14 Despite significant interest in POCs, their utilisation in industrial applications is hindered by the lack of understanding of how to control their assembly,15 both on a molecular level and in the solid-state. As techniques for predicting the molecular assembly of cage molecules become more refined,16,17 there still remains the question of how to predict and control the assembly in the solid-state.
As the internal cavity of the cages can only be accessed through their windows, changes in their solid-state phase behaviour can have a significant affect on the pore network in the material. Small changes in the chemical functionality of POCs can lead to non-intuitive changes in the global solid-state packing due to the fine balance between the weak dispersion forces (Fig. 2). These weak interactions are responsible for many of the advantages of POCs over extended frameworks such as their solution processability18 and the potential to control their solid-state assembly by manipulating the functionality of the cages or solvent used.19,20 However, facile prediction and control of the packing behaviour of novel cages is currently intractable as there remains a lack of understanding of how chemical functionality and solvent interplay in the formation of molecular solid-state materials. This complexity in predicting the packing behaviour of molecular crystals makes targeted design difficult.21
To implement the research presented in this paper, we use a general-purpose particle simulation toolkit, HOOMD-blue,22 to employ hard particle Monte Carlo (HPMC) simulations with directional interactions through “patches” (Fig. 3(a)). HPMC simulations as implemented within HOOMD-blue are often used to investigate the self assembly of structures formed by packing non-interacting polyhedra.23,24 Through these simulations, Glotzer and coworkers have shown that geometry alone can direct complex structural formation through an “entropic bond” governed by the polyhedral shape. The entropic bond is a purely statistical phenomena which explains that densely packed structures form from non-interacting particles due to the system maximising the number of microstates available. While the studies focused on colloidal, non-interacting particles, this has strong implications for packing in molecular materials. The entropic bonds are weak,25 on the order of a few kBT,26 which means enthalpic interactions dominate in most materials. However, the packing of organic molecules is dictated by weak intermolecular interactions, suggesting that the entropy gained from increasing the number of microstates available may influence the phase behaviour.
Within HOOMD-blue, patches can be added to hard particles in order to mimic directional interactions,27 which has been successfully applied to molecular materials as seen in the investigation of the self assembly of π-conjugated optoelectronic peptides.28 The benefit of using HOOMD-blue is the ease in using user-defined potentials as required for the introduction of coarse-grained models. This was demonstrated by Mansbach et al. who tracked the effects of core and side chain interaction strength and sterics on the morphology and kinetics of assembly using a coarse-grained model.28 These models allow for the formation of design rules by developing an understanding of the interactions and geometric building blocks required for targeted phase behaviour.29
Here we use HPMC simulations in conjunction with patchy particle models to investigate the effect of the combination of building block geometry and directional interactions on the solid-state phase behaviour of POCs. The basic framework rigidity of POCs means that geometrically, the molecules can be considered as polyhedra. For the POCs considered in this paper (Fig. 1), that means we can represent the molecular cages as hard octahedra for the HPMC simulations. The directional interactions are introduced through patches on each of the facets, where the different colour patches represent the reduction of symmetry of the octahedra due to the chemical makeup of the cages (i.e. the alternation of facets between windows or arenes) (Fig. 3(a)). Due to the simplicity of the model, this procedure drastically reduces computational efforts compared to CSP and facilitates the development of design rules required for targeted phase formation in the future. Although a simplistic model, we show that by manipulating the parameters of our coarse-grained model, we can reproduce the phase space spanned by the majority of octahedral POCs present in the Cambridge Structural Database.30 Finally, our results highlight new phases which are unreported within the literature. While the focus of this paper is on POCs, many of the motifs studied here are seen in other areas of molecular chemistry such as metal–organic cages.31,32 Thus our results may have wider reaching applications to predict and rationalise the solid-state behaviour of a range of molecular materials.
(1) |
(2) |
(3) |
(4) |
The angular term, Vang, is a measure of how directly the patches on adjacent particles point at each other. Here, θαij is the angle between the patch vector and the vector between the two neighbouring particles, rij, the magnitude of which is given by r (Fig. 3(b)). σang dictates the energetic penalty of particles deviating from being perfectly aligned, such that a larger σang value relates to worse alignment of the two octahedral facets, Fig. 3(c and d). σang is referred to as the width of the patch as changing σang narrows or widens the Gaussian function that describes the patchy interaction, effectively changing the angular width of the patch where the particle interacts through. Through this work, we examine how changing σang between 0.1 and 1.0 effects the solid-state phase behaviour of the interacting octahedral particles. We expect that this parameter encapsulates the effects of different substituents on the vertices as the substituents would likely change the directionality of the interaction due to steric effects, which is controlled by the patch width.
The torsional term, Vtor, describes the variation in the potential with rotation of the particle about the interparticle vector rij. ϕoffsetαβ is the preferred torsional angle between patches α and β, whereas ϕαβ is the actual torsional angle between patches α and β, such that there is an energy minimum where the two angles are the same. The preferred torsional angles used for each simulation are summarised in Table S3† and were chosen based on the likely lowest energy orientations of window-to-window and window-to-arene packings due to steric interactions, i.e. window-to-window prefers to be anti-aligned and window-to-arene prefers to be aligned. Similarly to σang, σtor dictates the energetic penalty of the particles deviation from the perfect torsional angle. In our simulations, we found that the results did not vary with σtor, and as such, much like other studies on patchy particles,35 we kept a constant ratio between σtor and σang such that σtor = 2σang. This is a physically reasonable approximation as, from chemical considerations, it is likely that σtor and σang are coupled such that the less the interaction cares about the alignment of the two molecules, the less likely it has a strong torsional preference i.e. the interaction overall will be less directional.
J is the measure of the interaction strength between the two octahedra. Varying σang in our model causes the clusters to be formed at different temperatures for the same value of J. This is because a smaller value of σang narrows the Gaussian function in eqn (3), resulting in a smaller value of Vang. As the absolute energy scale is not important to the results of this study, for simplicity a value of J was chosen for each simulation such that the transition temperature Tt occurred when kBTt ≈ 1. The interaction parameters used are summarised in Tables S1 and S2.† We note that for a given value of σang, J is larger for the window-to-window simulations than for the window-to-arene simulations. This is because each cage only has four windows, resulting in a maximum of four stabilising window-to-window “interactions” for each cage, whereas in the window-to-arene simulations, there can be up to eight stabilising interactions as the cages can be stabilised through interactions through the arene as well. This means that J must be larger to result in a transition occurring at the same temperature.
The simulations were performed on 512 particles of unit length in cubic boxes with a box length of 12 Å and periodic boundary conditions. As the potentials are attractive and the systems were initialised at low density, the simulations formed compact clusters with no mechanical stresses and structure defects that are often seen in bulk simulations. To determine if there were any finite size effects, some simulations were repeated with 4096 particles and evidenced the same results.
To determine the different structural phases formed, we calculated and compared the radial distribution functions (RDFs) of the centres of each octahedra within the clusters. Although this removed the orientational behaviour of the different phases, structures with similar RDFs were visually inspected to ensure that the orientational behaviour led to no meaningful differences between the structures. For structures at low σang, we were able to determine the space group of the clusters by abstracting a unit cell and using FINDSYM,36,37 a program used to identify the space group of a crystal, on a coarse-grained version of the structure and converted the solved structure back onto the equivalent structure formed with the cages. An overview of this process is given in Section S4.†
For structures at higher σang, as they are inherently more disordered due to the worse alignment of the cages (Fig. 3(c and d)), our procedure struggled to find a space group. In these cases, we were still able to abstract a unit cell and instead visually compared our structures to those found in the literature. An overview of which ones were solved using FINDSYM and which ones required visual inspection is given in Table S6† along with representative configurations of each visually solved phase (Fig. S3†). We then took the most prevalent ordered phases from the simulations and used them to colour the phase diagram by comparing the similarity of the RDF from the simulated data to the RDF of the solved phase using dynamic time warping, details of which are in Section S5.†
Fig. 6 (a) Phase diagram produced from simulations where patches of opposite types interact. Here the red, green, and blue components of the phase diagram are coloured based on the similarities of the phases to low temperature structures found at σang = 0.7, 0.2–0.4, and 0.5 respectively as described in more detail in Section S5.† We note that uncoloured areas of the phase diagram at low temperatures are due to the simulations timing out (as explained in Section 2.3) and the phase behaviour at these temperatures is considered to be unchanged from the last simulated structure. (b) Representative configurations of the low temperature phases showing how that angle of rotation of the nearest shell of cages rotates as a function of σang. From left to right the configurations represent the phases at σang = 0.2–0.4, 0.5, 0.6 and 0.7. The colour of the hexagons relate to the colour of the phase diagram where the phase is found. |
For the four different low temperature phases, the obvious question is how the phase behaviour differs with changing σang. Although the RDFs at higher values of σang suggest that the same phase persists from σang = 0.7–1.0, on inspection of the structures, the cages orientations are disordered when σang ≥ 0.8 (Fig. S8(a)†), likely due to the large values of σang and σtor leading to less strongly directional interactions towards the facets. This means that although the phase behaviour looks similar, we are hesitant to call this the same phase and instead suggest that phases at higher σang behave more like plastic crystals, much like in the case of high values of σang for the window-to-window simulations.
For the four orientationally ordered phases between σang = 0.2–0.7, when looking down one of the high symmetry directions, each cage is surrounded by a hexagonal arrangement of cages. Interestingly, the angle of the hexagonal arrangement around the central cage compared to the orientation of the central cage changes as a function of σang, Fig. 6(b). This suggests that the arrangement of the cages can be tuned almost linearly as a function of the patch width. To understand how this change in structure can affect the properties of the material, we examined the possible pore connectivity in each structure. We found that both the structures which form at small values of σang may be able to host 1D pore channels due to their extrinsic porosity (Fig. S9†), whereas the two ordered structures at higher values of σang can not. Therefore, understanding how the structure can be tuned as a function of patch width is a valuable task, both from a crystal engineering point of view, and for future studies focused on finding novel cages with pore connectivity. However, even if cages form either of the two structures at small values of σang, they might not exhibit porosity as if there were bulky groups on the vertices, the bulky groups could obstruct the connectivity by sitting within the pore channels.
On a molecular level, the manipulation of σang may be achieved by changing the chemical functionality of the vertices or by using directional solvents. Bulkier groups on the vertices are likely to result in smaller values of σang. Although bulkier groups might be thought to change the equilibrium distance between the cages (and thus σLJ in eqn (1)), results from force field calculations show that there is no change in the equilibrium bond distance for cages with vastly different steric behaviour e.g.CC1 and CC9 as shown in Section S9.† Instead, the reason why we expect that bulkier groups can be related to a decrease in σang is that the bulkier the group, the greater the energetic penalty for neighbouring cages to be unaligned from a central position—a measure of the patch width—due to repulsive steric interactions. Similarly, solvents can be used to obtain more directional interactions, for example through hydrogen bonding between the pores, and therefore smaller values of σang. We note that our simple model is unable to encapsulate all types of solvation effects, but instead use this as an example to highlight that some solvents may lead to more directional interactions along the 〈111〉 axes which could be used to navigate the phase behaviour seen here.
On changing temperature, the number of phase transitions seen in our simulations differs depending on the value of σang. At low σang (0.2–0.5), each phase undergoes one phase transition at kBT ≈ 1 to an ordered phase that persists down to low temperature. However, when σang ≥ 0.6, the structures undergo two phase transitions, one at kBT ≈ 1 and one at lower temperature, as evidenced by the different colours in the phase diagram (Fig. 6(a)). This first transition is to an amorphous cluster, a representative structure is included in Fig. S8(b),† and the second is to a positionally ordered phase at low temperature. Interestingly, the amorphous to ordered phase transition occurs for a wide range of σang, spanning areas of phase space with two distinct ordered structures at low temperature—both when σang = 0.6 and 0.7—as well as the plastic crystal phases where σang ≥ 0.8. The existence of two phases as a function of temperature suggests that the solid-state structure of cages which have a higher σang can be controlled by temperature. Thus, cages that relate to this area of phase space may form both an amorphous and ordered structure based on the temperature of the synthesis, whereas molecules which exist at lower values of σang may only evidence ordered structures no matter the synthesis temperature.
For cages where the dominant packing type is window-to-window, and a combination of window-to-window and arene-to-arene, only one solid-state structure is reported to exist (Fig. 7(a), window-to-window packing, and Fig. 7(b), window-to-window and arene-to-arene packing). This observation fits well with our simulations which also produce only one ordered structure from the two simulations, and on further analysis the structures produced in the simulations correspond to those seen experimentally (Fig. 8(a and b)).
When the dominant packing type is window-to-arene, three different crystal structures are reported to exist experimentally (Fig. 7(c–e)) which seemingly corroborates our window-to-arene results from the simulations that evidence a much richer phase diagram than for window-to-window. Yet, only two of the three experimentally known phases are found in our phase diagram, CC9-R3 (at σang = 0.6) (Fig. 8(c)) and CC1β′ (at σang = 0.7) (Fig. 8(d)). The remaining phase reported experimentally does not exist within any of our phase diagrams. However, this is not a surprising result. Experimentally, this structure is formed with both cages CC3 and CC4 upon desolvation from another ordered structure. As such, it has been suggested that the crystal structure identified is formed due to a solvent templation effect resulting in a single-crystal-to-single-crystal transformation upon desolvation, something not taken into account by our model, and so it is not expected to be found in our simulations.44,46 This result highlights how solvent can be used to alter the polymorphic behaviour of the crystal and even change the preference of the dominant interaction between facets. In this case, the change in packing behaviour for CC3 from CC3α to CC3β would likely decrease the porosity from the loss of the three dimensional pore network, however, for CC4, para-xylene can direct the packing behaviour from window-to-arene to window-to-window as seen in the formation of CC4β.44
While our results show that we can predict most of the crystal structures known to be formed by ordered, enantiopure, desolvated octahedral cages, there still remains two ordered phases produced in our window-to-arene simulations that do not match up to any of the structures in Table 1. The first of these structures is a novel phase (when σang = 0.5) (Fig. 8(e)) not reported within the literature, and the second is found in solvated forms of both CC3 with Et2O and CH2Cl2 and CC4 with MeOH (Fig. 8(f)). Although we were not intending to find solvated structures, the occurrence of the solvated crystal in this region of phase space is logical as the structure occurs at low values of σang = 0.2–0.4, which suggests the inclusion of the solvent increases the directionality, as it occurs at small patch widths. This is a logical result as the inclusion of solvents is likely to make the intermolecular interactions more directional, analogous to decreasing σang. Moreover, this phase is the same as the solvated structure which produces CC3β/CC4α upon desolvation.44,46 We note that the discussion of whether the affects of solvent can be represented by our model depends on how the solvent affects the phases. When the solvent is included in the pores, it leads to a more directional interaction which is able to be captured by our model. However, when the solvent is removed from the pores, as in the case for CC3β/CC4α, the act of desolvation disrupts the directional interactions introduced by the solvent without allowing the phase to properly re-equilibrate, causing a phase transition which our model cannot capture.44,46 An overview of where each experimental phase is found in our simulations is give in Table 2.
Experimentally reported structure | Dominant packing type | σ ang |
---|---|---|
PUDXES (CC3α) | Window-to-window | 0.1–0.6 |
OZECAY03 (CC4β) | ||
GANDUW (CC10) | Window-to-window/arene-to-arene | 0.1–0.6/0.1–0.6 |
NODVIN (CC3 with Et2O and CH2Cl2) | Window-to-arene | 0.2–0.4 |
OZEBUR (CC4 with MeOH) | ||
N/A | Window-to-arene | 0.5 |
GANDAC (CC9-R3) | Window-to-arene | 0.6 |
ELALAF (CC1β′) | Window-to-arene | 0.7 |
PUDXES02 (CC3β) | Window-to-arene | N/A |
OZECAY (CC4α) |
We have shown that when window-to-window packing dominates, only one phase is formed, as seen experimentally, whereas window-to-arene packing results in a much richer phase space containing four phases, only three of which have been seen experimentally. The coarse-grained model laid out in eqn (1) is able to produce the majority of the structures formed by the enantiopure, ordered, desolvated octahedral cages. By changing the width of the patch, and thus the directionality of the interactions between cages, we are able to manipulate the structural phases formed by our model, mimicking the structural changes seen when changing the chemical functionality. Now, the natural question is, given a novel cage, can we predict its likely solid-state structure using our simulations? And what design rules can we assimilate from our coarse-grained model?
Nevertheless, for the cages which pack preferentially via their window and arene facets, the energetic differences produced by the two cages slipping around the central patch position are in agreement with where their crystal structures are found relative to one another in the phase diagram. CC9 has a higher energetic penalty associated with displacements from the central patch position than CC1, mirroring the relationship that the solid-state structure of CC9-R3 is found at a lower σang than CC1β′ (Fig. S12†). These results indicate that similar calculations looking at the slipping of cages might be able to predict the packing behaviour of novel cages, particularly once a threshold is established between atomistic results and σang. For this we will need to find molecular cages whose crystalline structures relate to areas of our phase space currently unobserved experimentally.
As for design rules, our results have started to establish a relationship between chemical functionality, solvent, and solid-state phase space as represented by our model. Cages synthesised with bulkier diamine vertices (e.g.CC3, CC9, and CC10) tend to sit at lower values of σang than less bulky cages such as CC1 (Table 2). Additionally, the inclusion of solvent, as in the case of structures formed by CC3 and CC4, leads to low values of σang. These trends can inform design rules, as they show that solvating groups can be used to introduce more directional interactions, increasing the number of polymorphs available for a given cage and thus opportunities for increased porosity, and that bulkier groups on the vertices lead to less rotation and slipping between neighbouring cages.
Finding more desolvated cages whose solid-state structures match those in our phase space, particularly for structures at low σang, will lead to the development of a better relationship between results from atomistic calculations and the parameters of our coarse-grained model. This can lead to more concrete rules about the relative energies required to navigate phase space, allowing for easy prediction of the crystal structures formed by novel cages in a much less computationally expensive process than for CSP—only requiring dimer pair DFT calculations compared to many solid-state structure calculations. Through this we hope to further decipher the relationship between solvent, chemical functionality, and σang to indicate design rules for tuning the solid-state phase behaviour of POCs.
On top of information on the structural behaviour of cages, our results may hold insights on the crystallisation process of POCs. The window-to-arene simulations show that, depending on the temperature of the simulation, either an amorphous or ordered phase forms. Perhaps a similar trend could be found with synthesis temperatures, forming a link between our simulations and crystallisation effects such that our model can act as a guide for POCs crystallisation pathways. For example, our phase diagram suggests that crystallising CC9 or CC1 at high temperatures could lead to the formation of amorphous phases, whereas at low temperature an ordered crystalline structure will form. However for cages such as CC3, or any cages that pack window-to-window, the phase formed is independent of synthesis temperature as either an ordered phase forms at any temperature, as in CC3, or an amorphous phase is formed, as with cages that exist at higher σang. This is a particularly useful result as it highlights routes for targeting amorphous cages which can exhibit enhanced gas selectivity47 and increased porosity relative to their crystalline counterparts.47–49 Moreover, our results may provide insight into the polymorphic behaviour of POCs by giving an overview of how many phases exist given the dominant interaction type. For example, for window-to-window packing, the only known ordered structure is that of the 3D connected diamondoid structure found in CC3, which matches our window-to-window simulation results, whereas the window-to-arene simulations exhibit a much richer polymorphic phase space, as seen experimentally.
Through these simulations, we have shown that our coarse-grained model is able to reproduce the majority of the packing behaviour seen by the enantiopure, ordered, desolvated octahedral cages. These results are the first steps towards using computationally inexpensive coarse-grained simulations to predict the packing of POCs from their molecular structure, as well as developing design rules based on the chemical functionality of the vertices. Here we have shown how both solvent and the size of the functional group on the cages vertices can be used to direct crystal structure formation through changing the directionality of the interactions. Thus, both the addition of solvent and the inclusion of bulky functional groups can manipulate structural formation by reducing rotation and slipping between neighbouring cages, changing the packing behaviour. Additionally we have shown that synthesis temperature may be used to influence the crystallinity in cages, guiding efforts for amorphous phase formation which can lead to increased porosity.48 As far as we are aware, this is the first study that looks to coarse-grain molecules for crystal structure prediction. We hope that the results highlighted here will be expanded upon to use coarse-grained modelling for molecules with a richer experimental phase space in order to map molecular interactions onto coarse-grained phase space using processes outlined in Section 3.3 for novel molecular structure prediction.
Although the focus of this paper has been on POCs, the results outlined here have ramifications in many areas of molecular chemistry. Motifs common in POCs are also widespread in other porous materials such as elemental carbon cages50 and metal–organic cages.31 Therefore, solid-state structures predicted by methods outlined in this study may also relate to crystals formed by other molecular systems and the methodology outlined in this paper could be expanded to other areas of molecular chemistry to help predict the organic solid-state. Eventually, we aim to introduce our computationally inexpensive methodology into existing high-throughput workflows using robotic automation and computational modelling,15 creating a streamlined workflow for structural prediction of novel molecules. Applying this process to multi-component systems could help drive phase exploration at orders of magnitude currently intractable by current computational techniques.
Footnotes |
† Electronic supplementary information (ESI) available: Additional computational details, results, figures, structure files and movies of simulations. See DOI: https://doi.org/10.1039/d2sc04511g |
‡ Although plastic crystals are typically formed by spherical particles,24,38,39 particle orientations in the plastic crystal phase may not be entirely random and some orientations can be more likely to occur than others,38,40–42 similar to what we observe in these simulations. |
This journal is © The Royal Society of Chemistry 2022 |