Stephen A. Wells*a,
Naomi F. Cessfordb,
Nigel A. Seatonc and
Tina Düren*a
aCentre for Advanced Separations Engineering, Department of Chemical Engineering, University of Bath, Bath, UK. E-mail: saw42@bath.ac.uk; t.duren@bath.ac.uk
bInstitute for Materials and Processes, School of Engineering, University of Edinburgh, Edinburgh, UK
cAbertay University, Bell Street, Dundee, UK
First published on 8th May 2019
Metal–organic frameworks (MOF) comprising metal nodes bridged by organic linkers show great promise because of their guest-specific gas sorption, separation, drug-delivery, and catalytic properties. The selection of metal node, organic linker, and synthesis conditions in principle offers engineered control over both structure and function. For MOFs to realise their potential and to become more than just promising materials, a degree of predictability in the synthesis and a better understanding of the self-assembly or initial growth processes is of paramount importance. Using cobalt succinate, a MOF that exhibits a variety of phases depending on synthesis temperature and ligand to metal ratio, as proof of concept, we present a molecular Monte Carlo approach that allows us to simulate the early stage of MOF assembly. We introduce a new Contact Cluster Monte Carlo (CCMC) algorithm which uses a system of overlapping “virtual sites” to represent the coordination environment of the cobalt and both metal–metal and metal–ligand associations. Our simulations capture the experimentally observed synthesis phase distinction in cobalt succinate at 348 K. To the best of our knowledge this is the first case in which the formation of different MOF phases as a function of composition is captured by unbiased molecular simulations. The CCMC algorithm is equally applicable to any system in which short-range attractive interactions are a dominant feature, including hydrogen-bonding networks, metal–ligand coordination networks, or the assembly of particles with “sticky” patches, such as colloidal systems or the formation of protein complexes.
We make use of cobalt succinate MOFs, formed by the assembly of cobalt (Co2+) ions and succinate (COO–CH2–CH2–COO–) anions, as our proof-of-concept system to understand the influence of synthesis conditions on the formation of different frameworks. Multiple structural phases have been identified in hydrothermal syntheses14–16 using aqueous mixtures of cobalt hydroxide and succinic acid as ingredients, comprising seven distinct frameworks with markedly different topologies depending on the system temperature and the ratio of metal to ligand. Capturing the formation of these topologically distinct structures from the same components under different synthesis conditions provides an important challenge for our simulations. The cobalt ions are, in all the crystal structures, octahedrally coordinated by some combination of succinate carboxylate oxygens, solvent water oxygens, or framework oxygens or hydroxide groups. In general, phases produced at higher temperatures or with a greater proportion of cobalt are characterised by a greater degree of metal–metal association, that is, adjacent cobalt coordination octahedra with one, two or three oxygens in common.
The two phases that we investigate in this study are the low-temperature phases A and F illustrated in Fig. 1. Both are synthesised at 348 K. Phase A, produced at higher [succinate]:[cobalt] ratios, is characterised by linear chains of alternating metal and ligand units. The connectivity of these chains is one-dimensional and each cobalt ion is coordinated by two ligands and four water molecules. Adjacent, parallel chains are linked by hydrogen bonding between these water molecules. Phase F, produced at lower [succinate]:[cobalt] ratios (i.e. with a higher proportion of metal), is characterised by linear chains of linked metal octahedra; these chains are linked to one another by bridging succinate ligands.
The assembly of a MOF from its components involves the reversible formation of associations between metal and ligand units. In this paper, we develop a molecular Monte Carlo simulation approach suited to describing the assembly process. Our molecular modelling makes use of potentials of mean force (PMFs) derived from fully atomistic explicit-solvent interactions, so that we can use implicit-solvent simulations while retaining local solvent structuring effects. A system of overlapping “virtual sites” represents the coordination environment of the metal ions and allows both metal–metal and metal–ligand associations to develop spontaneously. These site overlaps also motivate a simple collective move algorithm, allowing smaller clusters to aggregate once they have formed, and are used to identify clusters with topologies appropriate to particular structural phases. We apply this Contact Cluster Monte Carlo (CCMC) approach to investigate the composition dependence of the production of phases A and F in cobalt succinates.
The system of virtual sites is similar in principle to the cationic dummy atom (CaDA) approach17–20 sometimes used in molecular dynamics simulations, as for example in the work of Yoneya et al.,10–12 to represent the coordination geometry around a metal centre. However, our virtual sites are more general, being interaction sites for PMFs rather than charge sites for Coulomb interactions. In particular, our model permits the overlap of virtual sites to represent the association of metal centres through a bridging oxygen species, allowing both metal–metal and metal–ligand interactions, whereas CaDA represents only metal–ligand interactions.
The effective energy of interaction between the various sites in our implicit-solvent model is represented by a set of PMFs derived from all-atom empirical-potential Monte Carlo simulations.21 We emphasise that our potentials are derived systematically on physical grounds, without “tuning” to favour the formation of any particular MOF structure. These simulations, carried out using a modified version of the MUSIC code,22 made use of the TIP3P water model23 and appropriate force field parameters for the divalent cobalt ion24 and for the succinate ion, using the OPLS-UA forcefield25,26 supplemented by the OPLS-AA force field where necessary.27–29 The simulation temperature was 348 K, corresponding to the experimental synthesis of phases A and F. Electrostatic interactions were handled in real space using the Wolf method.30,31 For every pair of interacting atom types from the set (Co, C, CH, O) a PMF was determined using umbrella sampling and the Weighted Histogram Analysis Method (WHAM).32–34 Force field parameters are tabulated, and additional details of the MC simulations are given in ESI S.2.†
All of the PMFs used in this study are provided as a text file and are discussed in ESI S.3.† Most of the resulting PMFs are relatively featureless, displaying strong repulsion at close distances (steric interaction) and near-zero interaction at greater distances once one or more water molecules are found between the atoms. This is as expected in cases where one or both of the atom types have small partial charges. The Co–O interaction, by contrast, displays significant structure, as shown in Fig. 3, largely due to the strong electrostatic interaction between the Co cation and the negatively charged carboxyl oxygen. The most important features are a strongly attractive well at a Co–O distance of about 1.9 Å, defining the coordination of metal by ligand oxygen, and a second, shallower attractive well at around 4 Å, corresponding to the solvent-mediated situation Co–water–O. The two wells are separated by an energy barrier at around 2.5 Å, corresponding to the unfavourable geometry where the oxygen species is not close to the metal but there is no room for a water molecule between them. The existence of this second attractive well shows how the PMFs capture solvent structuring and templating effects that can be highly significant in MOF formation. Such structuring is frequently neglected in implicit-solvent MD simulations where the solvent is represented only with a range-dependent dielectric constant.10 To make use of this potential in our virtual site model, we redefine the Co–O interaction as an interaction between vO of the cluster and O of the linker. A succinate oxygen interacts only with the closest vO site belonging to a given Co ion and the local attractive well corresponds to a zero vO–O separation. This potential therefore favours colocation of succinate oxygen and vO sites, allowing the ligand to coordinate the metal.
Our model also requires an attractive vO–vO interaction, favouring a zero vO–vO distance, in order to generate metal–metal associations as seen in the denser phases. In this case the overlap of vO sites represents the presence of a bridging oxygen species between the metal ions. This vO–vO interaction is absent from the set of calculated PMFs and so we employ a scaling of the vO–O interaction, as follows. A single vO–vO interaction represents two interactions between Co2+ and the bridging oxygen species, implying a factor of 2; and we assume that the bridging species is a hydroxyl with charge qOH = −1, whereas the charge of a carboxylate oxygen in the MC simulations is qO = −0.8. Under the assumption that the Coulomb energy is dominant in this interactions, this gives a scaling factor of 2 × (1/0.8) = 2.5. Preliminary testing (see ESI S.4†) confirmed that this vO–vO potential allows metal–metal associations to develop, whereas a smaller scaling factor does not provide a sufficient attractive interaction.
Steric radii for clash exclusion are assigned to the atomic types as specified in ESI S.5.† A vO site is, however, permitted to overlap freely with other vO or with succinate O sites. In our analysis of the results, two objects are considered to be clustered together if a vO site in one object has a centre-to-centre distance of less than 0.5 Å with a vO or O site in the other, putting the two sites within the short-range attractive well of the PMF. The resulting cluster geometries can then be assessed for their similarity to phases A and F in terms of structural motifs.
P(i)PPROP(ij)PACC(ij) = P(j)PPROP(ji)PACC(ji) |
In the general case (see for example chapter 15 of Frenkel and Smit36) the probabilities can also include a Jacobian factor reflecting non-uniform sampling of phase space. This can arise from a change of variables and/or from the introduction of holonomic constraints between interacting bodies. In the present case, however, no holonomic constraints are introduced between interacting bodies, and so no explicit Jacobian is applied. We note that for single-object moves, PPROP is just a constant representing the probability of choosing one object from the complete list of objects in the system, and so
PACC(ij)/PACC(ji) = P(j)/P(i) |
In the standard Metropolis Monte Carlo scheme, the ratio of the acceptance probabilities is set to the inverse of the Boltzmann energetic factor between j and i, that is exp(−ΔEij/kT), so as to sample microstates in proportion to their thermodynamic probability.
Many approaches to clustering and collective moves are possible, as discussed by e.g. Frenkel and Smit,36 and multiple clustering approaches for collective moves in MC have been reported in the literature. The Virtual Move Monte Carlo (VMMC) collective move approach of Whitelam and Geissler,37,38 its convenient algorithmic presentation by Ruzicka and Allen,39 and the very similar energetic clustering approach of Bhattacharyay and Troisi,40 as well as the rejection-free cluster formation algorithms of Luijten et al.,41–43 have been used to describe collective moves in, for example, simulations of colloidal fluids44 and the self-assembly of patchy particles representing proteins.45
We adopt an approach such that PPROP(ij) will equal PPROP(ji) by construction, retaining the Metropolis energetic criterion unchanged. For clarity of discussion, we define the following terms. An “object” is a single molecular entity, made up of multiple atomic sites. All Monte Carlo moves begin with the selection of one object. A “cluster” is a group of objects moving together as a single unit in a collective move. A “contact” between two different objects signifies close proximity between a site on one object and a site on the other, such that a favourable interaction would be expected. In our model, certain objects include “virtual sites” and close contacts are represented by the overlap of two virtual sites, or of a virtual site with an atomic site, as illustrated in Fig. 2 and 4. We therefore describe our model as Contact Cluster Monte Carlo (CCMC).
An attempt at a collective move begins with the selection of an individual object x in state i. All contacts between x and other objects y can then be tested for cluster formation. A contact will either form a link – so that y joins x in a moving cluster C – or a break, so that y does not join the cluster. The probability that a contact forms a link is a constant user-defined parameter PLINK. When an object joins the cluster C, it is tested for contacts with any objects that are not yet members of C until no new contacts are proposed. The end result is a cluster C bounded, in state i, by a set of breaks Bi. This set may of course be null. The probability of such a cluster being formed in state i can be notionally divided into the probability of forming links within the cluster, P(C), and the probability of not forming links in the breaks, P(Bi).
A collective move of the cluster (in general, a combination of a displacement and a rotation) generates state j. The internal geometry of the cluster is unchanged. However, its boundary is now a different set of breaks Bj (which may of course be null, if C forms no new contacts in j). We therefore penalise the probability of proposing this cluster move, using a factor P(Bj). If Bj involves n contacts, this probability is given by
P(Bj) = (1 − PLINK)n. |
The move is rejected if a random number selected uniformly from the range 0 to 1 is greater than P(Bj). Clearly if n = 0 then P(Bj) = 1 and the proposed move is not rejected. With this factor, the probability of proposing a cluster move from i to j is identical to that of the reverse move from j to i, being given by P(Bi)P(C)P(Bj) in both cases, and detailed balance is maintained.
As a result of this penalty, the results of the simulation are notably insensitive to the value of PLINK. Higher values favour the formation of larger moving clusters, but make it less likely that cluster moves will lead to new overlaps. A default of PLINK = 0.5 is effective (see ESI S.1†). We note that, in lattice Monte Carlo simulations of surfactant assembly carried out by Wu et al.,46 a similar cluster scheme with PLINK = 1.0 was used in order to forbid clusters from joining together, as an approximation of electrostatic repulsion between micelles.
Note that the energetic clustering methods37,40 allow particles to move along local potential energy gradients, and can therefore be used to approximate the dynamical evolution of a system.38 Since CCMC does not probe these local gradients it does not have this quasi-dynamic behaviour. As such, our simulations probe the energetic landscape of self-assembly rather than approximating the molecular dynamics thereof. The differences between CCMC and energetic clustering methods are discussed in more detail in ESI S.7.† CCMC is applicable to any system in which such short-range attractive interactions are a dominant feature, including hydrogen-bonding networks, metal–ligand coordination networks, or the assembly of particles with “sticky” patches, such as colloidal systems or the formation of protein complexes.
The representation of the cobalt coordination environment by the octahedron of virtual oxygen sites, and the set of vO–vO and vO–O interactions, are designed to represent the chemistry of the cobalt succinate system. By construction the system permits the formation of all the metal–metal and metal–ligand associations found in cobalt succinate structures and shown in Fig. 2, while not permitting inappropriate ligand–ligand direct associations. Other MOF systems with different chemistries would require adaptation of the scheme. For example, a representation of a Zeolitic Imidazolate Framework47 (ZIF) would involve a tetrahedral coordination of Zn ions by virtual nitrogens which can be overlapped by the real nitrogen atoms of the ligand; and in this case no attractive interaction between two virtual nitrogen sites would be required as this system does not display metal–metal association. The clustering approach shown in Fig. 4, meanwhile, is capable of moving clusters of arbitrary size and shape and is agnostic about the chemical identities of the clustered objects. It should therefore be very generally applicable without modification.
Fig. 5 Flow chart of Monte Carlo move sequence. The cluster construction loop (boxed at top right) occurs if a collective move is attempted. |
Testing for acceptance occurs in three stages. An initial steric clash test identifies and rejects cases where the move leads to a sterically forbidden overlap of non-bonded atoms. If a collective move generates new contacts, it also has a probability of being rejected to maintain detailed balance as discussed previously. The energy of the trial system is only calculated (using the interatomic PMFs) if the first two stages are passed. The move is then accepted or rejected based on the usual Metropolis energetic criterion. Each simulation run in this work consists of 2 × 108 attempted moves.
The identification of A-like clusters is based on the recognition of linear chains of alternating metal (M) and ligand (L) objects, schematically –M–L–M–L–. The identification of F-like clusters likewise proceeds by a labelling process which recognises groups of multiple M objects bridged by L objects (compare Fig. 1). In a filtering round, only those clusters which contain at least one group of three connected M objects (–M–M–M–) are counted; a strict criterion intended to avoid false-positive identification of F-like clusters. A final point to note is that this identification system results in the clusters L–M–L and M–L–M being identified as “A-like”. However, these three-object clusters can also be identified in the structure of phase F. To avoid false-positive identifications we therefore do not count such clusters of size three as truly A-like, and when assessing how many objects are members of A-like clusters, we consider only clusters of size 4 or more. The details of the phase recognition method are given in ESI S.6.†
To illustrate the character of the results, we consider simulation runs with PCOLLECT = 0.9. Over the course of the simulations, clusters of overlapping objects appear with gradually increasing size. Fig. 6 shows a census of all overlapping clusters, and of clusters identifiable as A-like or F-like, for both compositions. Clusters occur in this case with sizes from 2 to 15 molecular objects, with smaller clusters being more common.
For the ligand-poor composition C1, for which experimentally phase F is observed, there is no evidence of truly A-like clusters (note that phase identification for A-like clusters of size 3 is ambiguous and therefore only A-like clusters of size 4 or above are counted as described above). F-like clusters occur with sizes from 3 to 11; indeed the single largest cluster in the system is identifiably F-like. In contrast, for the ligand-rich composition C2, for which phase A is observed experimentally, A-like clusters are produced with sizes up to 11. Only a single F-like cluster, of size 10, is observed in this simulation run. Its appearance provides a useful confirmation that our model is capable of producing either type of cluster at any composition.
The different phase selection behaviours for the two compositions is also clear if we count up the number of molecular objects that are contained within identifiably A-like or F-like clusters, as shown in Fig. 7. This census also reveals the brief appearance of some A-like clusters in the ligand-poor composition C1, early on in the simulation, which are subsequently out-competed by F-like clusters.
Fig. 7 Number of molecular objects in identifiably A-like and F-like clusters over the course of a simulation with PCOLLECT = 0.9 at compositions (a) C1 (ligand-poor) and (b) C2 (ligand-rich). |
Fig. 8 shows illustrative examples of an F-like cluster formed at composition C1 (note the M–M–M structural motif required for this phase) and of an A-like linear chain formed at composition C2.
The preceding analysis demonstrates that our modelling approach is successful in capturing the character of syntheses with ligand-rich and ligand-poor compositions. In order to assess the effectiveness and utility of the collective move system, we now compare the results of simulations carried out at compositions C1 and C2 with and without collective moves, for PCOLLECT = 0 or 0.9. In each case we show the results of two independent Monte Carlo runs with different random seeds. Cluster census data in Fig. 9 shows that simulations without collective moves (PCOLLECT = 0; Fig. 9a and d) exhibit kinetic trapping, producing mostly small clusters of sizes 2–5. With collective moves, the simulations are able to produce larger clusters. The contrast is particularly strong at the more densely packed composition C2 (Fig. 9c and d), where the collective move scheme is clearly necessary in order to produce clusters with sizes greater than five.
Fig. 10 summarizes the sizes of the largest A-like and F-like cluster produced in each simulation (Fig. 10a) and the total number of objects contained in identifiably A-like and F-like clusters (Fig. 10b). The tendency towards F-like clusters in the ligand-poor composition C1 and towards A-like clusters in the ligand-rich composition C2 clearly exists with or without the use of collective moves. This establishes that the phase selection tendency results from the composition and the potential model, and is not dependent on the collective move system. In composition C1, the use of collective moves slightly favours the formation of larger F-like clusters, and clearly favours the inclusion of more objects into F-like clusters. In composition C2, the effect of collective moves is much more dramatic, doubling the size of the largest A-like clusters, and tripling the number of objects included in clearly A-like clusters (simulations using a PCOLLECT value of 0.5 rather than 0.9 give similar results, shown in detail in ESI S.9†). This underscores the importance of collective moves in Monte Carlo simulations of denser systems to avoid kinetic trapping effects.
Our results provide proof of principle that the CCMC approach is capable of identifying composition-dependent differences in MOF growth topologies. In principle our method can be extended to other MOF systems, although a new set of PMFs are required for each new system. Different MOF chemistries are of course a priority, as facile syntheses of stable MOFs with appropriate pore geometries are a necessity if MOFs are to fulfil their promise as microporous materials in applications.4,5,50 Possible target systems include other MOFs in which different phases can arise from the same building blocks under different reaction conditions, e.g. the chromium terephthalates MIL-101 (ref. 51) and MIL-53.52,53 The flexible chemistry of the Zr6 cluster found in structures such as UiO-66 (ref. 54) would also be of great interest; the coordination of each zinc ion can involve framework oxygen, hydroxide groups, water molecules and carboxylate oxygens, so that the adaptability of the virtual site system should be highly applicable. ZIFs are another important MOF subclass, in which strikingly different zeolite-like framework topologies arise through the use of different imidazolate derivatives as ligands. The interest in this case would be an investigation of the effect of ligand geometry on the initial generation of small clusters and rings.
Furthermore, the CCMC approach is not limited to MOFs, but could also be applied to other systems by taking a very broad definition of what constitutes a ligand. For example, the crystallisation of calcite (CaCo3) from solution is a widely studied system55–57 which could be described in our model by considering the triangular carbonate anion as a ligand.
In simulations of molecular aggregation, collective (cluster) moves are important to avoid severe computational slowdown as small clusters form and grow. We developed a Contact Cluster Monte Carlo (CCMC) algorithm which cleanly separates the cluster formation probability, based on the geometry of local attractive contacts, from the Metropolis energetic criterion for move acceptance. CCMC thus avoids the computational cost of evaluating multiple exponential terms during cluster construction and makes it easy for favourable interactions both to form and to break during cluster construction. The collective move algorithm described here is equally applicable to any model dominated by local attractive interactions, for example hydrogen bonding networks, the assembly of positively and negatively charged groups or the assembly of particles with “sticky” patches, such as colloidal systems or the formation of protein complexes. We therefore anticipate that the algorithm will be of general use in modelling the assembly of framework structures, including MOFs.
In molecular simulations of the cobalt succinate system, a MOF that exhibits different phases depending on the synthesis conditions, our methodology successfully models the aggregation of MOF building units into clusters, producing clusters with topologies characteristic of the experimentally observed A and F phases. The geometric collective move system introduced here is effective in assembling clusters with sizes that are not reached by single-object moves due to kinetic trapping. We observe different cluster topologies depending on the ligand to metal ratio: at high ligand concentration, 1D chains of alternating metal–ligand–metal form, whereas at low ligand concentration metal–metal clusters and chains are linked together by multiply coordinated ligands. This captures the experimental observation of the growth of two distinct phases (“A” and “F”) at moderate temperatures in ligand-rich and ligand-poor batch compositions. We note in particular that our PMFs were obtained systematically and were not “tuned” to produce A-like or F-like clusters. To the best of our knowledge, this is the first time that the spontaneous emergence of two different, experimentally appropriate, topologies, based only on the synthesis conditions has been predicted using molecular simulation and represents an important step forward in the simulation of MOF formation.
Footnote |
† Electronic supplementary information (ESI) available: Sections S.1: effect of PLINK parameter on cluster formation, S.2: details of explicit-solvent molecular Monte Carlo simulations and production of potentials of mean force, S.3: potentials of mean force as used in CCMC simulations, S.4: construction and testing of vO–vO potential, S.5: steric radii, S.6: on cluster identification for phases A and F, S.7: comparison of CCMC with energetic clustering schemes, S.8: statistics for Monte Carlo simulation runs, and S.9: effect of PCOLLECT value. All simulations were carried out using an in-house Monte Carlo code, developed in C++. The use of in-house code allowed us to implement the virtual site model, the use of PMFs, and the polyhedral representation of the cobalt objects as basic features. The code and input files are available through the University of Bath's research data repository, with DOI https://doi.org/10.15125/BATH-00517. See DOI: 10.1039/c9ra01504c |
This journal is © The Royal Society of Chemistry 2019 |