Qiang
Zhu
*a and
Shinnosuke
Hattori
*b
aDepartment of Mechanical Engineering and Engineering Science, University of North Carolina at Charlotte, Charlotte, NC 28223, USA. E-mail: qzhu8@uncc.edu
bAdvanced Research Laboratory, Research Platform, Sony Group Corporation, 4-14-1 Asahi-cho, Atsugi-shi 243-0014, Japan. E-mail: shinnosuke.hattori@sony.com
First published on 20th November 2024
With advancements in computational molecular modeling and powerful structure search methods, it is now possible to systematically screen crystal structures for small organic molecules. In this context, we introduce the Python package High-Throughput Organic Crystal Structure Prediction (HTOCSP), which enables the prediction and screening of crystal packing for small organic molecules in an automated, high-throughput manner. Specifically, we describe the workflow, which encompasses molecular analysis, force field generation, and crystal generation and sampling, all within customized constraints based on user input. We demonstrate the application of HTOCSP by systematically screening organic crystals for 100 molecules using different sampling strategies and force field options. Furthermore, we analyze the benchmark results to understand the underlying factors that may influence the complexity of the crystal energy landscape. Finally, we discuss the current limitations of the package and potential future extensions.
Nowadays, data-driven computer simulation has been playing an increasingly important role in materials development.9 Specifically, computational high-throughput screening of the existing organic crystals has become popular for the design of new materials with improved physical properties.10–15 However, most of the screening work has been limited by the availability of experimentally resolved structures. It has been found that crystal polymorphism can effectively modify the physical properties.16,17 Nevertheless, to experimentally synthesize new polymorphs and characterize their structural properties is rather time-demanding and expensive. From a practical standpoint, having the capability to screen likely organic crystal formations before laboratory synthesis and characterization would be highly valuable.18
In the past decade, there has been rapid development in the community of crystal structure prediction (CSP) for small organic molecules.19–24 The idea of CSP is to predict a short list of stable or metastable crystal packings that are likely to be observed in the experiment through powerful structure exploratory algorithms.25–28 Practically speaking, conducting a successful organic crystal prediction requires the integration of several different computational pipelines including force field generation, structure sampling and post-process energy ranking. Most practitioners rely on subscriptions to commercial licenses to run simulations and analyze the results. Due to the license restriction, it is often hard to reproduce previously published results, even for an experienced researcher.
In the field of CSP for inorganic materials, there have been plenty of code choices (e.g., USPEX,29 AIRSS,30 CALYPSO,31 XtalOpt,32 and GASP33) that are either completely open source or free for academic researchers. The availability of these tools has enabled significant progress in the field, allowing researchers to explore complex systems and improve their understanding of inorganic material behavior at a fundamental level.25 While the field of CSP has been progressing rapidly, the code choices29,34,35 for the prediction of organic materials are still limited. More importantly, to the best of our knowledge, there is currently no open-source CSP code specifically designed for the high-throughput prediction of organic crystals. As organic crystals are increasingly relevant in fields like pharmaceuticals, organic electronics, and molecular materials, the development of such tools is becoming ever more critical to support the growing demand for rapid and accurate crystal structure prediction.
To promote the open-source activity in organic CSP, we have developed an open source code High-Throughput Organic Crystal Structure Prediction (HTOCSP) that allows the automated prediction of organic crystal structures with a minimal input, by leveraging several existing open-source infrastructures36–40 in molecular modeling. In the following section, we will begin by detailing the workflow, covering molecular analysis, force field generation, and crystal sampling. This will be followed by a benchmark study on 100 molecules, utilizing different sampling strategies and force field options. Finally, we will analyze the test results and conclude with a discussion of the current limitations and potential future extensions.
In many cases, molecular SMILES strings and crystallographic information can be found through public online repositories such as PubChem41 and the Cambridge Crystallographic Data Center (CCDC). However, it is important to note that some erroneous entries may exist. Therefore, the user is strongly recommended to verify the correctness of the retrived data to ensure it accuracy and reliability.
The extracted force field parameters, including descriptions of charges, bonds, angles, torsions, and nonbonded van der Waals interactions, can be saved as an Extensible Markup Language (XML) file according to the OpenFF standard. Optionally, companion topology files can also be generated in different formats for later use in structural relaxation codes. While we recommend the use of fixed GAFF/OpenFF parameters for most CSP applications, it is possible that the default GAFF or OpenFF parameters may poorly describe the system. In such cases, we allow users to retrain the parameters for better accuracy, similar to the tailor-made force field concept proposed by Neumann and coworkers44 (see an extended discussion in Section V).
Finally, all the aforementioned functions have been implemented to the pyocse framework37 based on our earlier work.45 It is important to note that the current force field parameters were mainly fitted by using data under standard temperature/pressure conditions. Therefore, it is not recommended to use it for the search for polymorphs under extreme conditions.
In our earlier studies, GULP has been successfully used to address a range of practical CSP challenges.46,49–52 However, CHARMM generally offers faster performance than GULP, as it provides better support for the symmetry-adapted implementation of the Particle Mesh Ewald (PME) method.53 Consequently, CHARMM is the default FF calculator choice in HTOCSP due to its faster computational speed.
Beyond classical force field models, there are ongoing efforts to develop generic or universal machine learning force fields for material and molecular screenings. Notable examples include ANI54 and MACE.55,56 These models have been trained comprehensively on extensive datasets, and the resulting parameters are available as an open source for public access. Recent surveys28,57 indicate that machine learning force fields (MLFFs) generally reproduce experimental geometries more accurately and provide more reasonable rankings of crystal structures.
Despite these promising results, it's important to note that most generic or universal MLFFs are trained primarily on known experimental structures and their nearby basins of attraction. Consequently, they may struggle with geometry relaxation if the initial structure is far from equilibrium or has a high energy. Due to these limitations, HTOCSP currently supports the use of ANI and MACE only for post-energy re-ranking on pre-optimized crystals generated by GAFF or OpenFF. These structure optimizations can be performed using a customized ASE calculator,58 with the given space group symmetry constraints.
Recently, we have developed a Python program called PyXtal that handles various structure generation and symmetry analysis tasks for 0D/1D/2D/3D atomic and molecular crystals.59 For the module dedicated to 3D molecular crystals, PyXtal enables the generation of trial structures with a customized asymmetric unit within a specified space group. In the asymmetric unit, one can place one or multiple molecules at the general Wyckoff positions. If the symmetry of the input molecule is compatible with the site symmetry of a given Wyckoff position, PyXtal also supports the assignment of the molecule to a special Wyckoff position with higher symmetry.28 This feature is particularly useful for generating crystal structures of high-symmetry molecules with a fractional Z′ number (i.e., less than one molecule per asymmetric unit).
Additionally, it is quite common to encounter molecular crystal data labeled with a space group in non-standard settings. For instance, the standard setting for the most common molecular space group 14 is labeled as P121/c1 (shortened as P21/c). However, this space group can also be represented as P121/n1 or P121/a1, where the glide plane translates molecules in the diagonal direction or along the a-axis. Alternatively, one can choose the unique axis to be along a (P21/c11) or c (P1121/c). Taking into account variations in space group notations due to the choice of origin, there are 530 concise space group settings according to Hall.60 In PyXtal, we allow the use of Hall notations to generate crystals in non-standard space group settings.
To illustrate the use of PyXtal, we provide a few examples in the following listing.
Listing 1 PyXtal script to organic crystals.
With the aforementioned modules, one is ready to run a CSP calculation using a brute-force random search approach. This involves repeatedly generating random trial crystals and then using a structure calculator to perform geometry minimization, thereby obtaining low-energy structures. However, the success rate of finding the desired target may decrease exponentially as the search space becomes larger.
To address this challenge, HTOCSP implements several population-based methods to achieve more efficient structure sampling. In global optimization, population-based methods refer to a class of optimization techniques that maintain and evolve a population of candidate solutions throughout the optimization process. Unlike traditional optimization methods that work with a single solution (e.g., gradient descent), population-based methods explore the search space using multiple solutions simultaneously. This approach helps to avoid local minima and increases the likelihood of finding a global optimum.
In the past, several population-based methods have been successfully applied to address challenges in both inorganic and organic crystal structure prediction, utilizing genetic/evolutionary algorithms32,33,35,61 and particle swarm optimization (PSO).31 These approaches often require mechanisms to generate new solutions by sharing information from multiple sources (e.g., the crossover operation in genetic algorithms or velocity updates via the Cognitive Social Component in PSO). However, handling such processes without breaking crystal symmetry is often challenging. Therefore, we decided to adopt a simplified population-based approach that avoids information sharing between individuals within the same generation during the evolution process.
Specifically, we begin with a population of randomly generated trial crystal structures and update each individual based on its own evolution history. For a given chemical, a large number of hypothetical structures can exist. After relaxation, their trajectories form basins of attraction on the potential energy surface. The probability of successfully finding a low-energy structure depends on the shapes and sizes of the hyper-volumes of these basins of attraction and the details of the search.30,62,63 And we hypothesize that the target crystals (i.e., the likely observed crystal forms in experiments) form stable basins on the energy surface. However, due to artifacts in force fields and the constraints of symmetry and zero temperature during geometry optimization, these basins are often surrounded by many shallow energy minima. Consequently, a true basin and its surrounding shallow minima collectively form a meta-basin (as illustrated in Fig. 2).
In a purely random search, there is a good chance of sampling within the meta-basin and landing in one of the surrounding shallow minima, but this may miss the true target structure (i.e., the center of the meta-basin) within a limited number of sampling attempts. Therefore, a wise approach is to perform a further local search to locate the centers of these meta-basins based on historically explored energy minima. Depending on the extent of effort in utilizing historical information, we devised two stochastic sampling strategies and one deterministic strategy as follows.
Compared to pure random sampling, this method dedicates additional effort to exploring the surroundings of previously visited low-energy regions, thereby increasing the success rate of identifying more promising low-energy basins. Despite these extra efforts, the majority of the sampling still focuses on generating new random solutions, which is why we refer to this method as Width-First Sampling (WFS). This type of sampling is particularly useful for detecting the target structure within a narrow meta-basin with limited spread width, as illustrated in Fig. 2.
In this strategy, for each individual, we select either the most recently optimized structure (with a probability of 0.7) or the previously identified lowest-energy structure (with a probability of 0.3) to perform mutations and generate new samples. This process continues for 20 generations before restarting with new random samples. This mechanism is analogous to performing parallel basin hopping64 on multiple random samples. As a result, it achieves better sampling within each individual meta-basin, increasing the likelihood of identifying a true target surrounded by many shallow energy minima. Unlike WFS, which focuses on breadth, this approach is more focused on learning the shape of each meta-basin, which is why we refer to it as Depth-First Sampling (DFS).
To perform QRS on molecular crystals, we consider cell parameters and molecular positions separately. For example, the aspirin crystal can be represented by the following variables (Table 1).
Hall space group number | 81 |
Cell parameters | 11.43, 6.49, 11.19, 83.31 |
Wyckoff index | 4e |
Molecular XYZ | 0.77, 0.57, 0.53 |
Molecular orientation | 48.55, 24.31, 145.9 |
Molecular dihedral angles | −77.85, −4.40, 170.9 |
In the context of population-based methods, we generate a valid QRS sample for the cell parameters in each generation, followed by sampling the molecular positions, orientations, and dihedral angles with a sufficiently large population size. The user must define the lower and upper bounds for cell lengths to constrain the range of QRS samples, as well as specify a range of acceptable volume values to filter out unit cells that are either too large or too small.
In principle, all sampling algorithms should run for a sufficiently large number of generations to ensure convergence. There are two mechanisms to terminate the sampling. If the sampling is run in production mode (i.e., without knowledge of the target structure), it will stop once the maximum number of generations is reached. If the sampling is run in validation mode with the target structure provided, the sampling may stop earlier as long as a good match with the target structure is detected 10 times.
To determine if a good matched structure is found, the StructureMatcher module in Pymatgen65 is used. In this module, we ignore all H atoms and build a one-to-one map between each molecule in the unit cell, then check the largest root mean squared error (RMSE) between each atomic pair. By default, two structures are considered identical if the fractional length tolerance is 0.3, the site tolerance is less than 0.3 Å, and the angle tolerance is less than 5°. It's worth noting that Mercury66 also offers a mechanism to compute the RMSE for 15–20 packed molecules via the COMPACK algorithm.67 After extensive comparison, we found that these two functions produce nearly identical results. For ease of code implementation and accessibility, we have chosen to use Pymatgen's StructureMatcher module in HTOCSP.
For practical usage, one should first install the required Python packages via conda install. Then, download and compile the source codes of CHARMM or GULP. The executable paths and necessary libraries must be added to the environmental variables.
To run the simulation, one typically needs to setup the following Python script.
Listing 2 PyXtal script to run HTOCSP for the aspirin system.
This script will sample the aspirin crystals in WFS mode using the OpenFF force field, with a population size of 256 over 20 generations. With 128 parallel cores, the total sampling time is approximately 2.5 generations per minute. Therefore, the entire run of 20 generations can be completed within 10 minutes. When compared to the experimental aspirin form I in space group 14 (P21/c) with Z′ = 1, we found a total of 7 matches out of 5120 sampled structures, as shown in Fig. 3. We also found another structure with lower energy corresponding to the form II of aspirin. These two forms have been well known due to very close lattice energy ranking and their relative stability can only be distinguished with the advanced free energy calculation method.68 Last, it is important to note that the number of matched structures may vary due to the stochastic nature of the sampling algorithm. The confidence in these sampling results will be further discussed in the following section.
In a standard CSP run via HTOCSP, one only needs to provide a SMILES string to define the target molecule (either real or hypothetical), a list of space group choices, and a few sampling parameters related to population and generation sizes. If the target system is a cocrystal or salt, a list of SMILES strings and the compositional ratio must be provided. For further instructions, users can refer to the online documentation of HTOCSP69 to learn how to set up other types of calculations, such as structure searches with respect to the reference PXRD data.
To validate the concept of energy landscape complexity, we performed a systematic benchmark test on a set of 100 experimentally reported crystals. We selected these systems to encompass a range of challenges encountered in realistic CSP simulations. The set includes 30 systems from past blind tests,19–24 38 representative organic semiconducting (OSC) molecules, primarily consisting of thiophene rings, 14 polynuclear aromatic hydrocarbons (PAHs),70 and 18 other systems based on our previous research for various purposes (Misc). These systems vary widely, including single-component systems, multi-component cocrystals/salts, fractional to multiple Z′, and from rigid molecules to complex molecules with multiple flexible dihedral angles. In addition, a large fraction of the structures are not the most stable polymorph but are metastable. A complete list of their associated Cambridge Structural Database (CSD) entries and basic descriptions is provided in Appendix Table 2.
For each system, we performed four types of simulations: (1) WFS with GAFF (WFS-GAFF), (2) DFS with GAFF (DFS-GAFF), (3) WFS with GAFF relaxation and ANI energy evaluation (WFS-GAFF-ANI), and (4) DFS with GAFF relaxation and ANI energy evaluation (DFS-GAFF-ANI). In each run, we limited the search space based on the experimentally determined space group and Z′. If the target crystal had a fractional Z′ due to occupation of a special Wyckoff site, we followed the symmetry relation to transform it into a subgroup representation where the molecules occupy the general Wyckoff site.59 In all calculations, we set a population size of 256 with a maximum of 500 generations. However, the calculation could be terminated earlier if a total of at least 10 matched structures were found. In each simulation, we defined the success rate (SR) as the ratio of the number of matching structures to the total number of sampled structures. In each WFS run, the fraction of mutation was set to be 0.4.
(1) Tier I (SR > 0.5%). This tier consists of 56 systems that can be considered relatively easy CSP challenges. Statistically, one only needs to sample 2–3k structures to hit the target structure at least 10 times. At the molecular level, Tier I systems are mostly composed of symmetric molecules with low mass (<300) and only a few flexible dihedral angles (<3). Representative molecules in this tier include many PAH/OSC polymorphs (e.g., BENZEN: 27.5390%; ADAMAN01: 24.3164%; NAPHTA: 19.1406%; UVAHEF: 3.0835%; XELYUJ: 1.9458%) and simple targets (e.g., AXOSOW01: 10.3516%; BOQQUT: 3.2715%; GUFJOG: 0.9729%; NACJAF: 0.6925%). At the crystal level, the Z′ number is no more than 1. AFIGIH (2.2326%) is an outlier in this group, as it has only C1 symmetry, a relatively high mass (303.4), and 6 flexible dihedral angles. However, as observed in other systems, crystal structures with P symmetry are generally easier to predict than those with monoclinic or orthorhombic symmetries with more symmetry operation constraints.
(2) Tier II (0.05% < SR < 0.5%). This tier includes 33 systems that present relatively moderate CSP challenges. Typical molecules in this tier include 14 blind test targets (e.g., WIDBAO: 0.2650%; OBEQOD: 0.1755%), two famous aspirin polymorphs (ACSALA13: 0.2777%; ACSALA: 0.1617%), and many OSC polymorphs. To ensure convergence of 10 hits for the target structure, up to 20–30k structural samples are required. Most systems in this group have either fairly flexible molecules (with 2–6 rotors) or more than one molecule in the asymmetric unit (Z′ ≥ 1). Notably, the blind test target XXVI XAFQIH, previously considered a challenging example, also falls into this group with a modest SR value of 0.1486%.
(3) Tier III (0.001% < SR < 0.05%). This tier includes 6 systems (JOHSOP, WICZUF, KONTIQ, LEVJON, XAFPAY, and XAFPAY03). These systems typically require over 100k structural samples (approximately 4–6 hours with 128 parallel CPU processes). Systems in this tier are often characterized by either Z′ = 2 or a high number of rotors (4–6). As we will discuss in the following section, the sampling efficiency strongly depends on the choice of algorithms. An efficient algorithm can significantly speed up the finding of target structures.
(4) Tier IV (SR < 0.001%). This tier consists of 4 examples that represent the most challenging CSP cases. Despite running 128k samples with four different strategies, we observed only a few hits for OBEQIX (7), XAFQON (4), and YOKBIK (2), and we were unable to find the target structures of XAFPAY02 and UJIRIO02 within several runs of 500 generations. UJIRIO02 was found a couple of times when the maximum number of generations is 3000. To reliably predict these systems, one either needs to spend significantly more CPU hours or more efficient sampling algorithms are needed.
In practical CSP applications, it is popular to search for plausible crystal packing with 10 to 15 common space groups.28 As a result, the calculated success rate may decrease by 10–20 times in read-world scenarios. For clarity, we also provide the molecular diagrams and the corresponding space group/Z′ numbers for several representative systems from each tier in Fig. 4. One may use the complete data in Table 2 as a test bed to develop a predictive model to infer the CSP landscape complexity purely from the molecular and crystal information for any arbitrary system.
Comparing Fig. 5a and b, we first observe that adding ANI energy evaluation does not improve the efficiency of finding the target structures for the majority of the 100 systems. A few exceptional cases include KONTIQ, WICZUF, PYRZIN01, OBEQUJ, and XAFQON. However, the inclusion of ANI adds about 50–100% overhead to the calculation time. Therefore, it is recommended to run these samplings with pure GAFF or OpenFF to reduce computational costs.
Second, the comparison between WFS and DFS in Fig. 5a shows that the success rates do not vary significantly for Tier I and most Tier II structures. For these structures, sampling at most 10000 structures should be sufficient to obtain at least 10 hits of the target structure, ensuring successful sampling. However, there are a few exceptions in Tier II where WFS tends to outperform DFS. For example, in FLUANT (a rigid PAH molecule with Z′ = 2 in P21/n) and QQQCIG04 (the semiconducting rubrene with Z′ = 0.25 in Cmca), WFS can identify the targets about 5–10 times faster than DFS. This suggests that the experimental structures in these systems are characterized by relatively isolated, narrow meta-basins.
On the other hand, DFS tends to be more efficient for Tier III/IV systems. For instance, the success rate of DFS-GAFF for JOHSOP is 0.0551%, which is significantly higher than the 0.0095% achieved by WFS-GAFF. Additionally, DFS produces relatively decent SR values for LEVJON (0.0055%), OBEQIX (0.0016%), XAFQON (0.0008%), and YOKBIK (0.0008%), whereas WFS fails completely. These cases suggest that the target structures are likely characterized by wide meta-basins, making an in-depth exploration strategy more efficient.
Clearly, there is a trade-off between using WFS and DFS. In the future, more in-depth studies will be conducted to develop a predictive model that infers CSP complexity and further improves the success rate by enhancing sampling methods.
Additionally, data-mining-based approaches could supplement the current ab initio methods. In inorganic crystal prediction, chemical substitution on well-known prototypes is widely used to generate new candidate crystals. If the packing motifs of organic crystals can be effectively grouped,71 one could analyze the occurrence of each packing prototype and use them as templates to generate new structures more efficiently.
Additionally, one can start a massive CSP calculation with the generic FF, label the energy and forces for already sampled configurations with more accurate DFT or machine learning models, and then reparameterize the original force field parameters. Such an iterative strategy has been proved successful in past blind tests and many pharmaceutical applications.5,44,72 The current HTOCSP has begun to implement this function and it should be available in the near future.
Our systematic benchmark on 100 experimentally reported crystals demonstrates the effectiveness of different sampling strategies. We have shown that the choice of sampling strategy can significantly impact the success rate, especially for systems with varying degrees of landscape complexity. While width-first sampling proves to be more effective for systems characterized by narrow meta-basins, the depth-first sampling shows its strength in exploring wide meta-basins, particularly in more challenging CSP cases. In addition, these statistical results may serve as a ground for more efficient sampling method development in the future.
Despite these advancements, challenges remain, particularly in improving the efficiency of sampling strategies and developing robust post-analysis tools to accurately identify target structures from large datasets. The current work lays the foundation for future enhancements, including the integration of machine learning techniques to predict cell parameters, the implementation of more rigorous uncertainty quantification methods, and the development of advanced post-analysis modules to refine structure ranking. And the continuous development of HTOCSP will focus on addressing these challenges, with the aim of further improving the reliability and efficiency of CSP simulations.
CSD_entry | Space group symbol | Z′ | Rotors/unit | Mass | Success rate (%) | Energy rank | T WFS (min) | T DFS (min) | Tier | Remarks |
---|---|---|---|---|---|---|---|---|---|---|
CYANAM01 | Pbca | 1.00 | 0 | 42.0 | 27.9889 | 1/256 | 1 | 1 | 1 | Misc-cyanamide |
BENZEN | Pbca | 0.50 | 0 | 78.1 | 27.5391 | 1/256 | 1 | 1 | 1 | PAH-benzene |
ADAMAN01 | P21 | 1.00 | 0 | 136.2 | 24.3164 | 1/256 | 1 | 1 | 1 | Misc-adamantane |
NAPHTA15 | P21/c | 0.50 | 0 | 128.2 | 19.1406 | 1/256 | 1 | 1 | 1 | PAH-naphthalene |
ACEMID02 | R3c | 1.00 | 0 | 59.1 | 17.2191 | 1/256 | 1 | 1 | 1 | Misc-acetamide |
BIPHEN | P21/a | 0.50 | 1 | 154.2 | 15.1367 | 1/256 | 1 | 1 | 1 | PAH-biphenyl |
AXOSOW01 | Pbca | 1.00 | 1 | 56.1 | 10.3516 | 57/256 | 1 | 1 | 1 | BT-XI |
PHENAZ01 | P21/a | 0.50 | 0 | 180.2 | 9.3750 | 1/256 | 1 | 1 | 1 | Misc-phenazine |
NICLAN | P21 | 1.00 | 0 | 340.5 | 7.9102 | 39/256 | 1 | 1 | 1 | OSC-DNTT |
FORMAM | P21/c | 1.00 | 0 | 45.0 | 7.4219 | 54/256 | 1 | 1 | 1 | Misc-formamide |
OXALAC11 | Pcab | 0.25 | 1 | 90.0 | 7.4219 | 22/256 | 1 | 1 | 1 | Misc-oxalic acid |
RESORA03 | Pna21 | 1.00 | 0 | 110.1 | 5.9570 | 9/256 | 1 | 1 | 1 | Misc-resorcinol |
MERQUY | P21/n | 0.50 | 0 | 334.5 | 5.8594 | 17/256 | 1 | 1 | 1 | OSC-BDT |
OXALAC02 | Pcab | 0.50 | 1 | 90.0 | 5.4688 | 19/256 | 1 | 1 | 1 | Misc-oxalic acid |
AVIBEN | P21 | 1.00 | 0 | 440.6 | 5.2094 | 21/256 | 1 | 1 | 1 | OSC-DPhNDT |
COUMAR01 | Pca21 | 1.00 | 0 | 146.1 | 5.0781 | 1/256 | 1 | 1 | 1 | Misc-coumarin |
SIRMIQ01 | P21/n | 0.50 | 0 | 290.4 | 4.9020 | 1/256 | 1 | 1 | 1 | OSC-BBBT |
JAYDUI | P21/n | 1.00 | 0 | 44.1 | 4.0039 | 1/256 | 1 | 1 | 1 | BT-VII |
QUPHEN | P21/a | 0.50 | 3 | 306.4 | 3.6303 | 1/256 | 1 | 1 | 1 | PAH-p-quaterphenyl |
OKUPUG | P21/a | 0.50 | 2 | 392.5 | 3.5909 | 4/256 | 1 | 1 | 1 | OSC-DPh-NDT |
MERQOS | Pbca | 0.50 | 0 | 370.6 | 3.5156 | 15/256 | 1 | 1 | 1 | OSC-BEDT-BDT |
PAHYON01 | C2/c | 1.00 | 0 | 100.1 | 3.4180 | 1/256 | 1 | 1 | 1 | BT-VIII |
UREAXX02 |
P![]() |
0.25 | 0 | 60.1 | 3.3933 | 30/256 | 1 | 1 | 1 | Misc-urea |
BOQQUT | P21/a | 1.00 | 0 | 153.2 | 3.2715 | 12/256 | 1 | 1 | 1 | BT-IV |
UVAHEF | P21/a | 0.50 | 2 | 392.5 | 3.0835 | 6/768 | 1 | 1 | 1 | OSC-DPh-BTBT |
CRYSEN | I2/c | 0.50 | 0 | 228.3 | 3.0570 | 1/768 | 1 | 1 | 1 | PAH-chrysene |
UVAGUU | P21/c | 1.00 | 2 | 392.5 | 2.7986 | 1/1024 | 1 | 2 | 1 | OSC-Ph-NDT |
MUVMIA | P21/n | 0.50 | 0 | 368.5 | 2.6348 | 1/256 | 1 | 1 | 1 | OSC-3,10-DMeDNTT |
JUFRIO | P21/c | 0.50 | 2 | 382.6 | 2.3788 | 1/768 | 1 | 1 | 1 | OSC-MT-ADT |
AFIGIH |
P![]() |
1.00 | 6 | 303.4 | 2.2327 | 1/768 | 1 | 1 | 1 | Misc-thiophene deriv. |
DBZCOR | C2/c | 0.50 | 0 | 400.5 | 2.1304 | 1/1280 | 1 | 2 | 1 | PAH-dibenzocoronene |
DUTGIK | P21/a | 0.50 | 2 | 604.8 | 2.1016 | 1/256 | 1 | 1 | 1 | OSC-DPh-BBTNDT |
ECENAD | P21/c | 0.50 | 0 | 402.6 | 1.9637 | 82/768 | 1 | 1 | 1 | OSC-BNTBDT |
XELYUJ | P21 | 1.00 | 5 | 482.7 | 1.9458 | 1/2048 | 1 | 4 | 1 | OSC-dithiophenyl deriv. |
XULDUD | Pbca | 1.00 | 0 | 94.1 | 1.9317 | 422/768 | 1 | 1 | 1 | BT-I |
BZPHAN01 | P212121 | 1.00 | 0 | 228.3 | 1.8069 | 48/1024 | 5 | 2 | 1 | PAH-benzo(c)phenanthrene |
MERQIM | P21/a | 0.50 | 0 | 334.5 | 1.7658 | 37/768 | 1 | 1 | 1 | OSC-HTPBDT |
VOBYAN | P21/n | 0.50 | 0 | 402.6 | 1.4879 | 53/768 | 1 | 1 | 1 | OSC-anthracene-deriv. |
SITJUC | P21/c | 0.50 | 2 | 600.8 | 1.4861 | 54/3054 | 6 | 6 | 1 | OSC-IDBT-7d |
HAJJIN | P21 | 1.00 | 6 | 400.6 | 1.3669 | 1/1536 | 1 | 2 | 1 | OSC-DEP-DTT |
BETFOV |
P![]() |
1.00 | 0 | 274.4 | 1.3546 | 31/766 | 1 | 3 | 1 | Misc |
UVAHAB | P21/c | 0.50 | 2 | 392.5 | 1.3043 | 94/1280 | 4 | 2 | 1 | OSC-DPh-NDT4 |
WEXBOS | P21/c | 0.50 | 4 | 444.6 | 1.2810 | 3/256 | 3 | 1 | 1 | OSC-thiophene oligomer |
XULDUD01 | P21/c | 1.00 | 0 | 94.1 | 1.2503 | 908/2304 | 6 | 3 | 1 | BT-I |
KUPWOJ | P21/c | 0.50 | 2 | 492.7 | 1.1189 | 2/2048 | 1 | 4 | 1 | OSC-2,9DPhDNTT |
MIVDEC | Pna21 | 1.00 | 4 | 560.7 | 1.1186 | 4/3072 | 8 | 15 | 1 | OSC-rubrene-deriv. |
PYRENE07 | P21/a | 1.00 | 0 | 202.3 | 1.0785 | 162/768 | 4 | 1 | 1 | PAH-pyrene |
TIDFES |
P![]() |
1.00 | 3 | 428.5 | 1.0664 | 9/1790 | 6 | 7 | 1 | OSC-tetracene-deriv. |
GUFJOG | P21/n | 1.00 | 0 | 125.1 | 0.9729 | 7/2304 | 1 | 3 | 1 | BT-II |
SUCXIZ | P22121 | 1.00 | 2 | 286.4 | 0.8888 | 1/2048 | 1 | 3 | 1 | OSC-benzothiophene |
KUPWUP | Pbca | 0.50 | 2 | 492.7 | 0.8155 | 1/2304 | 3 | 5 | 1 | OSC-3,10-DPhDNTT |
FUNZOE | P21 | 1.00 | 4 | 268.3 | 0.8084 | 29/1024 | 4 | 1 | 1 | Misc |
ZZZDKE01 |
P![]() |
1.00 | 0 | 328.4 | 0.7102 | 22/3324 | 2 | 9 | 1 | PAH-hexacene |
NACJAF | P21/n | 1.00 | 0 | 248.3 | 0.6925 | 415/3072 | 7 | 4 | 1 | BT-XXII |
DURNAH | P21 | 1.00 | 4 | 372.5 | 0.6810 | 6/2048 | 1 | 4 | 1 | OSC-Ph-BTBT-C4 |
DOCXEA | P21/n | 1.00 | 0 | 452.6 | 0.5930 | 152/2560 | 3 | 6 | 1 | OSC-BBTNDT |
MERRAF | P21/c | 0.50 | 0 | 342.6 | 0.4413 | 213/1536 | 9 | 2 | 2 | OSC-BMBT-BDT |
CEQGEL |
P![]() |
2.00 | 0 | 252.3 | 0.4096 | 1/3072 | 7 | 8 | 2 | PAH-benzo(e)pyrene |
FOJVAC | P21 | 1.00 | 4 | 518.7 | 0.4053 | 9/2304 | 5 | 5 | 2 | OSC-BBPh-NDT |
AXIDER | P21/c | 0.50 | 4 | 604.6 | 0.3495 | 49/3840 | 10 | 8 | 2 | OSC-rubrene-deriv. |
FOJTUU | Pbca | 0.50 | 2 | 466.6 | 0.2839 | 1/4608 | 4 | 10 | 2 | OSC-DNap-NDT |
FIQSEG | P21/c | 0.50 | 2 | 282.5 | 0.2827 | 1963/4608 | 10 | 7 | 2 | OSC-MT-BDT |
ACSALA13 | P21/c | 1.00 | 3 | 180.2 | 0.2777 | 261/16![]() |
14 | 26 | 2 | Misc-aspirin |
WIDBAO | P21/c | 1.00 | 2 | 254.4 | 0.2651 | 1775/11![]() |
5 | 17 | 2 | BT-XIV |
TBZPYR | Pn21m | 0.50 | 0 | 352.4 | 0.2413 | 2107/4864 | 25 | 10 | 2 | PAH-b-tribenzopyrene |
XATMOV | P21/c | 2.00 | 0 | 57.1 | 0.2224 | 86/4352 | 9 | 7 | 2 | BT-XI |
FLUANT | P21/n | 2.00 | 0 | 202.3 | 0.1855 | 1/54![]() |
6 | 165 | 2 | PAH-fluoranthene |
QAXMEH53 |
P![]() |
1.00 | 3 | 259.3 | 0.1841 | 1828/4352 | 28 | 7 | 2 | Misc-ROY |
OBEQOD | P21/c | 1.00 | 2 | 237.0 | 0.1755 | 4245/7936 | 16 | 12 | 2 | BT-XVII |
KEKJEQ | P21/n | 0.50 | 2 | 418.4 | 0.1642 | 230/6912 | 7 | 13 | 2 | OSC-DP-NTCDI |
ACSALA | P21/c | 1.00 | 3 | 180.2 | 0.1617 | 753/13![]() |
9 | 21 | 2 | Misc-aspirin |
CAYKUJ | P21 | 1.00 | 5 | 472.7 | 0.1601 | 23/15![]() |
26 | 34 | 2 | Misc |
XATJOT | Pca21 | 2.00 | 3 | 246.2 | 0.1504 | 71/5376 | 54 | 8 | 2 | BT-XIX-cocrysta |
XAFQIH |
P![]() |
1.00 | 7 | 561.5 | 0.1486 | 9/24![]() |
8 | 64 | 2 | BT-XXVI |
UJIRIO01 | P21/c | 1.00 | 3 | 249.3 | 0.1406 | 560/16![]() |
19 | 27 | 2 | BT-VI |
QUATER10 | P21/a | 1.00 | 0 | 500.6 | 0.1311 | 1128/5374 | 36 | 30 | 2 | PAH-quaterrylene |
OBEQET | Pbca | 1.00 | 3 | 258.7 | 0.1227 | 166/6656 | 24 | 10 | 2 | BT-XVIII |
YIHVUI | P21/c | 0.50 | 1 | 490.7 | 0.1131 | 2729/11![]() |
26 | 21 | 2 | OSC-BTDT |
OBEQUJ | Pbca | 1.00 | 0 | 120.1 | 0.1017 | 11![]() ![]() |
31 | 17 | 2 | BT-XVI |
UJIRIO05 | Pbca | 1.00 | 3 | 249.3 | 0.0962 | 637/28![]() |
14 | 48 | 2 | BT-VI |
HAMTIZ01 | P21/n | 1.00 | 4 | 239.2 | 0.0957 | 478/14![]() |
9 | 24 | 2 | BT-X |
XAFPAY01 |
P![]() |
1.00 | 6 | 386.3 | 0.0917 | 43/13![]() |
21 | 32 | 2 | BT-XXIII |
KONTIQ09 | P21/c | 2.00 | 1 | 188.1 | 0.0910 | 1895/19![]() |
19 | 32 | 2 | BT-XXI-hydrate |
ACRDIN05 | Cc | 2.00 | 0 | 179.2 | 0.0897 | 1/18![]() |
11 | 36 | 2 | Misc-acridine |
PYRZIN01 | P21/c | 1.00 | 1 | 123.1 | 0.0770 | 50![]() ![]() |
28 | 79 | 2 | Misc-pyrazinamide |
QQQCIG04 | Cmca | 0.25 | 4 | 532.7 | 0.0680 | 256/30![]() |
44 | 113 | 2 | OSC-ruberene |
BOQQUT01 | P21/c | 2.00 | 0 | 153.2 | 0.0597 | 602/42![]() |
53 | 82 | 2 | BT-IV |
EVIDEV | P21/c | 0.50 | 2 | 344.6 | 0.0512 | 4788/13![]() |
27 | 23 | 2 | OSC-4TA-2 |
XAFQAZ | P21/c | 2.00 | 3 | 462.5 | 0.0501 | 13/113![]() |
18 | 284 | 2 | BT-XXV-cocrystal |
JOHSOP | P21 | 2.00 | 0 | 284.4 | 0.0251 | 914/19![]() |
265 | 55 | 3 | OSC-ABT |
WICZUF | P21/n | 2.00 | 1 | 245.3 | 0.0230 | 77![]() ![]() |
52 | 155 | 3 | BT-XV |
KONTIQ | P21/c | 2.00 | 1 | 188.1 | 0.0125 | 18![]() ![]() |
230 | 231 | 3 | BT-XXI-hydrate |
LEVJON | P2/c | 0.50 | 4 | 696.5 | 0.0055 | 143/127![]() |
348 | 383 | 3 | OSC |
XAFPAY | P21/c | 1.00 | 6 | 386.3 | 0.0032 | 65/127![]() |
312 | 286 | 3 | BT-XXIII |
XAFPAY03 | P21/n | 1.00 | 6 | 386.3 | 0.0013 | 53/127![]() |
312 | 284 | 3 | BT-XXIII |
OBEQIX | P21/n | 1.00 | 8 | 478.6 | 0.0009 | 5/127![]() |
362 | 349 | 4 | BT-XX |
XAFQON | P21/c | 3.00 | 4 | 201.7 | 0.0006 | 28![]() ![]() |
219 | 209 | 4 | BT-XXIV |
YOKBIK | P21/a | 0.50 | 14 | 464.8 | 0.0002 | 631/127![]() |
481 | 384 | 4 | OSC-C8-BTBT |
XAFPAY02 |
P![]() |
2.00 | 12 | 386.3 | 0.0000 | 0/127![]() |
367 | 403 | 4 | BT-XXIII |
UJIRIO02 | P21/c | 2.00 | 6 | 249.3 | 0.0000 | 0/127![]() |
273 | 297 | 4 | BT-VI |
This journal is © The Royal Society of Chemistry 2025 |