Issue 45, 2022

Identifying porous cage subsets in the Cambridge Structural Database using topological data analysis

Abstract

As rationally designable materials, the variety and number of synthesised metal–organic cages (MOCs) and organic cages (OCs) are expected to grow in the Cambridge Structural Database (CSD). In this regard, two of the most important questions are, which structures are already present in the CSD and how can they be identified? Here, we present a cage mining methodology based on topological data analysis and a combination of supervised and unsupervised learning that led to the derivation of – to the best of our knowledge – the first and only MOC dataset of 1839 structures and the largest experimental OC dataset of 7736 cages, as of March 2022. We illustrate the use of such datasets with a high-throughput screening of MOCs and OCs for xenon/krypton separation, important gases in multiple industries, including healthcare.

Graphical abstract: Identifying porous cage subsets in the Cambridge Structural Database using topological data analysis

Supplementary files

Article information

Article type
Edge Article
Submitted
06 Jun 2022
Accepted
30 Oct 2022
First published
31 Oct 2022
This article is Open Access

All publication charges for this article have been paid for by the Royal Society of Chemistry
Creative Commons BY license

Chem. Sci., 2022,13, 13507-13523

Identifying porous cage subsets in the Cambridge Structural Database using topological data analysis

A. Li, R. Bueno-Perez and D. Fairen-Jimenez, Chem. Sci., 2022, 13, 13507 DOI: 10.1039/D2SC03171J

This article is licensed under a Creative Commons Attribution 3.0 Unported Licence. You can use material from this article in other publications without requesting further permissions from the RSC, provided that the correct acknowledgement is given.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements