Catalyst Acquisition by Data Science (CADS): a web-based catalyst informatics platform for discovering catalysts
Abstract
An innovative web-based integrated catalyst informatics platform, Catalyst Acquisition by Data Science (CADS), is developed for use towards the discovery and design of catalysts. The platform provides three main functionalities: a repository for data sharing and publishing, an analytic workspace for exploratory visual analysis, and catalyst property prediction tools with pretrained machine learning models. Access to such a platform helps decrease barriers to entry faced by researchers in catalytic chemistry when attempting to apply catalyst informatics towards data by providing analytical and visualization tools that can be simultaneously applied and easily accessed within a central space, thereby helping the advancement of catalyst informatics. The developed platform allows researchers to upload and collect data onto the platform and conduct data analysis using a system of linked workspaces consisting of interactive visualization tools and machine learning tools that simultaneously update according to the researchers' actions in real time. The platform also provides a space for collaboration where researchers can choose to publish their uploaded data and resulting analyses to the platform for collaborations with other users and groups. As an example, CADS is applied towards oxidative coupling of methane (OCM) data where use of the platform tools reveals underlying patterns and trends that are otherwise hidden within the original data. Thus, the proposed platform contributes towards the advancement of catalyst informatics for both specialists and non-specialists.