Issue 44, 2022

Optimal machine learning feature selection for assessing the mechanical properties of a zeolite framework

Abstract

In this study, 45 and 249 critical features were discovered among 896 zeolite descriptors generated by the matminer package for estimating the shear and bulk moduli of zeolites, respectively. A database containing the mechanical properties of 873 zeolite structures, calculated using density functional theory, was used to train the machine learning regression model. The results of using these critical features with the LightGBM algorithm were rigorously compared with those from other regressors as well as with different sets of features. The comparison results indicate that the surrogate model with critical features increases the prediction accuracy of the bulk and shear moduli of zeolites by 17.3% and 10.6%, respectively, and reduces the prediction uncertainty by one-third of that achieved using previously available features. The suggested features originating from several physical and chemical groups highlight the unveiled relationships between the features and mechanical properties of zeolites. The robustness of the constructed model with 356 features was confirmed by applying a set of different training-test set ratios. We believe that the suggested critical features of zeolites can help to understand the mechanical behavior of a half million unlabeled hypothetical zeolite structures and accelerate the discovery of novel zeolites with unprecedented mechanical properties.

Graphical abstract: Optimal machine learning feature selection for assessing the mechanical properties of a zeolite framework

Supplementary files

Article information

Article type
Paper
Submitted
29 Jun 2022
Accepted
23 Sep 2022
First published
26 Sep 2022

Phys. Chem. Chem. Phys., 2022,24, 27031-27037

Optimal machine learning feature selection for assessing the mechanical properties of a zeolite framework

N. Kim and K. Min, Phys. Chem. Chem. Phys., 2022, 24, 27031 DOI: 10.1039/D2CP02949A

To request permission to reproduce material from this article, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements