Issue 4, 2021

Predicting second virial coefficients of organic and inorganic compounds using Gaussian process regression

Abstract

We show that by using intuitive and accessible molecular features it is possible to predict the temperature-dependent second virial coefficient of organic and inorganic compounds with Gaussian process regression. In particular, we built a low dimensional representation of features based on intrinsic molecular properties, topology and physical properties relevant for the characterization of molecule-molecule interactions. The featurization was used to predict second virial coefficients in the interpolative regime with a relative error ≲1% and to extrapolate the prediction to temperatures outside of the training range for each compound in the dataset with a relative error of 2.1%. Additionally, the model's predictive abilities were extended to organic molecules unseen in the training process, yielding a prediction with a relative error of 2.7%. Test molecules must be well-represented in the training set by instances of their families, which are high in variety. The method shows a generally better performance when compared to several semi-empirical procedures employed in the prediction of the quantity. Therefore, apart from being robust, the present Gaussian process regression model is extensible to a variety of organic and inorganic compounds.

Graphical abstract: Predicting second virial coefficients of organic and inorganic compounds using Gaussian process regression

Article information

Article type
Paper
Submitted
21 Oct 2020
Accepted
11 Jan 2021
First published
11 Jan 2021
This article is Open Access
Creative Commons BY license

Phys. Chem. Chem. Phys., 2021,23, 2891-2898

Predicting second virial coefficients of organic and inorganic compounds using Gaussian process regression

M. T. Cretu and J. Pérez-Ríos, Phys. Chem. Chem. Phys., 2021, 23, 2891 DOI: 10.1039/D0CP05509C

This article is licensed under a Creative Commons Attribution 3.0 Unported Licence. You can use material from this article in other publications without requesting further permissions from the RSC, provided that the correct acknowledgement is given.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements