Transfer learning on large datasets for the accurate prediction of material properties

Noah Hoffmann; Jonathan Schmidt; Silvana Botti; Miguel A. L. Marques

doi:10.1039/D3DD00030C

Transfer learning on large datasets for the accurate prediction of material properties

Noah Hoffmann,

^a Jonathan Schmidt,

^ba Silvana Botti

^b and Miguel A. L. Marques

*^a

Author affiliations

* Corresponding authors

^a Institut für Physik, Martin-Luther-Universität Halle-Wittenberg, D-06099 Halle, Germany
E-mail: miguel.marques@physik.uni-halle.de

^b Institut für Festkörpertheorie und -Optik, Friedrich-Schiller-Universität Jena, Max-Wien-Platz 1, 07743 Jena, Germany

Abstract

Graph neural networks trained on large crystal structure databases are extremely effective in replacing ab initio calculations in the discovery and characterization of materials. However, crystal structure datasets comprising millions of materials exist only for the Perdew–Burke–Ernzerhof (PBE) functional. In this work, we investigate the effectiveness of transfer learning to extend these models to other density functionals. We show that pre-training significantly reduces the size of the dataset required to achieve chemical accuracy and beyond. We also analyze in detail the relationship between the transfer-learning performance and the size of the datasets used for the initial training of the model and transfer learning. We confirm a linear dependence of the error on the size of the datasets on a log–log scale, with a similar slope for both training and the pre-training datasets. This shows that further increasing the size of the pre-training dataset, i.e., performing additional calculations with a low-cost functional, is also effective, through transfer learning, in improving machine-learning predictions with the quality of a more accurate, and possibly computationally more involved functional. Lastly, we compare the efficacy of interproperty and intraproperty transfer learning.

Article information

https://doi.org/10.1039/D3DD00030C

Article type

Paper

Submitted

05 Mar 2023

Accepted

04 Aug 2023

First published

18 Aug 2023

This article is Open Access

Download Citation

Digital Discovery, 2023,2, 1368-1379

Permissions

Request permissions

Transfer learning on large datasets for the accurate prediction of material properties

N. Hoffmann, J. Schmidt, S. Botti and M. A. L. Marques, Digital Discovery, 2023, 2, 1368 DOI: 10.1039/D3DD00030C

This article is licensed under a Creative Commons Attribution 3.0 Unported Licence. You can use material from this article in other publications without requesting further permissions from the RSC, provided that the correct acknowledgement is given.

Digital Discovery

Transfer learning on large datasets for the accurate prediction of material properties

Abstract

Article information

Download Citation

Permissions

Transfer learning on large datasets for the accurate prediction of material properties

Social activity

Search articles by author

Spotlight

Advertisements