Transfer learning for a foundational chemistry model

Emma King-Smith

doi:10.1039/D3SC04928K

Transfer learning for a foundational chemistry model†

Emma King-Smith

*^a

Author affiliations

* Corresponding authors

^a Cavendish Laboratory, University of Cambridge, Cambridge, UK
E-mail: esk34@cam.ac.uk

Abstract

Data-driven chemistry has garnered much interest concurrent with improvements in hardware and the development of new machine learning models. However, obtaining sufficiently large, accurate datasets of a desired chemical outcome for data-driven chemistry remains a challenge. The community has made significant efforts to democratize and curate available information for more facile machine learning applications, but the limiting factor is usually the laborious nature of generating large-scale data. Transfer learning has been noted in certain applications to alleviate some of the data burden, but this protocol is typically carried out on a case-by-case basis, with the transfer learning task expertly chosen to fit the finetuning. Herein, I develop a machine learning framework capable of accurate chemistry-relevant prediction amid general sources of low data. First, a chemical “foundational model” is trained using a dataset of ∼1 million experimental organic crystal structures. A task specific module is then stacked atop this foundational model and subjected to finetuning. This approach achieves state-of-the-art performance on a diverse set of tasks: toxicity prediction, yield prediction, and odor prediction.

This article is part of the themed collection: 2023 Chemical Science HOT Article Collection

Supplementary files

Transparent peer review

To support increased transparency, we offer authors the option to publish the peer review history alongside their article.

View this article’s peer review history

Article information

DOI: https://doi.org/10.1039/D3SC04928K
Article type: Edge Article
Submitted: 19 sep 2023
Accepted: 15 nov 2023
First published: 24 nov 2023
This article is Open Access

All publication charges for this article have been paid for by the Royal Society of Chemistry

Download Citation

Chem. Sci., 2024,15, 5143-5151

Permissions

Request permissions

Transfer learning for a foundational chemistry model

E. King-Smith, Chem. Sci., 2024, 15, 5143 DOI: 10.1039/D3SC04928K

This article is licensed under a Creative Commons Attribution 3.0 Unported Licence. You can use material from this article in other publications without requesting further permissions from the RSC, provided that the correct acknowledgement is given.

Chemical Science

Transfer learning for a foundational chemistry model†

Abstract

Supplementary files

Transparent peer review

Article information

Download Citation

Permissions

Transfer learning for a foundational chemistry model

Social activity

Search articles by author

Spotlight

Advertisements