Reinforcement learning in crystal structure prediction

Elena Zamaraeva; Christopher M. Collins; Dmytro Antypov; Vladimir V. Gusev; Rahul Savani; Matthew S. Dyer; George R. Darling; Igor Potapov; Matthew J. Rosseinsky; Paul G. Spirakis

doi:10.1039/D3DD00063J

Reinforcement learning in crystal structure prediction†

Elena Zamaraeva,

^a Christopher M. Collins,

^b Dmytro Antypov,

^a Vladimir V. Gusev,

^ac Rahul Savani,

*^cd Matthew S. Dyer,

^ab George R. Darling,

^b Igor Potapov,

^ac Matthew J. Rosseinsky

*^ab and Paul G. Spirakis

^ac

Author affiliations

* Corresponding authors

^a Leverhulme Research Centre for Functional Materials Design, University of Liverpool, Materials Innovation Factory, Liverpool, UK
E-mail: M.J.Rosseinsky@liverpool.ac.uk

^b Department of Chemistry, University of Liverpool, Liverpool, UK

^c Department of Computer Science, University of Liverpool, Liverpool, UK
E-mail: Rahul.Savani@liverpool.ac.uk

^d The Alan Turing Institute, London, UK

Abstract

Crystal Structure Prediction (CSP) is a fundamental computational problem in materials science. Basin-hopping is a prominent CSP method that combines global Monte Carlo sampling to search over candidate trial structures with local energy minimisation of these candidates. The sampling uses a stochastic policy to randomly choose which action (such as a swap of atoms) will be used to transform the current structure into the next. Typically hand-tuned for a specific system before the run starts, such a policy is simply a fixed discrete probability distribution of possible actions, which does not depend on the current structure and does not adapt during a CSP run. We show that reinforcement learning (RL) can generate a dynamic policy that both depends on the current structure and improves on the fly during the CSP run. We demonstrate the efficacy of our approach on two CSP codes, FUSE and MC-EMMA. Specifically, we show that, when applied to the autonomous exploration of a phase field to identify the accessible crystal structures, RL can save up to 46% of the computation time.

Digital Discovery

Reinforcement learning in crystal structure prediction†

Abstract

Supplementary files

Article information

Download Citation

Permissions

Reinforcement learning in crystal structure prediction

Social activity

Search articles by author

Spotlight

Advertisements