Application of large datasets to assess trends in the stability of perovskite photovoltaics through machine learning†
Abstract
Current trends in manufacturing indicate that optimised decision making using new state-of-the-art machine learning (ML) technologies will be used. ML is a versatile technique that rapidly and accurately generates new insights from multifactorial data. The ML approach has been applied to a perovskite solar cell (PSC) database to elucidate trends in stability and forecast the stability of new configurations. A database consisting of 6038 entries of device characteristics, performance, and stability data was utilised, and a sequential minimal optimisation regression (SMOreg) model was employed to determine the most influential factors governing solar cell stability. When considering sub-sections of data, it was found that pin-device architectures provided the best model fittings with a training correlation efficiency of 0.963, compared to 0.699 for all device architectures. By establishing models for each PSC architecture, the analysis allows the identification of materials that can lead to improvements in stability. This paper also attempts to summarise some key challenges and trends in the current research methodologies.