High throughput molecular design of electron donors and non-fullerene acceptors using machine learning combined with substructure importance

Abstract

The electron donor and acceptor materials in active layer critically influence organic solar cells (OSCs) performance. However, traditional experimental methods for discovering high-performance materials are often time-consuming, costly and inefficient. Herein, to address this challenge, we established the database containing 547 donor-acceptor pairs in OSCs. Each molecule in database was represented using Morgan and MACCS fingerprints. Machine learning Random Forest (RF) model was employed, with hyperparameters were optimized through grid search, to develop the predictive model for power conversion efficiency (PCE). To gain insights into the relationship between PCE and molecular substructures of both donors and non-fullerene acceptors, SHAP analysis was performed based on MACCS fingerprints. The top five important MACCS fingerprints were figured out for donor and non-fullerene acceptor molecules that positively correlate with PCE. The donor and non-fullerene acceptor molecules in constructed database were cut into molecular unit for enriching chemical space of efficient molecular design. The important donor units, acceptor units and π units were screened and selected to design donors (D-π-A-π type) and non-fullerene acceptor (A-π-D-π-A and A-D-A types) molecules, generated 4,914 donor and 701,800 acceptor molecules. Correspondingly, 3,448,645,200 donor-acceptor pairs were obtained. The PCE of newly designed donor-acceptor pairs were predicted using the optimized RF model. The 14,296 new donor-acceptor pairs were identified with the predicted PCE exceeding 14.00%. Among them, 123 pairs exhibited PCE greater than 15.50%, with the highest predicted PCE of 15.91%. This method enables the efficient molecular design of large number of potential OSCs materials.

Supplementary files

Article information

Article type
Paper
Submitted
03 Mar 2025
Accepted
02 Jun 2025
First published
03 Jun 2025

J. Mater. Chem. C, 2025, Accepted Manuscript

High throughput molecular design of electron donors and non-fullerene acceptors using machine learning combined with substructure importance

C. Zhang, L. Lv, M. Li, X. Liu, J. Gong, Z. Liu, Y. Wu and H. Chen, J. Mater. Chem. C, 2025, Accepted Manuscript , DOI: 10.1039/D5TC00931F

To request permission to reproduce material from this article, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements