BiBERTa: A Self-Supervised Framework for Accelerating the Discovery of Stable Organic Photovoltaic Materials

Abstract

The discovery of high-performance organic photovoltaic materials remains a time-consuming and resource-intensive process due to the combinatorial complexity of donor-acceptor pairs and the limited availability of experimental data. To address this challenge, we propose BiBERTa, a self-supervised deep learning framework that integrates large-scale pretraining (77 million SMILES) and domain-specific fine-tuning (2,449 experimental pairs) to predict power conversion efficiency (PCE) directly from molecular structures. Utilizing a bi-encoder RoBERTa architecture, BiBERTa captures critical chemical motifs, such as conjugated backbones and electron-withdrawing groups, through attention mechanisms, achieving state-of-the-art prediction accuracy (MAE = 1.67%, R² = 0.73) and generalizability across a wide range of acceptors, including emerging stable quasi-macromolecules. Leveraging this model, we designed and synthesized novel acceptors, achieving a PCE of 15.15% in PM6-based devices. Experimental validation confirmed the reliability of BiBERTa, with an MAE of 1.21% between predicted and measured PCEs. The synergy between computational screening and experimental optimization has reduced the discovery cycle compared to conventional trial-and-error approaches. A user-friendly web server (https://huggingface.co/spaces/jinysun/BiBERTa) facilitates community-driven material exploration, bridging molecular design, machine learning, and scalable synthesis. This work provides a paradigm for data-efficient discovery of energy materials under limited experimental resources.

Supplementary files

Article information

Article type
Paper
Submitted
24 feb 2025
Accepted
09 giu 2025
First published
11 giu 2025

J. Mater. Chem. A, 2025, Accepted Manuscript

BiBERTa: A Self-Supervised Framework for Accelerating the Discovery of Stable Organic Photovoltaic Materials

J. Sun, D. Li, J. Zou, X. Tan, Y. Wang, H. Zhang, Y. Zou, Z. Zhang and H. Lu, J. Mater. Chem. A, 2025, Accepted Manuscript , DOI: 10.1039/D5TA01529D

To request permission to reproduce material from this article, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements