SynAsk: Unleashing the Power of Large Language Models in Organic Synthesis

Abstract

The field of natural language processing (NLP) has witnessed a transformative shift with the emergence of large language models (LLMs), revolutionizing various language tasks and applications, and the integration of LLM into specialized domains enhances their capabilities for domain-specific applications. Notably, NLP has made significant strides in organic chemistry, particularly in predicting synthetic tasks, paving the way for the development of LLMs tailored to the organic chemistry field. In this work, we introduce SynAsk, a comprehensive organic chemistry domain-specific LLM platform developed by AIChemEco Inc. By finetuning an LLM with domain-specific data and integrating it with a chain of thought approach, SynAsk seamlessly accesses our knowledge base and advanced chemistry tools in a question-and-answer format. This includes functionalities such as a basic chemistry knowledge base, molecular information retrieval, reaction performance prediction, retrosynthesis prediction, chemical literature acquisition, and more. This novel methodology synergizes fine-tuning techniques with external resource integration, resulting in an organic chemistry-specific model poised to facilitate research and discovery in the field. Accessible via https://synask.aichemeco.com, SynAsk represents a significant advancement in leveraging NLP for synthetic applications.

Supplementary files

Article information

Article type
Edge Article
Submitted
17 Jul 2024
Accepted
11 Nov 2024
First published
18 Nov 2024
This article is Open Access

All publication charges for this article have been paid for by the Royal Society of Chemistry
Creative Commons BY license

Chem. Sci., 2024, Accepted Manuscript

SynAsk: Unleashing the Power of Large Language Models in Organic Synthesis

C. Zhang, Q. Lin, B. Zhu, H. Yang, X. Lian, H. Deng, J. Zheng and K. Liao, Chem. Sci., 2024, Accepted Manuscript , DOI: 10.1039/D4SC04757E

This article is licensed under a Creative Commons Attribution 3.0 Unported Licence. You can use material from this article in other publications without requesting further permissions from the RSC, provided that the correct acknowledgement is given.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements