Accelerating the inference of string generation-based chemical reaction models for industrial applications

Mikhail Andronov*, Natalia Andronova, Michael Wand, Jürgen Schmidhuber, Djork Arné Clevert

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Transformer-based, template-free SMILES-to-SMILES translation models for reaction prediction and single-step retrosynthesis are of interest to computer-aided synthesis planning systems, as they offer state-of-the-art accuracy. However, their slow inference speed limits their practical utility in such applications. To address this challenge, we propose speculative decoding with a simple chemically specific drafting strategy and apply it to the Molecular Transformer, an encoder-decoder transformer for conditional SMILES generation. Our approach achieves over 3X faster inference in reaction product prediction and single-step retrosynthesis with no loss in accuracy, increasing the potential of the transformer as the backbone of synthesis planning systems. To accelerate the simultaneous generation of multiple precursor SMILES for a given query SMILES in single-step retrosynthesis, we introduce Speculative Beam Search, a novel algorithm tackling the challenge of beam search acceleration with speculative decoding. Our methods aim to improve transformer-based models’ scalability and industrial applicability in synthesis planning.

Original languageEnglish (US)
Article number31
JournalJournal of Cheminformatics
Volume17
Issue number1
DOIs
StatePublished - Dec 2025

Keywords

  • CASP
  • Fast inference
  • Reaction prediction
  • Single-step retrosynthesis
  • Speculative decoding

ASJC Scopus subject areas

  • Computer Science Applications
  • Physical and Theoretical Chemistry
  • Computer Graphics and Computer-Aided Design
  • Library and Information Sciences

Fingerprint

Dive into the research topics of 'Accelerating the inference of string generation-based chemical reaction models for industrial applications'. Together they form a unique fingerprint.

Cite this