Root-aligned SMILES: a tight representation for chemical reaction prediction?
Chemical Science Pub Date: 2022-07-12 DOI: 10.1039/D2SC02763A
Abstract
Chemical reaction prediction, involving forward synthesis and retrosynthesis prediction, is a fundamental problem in organic synthesis. A popular computational paradigm formulates synthesis prediction as a sequence-to-sequence translation problem, where the typical SMILES is adopted for molecule representations. However, the general-purpose SMILES neglects the characteristics of chemical reactions, where the molecular graph topology is largely unaltered from reactants to products, resulting in the suboptimal performance of SMILES if straightforwardly applied. In this article, we propose the root-aligned SMILES (R-SMILES), which specifies a tightly aligned one-to-one mapping between the product and the reactant SMILES for more efficient synthesis prediction. Due to the strict one-to-one mapping and reduced edit distance, the computational model is largely relieved from learning the complex syntax and dedicated to learning the chemical knowledge for reactions. We compare the proposed R-SMILES with various state-of-the-art baselines and show that it significantly outperforms them all, demonstrating the superiority of the proposed method.
Recommended Literature
- [1] Fatty acid positional distribution in colostrum and mature milk of women living in Inner Mongolia, North Jiangsu and Guangxi of China? Long Deng,Qian Zou,Biao Liu,Wenhui Ye,Chengfei Zhuo,Li Chen,Ze-Yuan Deng,Ya-Wei Fan,Jing LiFood Funct., 2018,9, 4234-4245 10.1039/C8FO00787J
- [2] Establishing plasmon contribution to chemical reactions: alkoxyamines as a thermal probe? Olga Guselnikova,Gérard Audran,Jean-Patrick Joly,Andrii Trelin,Evgeny V. Tretyakov,Vaclav Svorcik,Oleksiy Lyutakov,Sylvain R. A. MarqueChem. Sci., 2021,12, 4154-4161 10.1039/D0SC06470J
- [3] Excimer emission and magnetoluminescence of radical-based zinc(ii) complexes doped in host crystals? Shojiro Kimura,Tetsuro KusamotoChem. Commun., 2020,56, 11195-11198 10.1039/D0CC04830E
- [4] Excited state dynamics of symmetric and asymmetric Cr3(dpa)4Cl2 measured using femtosecond transient absorption spectroscopy? Chao-Han Cheng,Wen-Zhen Wang,Shie-Ming Peng,I-Chia ChenPhys. Chem. Chem. Phys., 2017,19, 25471-25477 10.1039/C7CP03968A
- [5] Emerging investigator series: first-principles and thermodynamics comparison of compositionally-tuned delafossites: cation release from the (001) surface of complex metal oxides? Joseph W. Bennett,Diamond T. Jones,Blake G. Hudson,Joshua Melendez-Rivera,Robert J. Hamers,Sara E. MasonEnviron. Sci.: Nano, 2020,7, 1642-1651 10.1039/C9EN01304K
- [6] Estimates of hydride ion stability in condensed systems: energy of formation and solvation in aqueous and polar-organic solvents Craig A. Kelly,David R. RosseinskyPhys. Chem. Chem. Phys., 2001,3, 2086-2090 10.1039/B010092G
- [7] Examination of the hydrogen-bonding networks in small water clusters (n = 2–5, 13, 17) using absolutely localized molecular orbital energy decomposition analysis? Erika A. Cobar,Paul R. Horn,Robert G. Bergman,Martin Head-GordonPhys. Chem. Chem. Phys., 2012,14, 15328-15339 10.1039/C2CP42522J
- [8] Exchangeability of amino acid residues with similar physicochemical properties in coiled-coil interactions? Guiying Zhang,Maosheng Cheng,Yanni Li,Keliang Liu,Lifeng CaiChem. Commun., 2013,49, 11086-11088 10.1039/C3CC46560H
- [9] Excellent energy storage performance in NaNbO3-based relaxor antiferroeic ceramics under a low electric field XuxinCheng,XiaomingChen,PengyuanFan 10.1007/s10832-022-00283-w
- [10] Enabling stable MnO2 matrix for aqueous zinc-ion battery cathodes? Yiding Jiao,Liqun Kang,Jasper Berry-Gair,Kit McColl,Jianwei Li,Haobo Dong,Hao Jiang,Ryan Wang,Furio Corà,Dan J. L. Brett,Ivan P. ParkinJ. Mater. Chem. A, 2020,8, 22075-22082 10.1039/D0TA08638J
Journal Name:Chemical Science
research_products
-
CAS no.: 89640-58-4