Generating molecules with optimized aqueous solubility using iterative graph translation?
Reaction Chemistry & Engineering Pub Date: 2021-11-15 DOI: 10.1039/D1RE00315A
Abstract
While molecular discovery is critical for solving many scientific problems, the time and resource costs of experiments make it intractable to fully explore chemical space. Here, we present a generative modeling framework that proposes novel molecules that are 1) based on starting candidate structures and 2) optimized with respect to one or more objectives or constraints. We explore how this framework performs in an applied setting by focusing on the problem of optimizing molecules for aqueous solubility, using an experimental database containing data curated from the literature. The resulting model was capable of improving molecules with a range of starting solubilities. When synthetic feasibility was applied as a secondary optimization constraint (estimated using a combination of synthetic accessibility and retrosynthetic accessibility scores), the model generated synthetically feasible molecules 83.0% of the time (compared with 59.9% of the time without the constraint). To validate model performance experimentally, a set of candidate molecules was translated using the model and the solubilities of the candidate and generated molecules were verified experimentally. We additionally validated model performance via experimental measurements by holding out the top 100 most soluble molecules during training and showing that the model could rediscover 33 of those molecules. To determine the sensitivity of model performance to dataset size, we trained the model on different subsets of the initial training dataset. We found that model performance did not decrease significantly when the model was trained on a random 50% subset of the training data but did decrease when the model was trained on subsets containing only less soluble molecules (i.e., the bottom 50%). Overall, this framework serves as a tool for generating optimized, synthetically feasible molecules that can be applied to a range of problems in chemistry and chemical engineering.
Recommended Literature
- [1] Emulsifier-free, organotellurium-mediated living radical emulsion polymerization (emulsion TERP) of styrene: poly(dimethylaminoethyl methacrylate) macro-TERP agent? Yukiya KitayamaPolym. Chem., 2014,5, 2784-2792 10.1039/C3PY01539D
- [2] Evidence of field induced slow magnetic relaxation in cis-[Co(hfac)2(H2O)2] exhibiting tri-axial anisotropy with a negative axial component? Denis V. Korchagin,Elena A. Yureva,Alexander V. Akimov,Eugenii Ya. Misochko,Gennady V. Shilov,Artem D. Talantsev,Roman B. Morgunov,Alexander A. Shakin,Sergey M. Aldoshin,Boris S. TsukerblatDalton Trans., 2017,46, 7540-7548 10.1039/C7DT01236E
- [3] Excess electrons in lithium–ethylamine solutions—density, electrical conductivity and EPR studies Phys. Chem. Chem. Phys., 1999,1, 3561-3565 10.1039/A900683D
- [4] Distinct correlation between (CN2)x units and pores: a low-cost method for predesigned wide range control of micropore size of porous carbon? Xiaotong Feng,Lei Bian,Jie Ma,Lei Zhou,Xiayan Wang,Guangsheng Guo,Qiaosheng PuChem. Commun., 2019,55, 3963-3966 10.1039/C9CC01213C
- [5] Examination of deposit in commercial diluted phosphoric acid Analyst, 1880,5, 146-147 10.1039/AN8800500146
- [6] Excellent peroxidase mimicking property of CuO/Pt nanocomposites and their application as an ascorbic acid sensor? Xinhuan Wang,Shuangfei Cai,Cui QiAnalyst, 2017,142, 2500-2506 10.1039/C7AN00589J
- [7] Excellent mechanical performance and enhanced dielectric properties of OBC/SiO2 elastomeric nanocomposites: effect of dispersion of the SiO2 nanoparticles? Xing Zhao,Lu Bai,Rui-Ying Bao,Zheng-Ying Liu,Ming-Bo Yang,Wei YangRSC Adv., 2017,7, 46297-46305 10.1039/C7RA08074C
- [8] Empowering microfluidics by micro-3D printing and solution-based mineral coating? Hongxia Li,Aikifa Raza,Qiaoyu Ge,Jin-You Lu,TieJun ZhangSoft Matter, 2020,16, 6841-6849 10.1039/D0SM00958J
- [9] Evolutionary de novo design of phenothiazine derivatives for dye-sensitized solar cells? Vishwesh Venkatraman,Marco Foscato,Vidar R. Jensen,Bj?rn K?re AlsbergJ. Mater. Chem. A, 2015,3, 9851-9860 10.1039/C5TA00625B
- [10] Ester-mediated peptide formation promoted by deep eutectic solvents: a facile pathway to proto-peptides? Chen-Yu Chien,Sheng-Sheng YuChem. Commun., 2020,56, 11949-11952 10.1039/D0CC03319G
Journal Name:Reaction Chemistry & Engineering
research_products
-
CAS no.: 89640-58-4