One class classification as a practical approach for accelerating π–π co-crystal discovery?
Chemical Science Pub Date: 2020-12-08 DOI: 10.1039/D0SC04263C
Abstract
The implementation of machine learning models has brought major changes in the decision-making process for materials design. One matter of concern for the data-driven approaches is the lack of negative data from unsuccessful synthetic attempts, which might generate inherently imbalanced datasets. We propose the application of the one-class classification methodology as an effective tool for tackling these limitations on the materials design problems. This is a concept of learning based only on a well-defined class without counter examples. An extensive study on the different one-class classification algorithms is performed until the most appropriate workflow is identified for guiding the discovery of emerging materials belonging to a relatively small class, that being the weakly bound polyaromatic hydrocarbon co-crystals. The two-step approach presented in this study first trains the model using all the known molecular combinations that form this class of co-crystals extracted from the Cambridge Structural Database (1722 molecular combinations), followed by scoring possible yet unknown pairs from the ZINC15 database (21?736 possible molecular combinations). Focusing on the highest-ranking pairs predicted to have higher probability of forming co-crystals, materials discovery can be accelerated by reducing the vast molecular space and directing the synthetic efforts of chemists. Further on, using interpretability techniques a more detailed understanding of the molecular properties causing co-crystallization is sought after. The applicability of the current methodology is demonstrated with the discovery of two novel co-crystals, namely pyrene-6H-benzo[c]chromen-6-one (1) and pyrene-9,10-dicyanoanthracene (2).
Recommended Literature
- [1] An automatic determination of thoria in thoria-urania mixtures Analyst, 1966,91, 208-210 10.1039/AN9669100208
- [2] An algal process treatment combined with the Fenton reaction for high concentrations of amoxicillin and cefradine Haitao Li,Yu Pan,Zhizhi Wang,Shan Chen,Ruixin Guo,Jianqiu ChenRSC Adv., 2015,5, 100775-100782 10.1039/C5RA21508K
- [3] An approach to the structure and vibrational analysis of cis- and trans-3-chlorostyrene through IR/Raman and INS spectroscopies and theoretical ab initio/DFT calculations? J. M. Granadino-Roldán,M. Fernández-Gómez,A. Navarro,T. Pe?a Ruiz,U. A. JayasooriyaPhys. Chem. Chem. Phys., 2004,6, 1133-1143 10.1039/B314243D
- [4] An insight into the origin of room-temperature ferromagnetism in SnO2 and Mn-doped SnO2 quantum dots: an experimental and DFT approach? Dhamodaran Manikandan,S. Amirthapandian,I. S. Zhidkov,A. I. Kukharenko,S. O. Cholakh,Ramaswamy MuruganPhys. Chem. Chem. Phys., 2018,20, 6500-6514 10.1039/C7CP07182E
- [5] An anti-ultrasonic-stripping effect in confined micro/nanoscale cavities and its applications for efficient multiscale metallic patterning? Quan Xiang,Yiqin Chen,Zhiqin Li,Kaixi Bi,Guanhua Zhang,Huigao DuanNanoscale, 2016,8, 19541-19550 10.1039/C6NR07585A
- [6] Alumina coating on 5 V lithium cobalt fluorophosphate cathode material for lithium secondary batteries – synthesis and electrochemical properties? S. Amaresh,K. Karthikeyan,K. J. Kim,Y. S. LeeRSC Adv., 2014,4, 23107-23115 10.1039/C4RA02318H
- [7] An atomically efficient, highly stable and redox active Ce0.5Tb0.5Ox (3% mol.)/MgO catalyst for total oxidation of methane? Juan J. Sánchez,Miguel López-Haro,Juan C. Hernández-Garrido,Ginesa Blanco,Miguel A. Cauqui,José M. Rodríguez-Izquierdo,José A. Pérez-Omil,José J. Calvino,María P. YesteJ. Mater. Chem. A, 2019,7, 8993-9003 10.1039/C8TA11672E
- [8] An Assessment of the Laminar Hypersonic Double-Cone Experiments in the LENS-XX Tunnel JaideepRay,PatrickBlonigan,EricT.Phipps,KathrynMaupin 10.2514/1.j062802
- [9] An artificial enzyme cascade amplification strategy for highly sensitive and specific detection of breast cancer-derived exosomes? Huiying Xu,Lu Zheng,Yu Zhou,Bang-Ce YeAnalyst, 2021,146, 5542-5549 10.1039/D1AN01071A
- [10] Aluminium alkyl and aryloxide complexes of pyrazine and bipyridines: synthesis and structure? Doug Ogrin,Laura H. van Poppel,Simon G. Bott,Andrew R. BarronDalton Trans., 2004, 3689-3694 10.1039/B410662H
Journal Name:Chemical Science
research_products
-
CAS no.: 89640-58-4