A SAR and QSAR study on cyclin dependent kinase 4 inhibitors using machine learning methods?
Digital Discovery Pub Date: 2023-05-31 DOI: 10.1039/D2DD00143H
Abstract
Cyclin dependent kinase 4 (CDK4) is a promising target for cancer treatment, and developing new effective CDK4 inhibitors is of great significance in anticancer therapy. In this study, we conducted a structure activity relationship (SAR) study on 3018 CDK4 inhibitors. We applied four machine learning methods, which were Multiple Linear Regression (MLR), Random Forest (RF), Support Vector Machine (SVM) and Deep Neural Network (DNN), to develop 18 classification models based on 3018 inhibitors (dataset 1), 18 classification models based on dataset 1 and decoys, and 24 quantitative structure–activity relationship (QSAR) models based on 1427 inhibitors (dataset 2). We obtained some optimal models. Based on dataset 1, Model A2, built by SVM and MACCS fingerprints, has a prediction accuracy (Q) of 92.68% and a Matthews correlation coefficient (MCC) of 0.874 for the test set. Based on dataset 1 and decoys, Model C2, built by SVM and MACCS fingerprints, has a Q of 98.5% and a MCC of 0.937 for the test set. Based on dataset 2, Model F7, built by SVM and MOE descriptors, has a coefficient of determination (R2) of 0.824 and a root mean squared error (RMSE) of 0.534 for the test set. For classification models, it was found that the more samples used for modelling, the more robust the models, and the better the performance of the models. Moreover, we clustered 3018 inhibitors into 12 subsets, and analysed their scaffolds and fragment features. It was found that 2-aminopyrimidine, pyridine, piperazine and cyclopentane were common scaffolds and fragments in highly active inhibitors. This study can provide guidance for the discovery and optimization of CDK4 inhibitor lead compounds.
Recommended Literature
- [1] Fatty acid eutectic mixtures and derivatives from non-edible animal fat as phase change materials? Pau Gallart-Sirvent,Marc Martín,Gemma Villorbina,Mercè Balcells,Aran Solé,Luisa F. Cabeza,Ramon Canela-GarayoaRSC Adv., 2017,7, 24133-24139 10.1039/C7RA03845C
- [2] Fast synthesis of copper nanoclusters through the use of hydrogen peroxide additive and their application for the fluorescence detection of Hg2+ in water samples? Liao Xiaoqing,Li Ruiyi,Li Zaijun,Sun Xiulan,Wang Zhouping,Liu JunkangNew J. Chem., 2015,39, 5240-5248 10.1039/C5NJ00831J
- [3] Dissociative dynamics of O2 on Ag(110)? Ivor Lon?ari?Phys. Chem. Chem. Phys., 2015,17, 9436-9445 10.1039/C4CP05900J
- [4] Evolution in surface coverage of CH3NH3PbI3?XClXvia heat assisted solvent vapour treatment and their effects on photovoltaic performance of devices Dhirendra K. Chaudhary,Pramendra Kumar,Lokendra KumarRSC Adv., 2016,6, 94731-94738 10.1039/C6RA18729C
- [5] Establishing the accuracy of position-specific carbon isotope analysis of propane by GC-pyrolysis-GC-IRMS ChangjieLiu,PengLiu,XiaofengWang,XiaoqiangLi,JuskeHorita 10.1002/rcm.9494
- [6] Enabling chloride salts for thermal energy storage: implications of salt purity? J. Matthew Kurley,Phillip W. Halstenberg,Abbey McAlister,Stephen Raiman,Richard T. MayesRSC Adv., 2019,9, 25602-25608 10.1039/C9RA03133B
- [7] Excellent energy storage performance in NaNbO3-based relaxor antiferroeic ceramics under a low electric field XuxinCheng,XiaomingChen,PengyuanFan 10.1007/s10832-022-00283-w
- [8] Emerging investigator series: bacteriophages as nano engineering tools for quality monitoring and pathogen detection in water and wastewater Fereshteh BayatEnviron. Sci.: Nano, 2021,8, 367-389 10.1039/D0EN00962H
- [9] Evolution of dealloying induced strain in nanoporous gold crystals? Ross Harder,David C. Dunand,Ian McNultyNanoscale, 2017,9, 5686-5693 10.1039/C6NR09635B
- [10] Excellent humidity sensor based on ultrathin HKUST-1 nanosheets? Qiaoe Wang,Meiling Lian,Xiaowen Zhu,Xu ChenRSC Adv., 2021,11, 192-197 10.1039/D0RA08354B
Journal Name:Digital Discovery
research_products
-
CAS no.: 89640-58-4