首页> 美国卫生研究院文献>other >Comparison of Deep Learning With Multiple Machine Learning Methods and Metrics Using Diverse Drug Discovery Datasets

【2h】

Comparison of Deep Learning With Multiple Machine Learning Methods and Metrics Using Diverse Drug Discovery Datasets

机译：使用多种药物发现数据集将深度学习与多种机器学习方法和指标进行比较

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Machine learning methods have been applied to many datasets in pharmaceutical research for several decades. The relative ease and availability of fingerprint type molecular descriptors paired with Bayesian methods resulted in the widespread use of this approach for a diverse array of endpoints relevant to drug discovery. Deep learning is the latest machine learning algorithm attracting attention for many of pharmaceutical applications from docking to virtual screening. Deep learning is based on an artificial neural network with multiple hidden layers and has found considerable traction for many artificial intelligence applications. We have previously suggested the need for a comparison of different machine learning methods with deep learning across an array of varying datasets that is applicable to pharmaceutical research. Endpoints relevant to pharmaceutical research include absorption, distribution, metabolism, excretion and toxicity (ADME/Tox) properties, as well as activity against pathogens and drug discovery datasets. In this study, we have used datasets for solubility, probe-likeness, hERG, KCNQ1, bubonic plague, Chagas, tuberculosis and malaria to compare different machine learning methods using FCFP6 fingerprints. These datasets represent whole cell screens, individual proteins, physicochemical properties as well as a dataset with a complex endpoint. Our aim was to assess whether deep learning offered any improvement in testing when assessed using an array of metrics including AUC, F1 score, Cohen’s kappa, Matthews correlation coefficient and others. Based on ranked normalized scores for the metrics or datasets Deep Neural Networks (DNN) ranked higher than SVM, which in turn was ranked higher than all the other machine learning methods. Visualizing these properties for training and test sets using radar type plots indicates when models are inferior or perhaps over trained. These results also suggest the need for assessing deep learning further using multiple metrics with much larger scale comparisons, prospective testing as well as assessment of different fingerprints and DNN architectures beyond those used.

机译：数十年来，机器学习方法已应用于药物研究中的许多数据集。与贝叶斯方法配对的指纹类型分子描述符的相对易用性和可用性导致该方法被广泛用于与药物发现相关的各种终点。深度学习是最新的机器学习算法，从对接到虚拟筛选，在许多制药应用中都引起了人们的关注。深度学习基于具有多个隐藏层的人工神经网络，并已为许多人工智能应用找到了可观的吸引力。我们以前曾建议需要将适用于药物研究的不同机器学习方法与深度学习跨一系列可变数据集进行比较。与药物研究相关的端点包括吸收，分布，代谢，排泄和毒性（ADME / Tox）特性，以及针对病原体和药物发现数据集的活性。在这项研究中，我们使用了溶解度，探针相似性，hERG，KCNQ1，鼠疫，鼠尾草，结核和疟疾的数据集，以比较使用FCFP6指纹的不同机器学习方法。这些数据集代表整个细胞的筛选，单个蛋白质，理化特性以及具有复杂终点的数据集。我们的目的是评估使用一系列指标（包括AUC，F1得分，科恩的kappa，马修斯相关系数等）进行评估时，深度学习是否对测试有所改善。根据指标或数据集的归一化分数，深度神经网络（DNN）的排名高于SVM，而SVM的排名则高于所有其他机器学习方法。使用雷达类型图对训练和测试集的这些属性进行可视化可指示何时模型劣等或训练过度。这些结果还表明，需要使用具有更大比例比较的多个指标，前瞻性测试以及除使用的指纹和DNN架构之外的其他指纹和DNN架构来进一步评估深度学习。

著录项

期刊名称 other
作者
Alexandru Korotcov; Valery Tkachenko; Daniel P Russo; Sean Ekins;
展开▼
作者单位

展开▼
年(卷),期 -1(14),12
年度 -1
页码 4462–4475
总页数 33
原文格式 PDF
正文语种
中图分类
关键词
Deep Learning Drug Discovery Machine learning Pharmaceutics Support Vector Machine;

机译：深度学习;药物发现;机器学习;药物;支持向量机;

相似文献

外文文献
中文文献
专利

1. Comparison of Deep Learning With Multiple Machine Learning Methods and Metrics Using Diverse Drug Discovery Data Sets [J] . Korotcov Alexandru, Tkachenko Valery, Russo Daniel P., Molecular pharmaceutics . 2017,第12期

机译：使用不同药物发现数据集的多机学习方法和度量的深度学习比较
2. Combining deep residual neural network features with supervised machine learning algorithms to classify diverse food image datasets [J] . McAllister Patrick, Zheng Huiru, Bond Raymond, Computers in Biology and Medicine . 2018,第期

机译：与监督机器学习算法相结合的深度残余神经网络功能对不同的食物图像数据集进行分类
3. From machine learning to deep learning: progress in machine intelligence for rational drug discovery [J] . Lu Zhang, Jianjun Tan, Dan Han, Drug discovery today . 2017,第11期

机译：从机器学习到深度学习：理性药物发现的机器智能进展
4. Video Captioning using Deep Learning: An Overview of Methods, Datasets and Metrics [C] . M. Amaresh, S. Chitrakala International Conference on Communication and Signal Processing . 2019

机译：使用深度学习的视频字幕：方法，数据集和指标概述
5. Using Machine Learning on Diverse Datasets to Predict Drug- Induced Liver Injury [D] . Adeluwa, Temidayo Peter. 2021

机译：在不同数据集上使用机器学习预测药物诱导的肝损伤
6. Recent applications of deep learning and machine intelligence on in silico drug discovery: methods tools and databases [O] . Ahmet Sureyya Rifaioglu, Heval Atas, Maria Jesus Martin, -1

机译：深度学习和机器智能在计算机硅药物发现中的最新应用：方法工具和数据库
7. A Very Large-Scale Bioactivity Comparison of Deep Learning and Multiple Machine Learning Algorithms for Drug Discovery [O] . Thomas R. Lane, Daniel H. Foil, Eni Minerali, 2020

机译：深度学习和多种机器学习算法的一种非常大的生物活动比较药物发现

Comparison of Deep Learning With Multiple Machine Learning Methods and Metrics Using Diverse Drug Discovery Datasets

摘要

著录项

相似文献

相关主题

期刊订阅