Weighted similarity-based clustering of chemical structures and bioactivity data in early drug discovery

Perualila-Tan Nolen Joy; Shkedy Ziv; Talloen Willem; Gohlmann Hinrich W. H.; Van Moerbeke Marijke; Kasim Adetayo

首页> 外文期刊>Journal of Bioinformatics and Computational Biology >Weighted similarity-based clustering of chemical structures and bioactivity data in early drug discovery

【24h】

Weighted similarity-based clustering of chemical structures and bioactivity data in early drug discovery

机译：早期药物发现中基于加权相似度的化学结构和生物活性数据聚类

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The modern process of discovering candidate molecules in early drug discovery phase includes a wide range of approaches to extract vital information from the intersection of biology and chemistry. A typical strategy in compound selection involves compound clustering based on chemical similarity to obtain representative chemically diverse compounds (not incorporating potency information). In this paper, we propose an integrative clustering approach that makes use of both biological (compound efficacy) and chemical (structural features) data sources for the purpose of discovering a subset of compounds with aligned structural and biological properties. The datasets are integrated at the similarity level by assigning complementary weights to produce a weighted similarity matrix, serving as a generic input in any clustering algorithm. This new analysis work flow is semi-supervised method since, after the determination of clusters, a secondary analysis is performed wherein it finds differentially expressed genes associated to the derived integrated cluster(s) to further explain the compound-induced biological effects inside the cell. In this paper, datasets from two drug development oncology projects are used to illustrate the usefulness of the weighted similarity-based clustering approach to integrate multi-source high-dimensional information to aid drug discovery. Compounds that are structurally and biologically similar to the reference compounds are discovered using this proposed integrative approach.

机译：在早期药物发现阶段发现候选分子的现代过程包括多种从生物学和化学交叉中提取重要信息的方法。化合物选择的典型策略涉及基于化学相似性的化合物聚类以获得具有代表性的化学上多样化的化合物（不包括效价信息）。在本文中，我们提出了一种综合聚类方法，该方法利用生物学（化合物功效）和化学（结构特征）数据源，以发现具有一致的结构和生物学特性的化合物的子集。通过分配互补权重以生成加权相似度矩阵，将数据集以相似度级别进行集成，用作任何聚类算法中的通用输入。这种新的分析工作流程是半监督方法，因为在确定簇之后，进行了二次分析，其中发现与衍生的整合簇相关的差异表达基因，以进一步解释化合物诱导的细胞内生物学效应。。在本文中，来自两个药物开发肿瘤学项目的数据集用于说明基于加权相似度的聚类方法用于整合多源高维信息以帮助药物发现的有用性。使用这种提议的整合方法发现了与参考化合物在结构和生物学上相似的化合物。

著录项

来源
《Journal of Bioinformatics and Computational Biology》 |2016年第4期|共22页
作者
Perualila-Tan Nolen Joy; Shkedy Ziv; Talloen Willem; Gohlmann Hinrich W. H.; Van Moerbeke Marijke; Kasim Adetayo;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类细胞生物学;
关键词
Bioactivity; chemical structure; clustering; transcriptomic;

机译：生物活性;化学结构;聚类;转录组学;

相似文献

外文文献
中文文献
专利

1. Weighted similarity-based clustering of chemical structures and bioactivity data in early drug discovery [J] . Perualila-Tan Nolen Joy, Shkedy Ziv, Talloen Willem, Journal of Bioinformatics and Computational Biology . 2016,第4期

机译：早期药物发现中基于加权相似度的化学结构和生物活性数据聚类
2. Similarity-based data mining in files of two-dimensional chemical structures using fingerprint measures of molecular resemblance [J] . Willett Peter Wiley interdisciplinary reviews. Data mining and knowledge discovery . 2011,第3期

机译：使用分子相似性指纹测量法在二维化学结构文件中基于相似度的数据挖掘
3. Similarity-based prediction for Anatomical Therapeutic Chemical classification of drugs by integrating multiple data sources [J] . Liu Zhongyang, Guo Feifei, Gu Jiangyong, Bioinformatics . 2015,第11期

机译：集成多个数据源的基于相似性的药物解剖化学分类预测
4. Clustering of chemical data sets for drug discovery [C] . Malhat Mohamed G., Mousa Hamdy M., El-Sisi Ashraf B. 2014 9th International Conference on Informatics and Systems . 2014

机译：用于药物发现的化学数据集的聚类
5. Eunicea fusca and Pseudopterogorgia elisabethae as a resource for bioactive diterpenes: A journey through drug discovery, glycosylation chemistry, and chemical proteomics [D] . Marchbank, Douglas Hubert 2013

机译：Eunicea fusca和Pseudopterogorgia elisabethae作为生物活性二萜的资源：药物发现，糖基化化学和化学蛋白质组学的历程
6. SWEETLEAD: an In Silico Database of Approved Drugs Regulated Chemicals and Herbal Isolates for Computer-Aided Drug Discovery [O] . Paul A. Novick, Oscar F. Ortiz, Jared Poelman, -1

机译：SWEETLEAD：用于计算机辅助药物发现的批准药物管制化学品和草药分离物的计算机模拟数据库
7. Similarity-based data mining in files of two-dimensional chemical structures using fingerprint measures of molecular resemblance [O] . Willett P. 2011

机译：基于相似性的二维化学结构文件数据挖掘使用分子相似性指纹测量

Weighted similarity-based clustering of chemical structures and bioactivity data in early drug discovery

摘要

著录项

相似文献

相关主题

期刊订阅