Efficient layered density-based clustering of categorical data.

Andreopoulos B; An A; Wang X; Labudde D

首页> 外文期刊>Journal of biomedical informatics. >Efficient layered density-based clustering of categorical data.

【24h】

Efficient layered density-based clustering of categorical data.

机译：基于分层密度的高效分类数据聚类。

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

A challenge involved in applying density-based clustering to categorical biomedical data is that the "cube" of attribute values has no ordering defined, making the search for dense subspaces slow. We propose the HIERDENC algorithm for hierarchical density-based clustering of categorical data, and a complementary index for searching for dense subspaces efficiently. The HIERDENC index is updated when new objects are introduced, such that clustering does not need to be repeated on all objects. The updating and cluster retrieval are efficient. Comparisons with several other clustering algorithms showed that on large datasets HIERDENC achieved better runtime scalability on the number of objects, as well as cluster quality. By fast collapsing the bicliques in large networks we achieved an edge reduction of as much as 86.5%. HIERDENC is suitable for large and quickly growing datasets, since it is independent of object ordering, does not require re-clustering when new data emerges, and requires no user-specified input parameters.

机译：将基于密度的聚类应用于分类生物医学数据所涉及的挑战是，属性值的“多维数据集”没有定义顺序，从而使得对密集子空间的搜索变慢。我们提出了用于基于分层密度的分类数据聚类的HIERDENC算法，以及用于高效搜索密集子空间的互补索引。引入新对象时，将更新HIERDENC索引，这样就不必在所有对象上重复进行聚类。更新和群集检索效率很高。与其他几种聚类算法的比较表明，在大型数据集上，HIERDENC在对象数量以及聚类质量方面实现了更好的运行时可伸缩性。通过在大型网络中快速折叠Biclique，我们减少了多达86.5％的边缘。 HIERDENC适用于大型且快速增长的数据集，因为它独立于对象排序，在出现新数据时不需要重新聚类，并且不需要用户指定的输入参数。

著录项

来源
《Journal of biomedical informatics.》 |2009年第2期|共12页
作者
Andreopoulos B; An A; Wang X; Labudde D;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类医用一般科学;
关键词
physical density; Efficient; Exertional dyspnea; Algorithms; *算法;

机译：物理密度;有效;劳力性呼吸困难;算法;*算法;

相似文献

外文文献
中文文献
专利

1. Efficient layered density-based clustering of categorical data. [J] . Andreopoulos B, An A, Wang X, Journal of biomedical informatics. . 2009,第2期

机译：基于分层密度的高效分类数据聚类。
2. Tests for 2 x K contingency tables with clustered ordered categorical data. [J] . Jung SH, Kang SH Statistics in medicine . 2001,第5期

机译：使用聚集的有序分类数据测试2 x K列联表。
3. An efficient automated incremental density-based algorithm for clustering and classification [J] . Elham Azhir, Nima Jafari Navimipour, Mehdi Hosseinzadeh, Future generation computer systems . 2021,第Jana期

机译：基于群集和分类的高效自动增量密度算法
4. Hierarchical Density-Based Clustering of Categorical Data and a Simplification [C] . Bill Andreopoulos, Aijun An, Xiaogang Wang Advances in Knowledge Discovery and Data Mining; Lecture Notes in Artificial Intelligence; 4426 . 2007

机译：基于层次密度的分类数据聚类和简化
5. A cohesion-based clustering technique for categorical data. [D] . Nemalhabib, Aida. 2006

机译：基于凝聚力的分类数据聚类技术。
6. Enhanced Three Layer Hybrid Clustering Mechanism for Energy Efficient Routing in IoT [O] . Muhammad Faizan Ullah, Junaid Imtiaz, Khawaja Qasim Maqbool 2019

机译：物联网中节能路由的增强型三层混合集群机制
7. Efficient layered density-based clustering of categorical data [O] . Andreopoulos Bill, An Aijun, Wang Xiaogang, 2009

机译：基于分层的基于密度的有效分类数据聚类
8. Application of Cluster Analysis to Aerometric Data. Volume I. Part 1: Clustering, Validation, and Classification of Data. Part 2: Investigation and Report of Cluster Analysis [R] . Crutcher, H. L. , Nelson, C. , Fairbairn, B. , 1980

机译：聚类分析在航空数据中的应用。第一部分：数据的聚类，验证和分类。第2部分：聚类分析的调查和报告

Efficient layered density-based clustering of categorical data.

摘要

著录项

相似文献

相关主题

期刊订阅