Fast and Effective Active Clustering Ensemble Based on Density Peak

Shi Yifan; Yu Zhiwen; Cao Wenming; Chen C. L. Philip; Wong Hau-San; Han Guoqiang

首页> 外文期刊>Neural Networks and Learning Systems, IEEE Transactions on >Fast and Effective Active Clustering Ensemble Based on Density Peak

【24h】

Fast and Effective Active Clustering Ensemble Based on Density Peak

机译：基于密度峰值的快速有效的活动聚类集群

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Semisupervised clustering methods improve performance by randomly selecting pairwise constraints, which may lead to redundancy and instability. In this context, active clustering is proposed to maximize the efficacy of annotations by effectively using pairwise constraints. However, existing methods lack an overall consideration of the querying criteria and repeatedly run semisupervised clustering to update labels. In this work, we first propose an active density peak (ADP) clustering algorithm that considers both representativeness and informativeness. Representative instances are selected to capture data patterns, while informative instances are queried to reduce the uncertainty of clustering results. Meanwhile, we design a fast-update-strategy to update labels efficiently. In addition, we propose an active clustering ensemble framework that combines local and global uncertainties to query the most ambiguous instances for better separation between the clusters. A weighted voting consensus method is introduced for better integration of clustering results. We conducted experiments by comparing our methods with state-of-the-art methods on real-world data sets. Experimental results demonstrate the effectiveness of our methods.

机译：半质化聚类方法通过随机选择成对约束来提高性能，这可能导致冗余和不稳定性。在这种情况下，提出了主动聚类，以通过有效地使用成对约束来最大化注释的功效。但是，现有方法缺乏对查询标准的总体考虑，并反复运行半质量群集以更新标签。在这项工作中，我们首先提出了一种积极的密度峰值（ADP）聚类算法，其考虑了代表性和信息。选择代表实例以捕获数据模式，而查询信息实例以降低聚类结果的不确定性。同时，我们设计快速更新策略以有效更新标签。此外，我们提出了一个有效的聚类集群集群框架，将本地和全局不确定性结合起来查询最模糊的实例，以便在集群之间更好地分离。引入了加权投票共识方法，以便更好地集成聚类结果。我们通过将我们的方法与现实世界数据集的最新方法进行比较来进行实验。实验结果表明了我们方法的有效性。

著录项

来源
《Neural Networks and Learning Systems, IEEE Transactions on》 |2021年第8期|3593-3607|共15页
作者
Shi Yifan; Yu Zhiwen; Cao Wenming; Chen C. L. Philip; Wong Hau-San; Han Guoqiang;
展开▼
作者单位

South China Univ Technol Sch Comp Sci & Engn Guangzhou 510006 Peoples R China;

South China Univ Technol Sch Comp Sci & Engn Guangzhou 510006 Peoples R China|Guangdong Univ Technol Sch Comp Guangzhou 510006 Peoples R China;

Univ Hong Kong Dept Stat & Actuarial Sci Hong Kong Peoples R China;

South China Univ Technol Sch Comp Sci & Engn Guangzhou 510006 Peoples R China;

City Univ Hong Kong Dept Comp Sci Hong Kong Peoples R China;

South China Univ Technol Sch Comp Sci & Engn Guangzhou 510006 Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Uncertainty; Clustering algorithms; Learning systems; Measurement uncertainty; Toy manufacturing industry; Redundancy; Clustering methods; Active clustering; clustering ensemble; density peak (DP) clustering;

机译：不确定性;聚类算法;学习系统;测量不确定性;玩具制造业;冗余;聚类方法;活动聚类;聚类集群;密度峰值（DP）聚类;

相似文献

外文文献
中文文献
专利

1. Fast density peak-based clustering algorithm for multiple extended target tracking [J] . SHEN Xinglin, SONG Zhiyong, FAN Hongqi, 系统工程与电子技术（英文版） . 2019,第003期
2. Clustering by Fast Search and Find of Density Peaks with Data Field [J] . WANG Shuliang, WANG Dakui, LI Caoyuan, 电子学报（英文版） . 2016,第003期
3. Clustering by Fast Search and Find of Density Peaks with Data Field [J] . WANG Shuliang12, WANG Dakui1, LI Caoyuan2, 电子学报：英文版 . 2016,第003期
4. A Multi-Granularity Density Peak Clustering Algorithm Based on Variational Mode Decomposition [J] . GU Ziwen, LI Peng, LANG Xun, 电子学报：英文版 . 2021,第004期
5. A Multi-Granularity Density Peak Clustering Algorithm Based on Variational Mode Decomposition [J] . GU Ziwen, LI Peng, LANG Xun, 电子学报（英文版） . 2021,第004期
6. Unsupervised novelty detection-based structural damage localization using a density peaks-based fast clustering algorithm [J] . Cha Young-Jin, Wang Zilong Structural health monitoring . 2018,第2期

机译：基于密度峰值的快速聚类算法基于无监督新颖性检测的结构损伤定位
7. Extended Fast Search Clustering Algorithm : Widely Density Clusters, No Density Peaks [J] . Zhang WenKai, Li Jing Computer Science & Information Technology . 2015,第7期

机译：扩展的快速搜索聚类算法：宽密度簇，无密度峰值
8. Effective pattern recognition and find-density-peaks clustering based blind identification for underdetermined speech mixing systems [J] . Xiangdong Huang, Lin Yang, Runan Song, Multimedia Tools and Applications . 2018,第17期

机译：不确定语音混合系统的基于有效模式识别和查找密度峰聚类的盲识别
9. The Improvement on Self-Adaption Select Cluster Centers Based on Fast Search and Find of Density Peaks Clustering [C] . Hui Du, Yiyang Ni International Conference on Computational Intelligence and Security . 2020

机译：基于快速搜索和查找密度峰值聚类的自适应选择群集中心的改进
10. Relationship-based clustering and cluster ensembles for high-dimensional data mining. [D] . Strehl, Alexander. 2002

机译：用于高维数据挖掘的基于关系的聚类和聚类集成。
11. flowPeaks: a fast unsupervised clustering for flow cytometry data via K-means and density peak finding [O] . Yongchao Ge, Stuart C. Sealfon -1

机译：flowPeaks：通过K均值和密度峰发现对流式细胞术数据进行快速无监督的聚类
12. Extended fast search clustering algorithm: widely density clusters, no density peaks [O] . Zhang, Wenkai, Li, Jing 2015

机译：扩展的快速搜索聚类算法：广泛的密度聚类，没有密度峰值
13. Predictability and Ensemble Forecast Skill Enhancement Based on the Probability Density Function Estimation [R] . Ide, K. 2005

机译：基于概率密度函数估计的可预测性和集合预测技巧增强

Fast and Effective Active Clustering Ensemble Based on Density Peak

摘要

著录项

相似文献

相关主题

期刊订阅