...
首页> 外文期刊>Mathematical Problems in Engineering: Theory, Methods and Applications >Automatic Determination of Clustering Centers for “Clustering by Fast Search and Find of Density Peaks”
【24h】

Automatic Determination of Clustering Centers for “Clustering by Fast Search and Find of Density Peaks”

机译:自动确定“通过快速搜索和查找密度峰值聚类的聚类中心”

获取原文
           

摘要

Dividing abstract object sets into multiple groups, called clustering, is essential for effective data mining. Clustering can find innate but unknown real-world knowledge that is inaccessible by any other means. Rodriguez and Laio have published a paper about a density-based fast clustering algorithm in Science called CFSFDP. CFSFDP is a highly efficient algorithm that clusters objects by using fast searching of density peaks. But with CFSFDP, the essential second step of finding clustering centers must be done manually. Furthermore, when the amount of data objects increases or a decision graph is complicated, determining clustering centers manually is difficult and time consuming, and clustering accuracy reduces sharply. To solve this problem, this paper proposes an improved clustering algorithm, ACDPC, that is based on data detection, which can automatically determinate clustering centers without manual intervention. First, the algorithm calculates the comprehensive metrics and sorts them based on the CFSFDP method. Second, the distance between the sorted objects is used to judge whether they are the correct clustering centers. Finally, the remaining objects are grouped into clusters. This algorithm can efficiently and automatically determine clustering centers without calculating additional variables. We verified ACDPC using three standard datasets and compared it with other clustering algorithms. The experimental results show that ACDPC is more efficient and robust than alternative methods.
机译:将抽象对象集分为多个组,称为群集,对于有效的数据挖掘至关重要。聚类可以找到任何其他手段无法访问的先天但是未知的真实知识。 Rodriguez和Laio发表了一篇关于基于密度的快速聚类算法的纸张,称为CFSFDP。 CFSFDP是一种高效的算法,可以使用快速搜索密度峰值来群集对象。但是通过CFSFDP,必须手动完成聚类中心的基本第二步。此外,当数据对象的量增加或决策图是复杂的时,手动确定聚类中心是困难且耗时的,并且聚类精度急剧减少。为了解决这个问题,本文提出了一种改进的聚类算法ACDPC,即基于数据检测,可以自动确定没有手动干预的聚类中心。首先,该算法根据CFSFDP方法计算综合度量并对其进行排序。其次,排序对象之间的距离用于判断它们是正确的聚类中心。最后,剩下的对象被分组成簇。此算法可以有效地自动确定聚类中心,而无需计算额外的变量。我们使用三个标准数据集验证了ACDPC,并将其与其他聚类算法进行比较。实验结果表明,ACDPC比替代方法更有效且坚固。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号