【24h】

A Coherent and Heterogeneous Approach to Clustering

机译:聚类的相干和异质方法

获取原文

摘要

Despite outstanding successes of the state-of-the-art clustering algorithms, many of them still suffer from shortcomings. Mainly, these algorithms do not capture coherency and homogeneity of clusters simultaneously. We show that some of the best performing spectral as well as hierarchical clustering algorithms can lead to incorrect clustering when the data is comprised of clusters with different densities or includes outliers. We introduce algorithms based on variants of geodesic distance that capture both coherency and homogeneity of clusters. Such choice of distance measure empowers simple clustering algorithms such as K-medoids to outperform spectral and hierarchical clustering algorithms. To show the theoretical merits of our approach, we present theoretical analysis of a simplified version of the algorithm. We also provide remarkable experimental evidence on the performance of our algorithms on a number of challenging clustering problems.
机译:尽管最先进的聚类算法取得了突出的成功,但其中许多仍然遭受缺点。主要是,这些算法不会同时捕获簇的一致性和均匀性。我们表明,当数据由具有不同密度或包括异常值的群集组成时,一些最佳的频谱以及分层聚类算法可以导致群集不正确。我们基于捕获簇的一致性和均匀性的测地距的变体来引入算法。这种距离测量选择使得简单的聚类算法,例如K-METOIDS以优于频谱和分层聚类算法。为了展示我们方法的理论优点,我们呈现了算法简化版本的理论分析。我们还提供了关于我们对多项具有挑战性的聚类问题的算法表现的显着实验证据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号