...
首页> 外文期刊>Journal of supercomputing >Ensemble-based clustering of large probabilistic graphs using neighborhood and distance metric learning
【24h】

Ensemble-based clustering of large probabilistic graphs using neighborhood and distance metric learning

机译:基于集群的大型概率图使用邻域和距离度量学习

获取原文
获取原文并翻译 | 示例
           

摘要

Graphs are commonly used to express the communication of various data. Faced with uncertain data, we have probabilistic graphs. As a fundamental problem of such graphs, clustering has many applications in analyzing uncertain data. In this paper, we propose a novel method based on ensemble clustering for large probabilistic graphs. To generate ensemble clusters, we develop a set of probable possible worlds of the initial probabilistic graph. Then, we present a probabilistic co-association matrix as a consensus function to integrate base clustering results. It relies on co-occurrences of node pairs based on the probability of the corresponding common cluster graphs. Also, we apply two improvements in the steps before and after of ensembles generation. In the before step, we append neighborhood information based on node features to the initial graph to achieve a more accurate estimation of the probability between the nodes. In the after step, we use supervised metric learning-based Mahalanobis distance to automatically learn a metric from ensemble clusters. It aims to gain crucial features of the base clustering results. We evaluate our work using five real-world datasets and three clustering evaluation metrics, namely the Dunn index, Davies-Bouldin index, and Silhouette coefficient. The results show the impressive performance of clustering large probabilistic graphs.
机译:图通常用于表达各种数据的通信。面对不确定的数据,我们有概率图。作为此类图表的基本问题,聚类在分析不确定数据时具有许多应用。本文提出了一种基于大型概率图的集群的新方法。要生成集群,我们开发了一组可能的初始概率图世界。然后,我们向概率合作矩阵作为共识函数,以集成基本聚类结果。它依赖于基于相应的公共集群图的概率的节点对的共发生。此外,我们在合奏生成之前和之后的步骤中应用了两种改进。在之前的步骤中,我们将基于节点特征附加到初始图的邻居信息,以实现节点之间的概率的更准确估计。在后续步骤中,我们使用受监督的基于度量学习的Mahalanobis距离来自动学习来自集群集群的度量标准。它旨在获得基础聚类结果的关键特征。我们使用五个现实世界数据集和三个聚类评估指标来评估我们的工作,即Dunn指数,Davies-Bouldin指数和轮廓系数。结果显示了聚类大概率图的令人印象深刻的性能。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号