【24h】

Parallel Graph Clustering Based on Minhash

机译:基于minhash的并行图形聚类

获取原文

摘要

Graph clustering is a technique for grouping vertices having similar characteristics into the same cluster. It is widely used to analyze graph data and identify its characteristics. Recently, a large-capacity large-scale graph data is being generated in a variety of applications such as a social network service, a world wide web, and a telephone network. Therefore, the importance of clustering technique for efficiently processing large capacity graph data is increasing. In this paper, we propose a clustering algorithm that efficiently generates clusters of large capacity graph data. Our proposed method efficiently estimates the similarity between clusters in the graph using Min-Hash and generates clusters according to the calculated similarity. In the experiment using real world data, we show the efficiency of the proposed method compared with the proposed method and existing graph clustering methods.
机译:图表聚类是一种用于将具有与相同群集中具有相似特性的顶点的技术。它广泛用于分析图形数据并识别其特征。最近,在各种应用程序中产生大容量的大规模图形数据,例如社交网络服务,万维网和电话网络。因此,集群技术用于有效处理大容量图数据的重要性正在增加。在本文中,我们提出了一种聚类算法,其有效地生成大容量图数据的集群。我们所提出的方法有效地估计了使用MIN-HASH在图中簇之间的相似性,并根据计算的相似性生成群集。在使用现实世界数据的实验中,我们展示了所提出的方法的效率与所提出的方法和现有的图形聚类方法相比。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号