首页> 中文期刊> 《软件学报》 >基于MapReduce的图结构聚类算法

基于MapReduce的图结构聚类算法

         

摘要

Graph Clustering is a fundamental task for graph mining which has been widely used in social network analysis related applications.Graph structural clustering (SCAN) is a well-known density-based graph clustering algorithm.SCAN algorithm can not only find the clusters in a graph,but also be able to identify hub nodes and outliers.However,with the growing graph size,the traditional SCAN algorithm is very hard to handle massive graph data,as its time complexity is O(m1.5) (m is the number of edges in the graph).To overcome the scalability issue of SCAN algorithm,this paper proposes a MapReduce based graph structural clustering algorithm,called MRSCAN.Specifically,the paper develops a MapReduce based similarity computation,a core node computation,as well as two clustering merging algorithms.In addition,it conducts extensive experiments over serval real-world graph datasets,and results demonstrate the accuracy,effectiveness,and scalability of the presented algorithm.%图结构聚类(SCAN)是一种著名的基于密度的图聚类算法,该算法不仅能够找到图中的聚类结构,而且还能发现图中的Hub节点和离群节点.然而,随着图数据规模越来越大,传统的SCAN算法的复杂度为O(m1.5)(m为图中边的条数),因此很难处理大规模的图数据.为了解决SCAN算法的可扩展性问题,提出一种基于MapReduce的海量图结构聚类算法MRSCAN,这是一种计算核心节点以及两种合并聚类的MapReduce算法.最后,在多个真实的大规模图数据集上进行实验测试,实验结果验证了算法的准确性、有效性以及可扩展性.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号