Placing big graph into cloud for parallel processing with a two-phase community-aware approach

Hu Kekun; Zeng Guosun

首页> 外文期刊>Future generation computer systems >Placing big graph into cloud for parallel processing with a two-phase community-aware approach

【24h】

Placing big graph into cloud for parallel processing with a two-phase community-aware approach

机译：使用两阶段社区感知方法将大图放入云中进行并行处理

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Big graphs are so large that their analysis often rely on the cloud for parallel processing. Data placement, as a key pre-processing step, has a profound impact on the performance of parallel processing. Traditional placement methods fail to preserve graph topologies, leading to poor performance. As the community is the most common structure of big graphs, in this work, we present a two-phase community-aware placement algorithm to place big graphs into the cloud for parallel processing. It can obtain a placement scheme that preserves the community structure well by maximizing the modularity density of the scheme under memory capacity constraints of computational nodes of the cloud in two phases. In the first phase, we design a streaming partitioning heuristic to detect communities based on partial and incomplete graph information. They form an initial placement scheme with relatively high modularity density. To improve it further, in the second phase, we put forward a scale-constrained kernel k-means algorithm. It takes as input the initial placement scheme and iteratively redistributes graph vertices across computational nodes under scale constraints until the modularity density cannot be improved any further. Finally, experiments show that our algorithm can preserve graph topologies well and greatly support parallel processing of big graphs in the cloud. (C) 2019 Elsevier B.V. All rights reserved.

机译：大图是如此之大，以至于其分析经常依赖于云进行并行处理。数据放置作为关键的预处理步骤，对并行处理的性能产生深远的影响。传统的放置方法无法保留图形拓扑，从而导致性能不佳。由于社区是大图的最常见结构，因此在本文中，我们提出了一种两阶段的社区感知放置算法，将大图放置到云中以进行并行处理。它可以通过在两个阶段的云计算节点的存储容量约束下最大化方案的模块密度来获得一种能够很好地保留社区结构的布局方案。在第一阶段，我们设计了一种流分区启发法，以基于部分和不完整的图信息检测社区。它们形成具有较高模块密度的初始放置方案。为了进一步改进它，在第二阶段，我们提出了一个尺度受限的核k均值算法。它以初始放置方案为输入，并在比例约束下跨计算节点迭代地重新分布图顶点，直到无法进一步提高模块密度为止。最后，实验表明我们的算法可以很好地保留图拓扑，并极大地支持云中大图的并行处理。（C）2019 Elsevier B.V.保留所有权利。

著录项

来源
《Future generation computer systems》 |2019年第12期|1187-1200|共14页
作者
Hu Kekun; Zeng Guosun;
展开▼
作者单位

Tongji Univ Dept Comp Sci & Technol Shanghai 201804 Peoples R China;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Cloud computing; Big graph processing; Data placement; Community detection; Scale constraints; Modularity density;

机译：云计算;大图处理;数据放置;社区检测;规模约束;模块化密度;

相似文献

外文文献
中文文献
专利

1. Simulations for parallel processing of ultrasound reflection-mode tomography with applications to two-phase flow measurement [J] . Wiegand F., Hoyle B.S. IEEE Transactions on Ultrasonics, Ferroelectrics, and Frequency Control . 1989,第6期

机译：超声反射模式层析成像并行处理的仿真及其在两相流量测量中的应用
2. Parallel map projection of vector-based big spatial data: Coupling cloud computing with graphics processing units [J] . Tang Wenwu, Feng Wenpeng Computers，environment and urban systems . 2017,第ptaB期

机译：基于矢量的大空间数据的并行地图投影：将云计算与图形处理单元耦合
3. An image processing approach for two-phase interfaces visualized by a real time neutron radiography technique [J] . Keiichi Uchimura, Glenn D. Harvel, Takaaki Matsumoto, Flow Measurement and Instrumentation . 1998,第4期

机译：实时中子射线照相技术可视化的两相界面图像处理方法
4. Improving Attack Graph Scalability for the Cloud Through SDN-Based Decomposition and Parallel Processing [C] . Oussama Mjihil, Dijiang Huang, Abdelkrim Haqiq International symposium on ubiquitous networking . 2017

机译：通过基于SDN的分解和并行处理提高云的攻击图可扩展性
5. Parallel Algorithms and Dynamic Data Structures on the Graphics Processing Unit: a Warp-Centric Approach [D] . Ashkiani, Saman. 2017

机译：图形处理单元上的并行算法和动态数据结构：以翘曲为中心的方法
6. Simulating spiking neural networks on massively parallel graphical processing units using a code generation approach with GeNN [O] . Esin Yavuz, James Turner, Thomas Nowotny 2014

机译：使用带有GeNN的代码生成方法在大规模并行图形处理单元上模拟尖刺神经网络
7. A parallel meshless dynamic cloud method on graphic processing units for unsteady compressible flows past moving boundaries [O] . Z.H. Ma, H. Wang, S.H. Pu 2015

机译：用于非稳定压缩流过移动边界的图形处理单元上的平行网状动态云方法

Placing big graph into cloud for parallel processing with a two-phase community-aware approach

摘要

著录项

相似文献

相关主题

期刊订阅