Graphine: Programming Graph-Parallel Computation of Large Natural Graphs for Multicore Clusters

Jie Yan; Guangming Tan; Zeyao Mo; Ninghui Sun

首页> 外文期刊>IEEE Transactions on Parallel and Distributed Systems >Graphine: Programming Graph-Parallel Computation of Large Natural Graphs for Multicore Clusters

【24h】

Graphine: Programming Graph-Parallel Computation of Large Natural Graphs for Multicore Clusters

机译：Graphine：用于多核群集的大型自然图的编程图并行计算

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Graph-parallel computation has become a crucial component in emerging applications of web search, data analytics and machine learning. In practice, most graphs derived from real-world phenomena are very large and scale-free. Unfortunately, distributed graph-parallel computation of these graphs still suffers strong scalability issues on contemporary multicore clusters. To embrace the multicore architecture in distributed graph-parallel computation, we propose the framework Graphine, which features (i) A Scatter-Combine computation abstraction that is evolved from the traditional vertex-centric approach by fusing the paired scatter and gather operations, executed separately on two edge sides, into a one-sided scatter. Further coupled with active message mechanism, it potentially reduces intermediate message cost and enables fine-grained parallelism on multicore architecture. (ii) An Agent-Graph data model, which leverages an idea similar to vertex-cut but conceptually splits the remote replica into two agent types of scatter and combiner, resulting in less communication. We implement the Graphine framework and evaluate it using several representative algorithms on six large real-world graphs and a series of synthetic graphs with power-law degree distributions. We show that Graphine achieves sublinear scalability with the number of cores per node, number of nodes, and graph sizes (up to one billion vertices), and is 215 times faster than the state-of-the-art PowerGraph on a cluster of 16 multicore nodes.

机译：图并行计算已成为新兴的Web搜索，数据分析和机器学习应用程序中的重要组成部分。在实践中，大多数来自现实世界现象的图非常大且无比例。不幸的是，这些图形的分布式图形并行计算在现代多核集群上仍然遭受着严重的可伸缩性问题。为了在分布式图并行计算中包含多核体系结构，我们提出了Graphine框架，该框架具有（i）一种Scatter-Combine计算抽象，它是通过将成对的散布和聚集操作融合在一起而从传统的以顶点为中心的方法演变而来的，分别执行在两个边缘侧，形成一个单侧分散。再加上主动消息机制，它有可能降低中间消息成本，并在多核体系结构上实现细粒度的并行性。（ii）一个Agent-Graph数据模型，该模型利用了类似于顶点切割的思想，但在概念上将远程副本分为散布和合并两种代理类型，从而减少了通信。我们实现了Graphine框架，并使用几种代表性算法对六个大型现实世界图和一系列具有幂律度分布的合成图进行了评估。我们展示了Graphine通过每个节点的核心数量，节点数量和图形大小（最多十亿个顶点）实现了亚线性可扩展性，并且比16个集群上最新的PowerGraph快215倍多核节点。

著录项

来源
《IEEE Transactions on Parallel and Distributed Systems》 |2016年第6期|1647-1659|共13页
作者
Jie Yan; Guangming Tan; Zeyao Mo; Ninghui Sun;
展开▼
作者单位

State Key Laboratory of Computer Architecture, Institute of Computing Technology, Chinese Academy of Sciences;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Computational Model; Graph-Parallel; Graph-parallel; Parallel Framework; computational model; parallel framework;

机译：计算模型;图并行;图并行;并行框架;计算模型;并行框架;

相似文献

外文文献
中文文献
专利

1. LazyGraph: Lazy Data Coherency for Replicas in Distributed Graph-Parallel Computation [J] . Lei Wang, Liangji Zhuang, Junhang Chen, ACM SIGPLAN Notices: A Monthly Publication of the Special Interest Group on Programming Languages . 2018,第1期

机译：LazyGraph：分布式图形计算中的副本的惰性数据一致性
2. SYNC or ASYNC: Time to Fuse for Distributed Graph-Parallel Computation [J] . Xie Chenning, Chen Rong, Guan Haibing, ACM SIGPLAN Notices: A Monthly Publication of the Special Interest Group on Programming Languages . 2015,第8期

机译：SYNC或ASYNC：融合图分布式计算的时间
3. Parallel programming: for multicore and cluster systems [J] . Charles Morgan Computing reviews . 2014,第4期

机译：并行编程：用于多核和集群系统
4. Object-oriented Environment for Parallel Programming of Multicore Clusters Based on Flowgraph Stream Parallel Programming Language [C] . Yan-lan GUO, Li-rong CUI International Symposium on Distributed Computing and Applications to Business, Engineering and Science;DCABES 2007 . 2007

机译：基于流程图流并行编程语言的面向对象的多核集群并行编程环境
5. Graph Partitioning, Ordering, and Clustering for Multicore Architectures. [D] . LaSalle, Dominique. 2016

机译：多核架构的图形分区，排序和聚类。
6. An Improved Distance Matrix Computation Algorithm for Multicore Clusters [O] . Mohammed W. Al-Neama, Naglaa M. Reda, Fayed F. M. Ghaleb -1

机译：一种改进的多核集群距离矩阵计算算法
7. New Scheduling Strategies and Hybrid Programming for a Parallel Right-looking Sparse LU Factorization Algorithm on Multicore Cluster Systems [O] . Ichitaro Yamazaki, Xiaoye S. Li 2014

机译：多核集群系统并行右视稀疏LU分解算法的新调度策略与混合编程

Graphine: Programming Graph-Parallel Computation of Large Natural Graphs for Multicore Clusters

摘要

著录项

相似文献

相关主题

期刊订阅