Memory-Efficient and Skew-Tolerant MapReduce Over MPI for Supercomputing Systems

Gao Tao; Guo Yanfei; Zhang Boyu; Cicotti Pietro; Lu Yutong; Balaji Pavan; Taufer Michela

首页> 外文期刊>IEEE Transactions on Parallel and Distributed Systems >Memory-Efficient and Skew-Tolerant MapReduce Over MPI for Supercomputing Systems

【24h】

Memory-Efficient and Skew-Tolerant MapReduce Over MPI for Supercomputing Systems

机译：用于超级计算系统的MPI内存高效且歪斜的MAPREDUCE

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Data analytics has become an integral part of large-scale scientific computing. Among various data analytics frameworks, MapReduce has gained the most traction. Although some efforts have been made to enable efficient MapReduce for supercomputing systems, they are often limited to fairly homogeneous workloads where equal partitioning of input data across tasks results in essentially equal output or temporary data generated on each task. For workloads that are more skewed, however, current implementations can result in imbalance in memory usage and, consequently, can cause a slowdown in execution time and a loss in data scalability. To tackle this problem, we enhance a previously published memory-conscious MapReduce over MPI framework called Mimir. Our enhancements to Mimir include combiner and dynamic repartition optimizations to minimize and balance memory usage and to achieve close to optimal balance of the memory usage across processes and to reduce the execution time by up to 12 times. Experimental results show that Mimir can scale to at least 3072 processes on the Tianhe-2 supercomputer on skewed datasets.

机译：数据分析已成为大规模科学计算的一个组成部分。在各种数据分析框架中，MapReduce获得了最多的牵引力。尽管已经进行了一些努力使得能够为超级计算系统实现高效的MapReduce，但它们通常限于相当于同质的工作负载，其中跨任务的输入数据的相同划分导致在每个任务上生成的基本上等于输出或临时数据。然而，对于更偏斜的工作负载，当前实现可能导致内存使用情况不平衡，因此，可能导致执行时间的放缓和数据可伸缩性的损失。为了解决这个问题，我们通过称为MIMIR的MPI框架增强先前发布的内存有用的MapReduce。我们对MIMIR的增强包括组合器和动态重置优化，以最小化和平衡内存使用情况，并实现跨进程内存使用的最佳平衡，并将执行时间降低12次。实验结果表明，MIMIR可以在偏斜数据集上的天河2超级计算机上扩展到至少3072个过程。

著录项

来源
《IEEE Transactions on Parallel and Distributed Systems》 |2020年第12期|2734-2748|共15页
作者
Gao Tao; Guo Yanfei; Zhang Boyu; Cicotti Pietro; Lu Yutong; Balaji Pavan; Taufer Michela;
展开▼
作者单位

Univ Tennessee Dept Elect Engn & Comp Sci Knoxville TN 37996 USA|Natl Univ Def Technol Changsha 410073 Peoples R China;

Argonne Natl Lab Math & Comp Sci Div Lemont IL 60439 USA;

Univ Tennessee Dept Elect Engn & Comp Sci Knoxville TN 37996 USA;

NVIDIA San Diego CA 95051 USA;

Natl Supercomp Ctr Guangzhou Guangzhou Peoples R China|Sun Yat Sen Univ Guangzhou 510275 Peoples R China;

Argonne Natl Lab Math & Comp Sci Div Lemont IL 60439 USA;

Univ Tennessee Dept Elect Engn & Comp Sci Knoxville TN 37996 USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Supercomputers; Data models; Optimization; Programming; Operating systems; Aggregates; Data analysis; Skew mitigation; load balancing; high-performance computing; data analytics; MapReduce; memory efficiency; performance and scalability;

机译：超级计算机;数据模型;优化;编程;操作系统;汇总;数据分析;歪斜缓解;负载平衡;高性能计算;数据分析;MapReduce;MapReduce;性能和可扩展性;

相似文献

外文文献
中文文献
专利

1. Skew-Tolerant Key Distribution for Load Balancing in MapReduce [J] . Jihoon SON, Hyunsik CHOI, Yon Dohn CHUNG IEICE transactions on information and systems . 2012,第2期

机译：MapReduce中用于负载平衡的容错密钥分发
2. Skew-Tolerant Key Distribution for Load Balancing in MapReduce [J] . Jihoon SON, Hyunsik CHOI, Yon Dohn CHUNG IEICE Transactions on Information and Systems . 2012,第2期

机译：MapReduce中用于负载平衡的容错密钥分发
3. MRO-MPI: MapReduce overlapping using MPI and an optimized data exchange policy [J] . Hisham Mohamed, Stephane Marchand-Maillet Parallel Computing . 2013,第12期

机译：MRO-MPI：使用MPI和优化的数据交换策略进行MapReduce重叠
4. Mimir: Memory-Efficient and Scalable MapReduce for Large Supercomputing Systems [C] . Tao Gao, Yanfei Guo, Boyu Zhang, IEEE International Parallel and Distributed Processing Symposium . 2017

机译：Mimir：适用于大型超级计算系统的内存高效且可扩展的MapReduce
5. Ultra-Fast and Memory-Efficient Lookups for Cloud, Networked Systems, and Massive Data Management [D] . Yu, Ye. 2018

机译：针对云，网络系统和海量数据管理的超快速和内存高效查找
6. Phase analysis single-photon emission computed tomography (SPECT) myocardial perfusion imaging (MPI) detects dyssynchrony in myocardial scar and increases specificity of MPI [O] . John P. Bois, Chris Scott, Panithaya Chareonthaitawee, 2019

机译：相分析单光子发射计算机断层扫描（SPECT）心肌灌注成像（MPI）检测心肌疤痕的不同步性并增加MPI的特异性
7. Skew-Tolerant Key Distribution for Load Balancing in MapReduce [O] . Jihoon SON, Hyunsik CHOI, Yon Dohn CHUNG 2012

机译：MapReduce负载平衡的宽容密钥分布
8. Interactive Query Processing in Big Data Systems: A Cross Industry Study of MapReduce Workloads. [R] . R. H. Katz S. Alspaugh Y. Chen 2012

机译：大数据系统中的交互式查询处理：mapReduce工作负载的跨行业研究。

Memory-Efficient and Skew-Tolerant MapReduce Over MPI for Supercomputing Systems

摘要

著录项

相似文献

相关主题

期刊订阅