MrPhi: An Optimized MapReduce Framework on Intel Xeon Phi Coprocessors

Lu Mian; Liang Yun; Huynh Huynh Phung; Ong Zhongliang; He Bingsheng; Goh Rick Siow Mong

首页> 外文期刊>Parallel and Distributed Systems, IEEE Transactions on >MrPhi: An Optimized MapReduce Framework on Intel Xeon Phi Coprocessors

【24h】

MrPhi: An Optimized MapReduce Framework on Intel Xeon Phi Coprocessors

机译：MrPhi：英特尔至强融核协处理器上的优化MapReduce框架

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this work, we develop , an optimized MapReduce framework on a heterogeneous computing platform, particularly equipped with multiple Intel Xeon Phi coprocessors. To the best of our knowledge, this is the first work to optimize the MapReduce framework on the Xeon Phi. We first focus on employing advanced features of the Xeon Phi to achieve high performance on a single coprocessor. We propose a vectorization friendly technique and SIMD hash computation algorithms to utilize the SIMD vectors. Then we pipeline the map and reduce phases to improve the resource utilization. Furthermore, we eliminate multiple local arrays but use low cost atomic operations on the global array to improve the thread scalability. For a given application, our framework is able to automatically detect suitable techniques to apply. Moreover, we extend our framework to a heterogeneous platform to utilize all hardware resource effectively. We adopt non-blocking data transfer to hide the communication overhead. We also adopt aligned memory transfer in order to fully utilize the PCIe bandwidth between the host and coprocessor. We conduct comprehensive experiments to benchmark the Xeon Phi and compare our optimized MapReduce framework with a state-of-the-art multi-core based MapReduce framework (Phoenix++). By evaluating six real-world applications, the experimental results show that our optimized framework is 1.2 to 38 faster than Phoenix++ for various applications on a single Xeon Phi. Additionally, the performance of four applications is able to achieve linear scalability on a platform equipped with up to four Xeon Phi coprocessors.

机译：在这项工作中，我们在异构计算平台上开发了一个优化的MapReduce框架，该框架特别配备了多个Intel Xeon Phi协处理器。据我们所知，这是在Xeon Phi上优化MapReduce框架的第一项工作。我们首先专注于利用至强融核的高级功能在单个协处理器上实现高性能。我们提出了一种矢量化友好技术和SIMD哈希计算算法来利用SIMD向量。然后，我们对地图进行管线处理并减少阶段以提高资源利用率。此外，我们消除了多个局部数组，但对全局数组使用低成本的原子操作来提高线程可伸缩性。对于给定的应用程序，我们的框架能够自动检测适用的技术。此外，我们将框架扩展到异构平台，以有效利用所有硬件资源。我们采用无阻塞数据传输来隐藏通信开销。我们还采用对齐的内存传输，以充分利用主机和协处理器之间的PCIe带宽。我们进行了全面的实验，以对至强融核进行基准测试，并将经过优化的MapReduce框架与基于最新的多核MapReduce框架（Phoenix ++）进行比较。通过评估六个实际应用程序，实验结果表明，针对单个Xeon Phi上的各种应用程序，我们的优化框架比Phoenix ++快1.2到38。此外，在配备多达四个Xeon Phi协处理器的平台上，四个应用程序的性能能够实现线性可扩展性。

著录项

来源
《Parallel and Distributed Systems, IEEE Transactions on》 |2015年第11期|3066-3078|共13页
作者
Lu Mian; Liang Yun; Huynh Huynh Phung; Ong Zhongliang; He Bingsheng; Goh Rick Siow Mong;
展开▼
作者单位

Institute of High Performance Computing, A*STAR, Singapore;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Intel Many Integrated Core architecture (MIC); MapReduce; Xeon Phi; coprocessors; heterogeneous computing; high performance computing; parallel programming;

机译：英特尔众筹集成核心架构（MIC）;MapReduce;至强融核;协处理器;异构计算;高性能计算;并行编程;

相似文献

外文文献
中文文献
专利

1. Optimizing Purdue-Lin Microphysics Scheme for Intel Xeon Phi Coprocessor [J] . Mielikainen Jarno, Huang Bormin, Huang Hung-Lung Allen Selected Topics in Applied Earth Observations and Remote Sensing, IEEE Journal of . 2016,第1期

机译：优化英特尔至强融核协处理器的Purdue-Lin微物理方案
2. Parallel BRDF-based infrared radiation simulation of aerial targets implemented on Intel Xeon processor and Xeon Phi coprocessor [J] . Guo Xing, Wu Zhensen, Wu Jiaji, Journal of Real-Time Image Processing . 2019,第1期

机译：在英特尔至强处理器和至强融核协处理器上实现的基于BRDF的空中目标的并行红外辐射仿真
3. Explicit Fourth-Order Runge-Kutta Method on Intel Xeon Phi Coprocessor [J] . Beata Bylina, Joanna Potiopa International journal of parallel programming . 2017,第5期

机译：英特尔至强融核协处理器上的显式四阶Runge-Kutta方法
4. Optimizing the MapReduce framework on Intel Xeon Phi coprocessor [C] . Lu Mian, Zhang Lei, Huynh Huynh Phung, 2013 IEEE International Conference on Big Data . 2013

机译：在英特尔至强融核协处理器上优化MapReduce框架
5. Advancing LAMMPS Performance on Intel Xeon Phi Processors Coprocessors [D] . Vorsu, Sandeep Kumar. 2017

机译：在英特尔Xeon Phi处理器协处理器上推进LAMMPS性能
6. Efficient irregular wavefront propagation algorithms on Intel® Xeon Phi™ [O] . Jeremias M. Gomes, George Teodoro, Alba de Melo, -1

机译：英特尔®至强融核™上的高效不规则波前传播算法
7. Optimizing the MapReduce Framework on Intel Xeon Phi Coprocessor [O] . 2016

机译：在英特尔至强融核协处理器上优化mapReduce框架

MrPhi: An Optimized MapReduce Framework on Intel Xeon Phi Coprocessors

摘要

著录项

相似文献

相关主题

期刊订阅