Geometric Mapping of Tasks to Processors on Parallel Computers with Mesh or Torus Networks

Deveci Mehmet; Devine Karen D.; Pedretti Kevin; Taylor Mark A.; Rajamanickam Sivasankaran; Catalyurek Umit V.

首页> 外文期刊>IEEE Transactions on Parallel and Distributed Systems >Geometric Mapping of Tasks to Processors on Parallel Computers with Mesh or Torus Networks

【24h】

Geometric Mapping of Tasks to Processors on Parallel Computers with Mesh or Torus Networks

机译：具有网格或Torus网络的并行计算机上的任务到处理器上的几何映射

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We present a new method for reducing parallel applications' communication time by mapping their MPI tasks to processors in a way that lowers the distance messages travel and the amount of congestion in the network. Assuming geometric proximity among the tasks is a good approximation of their communication interdependence, we use a geometric partitioning algorithm to order both the tasks and the processors, assigning task parts to the corresponding processor parts. In this way, interdependent tasks are assigned to "nearby" cores in the network. We also present a number of algorithmic optimizations that exploit specific features of the network or application to further improve the quality of the mapping. We specifically address the case of sparse node allocation, where the nodes assigned to a job are not necessarily located in a contiguous block nor within close proximity to each other in the network. However, our methods generalize to contiguous allocations as well, and results are shown for both contiguous and non-contiguous allocations. We show that, for the structured finite difference mini-application MiniGhost, our mapping methods reduced communication time up to 75 percent relative to MiniGhost's default mapping on 128K cores of a Cray XK7 with sparse allocation. For the atmospheric modeling code E3SM/HOMME, our methods reduced communication time up to 31% on 16K cores of an IBM BlueGene/Q with contiguous allocation.

机译：我们提出了一种新方法，该方法通过将并行应用程序的MPI任务映射到处理器来减少并行应用程序的通信时间，从而降低了消息传播的距离和网络中的拥塞量。假设任务之间的几何接近度很好地近似了它们之间的通信相互依赖性，我们使用几何分区算法对任务和处理器进行排序，将任务部分分配给相应的处理器部分。通过这种方式，相互依赖的任务被分配给网络中的“附近”核心。我们还提出了许多算法优化，它们利用网络或应用程序的特定功能来进一步提高映射质量。我们专门解决稀疏节点分配的情况，在这种情况下，分配给作业的节点不必位于连续的块中，也不必位于网络中彼此紧邻的位置。但是，我们的方法也适用于连续分配，并且显示了连续分配和非连续分配的结果。我们表明，对于结构化有限差分微型应用程序MiniGhost，相对于MiniGhost在具有稀疏分配的Cray XK7的128K内核上的默认映射，相对于MiniGhost的默认映射，映射方法将通信时间减少了多达75％。对于大气建模代码E3SM / HOMME，我们的方法将具有连续分配的IBM BlueGene / Q的16K内核上的通信时间减少了多达31％。

著录项

来源
《IEEE Transactions on Parallel and Distributed Systems》 |2019年第9期|2018-2032|共15页
作者
Deveci Mehmet; Devine Karen D.; Pedretti Kevin; Taylor Mark A.; Rajamanickam Sivasankaran; Catalyurek Umit V.;
展开▼
作者单位

Google Mountain View CA 94043 USA;

Sandia Natl Labs Ctr Res Comp Albuquerque NM 87185 USA;

Georgia Inst Technol Sch Computat Sci & Engn Atlanta GA 30332 USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Task mapping; geometric partitioning; spatial partitioning; recursive bisection; jagged partitioning; load balancing;

机译：任务映射;几何分割;空间划分;递归二等分;锯齿状分区负载均衡;

相似文献

外文文献
中文文献
专利

1. Scalable rank-mapping algorithm for an icosahedral grid system on the massive parallel computer with a 3-D torus network [J] . Chihiro Kodama, Masaaki Terai, Akira T. Noda, Parallel Computing . 2014,第8期

机译：具有3-D环面网络的大规模并行计算机上二十面体网格系统的可扩展等级映射算法
2. EIGENANALYSIS-BASED TASK MAPPING ON PARALLEL COMPUTERS WITH CELLULAR NETWORKS [J] . PENG ZHANG, YUXIANG GAO, JANET FIERSON, Mathematics of computation . 2014,第288期

机译：基于蜂窝网络的基于特征分析的任务映射
3. Mapping computer-vision-related tasks onto reconfigurable parallel-processing systems [J] . Siegel H.J., Armstrong J.B. Computer . 1992,第2期

机译：将与计算机视觉相关的任务映射到可重新配置的并行处理系统上
4. Exploiting Geometric Partitioning in Task Mapping for Parallel Computers [C] . Deveci Mehmet, Rajamanickam Sivasankaran, Leung Vitus J., IEEE International Parallel Distributed Processing Symposium . 2014

机译：在并行计算机的任务映射中利用几何分区
5. Parallel image processing with image algebra on SIMD mesh-connected computers. [D] . Shi, Hongchi. 1994

机译：在SIMD网格连接的计算机上使用图像代数进行并行图像处理。
6. One-to-One Embedding between Honeycomb Mesh and Petersen-Torus Networks [O] . Jung-Hyun Seo, Hyun Sim, Dae-Heon Park, 2011

机译：蜂窝网格和Petersen-Torus网络之间的一对一嵌入
7. Exploiting Geometric Partitioning in Task Mapping for Parallel Computers [O] . Stephen L. Olivier, David P. Bunde, Karen Devine 2015

机译：在并行计算机任务映射中利用几何分区

Geometric Mapping of Tasks to Processors on Parallel Computers with Mesh or Torus Networks

摘要

著录项

相似文献

相关主题

期刊订阅