Introducing and Implementing the Allpairs Skeleton for Programming Multi-GPU Systems

Michel Steuwer; Malte Friese; Sebastian Albers; Sergei Gorlatch

首页> 外文期刊>International journal of parallel programming >Introducing and Implementing the Allpairs Skeleton for Programming Multi-GPU Systems

【24h】

Introducing and Implementing the Allpairs Skeleton for Programming Multi-GPU Systems

机译：介绍和实现用于对多GPU系统进行编程的Allpairs骨架

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Algorithmic skeletons simplify software development: they abstract typical patterns of parallelism and provide their efficient implementations, allowing the application developer to focus on the structure of algorithms, rather than on implementation details. This becomes especially important for modern parallel systems with multiple graphics processing units (GPUs) whose programming is complex and error-prone, because state-of-the-art programming approaches like CUDA and OpenCL lack high-level abstractions. We define a new algorithmic skeleton for allpairs computations which occur in real-world applications, ranging from bioinformatics to physics. We develop the skeleton's generic parallel implementation for multi-GPU Systems in OpenCL. To enable the automatic use of the fast GPU memory, we identify and implement an optimized version of the allpairs skeleton with a customizing function that follows a certain memory access pattern. We use matrix multiplication as an application study for the allpairs skeleton and its two implementations and demonstrate that the skeleton greatly simplifies programming, saving up to 90% of lines of code as compared to OpenCL. The performance of our optimized implementation is up to 6.8 times higher as compared with the generic implementation and is competitive to the performance of a manually written optimized OpenCL code.

机译：算法框架简化了软件开发：它们抽象出典型的并行模式并提供有效的实现，从而使应用程序开发人员可以专注于算法的结构，而不是实现细节。这对于具有多个图形处理单元（GPU）的现代并行系统尤其重要，这些图形系统的编程非常复杂且容易出错，因为CUDA和OpenCL等最新的编程方法缺少高级抽象。我们为在现实世界中发生的从生物信息学到物理学的所有对计算定义了一个新的算法框架。我们为OpenCL中的多GPU系统开发框架的通用并行实现。为了能够自动使用快速GPU内存，我们使用遵循某些内存访问模式的自定义功能来识别并实现allpairs骨架的优化版本。我们使用矩阵乘法作为allpairs骨架及其两个实现的应用研究，并证明了该骨架极大地简化了编程，与OpenCL相比，最多可节省90％的代码行。与通用实现相比，我们的优化实现的性能高达6.8倍，与手动编写的优化OpenCL代码的性能相比具有竞争力。

著录项

来源
《International journal of parallel programming》 |2014年第4期|601-618|共18页
作者
Michel Steuwer; Malte Friese; Sebastian Albers; Sergei Gorlatch;
展开▼
作者单位

Department of Mathematics and Computer Science. University of Muenster, Muenster, Germany;

Department of Mathematics and Computer Science. University of Muenster, Muenster, Germany;

Department of Mathematics and Computer Science. University of Muenster, Muenster, Germany;

Department of Mathematics and Computer Science. University of Muenster, Muenster, Germany;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
High-level programming models; Algorithmic skeletons; GPU computing; Allpairs computation; SkelCL;

机译：高级编程模型;算法框架;GPU计算;Allpairs计算;斯凯克;

相似文献

外文文献
中文文献
专利

1. Algorithmic skeletons for multi-core, multi-GPU systems and clusters [J] . Steffen Ernsting, Herbert Kuchen International Journal of High Performance Computing and Networking . 2012,第2期

机译：多核，多GPU系统和集群的算法框架
2. HPSM: a programming framework to exploit multi-CPU and multi-GPU systems simultaneously [J] . João Vicente Ferreira Lima, Daniel Di Domenico International Journal of Grid and Utility Computing . 2019,第3期

机译：HPSM：一种编程框架，可同时利用多CPU和多GPU系统
3. High-Level Programming of Stencil Computations on Multi-GPU Systems Using the SkelCL Library [J] . Michel Steuwer, Michael Haidl, Stefan Breuer, Parallel Processing Letters . 2014,第3期

机译：使用SkelCL库在多GPU系统上进行模板计算的高级编程
4. SkePU: A Multi-Backend Skeleton Programming Library for Multi-GPU Systems [C] . Johan Enmyren, Christoph W. Kessler 4th workshop on high-level parallel programming and applications 2010 . 2010

机译：SkePU：用于多GPU系统的多后端骨架编程库
5. An Investigation of Teachers’ Perceptions of Standards-Based Grading and Traditional Grading Systems and the Impact on Gifted Programming Standard Implementation [D] . Sult, Mary Ann. 2021

机译：教师教师对基于标准的评分和传统评分系统的看法以及对天赋规划标准实施的影响
6. To the bone: Comment on I wanted a skeleton … they brought a prince: A qualitative investigation of factors mediating the implementation of a Performance Based Incentive program in Malawi [O] . Dennis Pérez, Patrick Van der Stuyft, Valéry Ridde, 2019

机译：骨子里：评论我想要一个骨骼……他们带来了一位王子：对调解马拉维基于绩效的奖励计划实施的因素的定性研究
7. Introducing and implementing the allpairs skeleton for programming multi-GPU systems [O] . Steuwer Michel, Friese Malte, Albers Sebastian, 2014

机译：介绍和实现用于编程多GpU系统的allpairs骨架

Introducing and Implementing the Allpairs Skeleton for Programming Multi-GPU Systems

摘要

著录项

相似文献

相关主题

期刊订阅