Automatic Optimization of In-Flight Memory Transactions for GPU Accelerators Based on a Domain-Specific Language for Medical Imaging

机译：基于领域特定语言的医学成像，用于GPU加速器的飞行中内存事务自动优化

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

An efficient memory bandwidth utilization for GPU accelerators is crucial for memory bound applications. In medical imaging, the performance of many kernels is limited by the available memory bandwidth since only a few operations are performed per pixel. For such kernels only a fraction of the compute power provided by GPU accelerators can be exploited and performance is predetermined by memory bandwidth. As a remedy, this paper investigates the optimal utilization of available memory bandwidth by means of increasing in-flight memory transactions. Instead of doing this manually for different GPU accelerators, the required CUDA and OpenCL code is automatically generated from descriptions in a Domain-Specific Language (DSL) for the considered application domain. Moreover, the DSL is extended to also support global reduction operators. We show that the generated target-specific code improves bandwidth utilization for memory-bound kernels significantly. Moreover, competitive performance compared to the GPU back end of the widely used image processing library OpenCV can be achieved.

机译：GPU加速器的有效内存带宽利用率对于内存绑定应用程序至关重要。在医学成像中，许多内核的性能受到可用内存带宽的限制，因为每个像素仅执行少量操作。对于此类内核，只能利用GPU加速器提供的一部分计算能力，而性能由内存带宽预先确定。作为补救措施，本文通过增加飞行中的内存事务来研究可用内存带宽的最佳利用。代替手动为不同的GPU加速器执行此操作，所需的CUDA和OpenCL代码是根据所考虑的应用程序域的特定于域的语言（DSL）的描述自动生成的。此外，DSL已扩展为也支持全局还原运营商。我们表明，生成的特定于目标的代码可显着提高内存绑定内核的带宽利用率。而且，与广泛使用的图像处理库OpenCV的GPU后端相比，可以实现竞争性能。

著录项

来源
《2012 11th International Symposium on Parallel and Distributed Computing.》|2012年|p.211- 218|共8页
会议地点 Munich/Garching(DE);Munich/Garching(DE)
作者
Membarth Richard; Hannig Frank; Teich Jurgen; Korner Mario; Eckert Wieland;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类程序设计、软件工程;程序设计、软件工程;
关键词
入库时间 2022-08-26 14:06:11

相似文献

外文文献
中文文献
专利

1. Automatic generation of Truffle-based interpreters for Domain-Specific Languages [J] . Manuel Leduc, Gwendal Jouneaux, Thomas Degueule, The Journal of object technology . 2020,第2期

机译：自动生成域特定语言的基于Truffle的解释器
2. Automatic Semantic Indexing Of Medical Images Using A Web Ontologylanguage For Case-based Image Retrieval [J] . Gowri Allampalli-Nagaraj, Isabelle Bichindaritz Engineering Applications of Artificial Intelligence . 2009,第1期

机译：使用Web本体语言进行基于案例的图像检索的医学图像自动语义索引
3. Towards High-Performance Code Generation for Multi-GPU Clusters Based on a Domain-Specific Language for Algorithmic Skeletons [J] . Fabian Wrede, Herbert Kuchen International journal of parallel programming . 2020,第4期

机译：基于算法骨架的域特定语言，对多GPU集群的高性能代码生成
4. Automatic Optimization of In-Flight Memory Transactions for GPU Accelerators Based on a Domain-Specific Language for Medical Imaging [C] . Membarth Richard, Hannig Frank, Teich Jurgen, International Symposium on Parallel and Distributed Computing . 2012

机译：基于用于医学成像的域特定语言，自动优化GPU加速器的飞行内存交易
5. Language support and compiler optimizations for object-based software transactional memory. [D] . Eddon, Guy. 2008

机译：基于对象的软件事务存储器的语言支持和编译器优化。
6. DOPA: GPU-based protein alignment using database and memory access optimizations [O] . Laiq Hasan, Marijn Kentie, Zaid Al-Ars 2011

机译：DOPA：使用数据库和内存访问优化的基于GPU的蛋白质比对
7. Automatic Optimization of In-Flight Memory Transactions for GPU Accelerators based on a Domain-Specific Language for Medical Imaging [O] . Richard Membarth, Frank Hannig, Jürgen Teich, 2014

机译：基于医学成像领域特定语言的GPU加速器的机内内存事务自动优化

Automatic Optimization of In-Flight Memory Transactions for GPU Accelerators Based on a Domain-Specific Language for Medical Imaging

摘要

著录项

相似文献

相关主题

期刊订阅