首页> 外文期刊>Parallel and Distributed Systems, IEEE Transactions on >Autogeneration and Autotuning of 3D Stencil Codes on Homogeneous and Heterogeneous GPU Clusters
【24h】

Autogeneration and Autotuning of 3D Stencil Codes on Homogeneous and Heterogeneous GPU Clusters

机译:在均质和异构GPU群集上自动生成和自动调整3D模具代码

获取原文
获取原文并翻译 | 示例
           

摘要

This paper develops and evaluates search and optimization techniques for autotuning 3D stencil (nearest neighbor) computations on GPUs. Observations indicate that parameter tuning is necessary for heterogeneous GPUs to achieve optimal performance with respect to a search space. Our proposed framework takes a most concise specification of stencil behavior from the user as a single formula, autogenerates tunable code from it, systematically searches for the best configuration and generates the code with optimal parameter configurations for different GPUs. This autotuning approach guarantees adaptive performance for different generations of GPUs while greatly enhancing programmer productivity. Experimental results show that the delivered floating point performance is very close to previous handcrafted work and outperforms other autotuned stencil codes by a large margin. Furthermore, heterogeneous GPU clusters are shown to exhibit the highest performance for dissimilar tuning parameters leveraging proportional partitioning relative to single-GPU performance.
机译:本文开发并评估了搜索和优化技术,以在GPU上自动调整3D模具(最近邻)计算。观察结果表明,参数调整对于异构GPU实现相对于搜索空间的最佳性能是必需的。我们提出的框架将用户最简单的模板行为规范作为一个公式,从中自动生成可调代码,系统地搜索最佳配置,并为不同的GPU生成具有最佳参数配置的代码。这种自动调整方法保证了不同代GPU的自适应性能,同时大大提高了程序员的生产率。实验结果表明,所提供的浮点性能与以前的手工制作非常接近,并且在很大程度上优于其他自动调整的模板代码。此外,相对于单GPU性能,异构GPU集群在利用比例划分的不同调节参数方面表现出最高的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号