...
首页> 外文期刊>Computer physics communications >Efficient utilization of launched threads on GPUs: The spherical harmonic transform as a case study
【24h】

Efficient utilization of launched threads on GPUs: The spherical harmonic transform as a case study

机译:有效利用GPU上的启动线程:以球谐函数为例

获取原文
获取原文并翻译 | 示例
           

摘要

Maximum utilization of hardware resources is crucial to leverage the enormous computational power of graphics processing units (GPUs). However, there lacks an effective metric to denote whether the launched threads are kept busy. To address this issue, we propose a metric called ETU to describe the efficiency of threads utilization. First, we execute several CUDA-SDK sample codes, with(out) double precision arithmetic, on two generations of GPUs so as to perform a preliminary validation of the ETU metric. Taking the spherical harmonic transform as an example, we then give two GPU implementations for Legendre transforms and check the relationship between ETU and application performance. Experimental results show that applications with larger ETU can usually achieve better performance, which is more accurate than occupancy proposed by NVIDIA. Finally, we select the GPU implementations with better performance to accelerate Legendre transforms in STSWM, which is a spectral transform shallow water model.
机译:充分利用硬件资源对于利用图形处理单元(GPU)的巨大计算能力至关重要。但是,缺少一个有效的指标来表示启动的线程是否保持繁忙。为了解决此问题,我们提出了一种称为ETU的度量标准,用于描述线程利用的效率。首先,我们在两代GPU上执行多个(不带)双精度算法的CUDA-SDK样本代码,以便对ETU指标进行初步验证。以球形谐波变换为例,然后给出两种用于Legendre变换的GPU实现,并检查ETU与应用程序性能之间的关系。实验结果表明,具有较大ETU的应用程序通常可以实现更好的性能,比NVIDIA提出的占用率更准确。最后,我们选择性能更好的GPU实现来加速STSWM中的Legendre变换,STSWM是一种频谱变换浅水模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号