首页> 外文期刊>IEEE Transactions on Parallel and Distributed Systems >GPGPU Performance Estimation With Core and Memory Frequency Scaling
【24h】

GPGPU Performance Estimation With Core and Memory Frequency Scaling

机译:GPGPU性能估计与核心和内存频率缩放

获取原文
获取原文并翻译 | 示例
           

摘要

Contemporary graphics processing units (GPUs) support dynamic voltage and frequency scaling to balance computational performance and energy consumption. However, accurate and straightforward performance estimation for a given GPU kernel under different frequency settings is still lacking for real hardware, which is essential to determine the best frequency configuration for energy saving. In this article, we reveal a fine-grained analytical model to estimate the execution time of GPU kernels with both core and memory frequency scaling. Compared to the cycle-level simulators, which are too slow to apply on real hardware, our model only needs simple and one-off micro-benchmarks to extract a set of hardware parameters and kernel performance counters without any source code analysis. Our experimental results show that the proposed performance model can capture the kernel performance scaling behaviors under different frequency settings and achieve decent accuracy (average errors of 3.85, 8.6, 8.82, and 8.83 percent on a set of 20 GPU kernels with four modern Nvidia GPUs).
机译:当代图形处理单元(GPU)支持动态电压和频率缩放,以平衡计算性能和能耗。然而,在不同频率设置下对给定GPU内核的准确和直接的性能估计仍然缺乏真正的硬件,这对于确定节能的最佳频率配置至关重要。在本文中,我们揭示了一种细粒度的分析模型,以估算GPU内核与核心和内存频率缩放的执行时间。与循环级模拟器相比,在真实硬件上应用太慢,我们的型号仅需要简单和一次性的微基准,以提取一组硬件参数和内核性能计数器,而无需任何源代码分析。我们的实验结果表明,所提出的性能模型可以在不同的频率设置下捕获内核性能缩放行为,实现体面的精度(3.85,8.6,8.82和有四个现代NVIDIA GPU的一套20 GPU内核的平均误差) 。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号