GPGPU Performance Estimation With Core and Memory Frequency Scaling

Wang Qiang; Chu Xiaowen

首页> 外文期刊>IEEE Transactions on Parallel and Distributed Systems >GPGPU Performance Estimation With Core and Memory Frequency Scaling

【24h】

GPGPU Performance Estimation With Core and Memory Frequency Scaling

机译：GPGPU性能估计与核心和内存频率缩放

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Contemporary graphics processing units (GPUs) support dynamic voltage and frequency scaling to balance computational performance and energy consumption. However, accurate and straightforward performance estimation for a given GPU kernel under different frequency settings is still lacking for real hardware, which is essential to determine the best frequency configuration for energy saving. In this article, we reveal a fine-grained analytical model to estimate the execution time of GPU kernels with both core and memory frequency scaling. Compared to the cycle-level simulators, which are too slow to apply on real hardware, our model only needs simple and one-off micro-benchmarks to extract a set of hardware parameters and kernel performance counters without any source code analysis. Our experimental results show that the proposed performance model can capture the kernel performance scaling behaviors under different frequency settings and achieve decent accuracy (average errors of 3.85, 8.6, 8.82, and 8.83 percent on a set of 20 GPU kernels with four modern Nvidia GPUs).

机译：当代图形处理单元（GPU）支持动态电压和频率缩放，以平衡计算性能和能耗。然而，在不同频率设置下对给定GPU内核的准确和直接的性能估计仍然缺乏真正的硬件，这对于确定节能的最佳频率配置至关重要。在本文中，我们揭示了一种细粒度的分析模型，以估算GPU内核与核心和内存频率缩放的执行时间。与循环级模拟器相比，在真实硬件上应用太慢，我们的型号仅需要简单和一次性的微基准，以提取一组硬件参数和内核性能计数器，而无需任何源代码分析。我们的实验结果表明，所提出的性能模型可以在不同的频率设置下捕获内核性能缩放行为，实现体面的精度（3.85,8.6,8.82和有四个现代NVIDIA GPU的一套20 GPU内核的平均误差）。

著录项

来源
《IEEE Transactions on Parallel and Distributed Systems》 |2020年第12期|2865-2881|共17页
作者
Wang Qiang; Chu Xiaowen;
展开▼
作者单位

Hong Kong Baptist Univ Dept Comp Sci Kowloon Hong Kong Peoples R China;

Hong Kong Baptist Univ Dept Comp Sci Kowloon Hong Kong Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Graphics processing units; dynamic voltage and frequency scaling; GPU performance modeling;

机译：图形处理单元;动态电压和频率缩放;GPU性能建模;

相似文献

外文文献
中文文献
专利

1. GPGPU Power Estimation with Core and Memory Frequency Scaling [J] . Qiang Wang, Xiaowen Chu Performance evaluation review . 2017,第2期

机译：具有内核和内存频率缩放功能的GPGPU功耗估算
2. Resource Sharing Centric Dynamic Voltage and Frequency Scaling for CMP Cores, Uncore, and Memory [J] . Won Jae-Yeon, Gratz Paul V., Shakkottai Srinivas, ACM Transactions on Design Automation of Electronic Systems . 2016,第4期

机译：CMP内核，非内核和内存的资源共享中心动态电压和频率缩放
3. Parsimonious Estimation of the Wechsler Memory Scale, Fourth Edition Demographically Adjusted Index Scores: Immediate and Delayed Memory [J] . Justin B. Miller Bradley N. Axelrod Christian Schutte The Clinical Neuropsychologist . 2012,第3期

机译：Wechsler记忆量表的简约估计，第四版人口统计学调整指数评分：立即和延迟记忆
4. GPGPU Performance Estimation with Core and Memory Frequency Scaling [C] . Qiang Wang, Xiaowen Chu IEEE International Conference on Parallel and Distributed Systems . 2018

机译：具有内核和内存频率缩放功能的GPGPU性能估计
5. Performance analysis and fitness of GPGPU and multicore architectures for scientific applications. [D] . Bhuiyan, Mohammad Ashraf Uddin. 2011

机译：GPGPU和多核体系结构的性能分析和适用性，以用于科学应用。
6. Learning-Directed Dynamic Voltage and Frequency Scaling Scheme with Adjustable Performance for Single-Core and Multi-Core Embedded and Mobile Systems [O] . Yen-Lin Chen, Ming-Feng Chang, Chao-Wei Yu, 2018

机译：具有学习性能的学习型动态电压和频率缩放方案适用于单核和多核嵌入式和移动系统
7. GPGPU Performance Estimation with Core and Memory Frequency Scaling [O] . Qiang Wang, Xiaowen Chu 2018

机译：GPGPU性能估计与核心和内存频率缩放

GPGPU Performance Estimation With Core and Memory Frequency Scaling

摘要

著录项

相似文献

相关主题

期刊订阅