首页> 外文期刊>IEEE Transactions on Parallel and Distributed Systems >A Runtime and Non-Intrusive Approach to Optimize EDP by Tuning Threads and CPU Frequency for OpenMP Applications
【24h】

A Runtime and Non-Intrusive Approach to Optimize EDP by Tuning Threads and CPU Frequency for OpenMP Applications

机译:通过调整线程和CPU频率优化EDP的运行时和非侵入式方法,用于OpenMP应用程序

获取原文
获取原文并翻译 | 示例
           

摘要

Efficiently exploiting thread-level parallelism has been challenging. Many parallel applications are not sufficiently balanced or CPU-bound to take advantage of the increasing number of cores and the highest possible operating frequency. Moreover, many variables may change according to the system (input set, microarchitecture, and number of cores) or during execution, influencing each parallel region in different ways. Therefore, the task of rightly choosing the ideal configuration (number of threads and DVFS) for each parallel region to deliver the best Energy-Delay Product (EDP) is not straightforward. While the significant number of variables prevents the use of exhaustive search methods, the changing nature of the problem precludes offline strategies. Few solutions are online and synergistically consider thread throttling and DVFS. However, they lack transparency (demand changes in the original code) and/or adaptability (do not automatically adjust to applications at run-time). Our proposed Hoder covers all the characteristics above, optimizing at run-time any dynamically linked OpenMP application, without requiring any code transformation or recompilation. We show Hoder's efficiency by comparing it to two exhaustive offline and two online search approaches, three state-of-the-art techniques, and regular OpenMP execution, considering different setups (Intel 44-, 16- and 12-core; AMD 8- and 12-core).
机译:有效利用线程平行度是具有挑战性的。许多并行应用程序不充分平衡或CPU键,以利用越来越多的核和最高可能的工作频率。此外,许多变量可以根据系统(输入集,微架构和核心数)或在执行期间改变,以不同方式影响每个并行区域。因此,正确选择每个平行区域的理想配置(线程和DVF的数量)的任务,以提供最佳的能量延迟产品(EDP)并不直接。虽然大量变量阻止了使用详尽的搜索方法,但问题的变化性质排除了离线策略。很少有解决方案都在线,协同考虑线程限制和DVF。但是,它们缺乏透明度(原始代码的需求变化)和/或适应性(不要在运行时自动调整到应用程序)。我们提出的HODER涵盖了上述所有特性,在运行时优化任何动态链接的OpenMP应用程序,而无需任何代码转换或重新编译。通过将其与两个详尽的离线和两个在线搜索方法,三种最先进的技术以及定期的OpenMP执行,考虑不同的设置,我们展示了HODER的效率和12核)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号