首页> 外文会议>IEEE/ACM International Symposium on Code Generation and Optimization >YaskSite: Stencil Optimization Techniques Applied to Explicit ODE Methods on Modern Architectures
【24h】

YaskSite: Stencil Optimization Techniques Applied to Explicit ODE Methods on Modern Architectures

机译:Yasksite:模板优化技术应用于现代架构上的显式颂法方法

获取原文

摘要

The landscape of multi-core architectures is growing more complex and diverse. Optimal application performance tuning parameters can vary widely across CPUs, and finding them in a possibly multidimensional parameter search space can be time consuming, expensive and potentially infeasible. In this work, we introduce YaskSite, a tool capable of tackling these challenges for stencil computations. YaskSite is built upon Intel's YASK framework. It combines YASK's flexibility to deal with different target architectures with the Execution-Cache-Memory performance model, which enables identifying optimal performance parameters analytically without the need to run the code. Further we show that YaskSite's features can be exploited by external tuning frameworks to reliably select the most efficient kernel(s) for the application at hand. To demonstrate this, we integrate YaskSite into Offsite, an offline tuner for explicit ordinary differential equation methods, and show that the generated performance predictions are reliable and accurate, leading to considerable performance gains at minimal code generation time and autotuning costs on the latest Intel Cascade Lake and AMD Rome CPUs.
机译:多核架构的景观正在增长更复杂和多样化。最佳应用程序性能调谐参数可以跨CPU的广泛变化,并在可能的多维参数搜索空间中找到它们可能是耗时,昂贵且潜在的不可行的。在这项工作中,我们介绍了Yasksite,一种能够解决模板计算的这些挑战的工具。 Yasksite是在英特尔的Yask框架之上建造的。它将YASK的灵活性与执行缓存 - 内存性能模型进行处理不同的目标架构,这使得能够分析地识别最佳性能参数,而无需运行代码。此外,我们表明,亚盖塔的特征可以通过外部调整框架利用,以可靠地选择手头应用的最有效的内核。为了证明这一点,我们将Yaskesite集成到Offsite中,一个离线调谐器,用于显式常微分方程方法,并表明所生成的性能预测是可靠和准确的,导致最新的代码生成时间和最新英特尔级联的自动成本上的相当大的性能增益湖和罗马CPU。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号