首页> 外文期刊>ACM Transactions on Architecture and Code Optimization >Performance Optimization of the HPCG Benchmark on the Sunway TaihuLight Supercomputer
【24h】

Performance Optimization of the HPCG Benchmark on the Sunway TaihuLight Supercomputer

机译:Sunway Tohulight超级计算机上HPCG基准的性能优化

获取原文
获取原文并翻译 | 示例
           

摘要

In this article, we present some key techniques for optimizing HPCG on Sunway TaihuLight and demonstrate how to achieve high performance in memory-bound applications by exploiting specific characteristics of the hardware architecture. In particular, we utilize a block multicoloring approach for parallelization and propose methods such as requirement-based data mapping and customized gather collective to enhance the effective memory bandwidth. Experiments indicate that the optimized HPCG code can sustain 77% of the theoretical memory bandwidth and scale to the full system of more than 10 million cores, with an aggregated performance of 480.8 Tflop/s and a weak scaling efficiency of 87.3%.
机译:在本文中,我们提出了一些关键技术,用于优化Sunway Toinghulight上的HPCG,并通过利用硬件架构的特定特征来演示如何在内存绑定应用中实现高性能。 特别地,我们利用块多色方法进行并行化,并提出方法,例如基于需求的数据映射和定制收集集体,以增强有效的内存带宽。 实验表明,优化的HPCG码可以维持77%的理论内存带宽和缩放到1000多万核的整个系统,具有480.8 Tflop / s的汇总性能,较弱的缩放效率为87.3%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号