首页> 外文期刊>IEEE Transactions on Parallel and Distributed Systems >Accounting for memory bank contention and delay in high-bandwidth multiprocessors
【24h】

Accounting for memory bank contention and delay in high-bandwidth multiprocessors

机译:解决高带宽多处理器中的存储体争用和延迟

获取原文
获取原文并翻译 | 示例
           

摘要

For years, the computation rate of processors has been much faster than the access rate of memory banks, and this divergence in speeds has been constantly increasing in recent years. As a result, several shared-memory multiprocessors consist of more memory banks than processors. The object of this paper is to provide a simple model (with only a few parameters) for the design and analysis of irregular parallel algorithms that will give a reasonable characterization of performance on such machines. For this purpose, we extend Valiant's bulk-synchronous parallel (BSP) model with two parameters: a parameter for memory bank delay, the minimum time for servicing requests at a bank, and a parameter for memory bank expansion, the ratio of the number of banks to the number of processors. We call this model the (d, x)BSP. We show experimentally that the (d, x)-BSP captures the impact of bank contention and delay on the CRAY C90 and J90 for irregular access patterns, without modeling machine-specific details of these machines. The model has clarified the performance characteristics of several unstructured algorithms on the CRAY C90 and J90, and allowed us to explore tradeoffs and optimizations for these algorithms. In addition to modeling individual algorithms directly, we also consider the use of the (d, x)-BSP as a bridging model for emulating a very high-level abstract model, the Parallel Random Access Machine (PRAM). We provide matching upper and lower bounds for emulating the EREW and QRQW PRAMs on the (d, X)-BSP.
机译:多年来,处理器的计算速度比存储库的访问速度快得多,并且近年来这种速度差异一直在不断增加。结果,几个共享内存多处理器比处理器具有更多的存储体。本文的目的是为不规则并行算法的设计和分析提供一个简单的模型(只有几个参数),从而可以合理地表征此类机器的性能。为此,我们使用两个参数扩展了Valiant的批量同步并行(BSP)模型:一个用于存储体延迟的参数,一个用于服务于存储体的最小时间,一个用于存储体扩展的参数,存储到处理器数量。我们将此模型称为(d,x)BSP。我们通过实验表明,(d,x)-BSP可以捕获银行争用和延迟对CRAY C90和J90的不规则访问模式的影响,而无需对这些机器的机器特定细节进行建模。该模型阐明了CRAY C90和J90上几种非结构化算法的性能特征,并允许我们探索这些算法的权衡和优化。除了直接为单个算法建模之外,我们还考虑将(d,x)-BSP用作桥接模型,以模拟非常高级的抽象模型,并行随机存取机(PRAM)。我们提供了匹配的上下边界,用于仿真(d,X)-BSP上的EREW和QRQW PRAM。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号