首页> 外文期刊>IEEE Transactions on Parallel and Distributed Systems >TC-Release++: An Efficient Timestamp-Based Coherence Protocol for Many-Core Architectures
【24h】

TC-Release++: An Efficient Timestamp-Based Coherence Protocol for Many-Core Architectures

机译:TC-Release ++:适用于多核体系结构的基于时间戳的高效一致性协议

获取原文
获取原文并翻译 | 示例
           

摘要

As we enter the era of many-core, providing the shared memory abstraction through cache coherence has become progressively difficult. The standard directory-based coherence does not scale well with increasing core count. Timestamp-based hardware coherence protocols introduced recently offer an attractive alternative solution. This paper proposes a timestamp-based coherence protocol, called TC-Release++ , that efficiently supports cache coherence in large-scale systems. Our approach is inspired by TC-Weak, a recently proposed timestamp-based coherence protocol targeting GPU architectures. We first design TC-Release in an attempt to straightforwardly port TC-Weak to general-purpose many-cores. But re-purposing TC-Weak for general-purpose many-core architectures is challenging due to significant differences both in architecture and the programming model. Indeed the performance of TC-Release turns out to be worse than conventional directory protocols. We overcome the limitations and overheads of TC-Release by exploiting simple hardware support to eliminate frequent memory stalls, and an optimized lifetime prediction mechanism to improve cache performance. The resulting optimized coherence protocol TC-Release++ is highly scalable (storage scales logarithmically with core count) and shows better performance (3.0 percent) and comparable network traffic (within 1.3 percent) relative to the baseline MESI directory protocol. We use Murphi to formally verify that TC-Release ++ is error-free and imposes small verification cost.
机译:随着我们进入多核时代,通过缓存一致性提供共享内存抽象已变得越来越困难。随着核心数量的增加,基于目录的标准一致性无法很好地扩展。最近引入的基于时间戳的硬件一致性协议提供了一种有吸引力的替代解决方案。本文提出了一种基于时间戳的一致性协议,称为TC-Release ++,该协议可有效支持大规模系统中的缓存一致性。我们的方法受到TC-Weak的启发,TC-Weak是最近提出的针对GPU架构的基于时间戳的一致性协议。我们首先设计TC-Release,以尝试将TC-Weak直接移植到通用多核。但是由于架构和编程模型上的巨大差异,将TC-Weak用于通用的多核架构面临很大挑战。实际上,事实证明,TC-Release的性能比常规目录协议差。通过利用简单的硬件支持来消除频繁的内存停顿以及优化的寿命预测机制来提高缓存性能,我们克服了TC-Release的局限性和开销。产生的优化的一致性协议TC-Release ++具有高度可扩展性(存储量与核心数成对数扩展),并且相对于基线MESI目录协议,具有更好的性能(3.0%)和可比的网络流量(1.3%之内)。我们使用Murphi正式验证TC-Release ++是否没有错误,并且验证成本很小。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号