首页> 外文OA文献 >Scalable directoryless shared memory coherence using execution migration
【2h】

Scalable directoryless shared memory coherence using execution migration

机译:使用执行迁移的可扩展无目录共享内存一致性

摘要

We introduce the concept of deadlock-free migration-based coherent shared memory to the NUCA family of architectures. Migration-based architectures move threads among cores to guarantee sequential semantics in large multicores. Using a execution migration (EM) architecture, we achieve performance comparable to directory-based architectures without using directories: avoiding automatic data replication significantly reduces cache miss rates, while a fast network-level thread migration scheme takes advantage of shared data locality to reduce remote cache accesses that limit traditional NUCA performance. EM area and energy consumption are very competitive, and, on the average, it outperforms a directory-based MOESI baseline by 6.8% and a traditional S-NUCA design by 9.2%. We argue that with EM scaling performance has much lower cost and design complexity than in directory-based coherence and traditional NUCA architectures: by merely scaling network bandwidth from 128 to 256 (512) bit flits, the performance of our architecture improves by an additional 8% (12%), while the baselines show negligible improvement.
机译:我们将基于无死锁迁移的一致性共享内存的概念引入到NUCA系列体系结构中。基于迁移的体系结构在内核之间移动线程,以确保大型多核中的顺序语义。使用执行迁移(EM)架构,我们无需使用目录即可获得与基于目录的架构相当的性能:避免自动数据复制可显着降低缓存未命中率,而快速的网络级线程迁移方案则利用共享数据的本地性来减少远程限制传统NUCA性能的高速缓存访​​问。 EM区域和能耗非常有竞争力,平均而言,它比基于目录的MOESI基准要高出6.8%,比传统的S-NUCA设计要高9.2%。我们认为,与基于目录的一致性和传统NUCA架构相比,使用EM扩展性能具有更低的成本和设计复杂度:仅通过将网络带宽从128位flit扩展到256(512)位flits,我们的体系结构的性能可额外提高8 %(12%),而基线显示的改善可忽略不计。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号