首页> 外文期刊>IEEE Transactions on Parallel and Distributed Systems >A Loosely-Coupled Full-System Multicore Simulation Framework
【24h】

A Loosely-Coupled Full-System Multicore Simulation Framework

机译:松耦合的全系统多核仿真框架

获取原文
获取原文并翻译 | 示例
       

摘要

Full-system simulation is critical in evaluating design alternatives for multicore processors. However, state-of-the-art multicore simulators either lack good extensibility due to their tightly-coupled design between functional model (FM) and timing model (TM), or cannot guarantee cycle-accuracy. This paper conducts a comprehensive study on factors affecting cycle-accuracy and uncovers several contributing factors less studied before. Based on these insights, we propose a loosely-coupled functional-driven full-system simulator for multicore, namely Transformer. To ensure extensibility and cycle-accuracy, Transformer leverages an architecture-independent interface between FM and TM and uses a lightweight scheme to detect and recover from execution divergence between FM and TM. Built upon Transformer and its foundational simulator components, a graduate student only needed to write about 180 lines of code to extend an X86 functional model (QEMU) in Transformer. Moreover, the loosely-coupled design also removes the complex interaction between FM and TM and opens the opportunity to parallelize FM and TM to improve performance. Experimental results show that Transformer achieves an average of 8.4 and 7.0 percent performance improvement over GEMS in 4-core and 8-core configuration while guaranteeing cycle-accuracy. A further parallelization between FM and TM leads to 35.3 and 29.7 percent performance improvement respectively.
机译:完整的系统仿真对于评估多核处理器的设计选择至关重要。但是,最新的多核模拟器要么由于功能模型(FM)和时序模型(TM)之间紧密耦合的设计而缺乏良好的可扩展性,要么不能保证周期精度。本文对影响循环精度的因素进行了全面研究,并揭示了一些以前研究较少的影响因素。基于这些见解,我们为多核提出了一种松耦合的功能驱动的全系统模拟器,即Transformer。为了确保可扩展性和周期准确性,Transformer利用FM和TM之间的体系结构无关接口,并使用轻量级方案来检测FM和TM之间的执行差异并从中恢复。以Transformer及其基础模拟器组件为基础,一名研究生只需编写约180行代码即可在Transformer中扩展X86功能模型(QEMU)。此外,松耦合设计还消除了FM和TM之间的复杂交互,并为FM和TM并行化提供了机会,以提高性能。实验结果表明,与4核和8核配置的GEMS相比,Transformer的性能平均提高了8.4%和7.0%,同时保证了循环精度。 FM和TM之间的进一步并行化分别分别提高了35.3和29.7%的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号