首页> 外文会议>2011 International Conference on Embedded Computer Systems : Architectures, Modeling and Simulation >Scalable multi-core simulation using parallel dynamic binary translation
【24h】

Scalable multi-core simulation using parallel dynamic binary translation

机译:使用并行动态二进制翻译的可扩展多核仿真

获取原文

摘要

In recent years multi-core processors have seen broad adoption in application domains ranging from embedded systems through general-purpose computing to large-scale data centres. Simulation technology for multi-core systems, however, lags behind and does not provide the simulation speed required to effectively support design space exploration and parallel software development. While state-of-the-art instruction set simulators (Iss) for single-core machines reach or exceed the performance levels of speed-optimised silicon implementations of embedded processors, the same does not hold for multi-core simulators where large performance penalties are to be paid. In this paper we develop a fast and scalable simulation methodology for multi-core platforms based on parallel and just-in-time (Jit) dynamic binary translation (Dbt). Our approach can model large-scale multi-core configurations, does not rely on prior profiling, instrumentation, or compilation, and works for all binaries targeting a state-of-the-art embedded multi-core platform implementing the ARCompact instruction set architecture (Isa). We have evaluated our parallel simulation methodology against the industry standard Splash-2 and Eembc MultiBench benchmarks and demonstrate simulation speeds up to 25,307 Mips on a 32-core ×86 host machine for as many as 2048 target processors whilst exhibiting minimal and near constant overhead.
机译:近年来,多核处理器在从嵌入式系统到通用计算再到大型数据中心的应用领域中得到了广泛的采用。但是,用于多核系统的仿真技术滞后并且不能提供有效支持设计空间探索和并行软件开发所需的仿真速度。尽管单核计算机的最新指令集模拟器(Iss)达到或超过了嵌入式处理器速度优化的硅实现的性能水平,但多核模拟器却没有这样做,这会带来较大的性能损失。要支付。在本文中,我们基于并行和实时(Jit)动态二进制翻译(Dbt),为多核平台开发了一种快速且可扩展的仿真方法。我们的方法可以对大规模多核配置进行建模,不依赖于先前的性能分析,检测或编译,并且适用于所有二进制文件,这些二进制文件针对的是实现ARCompact指令集体系结构的最新嵌入式多核平台(伊萨)。我们已根据行业标准Splash-2和Eembc MultiBench基准评估了并行仿真方法,并在32核×86主机上为多达2048个目标处理器演示了高达25,307 Mips的仿真速度,同时展现出最小且接近恒定的开销。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号