首页> 外文会议>2013 8th International Design and Test Symposium >Fault tolerance on multicore processors using deterministic multithreading
【24h】

Fault tolerance on multicore processors using deterministic multithreading

机译:使用确定性多线程的多核处理器上的容错

获取原文
获取原文并翻译 | 示例

摘要

This paper describes a software based fault tolerance approach for multithreaded programs running on multicore processors. Redundant multithreaded processes are used to detect soft errors and recover from them. Our scheme makes sure that the execution of the redundant processes is identical even in the presence of non-determinism due to shared memory accesses. This is done by making sure that the redundant processes acquire the locks for accessing the shared memory in the same order. Instead of using record/replay technique to do that, our scheme is based on deterministic multithreading, meaning that for the same input, a multithreaded program always have the same lock interleaving. Unlike record/replay systems, this eliminates the requirement for communication between the redundant processes. Moreover, our scheme is implemented totally in software, requiring no special hardware, making it very portable. Furthermore, our scheme is totally implemented at user-level, requiring no modification of the kernel. For selected benchmarks, our scheme adds an average overhead of 49% for 4 threads.
机译:本文介绍了一种基于软件的容错方法,用于在多核处理器上运行的多线程程序。冗余多线程进程用于检测软错误并从中恢复。我们的方案确保即使在由于共享内存访问而导致不确定性的情况下,冗余过程的执行也是相同的。通过确保冗余进程以相同的顺序获取用于访问共享内存的锁来完成此操作。我们的方案不是使用记录/重放技术来执行此操作,而是基于确定性多线程,这意味着对于相同的输入,多线程程序始终具有相同的锁交错。与记录/重放系统不同,这消除了冗余过程之间通信的需求。而且,我们的方案完全用软件实现,不需要特殊的硬件,因此非常易于移植。此外,我们的方案完全在用户级别实现,无需修改内核。对于选定的基准,我们的方案为4个线程平均增加了49%的开销。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号