首页> 外文期刊>International Journal of Distributed and Parallel Systems >A New Co-Ordinated Checkpointing and Rollback Recovery Scheme for Distributed Shared Memory Clusters
【24h】

A New Co-Ordinated Checkpointing and Rollback Recovery Scheme for Distributed Shared Memory Clusters

机译:分布式共享内存群集的新的协调统一的检查点和回滚恢复方案

获取原文
           

摘要

In this paper, an unified lightweight error recovery scheme based on coordinated checkpointing and rollback for distributed shared memory clusters is proposed. The new scheme maintains multiple globally consistent checkpoints of the state of a distributed shared memory cluster and recovers to a pre-fault checkpoint of the system. It also describes and evaluates the coordinated checkpointing. The coordinated checkpoint neither needs to exchange coordination messages nor adds information to the process messages. It only accesses stable storage when checkpoints are saved. Each of the processes saves its state independently from the other processes. The checkpoint timers are set at different processes. Based on the results of performance evaluation the proposed scheme is shown to outperform the previously proposed checkpoint and recovery schemes for distributed shared memory clusters
机译:本文提出了一种基于协调检查点和回滚的分布式共享内存集群统一轻量级错误恢复方案。新方案维护分布式共享内存集群状态的多个全局一致的检查点,并恢复到系统的故障前检查点。它还描述并评估了协调检查点。协调检查点既不需要交换协调消息,也不需要向过程消息添加信息。仅在保存检查点时才访问稳定的存储。每个进程都独立于其他进程保存其状态。检查点计时器在不同的过程中设置。基于性能评估的结果,该提议的方案表现出优于先前提议的分布式共享内存集群的检查点和恢复方案

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号