...
首页> 外文期刊>Cluster computing >FRAsystem: Fault tolerant system using agents in distributed computing systems
【24h】

FRAsystem: Fault tolerant system using agents in distributed computing systems

机译:FRAsystem:在分布式计算系统中使用代理的容错系统

获取原文
获取原文并翻译 | 示例
           

摘要

In this paper, we present a fault tolerant and recovery system called FRASystem (Fault Tolerant & Recovery Agent System) using multi-agent in distributed computing systems. Previous rollback-recovery protocols were dependent on an inherent communication and an underlying operating system, which caused a decline of computing performance. We propose a rollback- recovery protocol that works independently on an operating system and leads to an increasing portability and extensibility. We define four types of agents: (1) a recovery agent performs a rollback-recovery protocol after a failure, (2) an information agent constructs domain knowledge as a rule of fault tolerance and information during a failure-free operation, (3) a facilitator agent controls the communication between agents, (4) a garbage collection agent performs garbage collection of the useless fault tolerance information. Since agent failures may lead to inconsistent states of a system and a domino effect, we propose an agent recovery algorithm. A garbage collection protocol addresses the performance degradation caused by the increment of saved fault tolerance information in a stable storage. We implemented a prototype of FRASystem using This work was supported by the Soon chunhyang University Research Fund 20080152. Java and CORBA and experimented the proposed roll back recovery protocol. The simulations results indicate that the performance of our protocol is better than previous roll back recovery protocols which use independent check pointing and pessimistic message logging without using agents. Our contributions are as follows: (1) this is the first rollback recovery protocol using agents, (2) FRASystem is not dependent on an operating system, and (3) FRASystem provides a portability and extensibility.
机译:在本文中,我们提出了一种在分布式计算系统中使用多代理的容错和恢复系统,称为FRASystem(容错和恢复代理系统)。以前的回滚恢复协议依赖于固有的通信和底层的操作系统,从而导致计算性能下降。我们提出了一种回滚恢复协议,该协议可在操作系统上独立运行,并导致可移植性和可扩展性不断提高。我们定义了四种类型的代理:(1)恢复代理在故障后执行回滚恢复协议;(2)信息代理根据无故障操作过程中的容错性和信息来构造域知识;(3)促进者代理控制代理之间的通信,(4)垃圾收集代理对无用的容错信息执行垃圾收集。由于代理故障可能导致系统状态不一致和多米诺骨牌效应,因此我们提出了一种代理恢复算法。垃圾回收协议可解决由于稳定存储中已保存的容错信息增加而导致的性能下降。我们使用以下代码实现了FRASystem的原型。这项工作得到了很快的淳阳大学研究基金20080152的支持。Java和CORBA进行了实验,并尝试了建议的回滚恢复协议。仿真结果表明,我们的协议的性能优于以前的回滚恢复协议,后者使用独立的检查点和悲观的消息记录而不使用代理。我们的贡献如下:(1)这是使用代理的第一个回滚恢复协议,(2)FRASystem不依赖于操作系统,并且(3)FRASystem提供了可移植性和可扩展性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号