Dynamic Fault Tolerance in Distributed Simulation System

机译：分布式仿真系统中的动态容错

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Distributed simulation system is widely used for forecasting, decision-making and scientific computing. Multi-agent and Grid have been used as platform for simulation. In order to survive from software or hardware failures and guarantee successful rate during agent migrating, system must solve the fault tolerance problem. Classic fault tolerance technology like checkpoint and redundancy can be used for distributed simulation system, but is not efficient. We present a novel fault tolerance protocol which combines the causal message logging method and prime-backup technology. The proposed protocol uses iterative backup location scheme and adaptive update interval to reduce overhead and balance the cost of fault tolerance and recovery time. The protocol has characteristics of no orphan state, and do not need the survival agents to rollback. Most important is that the recovery scheme can tolerant concurrently failures, even the permanent failure of single node. Correctness of the protocol is proved and experiments show the protocol is efficient.

机译：分布式仿真系统被广泛用于预测，决策和科学计算。多主体和网格已用作仿真平台。为了在软件或硬件故障中生存并确保代理迁移期间的成功率，系统必须解决容错问题。经典的容错技术（例如检查点和冗余）可以用于分布式仿真系统，但效率不高。我们提出了一种新颖的容错协议，它结合了因果消息记录方法和原始备份技术。所提出的协议使用迭代备份定位方案和自适应更新间隔来减少开销并平衡容错和恢复时间的成本。该协议具有无孤立状态的特征，并且不需要生存代理进行回滚。最重要的是，恢复方案可以容忍并发故障，甚至是单节点的永久性故障。实验证明了该协议的正确性，实验表明该协议是有效的。

著录项

来源
《International Conference on Computational Science(ICCS 2006) pt.1; 20060528-31; Reading(GB)》|2006年|P.769-776|共8页
会议地点 Reading(GB)
作者
Min Ma; Shiyao Jin; Chaoqun Ye; Xiaojian Liu;
展开▼
作者单位

School of Computer Science, National University of Defense Technology, Hunan Changsha 410073, China;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Dynamic Distributed Intrusion Detection System Based on Mobile Agents with Fault Tolerance [J] . Sasikumar R., D. Manjula Journal of computer sciences . 2012,第7期

机译：基于容错的移动Agent的动态分布式入侵检测系统
2. Dynamic Distributed Intrusion Detection System Based on Mobile Agents with Fault Tolerance | Science Publications [J] . D. Manjula, R. Sasikumar Journal of computer sciences . 2012,第7期

机译：基于具有容错能力的移动代理的动态分布式入侵检测系统科学出版物
3. A Dynamic Slack Management Technique for Real-Time Distributed Embedded System with Enhanced Fault Tolerance and Resource Constraints [J] . Santhi Baskaran, I. Gugan, A. Aswin Kumar, International Journal on Computer Science and Engineering . 2011,第1期

机译：具有增强的容错能力和资源约束的实时分布式嵌入式系统动态松弛管理技术
4. Dynamic Fault Tolerance in Distributed Simulation System [C] . Min Ma, Shiyao Jin, Chaoqun Ye, International Conference on Computational Science pt.1 . 2006

机译：分布式仿真系统中的动态容错
5. Runtime systems for load balancing and fault tolerance on distributed systems. [D] . Arafat, Md Humayun. 2014

机译：运行时系统，用于分布式系统上的负载平衡和容错。
6. Ab initio molecular dynamics simulation of the effects of stacking faults on the radiation response of 3C-SiC [O] . M. Jiang, S. M. Peng, H. B. Zhang, -1

机译：从头算分子动力学模拟堆垛层错对3C-SiC辐射响应的影响
7. Dynamic Distributed Intrusion Detection System Based on Mobile Agents with Fault Tolerance [O] . D. Manjula, R. Sasikumar 2012

机译：基于容错移动代理的动态分布式入侵检测系统

Dynamic Fault Tolerance in Distributed Simulation System

摘要

著录项

相似文献

相关主题

期刊订阅