首页> 外文学位 >Checkpointing for graceful degradation in distributed embedded systems.
【24h】

Checkpointing for graceful degradation in distributed embedded systems.

机译:分布式嵌入式系统中性能下降的检查点。

获取原文
获取原文并翻译 | 示例

摘要

Graceful degradation is an approach to developing dependable safety-critical embedded applications, where redundant active or standby resources are used to cope with faults through a system reconfiguration at run-time. Compared to traditional hardware and software redundancy, it is a promising technique that may achieve dependability with a significant reduction in cost, size, weight, and power requirements. Reconfiguration at run-time necessitates using proper checkpointing protocols to support state reservation to ensure correct task restarts after a system reconfiguration. One of the most common checkpointing protocols are communication induced checkpointing (CIC) protocols, which are well developed and understood for large parallel and information systems, but not much has been done for resource limited embedded systems. This work implements and evaluates some of the most common CIC protocols in a periodic resource constrained distributed embedded system for graceful degradation purposes. A test-bed has been developed and used for the evaluation of the various protocols. The implemented protocols are thoroughly studied and performances are contrasted. Specifically the periodicity property and how it benefits checkpointing in embedded systems is investigated. This work introduces a unique effort of CIC protocol implementation and evaluation in the field of distributed embedded systems. Other than providing a test-bed for graceful degradation support, this work shows that some checkpointing protocols that are not efficient in large information systems and supercomputers perform well in embedded systems. We show that a simple index-based CIC protocol, such as the BCS protocol, is more appropriate in embedded system applications compared to other protocols that piggyback a significant amount of information to reduce the number of forced checkpoints. Finally, this work proposes a whole graceful degradation approach to achieve fault tolerance in resource constrained real-time embedded systems. As a case study, the BCS protocol was used to checkpoint the avionics of an unmanned aerial vehicle. Faults were injected during run-time causing one of the system's stability control tasks to fail. The system was able to recover in a very short time by restarting the affected task on a different processor with a correct state with a time delay that did not cause any instability.
机译:正常降级是一种开发可靠的,对安全性至关重要的嵌入式应用程序的方法,其中冗余的活动或备用资源用于在运行时通过系统重新配置来应对故障。与传统的硬件和软件冗余相比,这是一种很有前途的技术,可以显着降低成本,尺寸,重量和功耗,从而实现可靠性。在运行时进行重新配置必须使用正确的检查点协议来支持状态保留,以确保正确的任务在系统重新配置后重新启动。最常见的检查点协议之一是通信诱导检查点(CIC)协议,该协议已为大型并行和信息系统很好地开发和理解,但对于资源有限的嵌入式系统却做得很少。这项工作在周期性资源受限的分布式嵌入式系统中实现并评估了一些最常见的CIC协议,以实现正常的降级目的。已经开发了一个试验台,并用于评估各种协议。对已实现的协议进行了彻底的研究,并对性能进行了对比。具体来说,研究了周期性属性及其对嵌入式系统中检查点的好处。这项工作介绍了在分布式嵌入式系统领域中CIC协议实现和评估的独特工作。除了为正常降级支持提供测试平台外,这项工作还表明,某些在大型信息系统和超级计算机中效率不高的检查点协议在嵌入式系统中表现良好。我们显示,与其他协议相比,简单的基于索引的CIC协议(如BCS协议)在嵌入式系统应用程序中更合适,因为其他协议会附带大量信息以减少强制检查点的数量。最后,这项工作提出了一种全面的降级方法,以在资源受限的实时嵌入式系统中实现容错能力。作为一个案例研究,BCS协议被用来检查无人驾驶飞机的航空电子设备。在运行时注入了故障,导致系统的稳定性控制任务之一失败。通过以正确的状态重新启动受影响的任务,并具有一定的延时(不会引起任何不稳定),系统可以在很短的时间内恢复正常。

著录项

  • 作者

    Sababha, Belal Hussein.;

  • 作者单位

    Oakland University.;

  • 授予单位 Oakland University.;
  • 学科 Engineering Computer.;Engineering Aerospace.;Engineering Electronics and Electrical.
  • 学位 Ph.D.
  • 年度 2011
  • 页码 119 p.
  • 总页数 119
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号