首页> 外文OA文献 >Feedback based Performance Management and Fault Tolerance for Networked and Embedded Computing Systems
【2h】

Feedback based Performance Management and Fault Tolerance for Networked and Embedded Computing Systems

机译:基于反馈的网络和嵌入式计算系统的性能管理和容错

摘要

Performance management and fault tolerance are two important issues faced by computing systems research. In this dissertation, we exploit the use of feedback control for performance management and fault tolerance. Specifically, we propose Queueing Model Based Feedback Control scheme to achieve performance regulation. Traditionally, queueing theory was used for modeling computing system's performance. It usually serves as an offline capacity planning tool. On the other hand, feedback control theory was used for dynamically controlling the performance of electro-mechanical systems. How to utilize the ``descriptive'' power of queueing theory and the ``prescriptive'' power of feedback control to control computing system's performance was an open problem. Queueing Model Based Feedback Control answers this problem by integrating the strengths of both queueing model and feedback control into one unified framework. It provides better performance regulation compared with other control-theoretic approaches. We show the advantages of Queueing Model Based Feedback Control for two networked server applications: one is response time regulation for Apache Web server via dynamic resource allocations; the other is response time regulation for a multi-tiered Web service application via dynamic admission control. In the second part of this dissertation, we further exploit the use of feedback control to achieve fault tolerance for real-time embedded control systems. We propose ORTGA (On-demand Real-Time GuArd), a new fault tolerance architecture which utilizes feedback control based software execution. ORTGA delivers the same functionalities as previously proposed Simplex architecture, with the same high fault coverage and reliability but with much more efficient resource utilization and flexibility. Hence it can be deployed in a wide range of real-time embedded applications to provide fault tolerance. We implemented ORTGA in an inverted pendulum testbed to demonstrate its efficacy and efficiency. Based on the ORTGA design, we discussed the fault tolerance and scheduling co-design problem and its solutions.
机译:性能管理和容错是计算系统研究面临的两个重要问题。本文利用反馈控制技术进行绩效管理和容错。具体来说,我们提出了基于排队模型的反馈控制方案来实现性能调节。传统上,使用排队论对计算系统的性能进行建模。它通常用作离线容量规划工具。另一方面,反馈控制理论被用于动态地控制机电系统的性能。如何利用排队论的``描述性''能力和反馈控制的``描述性''能力来控制计算系统的性能是一个悬而未决的问题。基于排队模型的反馈控制通过将排队模型和反馈控制的优势整合到一个统一的框架中来解决此问题。与其他控制理论方法相比,它提供了更好的性能调节。我们展示了基于排队模型的反馈控制对于两个网络服务器应用程序的优势:一个是通过动态资源分配来调整Apache Web服务器的响应时间;另一个是通过动态资源分配对Apache Web服务器进行响应时间调节。另一个是通过动态许可控制对多层Web服务应用程序的响应时间进行调节。在本文的第二部分,我们进一步利用反馈控制来实现实时嵌入式控制系统的容错能力。我们提出ORTGA(按需实时GuArd),这是一种新的容错体系结构,它利用基于反馈控制的软件执行功能。 ORTGA提供与以前提出的Simplex体系结构相同的功能,具有相同的高故障覆盖率和可靠性,但具有更高的资源利用率和灵活性。因此,它可以部署在各种实时嵌入式应用程序中以提供容错能力。我们在倒立摆试验台上实施了ORTGA,以证明其功效和效率。在ORTGA设计的基础上,我们讨论了容错与调度协同设计问题及其解决方案。

著录项

  • 作者

    Liu Xue;

  • 作者单位
  • 年度 2006
  • 总页数
  • 原文格式 PDF
  • 正文语种 {"code":"en","name":"English","id":9}
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号