首页> 外文期刊>Cluster Computing >Self healing in System-S
【24h】

Self healing in System-S

机译:System-S中的自我修复

获取原文
获取原文并翻译 | 示例
           

摘要

Faults in a cluster are inevitable. The larger the cluster, the more likely the occurrence of some failure in hardware, in software, or by human error. System-S software must detect and self-repair failures while carrying out its prime directive—enabling stream processing program fragments to be distributed and connected to form complex applications. Depending on the type of failure, System-S may be able to continue with little or no disruption to potentially tens of thousands of interdependent and heterogeneous program fragments running across thousands of nodes.
机译:集群中的故障是不可避免的。群集越大,发生硬件,软件或人为错误的故障的可能性就越大。 System-S软件在执行其主要指令时必须检测并自我修复故障,从而使流处理程序片段可以分发并连接以形成复杂的应用程序。根据故障的类型,System-S可能几乎不会中断甚至不会破坏跨数千个节点运行的成千上万个相互依赖的异构程序片段。

著录项

  • 来源
    《Cluster Computing》 |2008年第3期|247-257|共11页
  • 作者单位

    Center for Reliable and High-Performance Computing University of Illinois at Urbana Champaign 1308 W. Main St. Urbana IL 61820 USA;

    IBM T.J. Watson Research Center IBM Research 19 Skyline Dr. Hawthorne NY 10532 USA;

    IBM T.J. Watson Research Center IBM Research 19 Skyline Dr. Hawthorne NY 10532 USA;

    IBM T.J. Watson Research Center IBM Research 19 Skyline Dr. Hawthorne NY 10532 USA;

    IBM T.J. Watson Research Center IBM Research 19 Skyline Dr. Hawthorne NY 10532 USA;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Fault-tolerance; Stream processing systems; Distributed recovery;

    机译:容错;流处理系统;分布式恢复;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号