首页> 外文会议>Proof of Designed Reliability >High-availability algorithms for distributed stream processing
【24h】

High-availability algorithms for distributed stream processing

机译:分布式流处理的高可用性算法

获取原文
获取原文并翻译 | 示例

摘要

Stream-processing systems are designed to support an emerging class of applications that require sophisticated and timely processing of high-volume data streams, often originating in distributed environments. Unlike traditional data-processing applications that require precise recovery for correctness, many stream-processing applications can tolerate and benefit from weaker recovery guarantees. In this paper, we study various recovery guarantees and pertinent recovery techniques that can meet the correctness and performance requirements of stream-processing applications. We discuss the design and algorithmic challenges associated with the proposed recovery techniques and describe how each can provide different guarantees with proper combinations of redundant processing, checkpointing, and remote logging. Using analysis and simulations, we quantify the cost of our recovery guarantees and examine the performance and applicability of the recovery techniques. We also analyze how the knowledge of query network properties can help decrease the cost of high availability.
机译:流处理系统旨在支持新兴的一类应用程序,这些应用程序需要对大量数据流进行复杂,及时的处理,这些数据流通常起源于分布式环境。与要求精确恢复以确保正确性的传统数据处理应用程序不同,许多流处理应用程序可以忍受较弱的恢复保证并从中受益。在本文中,我们研究了各种可满足流处理应用程序正确性和性能要求的恢复保证和相关恢复技术。我们讨论与提出的恢复技术相关的设计和算法挑战,并描述每种方法如何通过冗余处理,检查点和远程日志记录的适当组合来提供不同的保证。通过分析和模拟,我们可以量化恢复保证的成本,并检查恢复技术的性能和适用性。我们还将分析查询网络属性的知识如何帮助降低高可用性的成本。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号