首页> 外文会议>British National Conference on Databases(BNCOD 23); 20060718-23; Belfast(GB) >Scheduling Strategies and Their Evaluation in a Data Stream Management System
【24h】

Scheduling Strategies and Their Evaluation in a Data Stream Management System

机译:数据流管理系统中的调度策略及其评估

获取原文
获取原文并翻译 | 示例

摘要

MavStream, a Data Stream Management System (DSMS), has been developed for processing stream data from applications such as network monitoring, sensor monitoring and traffic management systems that require near-real time results and have to process unbounded streams of data. In order to be useful, a result produced by MavStream has to meet certain Quality of Service (QoS) requirements on tuple latency, memory usage, and throughput. Strategies used for scheduling the operators of continuous query (CQ) significantly affect the QoS metrics and hence are of interest. This paper discusses scheduling strategies used in MavStream, their design, implementation, and evaluation. Scheduling is done in MavStream at the operator level. The scheduler maintains a ready queue of operators and decides on the operators to be scheduled based on the scheduling strategy. We first introduce the path capacity scheduling strategy with the goal of minimizing tuple latency by scheduling operator paths with maximum processing capacity. Later we discuss segment-scheduling strategy that aims at minimization of total memory requirement by scheduling operator segments with maximum memory release capacity. We then discuss simplified segment strategy, which splits operator path into just two segments providing better tuple latency performance than segment scheduling strategy and lower memory utilization than path capacity scheduling strategy. Extensive set of experiments have been designed and performed to evaluate the proposed scheduling strategies by simulating real time streams. The performance metrics of average tuple latency, memory utilization and throughput are compared with each other for different strategies and with round robin strategy to validate the analytical conclusions.
机译:MavStream是一种数据流管理系统(DSMS),用于处理来自应用程序(例如网络监控,传感器监控和流量管理系统)的流数据,这些应用需要近乎实时的结果,并且必须处理无限制的数据流。为了有用,MavStream产生的结果必须满足有关元组延迟,内存使用和吞吐量的某些服务质量(QoS)要求。用于调度连续查询(CQ)的运算符的策略会显着影响QoS指标,因此受到关注。本文讨论了MavStream中使用的调度策略,其设计,实现和评估。调度是在MavStream中的操作员级别完成的。调度程序维护操作员准备就绪的队列,并根据调度策略决定要调度的操作员。我们首先介绍路径容量调度策略,其目的是通过调度具有最大处理能力的操作员路径来最大程度地减少元组等待时间。稍后,我们将讨论分段调度策略,该策略旨在通过调度具有最大内存释放容量的运算符分段来最大程度地减少总内存需求。然后,我们讨论简化的段策略,该策略将操作员路径分为两个段,比段调度策略提供更好的元组等待时间性能,比路径容量调度策略提供更低的内存利用率。已经设计并进行了广泛的实验,以通过模拟实时流来评估所提出的调度策略。将平均元组等待时间,内存利用率和吞吐量的性能指标针对不同的策略进行比较,并使用循环策略进行比较以验证分析结论。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号