首页> 外文期刊>International Journal of Networking and Computing >Checkpointing Strategies for Scheduling Computational Workflows
【24h】

Checkpointing Strategies for Scheduling Computational Workflows

机译:调度计算工作流的检查点策略

获取原文
           

摘要

We study the scheduling of computational workflows on compute resources that experience exponentially distributed failures. When a failure occurs, rollback and recovery is used to resume the execution from the last checkpointed state. The scheduling problem is to minimize the expected execution time by deciding in which order to execute the tasks in the workflow and deciding for each task whether to checkpoint it or not after it completes. We give a polynomial-time optimal algorithm for fork DAGs (Directed Acyclic Graphs) and show that the problem is NP-complete with join DAGs. We also investigate the complexity of the simple case in which no task is checkpointed. Our main result is a polynomial-time algorithm to compute the expected execution time of a workflow, with a given task execution order and specified to-be-checkpointed tasks. Using this algorithm as a basis, we propose several heuristics for solving the scheduling problem. We evaluate these heuristics for representative workflow configurations.?
机译:我们研究在经历指数分布的故障的计算资源上计算工作流的调度。发生故障时,将使用回滚和恢复从上一个检查点状态恢复执行。调度问题是通过确定以什么顺序执行工作流中的任务并确定每个任务在完成后是否对其进行检查来使期望的执行时间最小化。我们给出了叉DAG(有向无环图)的多项式时间最优算法,并证明了问题是连接DAG的NP完全问题。我们还研究了没有任务被检查点的简单情况的复杂性。我们的主要结果是多项式时间算法,用于计算工作流的预期执行时间,其中包含给定的任务执行顺序和指定的待检查任务。以该算法为基础,提出了几种启发式算法来解决调度问题。我们评估这些启发式方法是否具有代表性的工作流程配置。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号