首页> 外文期刊>Reliability Engineering & System Safety >Optimal task replication considering reliability, performance, and energy consumption for parallel computing in cloud systems
【24h】

Optimal task replication considering reliability, performance, and energy consumption for parallel computing in cloud systems

机译:考虑云系统中并行计算的可靠性,性能和能量消耗的最佳任务复制

获取原文
获取原文并翻译 | 示例
           

摘要

In a cloud-based cyber-physical system, many jobs consist of multiple parallel tasks. The cloud system usually adopts active task replication to improve performance and guarantee the reliability of a job. This technology creates redundant replicas for each task and then executes the replicas concurrently. In the cloud system, each replica is a virtual machine (VM) image that can be easily assigned to different physical machines (PMs) to overcome resource heterogeneity. However, how to design a rational task replication strategy (including replica creation and VM assignment) is indeed a complex issue. It should comprehensively consider correlations and tradeoffs among reliability, performance, and energy consumption. This paper first proposes a reliability-performance correlation model for a job executed by using active task replication. We design a general method to avoid analyzing complex failure correlations and give a Bayesian approach to calculate the performability metric of the job. The paper also proposes a reliability-energy correlation model to evaluate random energy consumption of a PM hosting multiple VMs by using mixed random variables. Finally, an expected net profit optimization model and a genetic algorithm are developed to search for an optimal task replication strategy balancing tradeoffs among reliability, performance, and energy consumption. Illustrative examples are provided.
机译:在基于云的网络物理系统中,许多作业包括多个并行任务。云系统通常采用活动任务复制,以提高性能并保证作业的可靠性。此技术为每个任务创建冗余副本,然后同时执行副本。在云系统中,每个副本是一个虚拟机(VM)图像,可以很容易地分配给不同的物理机器(PMS)以克服资源异质性。但是,如何设计Rational Task Replication策略(包括副本创建和VM分配)确实是一个复杂的问题。它应该全面考虑可靠性,性能和能耗之间的相关性和权衡。本文首先提出了通过使用Active Task Replication执行的作业的可靠性性能相关模型。我们设计一种普遍的方法,以避免分析复杂的故障相关性,并给出贝叶斯方法来计算作业的可执行性度量。本文还提出了一种可靠性 - 能量相关模型来评估通过混合随机变量来评估托管多个VM的PM的随机能耗。最后,开发了预期的净利润优化模型和遗传算法,以搜索可靠性,性能和能耗之间的最佳任务复制策略平衡权衡。提供了说明性实例。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号