Exploiting Replication and Data Reuse to Efficiently Schedule Data-Intensive Applications on Grids

机译：利用复制和数据重用在网格上有效地调度数据密集型应用程序

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Data-intensive applications executing over a computational grid demand large data transfers. These are costly operations. Therefore, taking them into account is mandatory to achieve efficient scheduling of data-intensive applications on grids. Further, within a heterogeneous and ever changing environment such as a grid, better schedules are typically attained by heuristics that use dynamic information about the grid and the applications. However, this information is often difficult to be accurately obtained. On the other hand, although there are schedulers that attain good performance without requiring dynamic information, they were not designed to take data transfer into account. This paper presents Storage Affinity, a novel scheduling heuristic for bag-of-tasks data-intensive applications running on grid environments. Storage Affinity exploits a data reuse pattern, common on many data-intensive applications, that allows it to take data transfer delays into account and reduce the makespan of the application. Further, it uses a replication strategy that yields efficient schedules without relying upon dynamic information that is difficult to obtain. Our results show that Storage Affinity may attain better performance than the state-of-the-art knowledge-dependent schedulers. This is achieved at the expense of consuming more CPU cycles and network bandwidth.

机译：在计算网格上执行的数据密集型应用程序需要大量数据传输。这些都是昂贵的操作。因此，必须考虑这些因素，以实现网格上数据密集型应用程序的有效调度。此外，在诸如网格之类的异构且不断变化的环境中，通常通过使用关于网格及其应用程序的动态信息的试探法来获得更好的调度。但是，该信息通常难以准确获得。另一方面，尽管有些调度程序可以在不需要动态信息的情况下达到良好的性能，但是它们并不是为了考虑数据传输而设计的。本文介绍了Storage Affinity，这是一种新颖的调度启发式方法，适用于在网格环境中运行的任务包数据密集型应用程序。 Storage Affinity利用了许多数据密集型应用程序中常见的数据重用模式，该模式允许它考虑数据传输延迟并减少应用程序的有效期。此外，它使用一种复制策略，该策略可产生有效的计划，而不必依赖于难以获取的动态信息。我们的结果表明，存储亲和性可能比最新的知识相关调度器具有更好的性能。以消耗更多的CPU周期和网络带宽为代价来实现。

著录项

来源
《International Workshop on Job Scheduling Strategies for Parallel Processing(JSSPP 2004); 20040613; New York,NY(US)》|2004年|P.210-232|共23页
会议地点 New YorkNY(US)
作者
Elizeu Santos-Neto; Walfredo Cirne; Francisco Brasileiro; Aliandro Lima;
展开▼
作者单位

Universidade Federal de Campina Grande;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类在其他方面的应用;
关键词

相似文献

外文文献
中文文献
专利

1. An architecture and a dynamic scheduling algorithm of grid for providing security for real-time data-intensive applications1 [J] . Mohd Rafiqul Islam1, *, Mohd Toufiq Hasan2 and G. M. Ashaduzzaman2 International Journal of Network Management . 2011,第5期

机译：网格的架构和动态调度算法，可为实时数据密集型应用程序提供安全性1
2. An SCP-based heuristic approach for scheduling distributed data-intensive applications on global grids [J] . Srikumar Venugopal, Rajkumar Buyya Journal of Parallel and Distributed Computing . 2008,第4期

机译：基于SCP的启发式方法，用于在全局网格上调度分布式数据密集型应用程序
3. Security-driven scheduling for data-intensive applications on grids [J] . Tao Xie, Xiao Qin Cluster computing . 2007,第2期

机译：安全驱动的调度，用于网格上的数据密集型应用
4. Exploiting Replication and Data Reuse to Efficiently Schedule Data-Intensive Applications on Grids [C] . International Workshop on Job Scheduling Strategies for Parallel Processing . 2005

机译：利用复制和数据重用以有效地安排网格上的数据密集型应用程序
5. Efficient data scheduling for real-time large-scale data-intensive distributed applications. [D] . Eltayeb, Mohammed Soleiman. 2004

机译：针对实时大规模数据密集型分布式应用程序的高效数据调度。
6. Efficient Dynamic Replication Algorithm Using Agent for Data Grid [O] . Priyanka Vashisht, Rajesh Kumar, Anju Sharma -1

机译：使用代理进行数据网格的高效动态复制算法
7. Exploiting Replication and Data Reuse to Efficiently Schedule Data-intensive Applications on Grids [O] . Elizeu Santos-neto, Walfredo Cirne, Francisco Brasileiro, 2004

机译：利用复制和数据重用在网格上有效地调度数据密集型应用程序

Exploiting Replication and Data Reuse to Efficiently Schedule Data-Intensive Applications on Grids

摘要

著录项

相似文献

相关主题

期刊订阅