【24h】

Data Placement in Widely Distributed Environments

机译:在广泛分布的环境中的数据放置

获取原文

摘要

The increasing computation and data requirements of scientific applications, especially in the areas of bioinformatics, astronomy, high energy physics, and earth sciences, have necessitated the use of distributed resources owned by collaborating parties. While existing distributed systems work well for compute-intensive applications that require limited data movement, they fail in unexpected ways when the application accesses, creates, and moves large amounts of data over wide-area networks. Existing systems closely couple data movement and computation, and consider data movement as a side effect of computation. In this chapter, we propose a framework that de-couples data movement from computation, allows queuing and scheduling of data movement apart from computation, and acts as an I/O subsystem for distributed systems. This system provides a uniform interface to heterogeneous storage systems and data transfer protocols; permits policy support and higher-level optimization; and enables reliable, efficient scheduling of compute and data resources.
机译:越来越多的计算和数据要求的科学应用,特别是在生物信息学,天文学,高能量物理和地球科学领域,都需要使用合作缔约方所拥有的分布式资源。虽然现有的分布式系统适用于需要有限数据移动的计算密集型应用程序,但在应用程序访问,创建和移动广域网中的大量数据时,它们以意外的方式失败。现有系统紧密地耦合数据移动和计算,并将数据移动视为计算的副作用。在本章中,我们提出了一种框架,该框架将数据移动从计算执行,允许除了计算之外的数据移动的排队和调度,并充当分布式系统的I / O子系统。该系统为异构存储系统和数据传输协议提供统一的接口;允许政策支持和更高级别的优化;并实现计算和数据资源的可靠,有效的调度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号