首页> 外文会议>International Workshop on Experimental and Efficient Algorithms(WEA 2005); 20050510-13; Santorini Island(GR) >Integrating Coordinated Checkpointing and Recovery Mechanisms into DSM Synchronization Barriers
【24h】

Integrating Coordinated Checkpointing and Recovery Mechanisms into DSM Synchronization Barriers

机译:将协调的检查点和恢复机制集成到DSM同步屏障中

获取原文
获取原文并翻译 | 示例

摘要

Distributed Shared Memory (DSM) creates an abstraction of a physical shared memory that parallel programmers can access. Most recent software DSMs provide relaxed memory models that guarantee consistency only at synchronization operations. As the main goal of DSM systems is to provide support for long term computation intensive applications, checkpointing and recovery mechanisms are highly desirable. This article presents and evaluates the integration of a coordinated checkpointing mechanism to the barrier primitive that is usually provided with many DSM systems. Our results on some popular benchmarks and a real parallel application show that the overhead introduced during the failure-free execution is often small.
机译:分布式共享内存(DSM)创建了并行程序员可以访问的物理共享内存的抽象。最新的软件DSM提供宽松的内存模型,仅在同步操作时才保证一致性。由于DSM系统的主要目标是为长期的计算密集型应用程序提供支持,因此非常需要检查点和恢复机制。本文介绍并评估了协调检查点机制与通常由许多DSM系统提供的屏障原语的集成。我们在一些流行的基准测试和实际的并行应用程序上的结果表明,在无故障执行过程中引入的开销通常很小。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号