首页> 外文会议>Annual IEEE/IFIP International Conference on Dependable Systems and Networks >Reconsidering Single Failure Recovery in Clustered File Systems
【24h】

Reconsidering Single Failure Recovery in Clustered File Systems

机译:重新考虑群集文件系统中的单一故障恢复

获取原文

摘要

How to improve the performance of single failure recovery has been an active research topic because of its prevalence in large-scale storage systems. We argue that when erasure coding is deployed in a cluster file system (CFS), existing single failure recovery designs are limited in different aspects: neglecting the bandwidth diversity property in a CFS architecture, targeting specific erasure code constructions, and no special treatment on load balancing during recovery. In this paper, we reconsider the single failure recovery problem in a CFS setting, and propose CAR, a cross-rack-aware recovery algorithm. For each stripe, CAR finds a recovery solution that retrieves data from the minimum number of racks. It also reduces the amount of cross-rack repair traffic by performing intra-rack data aggregation prior to cross-rack transmission. Furthermore, by considering multi-stripe recovery, CAR balances the amount of cross-rack repair traffic across multiple racks. Evaluation results show that CAR can effectively reduce the amount of cross-rack repair traffic and the resulting recovery time.
机译:由于其在大规模存储系统中的普遍性,如何提高单故障恢复的性能一直是一个活跃的研究主题。我们认为,当擦除编码部署在群集文件系统(CFS)中时,现有的单个故障恢复设计将在不同方面受到限制:忽略CFS体系结构中的带宽多样性属性,针对特定的擦除代码构造,并且不对负载进行特殊处理恢复期间保持平衡。在本文中,我们重新考虑了CFS设置中的单个故障恢复问题,并提出了CAR(一种跨机架感知的恢复算法)。对于每个条带,CAR都会找到一种恢复解决方案,该解决方案可以从最少数量的机架中检索数据。通过在跨机架传输之前执行机架内数据聚合,它还减少了跨机架维修流量。此外,通过考虑多级恢复,CAR可以平衡多个机架之间的跨机架维修流量。评估结果表明,CAR可以有效减少跨机架维修流量,并缩短恢复时间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号