【24h】

Lazy exact deduplication

机译:延迟完全重复数据删除

获取原文

摘要

During data deduplication, on-disk fingerprint lookups lead to high disk traffic, resulting in a bottleneck. In this paper, we propose a “lazy” data deduplication method which buffers incoming fingerprints and performs on-disk lookups in batches, aiming to reduce the disk bottleneck. In deduplication in general, prefetching is used to improve the cache hit rate by exploiting locality within the incoming fingerprint stream. For lazy deduplication, we design a buffering strategy that preserves locality in order to similarly facilitate prefetching. Experimental results indicate that the lazy method improves fingerprint identification performance by over 50% compared with an “eager” method with the same data layout.
机译:在重复数据删除期间,磁盘上的指纹查找会导致磁盘流量较高,从而导致瓶颈。在本文中,我们提出了一种“惰性”重复数据删除方法,该方法可以缓冲传入的指纹并分批执行磁盘上的查找,目的是减少磁盘瓶颈。通常,在重复数据删除中,预取用于通过利用传入指纹流中的局部性来提高缓存命中率。对于延迟重复数据删除,我们设计了一种保留局部性的缓冲策略,以类似地促进预取。实验结果表明,与具有相同数据布局的“渴望”方法相比,惰性方法将指纹识别性能提高了50%以上。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号