首页> 外文会议>Symposium on Mass Storage Systems and Technologies >A long-term user-centric analysis of deduplication patterns
【24h】

A long-term user-centric analysis of deduplication patterns

机译:以用户为中心的长期重复数据删除模式分析

获取原文

摘要

Deduplication has become essential in disk-based backup systems, but there have been few long-term studies of backup workloads. Most past studies either were of a small static snapshot or covered only a short period that was not representative of how a backup system evolves over time. For this paper, we collected 21 months of data from a shared user file system; 33 users and over 4,000 snapshots are covered. We analyzed the data set for a variety of essential characteristics. However, our primary focus was individual user data. Despite apparently similar roles and behavior in all of our users, we found significant differences in their deduplication ratios. Moreover, the data that some users share with others had a much higher deduplication ratio than average. We analyze this behavior and make recommendations for future deduplication systems design.
机译:重复数据删除在基于磁盘的备份系统中已变得至关重要,但是对备份工作负载的长期研究很少。过去的大多数研究要么只是一个小的静态快照,要么仅涵盖了很短的时间,这并不代表备份系统随着时间的发展而变化。在本文中,我们从共享用户文件系统中收集了21个月的数据;涵盖了33个用户和4,000多个快照。我们分析了数据集的各种基本特征。但是,我们的主要重点是个人用户数据。尽管我们所有用户的角色和行为显然相似,但我们发现他们的重复数据删除率存在显着差异。此外,一些用户与其他用户共享的数据具有比平均更高的重复数据删除率。我们分析了这种行为,并为将来的重复数据删除系统设计提出了建议。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号