首页>
外国专利>
Utilizing global digests caching in similarity based data deduplication
Utilizing global digests caching in similarity based data deduplication
展开▼
机译:在基于相似性的重复数据删除中利用全局摘要缓存
展开▼
页面导航
摘要
著录项
相似文献
摘要
Input data is partitioned into data chunks and digest values are calculated for each of the data chunks. The positions of similar repository data are found in a repository of data for each of the data chunks. The input digests of the input data are matched with the repository digests contained in the global digests cache for locating data matches. The processor prefers to match the input digests of the input data with the repository digests contained in the global digests cache which are of the similar repository data, rather than repository digests which are of other repository data that was not determined as similar to the input data chunks. The positions of the similar repository data are used to locate and linearly load into the global digests cache, digests and digest block boundaries of the similar repository data.
展开▼