首页> 中文期刊> 《计算机应用与软件》 >海量图片文件存储去重技术研究

海量图片文件存储去重技术研究

         

摘要

提出一种基于分布式数据库与分布式文件系统相结合的海量图片文件存储去重技术。该技术通过提取图片文件二进制流的特征段计算文件 MD5码签名,依据签名对图片文件进行存储去重。结合实验数据分析验证该技术不仅能够准确地去重图片,有较高的删除率,且经对比得到该技术在计算签名时间、上传速度等方面均优于文件级去重和块级去重技术,是对海量图片数据存储的一种优化。同时针对该技术的不足提出了改进方案。%In this paper we present a deduplication technology for massive image files storage.This technology,which is based on the combination of distributed database and distributed file system,calculates file’s of MD5 signature by extracting the feature segment of binary stream of image files,and deduplicates the storage in regard to image files according to the signature.It has been analysed and verified in combination with the experimental data that this technology is accurate in deduplicating images,besides,it has a high deletion rate.What’s more,compared with file-level deduplication and block-level deduplication technology,this technology is superior in calculating the time of signature and uploading speed,and offers an optimisation to massive image files storage.Meanwhile,we also put forward in this paper an improved scheme aiming at the deficiency of this technology.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号