首页> 外文会议>2014 IEEE/ACM Joint Conference on Digital Libraries >Bridging the gap between real world repositories and Scalable Preservation Environments
【24h】

Bridging the gap between real world repositories and Scalable Preservation Environments

机译:缩小现实世界存储库与可扩展的保存环境之间的差距

获取原文
获取原文并翻译 | 示例

摘要

Integrating large scale processing environments, such as Hadoop, with traditional repository systems, such as Fedora Commons 3, have long proved a daunting task. In this paper we show how this integration can be achieved using software developed in the SCAPE project. The SCAPE integration is based on four steps: retrieving the metadata records from the repository, reading the records and their references to data files, updating the records, and storing them back in the repository. This allows full use of the Hadoop system for massively distributed processing without causing excessive load on the repository.
机译:长期以来,将Hadoop等大规模处理环境与Fedora Commons 3等传统存储系统集成在一起一直是一项艰巨的任务。在本文中,我们展示了如何使用SCAPE项目中开发的软件来实现这种集成。 SCAPE集成基于四个步骤:从存储库中检索元数据记录,读取记录及其对数据文件的引用,更新记录并将它们存储回存储库中。这样可以充分利用Hadoop系统进行大规模分布式处理,而不会导致存储库上的过多负载。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号