首页> 外文会议>IEEE/ACM Joint Conference on Digital Libraries >Bridging the gap between real world repositories and Scalable Preservation Environments
【24h】

Bridging the gap between real world repositories and Scalable Preservation Environments

机译:弥合现实世界存储库和可扩展保存环境之间的差距

获取原文

摘要

Integrating large scale processing environments, such as Hadoop, with traditional repository systems, such as Fedora Commons 3, have long proved a daunting task. In this paper we show how this integration can be achieved using software developed in the SCAPE project. The SCAPE integration is based on four steps: retrieving the metadata records from the repository, reading the records and their references to data files, updating the records, and storing them back in the repository. This allows full use of the Hadoop system for massively distributed processing without causing excessive load on the repository.
机译:整合大规模处理环境,例如Hadoop,具有传统的存储库系统,例如Fedora Commons 3,长期以来一直证明了一个艰巨的任务。在本文中,我们展示了如何使用Scape项目中开发的软件实现该集成。 Scape集成基于四个步骤:从存储库中检索元数据记录,读取记录及其对数据文件的引用,更新记录,并将其存储在存储库中。这允许充分利用Hadoop系统进行大规模分布式处理,而不会导致存储库上的过度负载。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号