...
首页> 外文期刊>Information Services & Use >Deduplication of metadata harvested from Open Archives Initiative repositories
【24h】

Deduplication of metadata harvested from Open Archives Initiative repositories

机译:从Open Archives Initiative存储库中收集的元数据进行重复数据删除

获取原文
获取原文并翻译 | 示例
           

摘要

Open access (OA) is a way of providing unrestricted access via the Internet to peer-reviewed journal articles as well as theses, monographs and book chapters. Many open access repositories have been created in the last decade. There is also a number of registry websites that index these repositories. This article analyzes the repositories indexed by the Open Archives Initiative (OAI) organization in terms of record duplication. Based on the sample of 958 metadata files containing records modified in 2012 we provide an estimate on the number of duplicates in the entire collection of repositories indexed by OAI. In addition, this work describes several open source tools that form a generic workflow suitable for deduplication of bibliographic records.
机译:开放访问(OA)是一种通过Internet不受限制地访问经过同行评审的期刊文章以及这些论文,专着和书籍章节的方法。在过去十年中创建了许多开放访问存储库。也有许多注册网站索引这些存储库。本文从记录重复的角度分析了开放档案馆组织(OAI)组织建立索引的存储库。基于包含2012年修改记录的958个元数据文件的样本,我们提供了对OAI索引的整个存储库集合中重复项数量的估计。此外,本文还介绍了几种开源工具,这些工具形成了适用于书目记录重复数据删除的通用工作流程。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号