首页> 外文会议>2017 ACM/IEEE Joint Conference on Digital Libraries >Impact of URI Canonicalization on Memento Count
【24h】

Impact of URI Canonicalization on Memento Count

机译:URI规范化对Memento计数的影响

获取原文
获取原文并翻译 | 示例

摘要

Memento TimeMaps list identifiers for archival web captures (URI-Ms). When some URI-Ms are dereferenced, they redirect to a different URI-M instead of a unique representation at the datetime. This suggests that confidently obtaining an accurate count quantifying the number of non-forwarding captures for an Original Resource URI (URI-R) is not possible using a TimeMap alone and that the magnitude of a TimeMap is not equivalent to the number of representations it identifies. This work represents an abbreviated version of the full technical report describing this phenomena in depth. For google.com we found that 84.9% of the URI-Ms in a TimeMap result in an HTTP redirect when dereferenced. The full study applies this technique to seven other URI-Rs of large Web sites and 13 academic institutions. Using a ratio metric for the number of URI-Ms without redirects to those requiring a redirect when dereferenced, five of the eight large web sites' and two of the thirteen academic institutions' TimeMaps had a ratio of less than one, indicating that more than half of the URI-Ms in these TimeMaps result in redirects when dereferenced.
机译:Memento TimeMaps列出了存档Web捕获(URI-Ms)的标识符。取消引用某些URI-M时,它们会在日期时间重定向到其他URI-M而不是唯一的表示形式。这表明,仅使用TimeMap不可能可靠地获得量化原始资源URI(URI-R)的非转发捕获数量的准确计数,并且TimeMap的大小不等于其标识的表示数量。这项工作代表了详细描述此现象的完整技术报告的缩写。对于google.com,我们发现在取消引用时,TimeMap中84.9%的URI-M导致HTTP重定向。完整的研究将此技术应用于大型网站和13个学术机构的其他七个URI-R。对不带重定向的URI-M数量使用比率度量标准,然后将其取消引用后需要重定向的URI-M数量,八个大型网站中的五个和13个学术机构中的两个TimeMaps的比率小于1,表明大于当取消引用时,这些TimeMap中的URI-M中有一半会导致重定向。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号