【24h】

How Much of the Web Is Archived?

机译:多少Web已存档?

获取原文
获取原文并翻译 | 示例

摘要

The Memento Project's archive access additions to HTTP have enabled development of new web archive access user interfaces. After experiencing this web time travel, the inevitable question that comes to mind is "How much of the Web is archived?" This question is studied by approximating the Web via sampling URIs from DMOZ. Delicious, Bitly. and search engine indexes and measuring number of archive copies available in various public web archives. The results indicate that 35%-90% of URIs have at least one archived copy, 17%-49% have two to five copies, l%-8% have six to ten copies, and 8%-63% at least ten copies. The number of URI copies varies as a function of time, but only 14.6-31.3% of URIs are archived more than once per month.
机译:Memento项目对HTTP的存档访问权限的新增功能使开发新的Web存档访问用户界面成为可能。经历了这段Web时间旅行之后,想到的一个不可避免的问题是“多少Web已存档?”通过从DMOZ采样URI来近似Web来研究这个问题。真好吃搜索引擎索引,并测量各种公共Web档案中可用的档案副本数量。结果表明,35%-90%的URI具有至少一个存档副本,17%-49%的具有2至5个副本,l%-8%的具有6至10个副本,8%-63%的具有至少10个副本。 URI副本的数量随时间而变化,但是每个月仅存档14.6-31.3%的URI超过一次。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号