...
首页> 外文期刊>BMC Medical Informatics and Decision Making >An update on Uniform Resource Locator (URL) decay in MEDLINE abstracts and measures for its mitigation
【24h】

An update on Uniform Resource Locator (URL) decay in MEDLINE abstracts and measures for its mitigation

机译:MEDLINE摘要中统一资源定位符(URL)衰减的更新及其缓解措施

获取原文
           

摘要

Background For years, Uniform Resource Locator (URL) decay or "link rot" has been a growing concern in the field of biomedical sciences. This paper addresses this issue by examining the status of the URLs published in MEDLINE abstracts, establishing current availability and estimating URL decay in these records from 1994 to 2006. We also reviewed the information provided by the URL to determine if the context that the author cited in writing the paper is the same information presently available in the URL. Lastly, with all the documented recommended methods to preserve URL links, we determined which among them have gained acceptance among authors and publishers. Methods MEDLINE records from 1994 to 2006 from the National Library of Medicine in Extensible Mark-up Language (XML) format were processed yielding 10,208 URL addresses. These were accessed once daily at random times for 30 days. Titles and abstracts were also searched for the presence of archival tools such as WebCite, Persistent URL (PURL) and Digital Object Identifier (DOI). Results Results showed that the average URL length ranged from 13 to 425 characters with a mean length of 35 characters [Standard Deviation (SD) = 13.51; 95% confidence interval (CI) 13.25 to 13.77]. The most common top-level domains were ".org" and ".edu", each with 34%. About 81% of the URL pool was available 90% to 100% of the time, but only 78% of these contained the actual information mentioned in the MEDLINE record. "Dead" URLs constituted 16% of the total. Finally, a survey of archival tool usage showed that since its introduction in 1998, only 519 of all abstracts reviewed had incorporated DOI addresses in their MEDLINE abstracts. Conclusion URL persistence parallels previous studies which showed approximately 81% general availability during the 1-month study period. As peer-reviewed literature remains to be the main source of information in biomedicine, we need to ensure the accuracy and preservation of these links.
机译:背景技术多年来,统一资源定位符(URL)衰减或“链接腐烂”一直是生物医学科学领域中日益关注的问题。本文通过检查MEDLINE摘要中发布的URL的状态,建立当前可用性并估计1994年至2006年这些记录中的URL衰减来解决此问题。我们还回顾了URL提供的信息,以确定作者引用的上下文是否在撰写本文时,URL中提供的信息与目前相同。最后,通过所有记录在案的推荐方法来保存URL链接,我们确定了其中哪些方法已获得作者和发布者的认可。方法处理1994年至2006年来自美国国家医学图书馆的可扩展标记语言(XML)格式的MEDLINE记录,产生10,208个URL地址。每天随机访问一次,持续30天。还搜索了标题和摘要,以查找诸如WebCite,Persistent URL(PURL)和Digital Object Identifier(DOI)等归档工具的存在。结果结果显示,平均URL长度在13到425个字符之间,平均长度为35个字符[标准差(SD)= 13.51; 95%置信区间(CI)13.25至13.77]。最常见的顶级域名是“ .org”和“ .edu”,各自占34%。 90%到100%的时间中有大约81%的URL池可用,但是其中只有78%包含MEDLINE记录中提到的实际信息。 “死” URL占总数的16%。最后,一项对档案工具使用情况的调查显示,自1998年引入档案工具以来,在所有摘要中,只有519个摘要将DOI地址纳入其MEDLINE摘要中。结论URL持久性与之前的研究相似,后者在1个月的研究期内总体可用性约为81%。由于经过同行评审的文献仍然是生物医学信息的主要来源,我们需要确保这些链接的准确性和保存性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号