【24h】

MeSH-based dataset for measuring the relevance of text retrieval

机译:基于MeSH的数据集,用于测量文本检索的相关性

获取原文

摘要

Creating simulated search environments has been of a significant interest in information retrieval, in both general and biomedical search domains. Existing collections include modest number of queries and are constructed by manually evaluating retrieval results. In this work we propose leveraging MeSH term assignments for creating synthetic test beds. We select a suitable subset of MeSH terms as queries, and utilize MeSH term assignments as labels for retrieval evaluation. Using well studied retrieval functions, we show that their performance on the proposed data is consistent with similar findings in previous work. We further use the proposed retrieval evaluation framework to better understand how to combine heterogeneous sources of textual information.
机译:在一般和生物医学搜索领域中,创建模拟搜索环境对信息检索都引起了极大的兴趣。现有集合包括少量查询,并且是通过手动评估检索结果来构造的。在这项工作中,我们建议利用MeSH术语分配来创建综合测试台。我们选择合适的MeSH术语子集作为查询,并利用MeSH术语分配作为检索评估的标签。使用经过充分研究的检索功能,我们证明了它们在建议数据上的性能与先前工作中的类似发现一致。我们进一步使用提出的检索评估框架来更好地理解如何结合文本信息的异构来源。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号