首页> 中文期刊> 《情报学报》 >一种基于引用上下文和引文网络的相关反馈算法

一种基于引用上下文和引文网络的相关反馈算法

         

摘要

Relevance feedback is a method for refactoring retrieval query according to the relevance judgment byrnsystem or user. It is proved to improve retrieval result effectively. And for the information retrieval on academic literature, the reference relationship characterizes the correlation on content, so the reference relationship can provide supplementary information in relevance feedback. In this paper, a novel relevance feedback algorithm based on citation context, co-citation and bibliographic coupling is proposed. A citation context is the text surrounding the reference markers used to refer to other scientific works. The citation context can provide additive terms to represent the academic literature, this algorithm use citation context to expand the "bags of words" model. In the stage of relevance judgment, we use the relation of co-citation and bibliographic coupling in citation network to expand the set of relevance document. Finally, the algorithm uses the clustering method to extract terms to expand query in relevance document. Experimental results show that the retrieval quality is improved. In addition, we investigate the correlation of co-citation, bibliographic coupling and literature content by correlation analysis in statistics.%相关反馈是一种根据用户或系统的相关性判断重构初始检索提问的方法,已被证明可以有效地改进检索效果.具体到学术文献,其引用关系表征了文献内容上的相关性,因而可以为相关反馈提供有价值的辅助信息.本文提出了一种基于引用上下文、文献同被引和文献耦合的相关反馈改进算法.该算法的基本思想包括:利用学术文献的引用上下文信息扩充词包模型(bags of words)进行文本表示;在相关文献判断阶段利用相关文献在引文网络中与其他文献的同被引强度和耦合强度扩充相关文献集合;结合基于聚类的相关反馈思想抽取查询扩展项.实验证明该算法提高了相关反馈效果.此外,相关分析的结果表明文献同被引以及文献耦合强度与文献内容相似度具有显著的相关性.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号