首页> 外文会议>Proceedings of the 2011 ACM/IEEE on joint conference on digital libraries. >An Exploration of Pattern-based Subtopic Modeling for Search Result Diversification
【24h】

An Exploration of Pattern-based Subtopic Modeling for Search Result Diversification

机译:基于模式的搜索结果多样化子主题建模的探索

获取原文
获取原文并翻译 | 示例

摘要

'Traditional information retrieval models do not necessarily provide users with optimal search experience because the top ranked documents may contain the same piece of relevant information, i.e., the same subtopic of a emery. The goal of search result diversification is to return search results that not only are relevant to the query but also cover different subtopics. Therefore, the subtopic modeling is an important research topic in search result diversification. In this paper, we propose a novel pattern based method to extract subtopics from retrieved documents. The basic idea is to explicitly model a query subtopic as a scmanticaJly meaningful text, unit in relevant documents. We apply a frequent pattern mining algorithm to efficiently extract these text units (patterns) from retrieved documents. We then model a query subtopic with a single pattern and rank subtopics based on their similarity with the query. These pattern based subtopics are then used to diversify search results.
机译:传统信息检索模型不一定为用户提供最佳搜索体验,因为排名最高的文档可能包含相同的相关信息,即金刚砂的相同子主题。搜索结果多样化的目标是返回不仅与查询相关而且涵盖不同子主题的搜索结果。因此,子主题建模是搜索结果多样化的重要研究课题。在本文中,我们提出了一种基于模式的新方法,该方法可以从检索到的文档中提取子主题。基本思想是将查询子主题明确建模为相关文档中的语义有意义的文本单元。我们应用一种频繁的模式挖掘算法来从检索到的文档中有效地提取这些文本单元(模式)。然后,我们使用单个模式对查询子主题进行建模,并根据子主题与查询的相似性对子主题进行排名。这些基于模式的子主题随后用于使搜索结果多样化。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号