...
首页> 外文期刊>The Journal of Artificial Intelligence Research >The Opposite of Smoothing: A Language Model Approach to Ranking Query-Specific Document Clusters
【24h】

The Opposite of Smoothing: A Language Model Approach to Ranking Query-Specific Document Clusters

机译:平滑的对立面:一种用于对查询特定的文档簇进行排名的语言模型方法

获取原文
获取原文并翻译 | 示例
           

摘要

Exploiting information induced from (query-specific) clustering of top-retrieved documents has long been proposed as a means for improving precision at the very top ranks of the returned results. We present a novel language model approach to ranking query-specific clusters by the presumed percentage of relevant documents that they contain. While most previous cluster ranking approaches focus on the cluster as a whole, our model utilizes also information induced from documents associated with the cluster. Our model substantially outperforms previous approaches for identifying clusters containing a high relevant-document percentage. Furthermore, using the model to produce document ranking yields precision-at-top-ranks performance that is consistently better than that of the initial ranking upon which clustering is performed. The performance also favorably compares with that of a state-of-the-art pseudo-feedback-based retrieval method.
机译:长期以来,一直有人提出利用从(特定于查询的)顶部检索文档的聚类中诱发的信息,作为提高返回结果的最高级精度的一种方法。我们提出了一种新颖的语言模型方法,可根据所包含的相关文档的假定百分比对查询特定的群集进行排名。尽管大多数以前的聚类排名方法都将重点放在整个聚类上,但我们的模型还利用了与聚类相关联的文档中引入的信息。我们的模型大大优于先前的方法,该方法可以识别包含较高相关文档百分比的聚类。此外,使用该模型产生文档排名可产生最高排名精度的性能,该性能始终优于执行聚类的初始排名。与最新的基于伪反馈的检索方法相比,该性能也具有优势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号