首页> 外文会议>International Symposium of Information Technology >Query Expansion using Information Scent
【24h】

Query Expansion using Information Scent

机译:使用信息的查询扩展

获取原文

摘要

Web has grown to a huge mass of information resource and is diverse in content. To search such rich source of information one has to be very precise in using keywords in queries to retrieve the relevant documents. Most of the queries issued to search engines are short and have ambiguous context. One way to produce effective queries is by automatic query expansion. Work has been done in this field to use the local and global techniques. The global techniques examine word occurrences and relationships in the corpus as a whole and use this information to expand a particular query. Local context analysis examines the concept occurrences and relationship in top ranked documents retrieved by the original input query to expand the same query. Query log of search engines is used by researchers to expand the input queries using the clicked documents related to any of the terms of input query in query session of query log. In this paper a new local analysis technique is proposed which make use of information need of query sessions modeled using Information Scent and content of clicked documents to select the clicked documents for query expansion. Information scent is the subjective sense of value and cost of accessing a page based on perceptual cues with respect to the information need of the user. The input query issued in a particular domain is used to select the set of documents associated with the information need of the query sessions in the same domain and used as local corpora to provide related set of terms to be added to the input query. The resulting expanded query is used to retrieve the relevant documents from the same retrieval system. This approach is unique as it is using those documents in local corpora which belong to the information need associated with the domain in which input query is issued using Information Scent and content of clicked pages in the query sessions and direct the search in a fruitful direction by expanding initial input query using set of related terms. Experimental study of the proposed approach is done on the data set extracted from Web history of "Google" search engine and improvement in the information retrieval precision with low computation complexity during online processing of input queries confirms the effectiveness of the proposed approach.
机译:Web已经发展到大量信息资源,内容多样化。为了搜索这种丰富的信息来源,必须非常精确地在查询中使用关键字来检索相关文档。发出到搜索引擎的大多数查询都很短,并且具有模糊的背景。一种生成有效查询的方法是自动查询扩展。在此字段中已经完成了工作以使用本地和全局技术。全局技术作为整个语料库中的词出现和关系,并使用此信息来扩展特定查询。本地上下文分析检查由原始输入查询检索的顶部排名文档中的概念出现和关系,以扩展相同的查询。研究人员使用查询搜索引擎的日志来使用与查询日志查询会话中的任何输入查询相关的单击文档来扩展输入查询。在本文中,提出了一种新的本地分析技术,该技术利用使用信息香味和点击文档的内容建模的查询会话的信息来选择点击的文档进行查询扩展。信息香味是基于用户的信息需要基于感知提示访问页面的主观价值和成本。在特定域中发出的输入查询用于选择与同一域中查询会话的信息相关联的文件集,并用作本地语料库以提供要添加到输入查询的相关术语集。生成的扩展查询用于从同一检索系统中检索相关文档。这种方法是唯一的,因为它在本地语境中使用的那些文档属于使用查询会话中的信息香味和点击页面的点击页面发出输入查询的域相关联的信息,并通过富有成效的方向指导搜索使用相关术语扩展初始输入查询。所提出的方法的实验研究是在从“谷歌”搜索引擎的网络历史中提取的数据集上完成的,并且在线处理在输入查询的在线处理期间具有低计算复杂性的信息检索精度的信息证实了所提出的方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号