Index ordering by query-independent measures

Paul Ferguson; Alan F. Smeaton

首页> 外文期刊>Information Processing & Management >Index ordering by query-independent measures

【24h】

Index ordering by query-independent measures

机译：通过与查询无关的措施对索引进行排序

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Conventional approaches to information retrieval search through all applicable entries in an inverted file for a particular collection in order to find those documents with the highest scores. For particularly large collections this may be extremely time consuming. A solution to this problem is to only search a limited amount of the collection at querytime, in order to speed up the retrieval process. In doing this we can also limit the loss in retrieval efficacy (in terms of accuracy of results). The way we achieve this is to firstly identify the most "important" documents within the collection, and sort documents within inverted file lists in order of this "importance". In this way we limit the amount of information to be searched at query time by eliminating documents of lesser importance, which not only makes the search more efficient, but also limits loss in retrieval accuracy. Our experiments, carried out on the TREC Terabyte collection, report significant savings, in terms of number of postings examined, without significant loss of effectiveness when based on several measures of importance used in isolation, and in combination. Our results point to several ways in which the computation cost of searching large collections of documents can be significantly reduced.

机译：信息检索的常规方法是搜索反向文件中所有特定条目的特定集合，以查找得分最高的那些文档。对于特别大的收藏，这可能会非常耗时。解决此问题的方法是在查询时仅搜索有限数量的集合，以加快检索过程。通过这样做，我们还可以限制检索效率的损失（就结果的准确性而言）。我们实现这一目标的方法是，首先确定集合中最“重要”的文档，然后按照这种“重要性”的顺序对反向文件列表中的文档进行排序。这样，通过消除重要性较低的文档，我们限制了查询时要搜索的信息量，这不仅使搜索效率更高，而且还限制了检索准确性的损失。我们的实验是在TREC Terabyte集合上进行的，根据所审查的发布数量，发现了可观的节省，而当基于单独使用和组合使用的几种重要衡量指标时，其有效性没有显着下降。我们的结果指出了可以大幅度减少搜索大量文档的计算成本的几种方法。

著录项

来源
《Information Processing & Management》 |2012年第3期|p.569-586|共18页
作者
Paul Ferguson; Alan F. Smeaton;
展开▼
作者单位

CLARITY: Centre for Sensor Web Technologies, Dublin City University, Dublin, Ireland;

CLARITY: Centre for Sensor Web Technologies, Dublin City University, Dublin, Ireland;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
indexing; efficiency/effectiveness tradeoffs; query-independent search;

机译：索引效率/效果权衡;独立查询;

相似文献

外文文献
中文文献
专利

1. Effective sentence retrieval based on query-independent evidence [J] . Ronald T. Fernandez, David E. Losada Information Processing & Management . 2012,第6期

机译：基于独立于查询的证据的有效句子检索
2. Multigraph-Based Query-Independent Learning for Video Search [J] . Liu Y., Mei T., Wu X., Circuits and Systems for Video Technology, IEEE Transactions on . 2009,第12期

机译：用于视频搜索的基于多图的查询独立学习
3. Query-Independent Evidence in Home Page Finding [J] . TRYSTAN UPSTILL, NICK CRASWELL, DAVID HAWKING ACM Transactions on Information Systems . 2003,第3期

机译：查找主页中与查询无关的证据
4. Multiple Query-Independent Values Based Asymmetric Ranking for Approximate Nearest Neighbor Search [C] . Yuan Cao, Heng Qi, Keqiu Li, IEEE International Conference on Big Data Science and Engineering;IEEE International Conference on Trust, Security and Privacy in Computing and Communications;IEEE International Symposium on Parallel and Distributed Processing with Applications . 2016

机译：近似最近邻居搜索的基于多个查询独立值的不对称排序
5. Measuring legal fictions: Law and sovereignty in "Measure for Measure". [D] . Funk, James. 2012

机译：衡量法律小说：“衡量措施”中的法律和主权。
6. Measure for Measure: Measuring the Usefulness of Measuring AntiseizureMedication Levels [O] . Edward Faught 2020

机译：衡量措施：衡量抗癫痫发作的有用性药物水平
7. Index ordering by query-independent measures [O] . Ferguson, Paul, Smeaton, Alan F. 2012

机译：通过与查询无关的措施对索引进行排序

Index ordering by query-independent measures

摘要

著录项

相似文献

相关主题

期刊订阅