Document Retrieval on Repetitive Collections

机译：重复馆藏文献检索

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Document retrieval aims at finding the most important documents where a pattern appears in a collection of strings. Traditional pattern-matching techniques yield brute-force document retrieval solutions, which has motivated the research on tailored indexes that offer near-optimal performance. However, an experimental study establishing which alternatives are actually better than brute force, and which perform best depending on the collection characteristics, has not been carried out. In this paper we address this shortcoming by exploring the relationship between the nature of the underlying collection and the performance of current methods. Via extensive experiments we show that established solutions are often beaten in practice by brute-force alternatives. We also design new methods that offer superior time/space tradeoffs, particularly on repetitive collections.

机译：文档检索旨在查找最重要的文档，其中某个模式出现在字符串集合中。传统的模式匹配技术产生了蛮力的文档检索解决方案，这激发了对提供接近最佳性能的定制索引的研究。但是，尚未进行实验研究，以确定哪些替代品实际上比蛮力好，哪些替代品根据收集特性表现最佳。在本文中，我们通过探索基础集合的性质与当前方法的性能之间的关系来解决此缺点。通过广泛的实验，我们表明，在实践中，通常在解决方案上往往会遭到蛮力替代。我们还设计了新的方法，可提供出色的时间/空间权衡，尤其是在重复性收藏中。

著录项

来源
《Annual European symposium on algorithms》|2014年|725-736|共12页
会议地点
作者
Gonzalo Navarro; Simon J. Puglisi; Jouni Siren;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Document retrieval on repetitive string collections [J] . Gagie Travis, Hartikainen Aleksi, Karhu Kalle, Information retrieval . 2017,第3期

机译：重复字符串集合的文档检索
2. On the reproducibility of experiments of indexing repetitive document collections [J] . Farina Antonio, Martinez-Prieto Miguel A., Claude Francisco, Information Systems . 2019,第JULa期

机译：关于索引重复性文档集的实验的可重复性
3. On the reproducibility of experiments of indexing repetitive document collections [J] . Farina Antonio, Martinez-Prieto Miguel A., Claude Francisco, Information Systems . 2019,第Jula期

机译：论索引重复文件收集实验的再现性
4. Document Retrieval on Repetitive Collections [C] . Gonzalo Navarro, Simon J. Puglisi, Jouni Siren Annual European Symposium on Algorithms . 2014

机译：记录重复收集的文件检索
5. Combinatoric models of information retrieval ranking methods and performance measures for weakly-ordered document collections. [D] . Church, Lewis. 2010

机译：信息检索排序方法和性能度量的组合模型，用于弱序文档收集。
6. Document retrieval on repetitive string collections [O] . Travis Gagie, Aleksi Hartikainen, Kalle Karhu, -1

机译：重复字符串集合的文档检索
7. Document retrieval on repetitive string collections [O] . Travis Gagie, Aleksi Hartikainen, Kalle Karhu, 2017

机译：重复字符串集合的文档检索
8. RETRIEVAL SYSTEMS FOR NON-STATIC DOCUMENT COLLECTIONS. [R] . hillman,donald j. 1965

机译：非静态文件收集的检索系统。

Document Retrieval on Repetitive Collections

摘要

著录项

相似文献

相关主题

期刊订阅