...
首页> 外文期刊>Information Systems >Inverted index maintenance strategy for flashSSDs: Revitalization of in-place index update strategy
【24h】

Inverted index maintenance strategy for flashSSDs: Revitalization of in-place index update strategy

机译:flashSSD的反向索引维护策略:现场索引更新策略的复兴

获取原文
获取原文并翻译 | 示例
           

摘要

An inverted index is a core data structure of Information Retrieval systems, especially in search engines. Since the search environments have become more dynamic many on-line index maintenance strategies have been proposed. Previous strategies were designed for HDDs. Consequently, in order to avoid expensive random access cost Merge-based strategies have been preferred to In-place index update strategies on HDDs. However, flashSSDs have become solid alternatives to HDDs. FlashSSDs currently are adopted in a wide range of areas due to their superior features such as the short access latency, energy efficiency, and high bandwidth. In this article, we first reexamined potentials of In-place index update strategies on flashSSDs. Thanks to the insignificant access latency of flashSSDs, we discovered that In-place index update strategies outperform Merge-based strategies, since In-place index update strategies generate much less amount of I/O than Merge-based strategies despite inducing frequent random accesses. Based on this discovery, we suggest a new inverted index maintenance strategy based on an In-place index update strategy for flashSSDs, called Mulu'path Flash In-place Strategy (MFIS). To enhance the index maintenance performance, MFIS stores the posting list of each term non-contiguously and exploits the internal parallelism of flashSSDs. Thus, MFIS not only induces the minimum amount of I/O but also utilizes the maximum bandwidth of flashSSDs. Furthermore, MFIS is designed to show high query processing performance by utilizing the internal parallelism of flashSSDs even though the posting list of each term is stored non-contiguously. In our experiments, the index maintenance performance of MFIS was considerably better than other previous maintenance strategies. The index maintenance performance was up to 1433, 4.04, 5.12, and 233 times higher than Merge-based strategies such as Immediate Merge, Geometric Partitioning, Hybrid, and SSD-aware Hybrid, respectively. The query processing performance of MFIS was up to 1.62 times higher than non-contiguous In-place. In addition, MFIS showed almost the best query processing performance as Merge-based strategies did. In conclusion, MFIS is the best on-line inverted index maintenance strategy on flashSSDs in terms of both index maintenance and query processing performance.
机译:倒排索引是信息检索系统的核心数据结构,尤其是在搜索引擎中。由于搜索环境变得更加动态,因此提出了许多在线索引维护策略。以前的策略是为硬盘设计的。因此,为了避免昂贵的随机访问成本,基于合并的策略已被优先于HDD上的就地索引更新策略。但是,flashSSD已成为HDD的可靠替代品。由于FlashSSD具有优越的功能,例如较短的访问等待时间,能效和高带宽,目前已被广泛应用于各个领域。在本文中,我们首先重新检查了FlashSSD上就地索引更新策略的潜力。由于flashSSD的访问延迟不明显,因此我们发现就地索引更新策略的性能优于基于合并的策略,因为尽管会导致频繁的随机访问,但就地索引更新策略所产生的I / O量要比基于合并的策略少得多。基于此发现,我们建议一种基于FlashSSD的就地索引更新策略的新的反向索引维护策略,称为Mulu'path Flash就地策略(MFIS)。为了增强索引维护性能,MFIS不连续地存储每个术语的发布列表,并利用flashSSD的内部并行性。因此,MFIS不仅引起最少的I / O量,而且还利用flashSSD的最大带宽。此外,MFIS被设计为通过利用flashSSD的内部并行性来显示高查询处理性能,即使每个术语的发布列表不连续存储也是如此。在我们的实验中,MFIS的索引维护性能明显优于其他先前的维护策略。索引维护性能分别比即时合并,几何分区,混合和可识别SSD的基于合并的策略分别高1433、4.04、5.12和233倍。 MFIS的查询处理性能比不连续的In-place高出1.62倍。此外,MFIS显示了几乎最佳的查询处理性能,这与基于合并的策略一样。总之,就索引维护和查询处理性能而言,MFIS是flashSSD上最好的在线反向索引维护策略。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号