首页> 外文会议> >Effect of Inverted Index Partitioning Schemes on Performance of Query Processing in Parallel Text Retrieval Systems
【24h】

Effect of Inverted Index Partitioning Schemes on Performance of Query Processing in Parallel Text Retrieval Systems

机译:反向索引分区方案对并行文本检索系统中查询处理性能的影响

获取原文
获取原文并翻译 | 示例

摘要

Shared-nothing, parallel text retrieval systems require an inverted index, representing a document collection, to be partitioned among a number of processors. In general, the index can be partitioned based on either the terms or documents in the collection, and the way the partitioning is done greatly affects the query processing performance of the parallel system. In this work, we investigate the effect of these two index partitioning schemes on query processing. We conduct experiments on a 32-node PC cluster, considering the case where index is completely stored in disk. Performance results are reported for a large (30 GB) document collection using an MPI-based parallel query processing implementation.
机译:无共享并行文本检索系统需要一个代表文档集合的倒排索引,才能在多个处理器之间进行分区。通常,可以根据集合中的术语或文档对索引进行分区,并且分区的方式会极大地影响并行系统的查询处理性能。在这项工作中,我们研究了这两种索引分区方案对查询处理的影响。考虑到索引完全存储在磁盘中的情况,我们在32节点PC群集上进行了实验。使用基于MPI的并行查询处理实现报告了大型(30 GB)文档集合的性能结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号