首页> 外文期刊>Information Processing & Management >Analyzing imbalance among homogeneous index servers in a web search system
【24h】

Analyzing imbalance among homogeneous index servers in a web search system

机译:分析Web搜索系统中同类索引服务器之间的不平衡

获取原文
获取原文并翻译 | 示例
           

摘要

The performance of parallel query processing in a cluster of index servers is crucial for modern web search systems. In such a scenario, the response time basically depends on the execution time of the slowest server to generate a partial ranked answer. Previous approaches investigate performance issues in this context using simulation, analytical modeling, experimentation, or a combination of them. Nevertheless, these approaches simply assume balanced execution times among homogeneous servers (by uniformly distributing the document collection among them, for instance)—a scenario that we did not observe in our experimentation. On the contrary, we found that even with a balanced distribution of the document collection among index servers, correlations between the frequency of a term in the query log and the size of its corresponding inverted list lead to imbalances in query execution times at these same servers, because these correlations affect disk caching behavior. Further, the relative sizes of the main memory at each server (with regard to disk space usage) and the number of servers participating in the parallel query processing also affect imbalance of local query execution times. These are relevant findings that have not been reported before and that, we understand, are of interest to the research community.
机译:索引服务器群集中并行查询处理的性能对于现代Web搜索系统至关重要。在这种情况下,响应时间基本上取决于最慢服务器生成部分排名答​​案的执行时间。先前的方法使用模拟,分析模型,实验或它们的组合来研究这种情况下的性能问题。但是,这些方法只是假设同类服务器之间的执行时间是平衡的(例如,通过在其中均匀地分布文档集合),这是我们在实验中未曾观察到的情况。相反,我们发现即使索引服务器之间的文档集合分布均衡,查询日志中术语的频率与其相应倒排列表的大小之间的相关性也会导致这些相同服务器上查询执行时间的不平衡,因为这些相关性会影响磁盘缓存行为。此外,每个服务器上主存储器的相对大小(关于磁盘空间使用情况)和参与并行查询处理的服务器数量也会影响本地查询执行时间的不平衡。这些是相关的发现,以前没有被报道过,据我们了解,研究界对此很感兴趣。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号