首页> 外文会议>ACM SIGMOD international conference on Management of data >A performance analysis of alternative multi-attribute declustering strategies
【24h】

A performance analysis of alternative multi-attribute declustering strategies

机译:替代多属性聚类策略的性能分析

获取原文

摘要

During the past decade, parallel database systems have gained increased popularity due to their high performance, scalability and availability characteristics. With the predicted future database sizes and the complexity of queries, the scalability of these systems to hundreds and thousands of processors is essential for satisfying the projected demand. Several studies have repeatedly demonstrated that both the performance and scalability of a paralel database system is contingent on the physical layout of data across the processors of the system. If the data is not declustered properly, the execution of an operator might waste resources, reducing the overall processing capability of the system.

With earlier, single attribute declustering strategies, such as those found in Tandem, Teradata, Gamma, and Bubba parallel database systems, a selection query including a range predicate on any attribute other than the partitioning attribute must be sent to all processors containing tuples of the relation. By directing a query with minimal resource requirements to processors that contain no relevant tuples, the system wastes CPU cycles, communication bandwidth, and I/O bandwidth, reducing its overall processing capability. As a solution, several multi-attribute declustering strategies have been proposed. However, the performance of these declustering techniques have not previously been compared to one another nor with a single attribute partitioning strategy. This paper, compares the performance of Multi-Attribute GrId deClustering (MAGIC) strategy and Bubba's Extended Range Declustering (BERD) strategy with one another and with the range partitioning strategy. Our results indicate that MAGIC outperforms both range and BERD in all experiments conducted in this study.

机译:

在过去的十年中,并行数据库系统由于其高性能,可伸缩性和可用性特性而越来越受欢迎。由于预计未来的数据库大小和查询的复杂性,这些系统对成千上万个处理器的可伸缩性对于满足计划的需求至关重要。几项研究反复证明并行数据库系统的性能和可伸缩性都取决于系统处理器中数据的物理布局。如果无法正确整理数据,操作员的执行可能会浪费资源,从而降低系统的整体处理能力。

使用早期的单属性分簇策略(例如在Tandem,Teradata,Gamma和Bubba并行数据库系统中发现的策略),必须将包含除分区属性以外的任何属性的范围谓词的选择查询发送给包含以下内容的所有处理器:关系的元组。通过将具有最少资源需求的查询定向到不包含相关元组的处理器,系统浪费了CPU周期,通信带宽和I / O带宽,从而降低了其整体处理能力。作为解决方案,已经提出了几种多属性分簇策略。但是,这些去簇技术的性能以前没有相互比较过,也没有与单个属性分区策略进行过比较。本文比较了多属性GrId去聚类(MAGIC)策略和Bubba的扩展范围去聚类(BERD)策略以及范围划分策略的性能。我们的结果表明,在这项研究中进行的所有实验中,MAGIC的性能均优于量程和BERD。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号