首页> 外文期刊>IEEE Transactions on Parallel and Distributed Systems >MAGIC: a multiattribute declustering mechanism for multiprocessor database machines
【24h】

MAGIC: a multiattribute declustering mechanism for multiprocessor database machines

机译:MAGIC:用于多处理器数据库机器的多属性分簇机制

获取原文
获取原文并翻译 | 示例
           

摘要

During the past decade, parallel database systems have gained increased popularity due to their high performance, scalability, and availability characteristics. With the predicted future database sizes and complexity of queries, the scalability of these systems to hundreds and thousands of processors is essential for satisfying the projected demand. Several studies have repeatedly demonstrated that both the performance and scalability of a parallel database system are contingent on the physical layout of the data across the processors of the system. If the data are not declustered appropriately, the execution of an operation might waste system resources, reducing the overall processing capability of the system. With earlier, single-attribute partitioning mechanisms such as those found in the Tandem, Teradata, Gamma, and Bubba parallel database systems, range selections on any attribute other than the partitioning attribute must be sent to all processors containing tuples of the relation, while range selections on the partitioning attribute can be directed to only a subset of the processors. Although using all the processors for an operation is reasonable for resource intensive operations, directing a query with minimal resource requirements to processors that contain no relevant tuples wastes CPU cycles, communication bandwidth, and I/O bandwidth. As a solution, this paper describes a new partitioning strategy, multiattribute grid declustering (MAGIC), which can use two or more attributes of a relation to decluster its tuples across multiple processors and disks. In addition, MAGIC declustering, unlike other multiattribute partitioning mechanisms that have been proposed, is able to support range selections as well as exact match selections on each of the partitioning attributes. This capability enables a greater variety of selection operations to be directed to a restricted subset of the processors in the system. Finally, MAGIC partitions each relation based on the resource requirements of the queries that constitute the workload for the relation and the processing capacity of the system in order to ensure that the proper number of processors are used to execute queries that reference the relation.
机译:在过去的十年中,并行数据库系统由于其高性能,可伸缩性和可用性特性而越来越受欢迎。有了预期的未来数据库大小和查询的复杂性,这些系统对成千上万个处理器的可伸缩性对于满足计划的需求至关重要。几项研究反复证明,并行数据库系统的性能和可伸缩性均取决于系统处理器中数据的物理布局。如果没有适当地对数据进行分簇,则操作的执行可能会浪费系统资源,从而降低系统的整体处理能力。使用早期的单属性分区机制(例如在Tandem,Teradata,Gamma和Bubba并行数据库系统中发现的机制),必须将除分区属性之外的任何属性上的范围选择发送到所有包含关系元组的处理器,而range可以将分区属性上的选择仅定向到处理器的子集。尽管将所有处理器用于一个操作对于资源密集型操作是合理的,但是将具有最少资源需求的查询定向到不包含相关元组的处理器会浪费CPU周期,通信带宽和I / O带宽。作为解决方案,本文介绍了一种新的分区策略,即多属性网格分簇(MAGIC),它可以使用关系的两个或多个属性来分簇其跨多个处理器和磁盘的元组。另外,与已经提出的其他多属性分区机制不同,MAGIC聚类能够支持每个分区属性上的范围选择以及精确匹配选择。此功能使更多选择操作可以定向到系统中处理器的受限子集。最后,MAGIC基于构成关系工作量的查询的资源需求和系统的处理能力,对每个关系进行分区,以确保使用适当数量的处理器来执行引用该关系的查询。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号