首页> 外文期刊>系统科学与信息学报:英文版 >Optimization in Data Cube System Design
【24h】

Optimization in Data Cube System Design

机译:数据立方体系统设计中的优化

获取原文
获取原文并翻译 | 示例
           

摘要

The design of an OLAP system for supporting real-time queries is one of the major research issues.One approach is to use data cubes,which are pre-computed multidimensional views of data in the data warehouse.An initial set of data cubes can be derived.from which the answer to each frequently asked query can be retrieved directly.However,there are two practical problems concerning the design of a cube based system:1)the maintenance cost of the data cubes,and 2)the query cost to answer a selected set of frequently asked queries.Maintaining a data cube requires disk storage and CPU computation,So the maintenance cost is related to the total size of the data cubes materialized,and thus keeping all data cubes is impractical.The total size of cubes may be reduced by merging some cubes.However,the resulting larger cubes will increase the query cost of answering some queries.If the bounds on maintenance cost and query cost are strict.some of the queries need to be sacrificed.An optimization problem in data cube system design has been defined.With a maintenance-cost bound and a query-cost bound given by the user,it is necessary to opti-mize the initial set of data cubes such that the system can answer a maximum number of queries and satisfy the bounds.This is an NP-complete problem.Approximate algorithms Greedy Removing(GR)and 2-Greedy Merging with Multiple paths(2GGM)are proposed.Experiments have been done on a census database and the results show that our approach in both effbctive and efficient.
机译:支持实时查询的OLAP系统的设计是主要的研究问题之一。一种方法是使用数据多维数据集,数据多维数据集是数据仓库中数据的预先计算的多维视图。可以初始设置数据多维数据集但是,在基于多维数据集的系统设计中存在两个实际问题:1)数据多维数据集的维护成本; 2)回答的查询成本维护数据多维数据集需要磁盘存储和CPU计算,因此维护成本与实现的数据多维数据集的总大小有关,因此保持所有数据多维数据集是不切实际的。多维数据集的总大小可能可以通过合并一些多维数据集来减少。但是,结果较大的多维数据集将增加回答某些查询的查询成本。如果维护成本和查询成本的界限严格,则需要牺牲一些查询。定义了数据多维数据集系统设计中的障碍。在用户给定维护成本界限和查询成本界限的情况下,有必要对数据立方体的初始集合进行优化,以使系统能够回答最大数量的问题。这是一个NP完全问题。提出了近似算法贪婪去除(GR)和多路径2-贪婪合并(2GGM)。在普查数据库中进行了实验,结果表明我们的方法既有效又高效。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号