首页> 外文会议>International Conference on Database and Expert Systems Applications >Optimizing I/O Costs of Multi-dimensional Queries Using Bitmap Indices
【24h】

Optimizing I/O Costs of Multi-dimensional Queries Using Bitmap Indices

机译:使用位图索引优化多维查询的I / O成本

获取原文

摘要

Bitmap indices are efficient data structures for processing complex, multi-dimensional queries in data warehouse applications and scientific data analysis. For high-cardinality attributes, a common approach is to build bitmap indices with binning. This technique partitions the attribute values into a number of ranges, called bins, and uses bitmap vectors to represent bins (attribute ranges) rather than distinct values. In order to yield exact query answers, parts of the original data values have to be read from disk for checking against the query constraint. This process is referred to as candidate check and usually dominates the total query processing time. In this paper we study several strategies for optimizing the candidate check cost for multi-dimensional queries. We present an efficient candidate check algorithm based on attribute value distribution, query distribution as well as query selectivity with respect to each dimension. We also show that re-ordering the dimensions during query evaluation can be used to reduce I/O costs. We tested our algorithm on data with various attribute value distributions and query distributions. Our approach shows a significant improvement over traditional binning strategies for bitmap indices.
机译:位图指数是用于在数据仓库应用中处理复杂的多维查询和科学数据分析的高效数据结构。对于高基数属性,一种常见的方法是用衬砌构建位图索引。此技术将属性值分区为多个范围,称为仓库,并使用位图向量来表示垃圾箱(属性范围)而不是不同的值。为了产生精确的查询答案,必须从磁盘读取原始数据值的部分以检查查询约束。此过程称为候选检查,通常占主导地位的总查询处理时间。在本文中,我们研究了多个策略,以优化多维查询的候选检查成本。我们基于属性值分布,查询分布以及关于每个维度的查询选择性的有效候选候选算法。我们还显示在查询评估期间重新订购维度可用于降低I / O成本。我们在具有各种属性值分布和查询分布的数据上测试了我们的算法。我们的方法显示出对位图指数的传统融合策略的重大改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号