...
首页> 外文期刊>IEEE Transactions on Knowledge and Data Engineering >The design and implementation of seeded trees: an efficient method for spatial joins
【24h】

The design and implementation of seeded trees: an efficient method for spatial joins

机译:种子树的设计和实现:一种有效的空间连接方法

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Existing methods for spatial joins require pre-existing spatial indices or other precomputation, but such approaches are inefficient and limited in generality. Operand data sets of spatial joins may not all have precomputed indices, particularly when they are dynamically generated by other selection or join operations. Also, existing spatial indices are mostly designed for spatial selections, and are not always efficient for joins. This paper explores the design and implementation of seeded trees, which are effective for spatial joins and efficient to construct at join time. Seeded trees are R-tree-like structures, but divided into seed levels and grown levels. This structure facilitates using information regarding the join to accelerate the join process, and allows efficient buffer management. In addition to the basic structure and behavior of seeded trees we present techniques for efficient seeded tree construction, a new buffer management strategy to lower I/O costs, and theoretical analysis for choosing algorithmic parameters. We also present methods for reducing space requirements and improving the stability of seeded tree performance with no additional I/O costs. Our performance studies show that the seeded tree method outperforms other tree-based methods by far both in terms of the number disk pages accessed and weighted I/O costs. Further, its performance gain is stable across different input data, and its incurred CPU penalties are also lower.
机译:现有的用于空间连接的方法需要预先存在的空间索引或其他预先计算,但是这样的方法效率低下并且普遍性受到限制。空间连接的操作数数据集可能不都具有预先计算的索引,尤其是当它们通过其他选择或连接操作动态生成时。同样,现有的空间索引主要是为空间选择而设计的,并且对于连接并不总是有效的。本文探讨了种子树的设计和实现,这些树对于空间连接是有效的,并且在连接时可以有效地进行构造。种子树是R树状的结构,但分为种子水平和生长水平。此结构有助于使用有关连接的信息来加速连接过程,并允许有效的缓冲区管理。除了种子树的基本结构和行为之外,我们还提供有效的种子树构建技术,降低I / O成本的新缓冲区管理策略以及选择算法参数的理论分析。我们还提出了减少空间需求和提高种子树性能稳定性的方法,而没有额外的I / O成本。我们的性能研究表明,就访问的磁盘页数和加权的I / O成本而言,种子树方法远胜于其他基于树的方法。此外,其性能增益在不同的输入数据上是稳定的,并且所产生的CPU损失也较低。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号