首页> 外文学位 >Indexing of multidimensional discrete data spaces and hybrid extensions.
【24h】

Indexing of multidimensional discrete data spaces and hybrid extensions.

机译:多维离散数据空间和混合扩展的索引。

获取原文
获取原文并翻译 | 示例

摘要

In this thesis various indexing techniques are developed and evaluated to support efficient queries in different vector data spaces.;Various indexing techniques have been introduced for the (ordered) Continuous Data Space (CDS) and the Non-ordered Discrete Data Space (NDDS). All these techniques rely on special properties of the CDS or the NDDS to optimize data accesses and storage in their corresponding structures.;Besides conventional exact match queries, the similarity queries and the box queries are two types of fundamental operations widely supported by modern indexing techniques. A box query is different from a similarity query in that the box query in multidimensional spaces tries to look up indexed data which meet query conditions on each and every dimension. The difference between similarity queries and box queries suggests that indexing techniques which work well for similarity queries may not necessarily support efficient box queries. In this thesis, we propose the BoND-tree, a new indexing technique designed for supporting box queries in an NDDS. Both our theoretical analysis and experimental results demonstrate that the new heuristics proposed for the BoND-tree improve the performance of box queries in an NDDS significantly.;The Hybrid Data Space (HDS) is a multidimensional data space which contains both (ordered) continuous and non-ordered discrete dimensions. In this thesis a novel indexing structure, the C-ND tree, has been developed to support efficient similarity queries in HDSs. To do so, some geometric concepts in the HDS are introduced. Novel node splitting heuristics which exploit characteristics of both CDS and NDDS are proposed. Our extensive experimental results show that the C-ND tree is quite promising in supporting similarity queries in HDSs.;To support box queries in the HDS, we extended the original ND-tree to the HDS to evaluate the effectiveness of the ND-tree heuristics on supporting box queries in an HDS. A novel power value adjustment strategy is used to make the continuous and discrete dimensions comparable and controllable in the HDS. An estimation model is developed to predict the box query performance of the hybrid indexing. Our experimental results show that the original ND-tree heuristics are effective in supporting box queries in an HDS, and could be further improved with our power adjustment strategies to address the characteristics of the HDS.
机译:本文开发和评估了各种索引技术,以支持在不同的向量数据空间中进行有效的查询。;已经为(有序的)连续数据空间(CDS)和无序的离散数据空间(NDDS)引入了各种索引技术。所有这些技术都依靠CDS或NDDS的特殊属性来优化其相应结构中的数据访问和存储。除了常规的完全匹配查询之外,相似性查询和框查询是现代索引技术广泛支持的两种基本操作类型。盒式查询与相似性查询的不同之处在于,多维空间中的盒式查询试图查找满足每个维度上查询条件的索引数据。相似性查询与框式查询之间的差异表明,对相似性查询有效的索引编制技术不一定支持有效的框式查询。在本文中,我们提出了BoND树,这是一种新的索引技术,旨在支持NDDS中的框查询。我们的理论分析和实验结果均表明,针对BoND树提出的新启发式方法显着提高了NDDS中的框查询性能。;混合数据空间(HDS)是一个多维数据空间,它既包含(有序)连续又包含无序离散尺寸。本文提出了一种新颖的索引结构,即C-ND树,以支持HDS中的高效相似性查询。为此,引入了HDS中的一些几何概念。提出了利用CDS和NDDS特性的新型节点分裂启发式算法。我们广泛的实验结果表明,C-ND树在支持HDS中的相似性查询方面很有前途。为了支持HDS中的盒查询,我们将原始ND树扩展到HDS以评估ND树启发式算法的有效性在HDS中支持盒查询。一种新颖的功率值调整策略用于使连续尺寸和离散尺寸在HDS中具有可比性和可控性。开发了一个估计模型来预测混合索引的箱查询性能。我们的实验结果表明,原始的ND树启发式方法可有效支持HDS中的框查询,并且可以通过我们的功率调整策略来进一步改善HDS的特性,从而进一步改进。

著录项

  • 作者

    Chen, Changqing.;

  • 作者单位

    Michigan State University.;

  • 授予单位 Michigan State University.;
  • 学科 Engineering Computer.
  • 学位 Ph.D.
  • 年度 2009
  • 页码 172 p.
  • 总页数 172
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号