首页> 外文学位 >Efficient storage and query processing of set-valued attributes.
【24h】

Efficient storage and query processing of set-valued attributes.

机译:集值属性的有效存储和查询处理。

获取原文
获取原文并翻译 | 示例

摘要

In order to better support complex applications, object relational systems provide features that are absent in relational systems. The main features a new type by providing collections of existing types. It is well known that sets are useful in modeling a great deal of real world data. However, such powerful modeling comes at a price; without an efficient implementation, using sets can yield a performance much worse than that obtained using only traditional relational constructs. This dissertation explores novel ways of implementing set-valued attributes in an object relational system. Specifically, it considers various options for storing set-valued attributes, and ways of computing the challenging set containment join operation.; We first address the problem of storing set-valued attributes. Using the orthogonal attributes of nesting and location we identify four options for representing sets: nested internal and external, and unnested external and internal. These representations can be combined with the creation of various indices to create various classes of indexed representations. We evaluate each of these representations with respect to conjunctive and disjunctive queries. Our results show that overall the nested implementations perform better than the unnested implementations because: (a) they exploit grouping semantics while fetching the members of a set instance and (b) they allow the evaluation of set predicates directly on the set instance.; Next we consider the problem of efficiently evaluating set containment joins. For unnested external representation, the set containment join can be expressed directly in SQL. By contrast, the most obvious algorithm for computing set containment joins on nested representations is the signature nested loops algorithm, which computes set signatures and compares each signature in a relation with all the signatures in the other relation. To improve on the performance of this algorithm we propose a new partitioned set join algorithm (PSJ), which uses a multi-level scheme of partitioning by replicating the inner relation. Our performance study shows that for extremely small relation and small set cardinalities, the SQL query approach and signature nested loops perform comparably to PSJ. However, as the size of the data sets increase (in both relation and set cardinality), PSJ clearly dominates.
机译:为了更好地支持复杂的应用程序,对象关系系统提供了关系系统中缺少的功能。主要功能通过提供现有类型的集合来提供一种新类型。众所周知,集合对于建模大量现实世界数据很有用。但是,如此强大的建模需要付出一定的代价。在没有有效实现的情况下,使用集会产生比仅使用传统关系构造获得的性能差得多的性能。本文探讨了在对象关系系统中实现集值属性的新颖方法。具体来说,它考虑了用于存储集合值属性的各种选项,以及计算具有挑战性的集合包含联接操作的方式。我们首先解决存储集合值属性的问题。使用嵌套和位置的正交属性,我们确定了四个表示集合的选项:嵌套的内部和外部,以及未嵌套的外部和内部。这些表示可以与各种索引的创建结合以创建各种类别的索引表示。我们针对合取和析取查询评估每种表示形式。我们的结果表明,嵌套的实现总体上比未嵌套的实现更好,这是因为:(a)它们在获取集合实例的成员时利用分组语义,并且(b)它们允许直接在集合实例上评估集合谓词。接下来,我们考虑有效评估集合包含联接的问题。对于未嵌套的外部表示,可以直接在SQL中表示集合包含联接。相比之下,用于计算嵌套表示上的集合包含联接的最明显的算法是签名嵌套循环算法,该算法计算集合签名并将一个关系中的每个签名与另一个关系中的所有签名进行比较。为了提高该算法的性能,我们提出了一种新的分区集联接算法(PSJ),该算法通过复制内部关系使用多级分区方案。我们的性能研究表明,对于极小的关系和很小的基数,SQL查询方法和签名嵌套循环的性能与PSJ相当。但是,随着数据集大小的增加(关系和集基数),PSJ显然占主导地位。

著录项

  • 作者

    Ramasamy, Karthikeyan.;

  • 作者单位

    The University of Wisconsin - Madison.;

  • 授予单位 The University of Wisconsin - Madison.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2001
  • 页码 144 p.
  • 总页数 144
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号