Efficient storage and query processing of set-valued attributes.

机译：集值属性的有效存储和查询处理。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

In order to better support complex applications, object relational systems provide features that are absent in relational systems. The main features a new type by providing collections of existing types. It is well known that sets are useful in modeling a great deal of real world data. However, such powerful modeling comes at a price; without an efficient implementation, using sets can yield a performance much worse than that obtained using only traditional relational constructs. This dissertation explores novel ways of implementing set-valued attributes in an object relational system. Specifically, it considers various options for storing set-valued attributes, and ways of computing the challenging set containment join operation.; We first address the problem of storing set-valued attributes. Using the orthogonal attributes of nesting and location we identify four options for representing sets: nested internal and external, and unnested external and internal. These representations can be combined with the creation of various indices to create various classes of indexed representations. We evaluate each of these representations with respect to conjunctive and disjunctive queries. Our results show that overall the nested implementations perform better than the unnested implementations because: (a) they exploit grouping semantics while fetching the members of a set instance and (b) they allow the evaluation of set predicates directly on the set instance.; Next we consider the problem of efficiently evaluating set containment joins. For unnested external representation, the set containment join can be expressed directly in SQL. By contrast, the most obvious algorithm for computing set containment joins on nested representations is the signature nested loops algorithm, which computes set signatures and compares each signature in a relation with all the signatures in the other relation. To improve on the performance of this algorithm we propose a new partitioned set join algorithm (PSJ), which uses a multi-level scheme of partitioning by replicating the inner relation. Our performance study shows that for extremely small relation and small set cardinalities, the SQL query approach and signature nested loops perform comparably to PSJ. However, as the size of the data sets increase (in both relation and set cardinality), PSJ clearly dominates.

机译：为了更好地支持复杂的应用程序，对象关系系统提供了关系系统中缺少的功能。主要功能通过提供现有类型的集合来提供一种新类型。众所周知，集合对于建模大量现实世界数据很有用。但是，如此强大的建模需要付出一定的代价。在没有有效实现的情况下，使用集会产生比仅使用传统关系构造获得的性能差得多的性能。本文探讨了在对象关系系统中实现集值属性的新颖方法。具体来说，它考虑了用于存储集合值属性的各种选项，以及计算具有挑战性的集合包含联接操作的方式。我们首先解决存储集合值属性的问题。使用嵌套和位置的正交属性，我们确定了四个表示集合的选项：嵌套的内部和外部，以及未嵌套的外部和内部。这些表示可以与各种索引的创建结合以创建各种类别的索引表示。我们针对合取和析取查询评估每种表示形式。我们的结果表明，嵌套的实现总体上比未嵌套的实现更好，这是因为：（a）它们在获取集合实例的成员时利用分组语义，并且（b）它们允许直接在集合实例上评估集合谓词。接下来，我们考虑有效评估集合包含联接的问题。对于未嵌套的外部表示，可以直接在SQL中表示集合包含联接。相比之下，用于计算嵌套表示上的集合包含联接的最明显的算法是签名嵌套循环算法，该算法计算集合签名并将一个关系中的每个签名与另一个关系中的所有签名进行比较。为了提高该算法的性能，我们提出了一种新的分区集联接算法（PSJ），该算法通过复制内部关系使用多级分区方案。我们的性能研究表明，对于极小的关系和很小的基数，SQL查询方法和签名嵌套循环的性能与PSJ相当。但是，随着数据集大小的增加（关系和集基数），PSJ显然占主导地位。

著录项

作者
Ramasamy, Karthikeyan.;
展开▼
作者单位

The University of Wisconsin - Madison.;

展开▼
授予单位 The University of Wisconsin - Madison.;
学科 Computer Science.
学位 Ph.D.
年度 2001
页码 144 p.
总页数 144
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Efficient processing of probabilistic set-containment queries on uncertain set-valued data [J] . Xiaolong Zhanga, Ke Chena, Lidan Shoua, Information Sciences: An International Journal . 2012,第Null期

机译：对不确定集值数据的概率集包含查询的高效处理
2. A link-based storage scheme for efficient aggregate query processing on clustered road networks [J] . Engin Demir, Cevdet Aykanat, B. Barla Cambazoglu Information Systems . 2010,第1期

机译：基于链接的存储方案，可在集群道路网络上进行有效的聚合查询处理
3. Efficient spatial query processing for KNN queries using well organised net-grid partition indexing approach [J] . K. Geetha, A. Kannan International journal of data mining, modelling and management . 2018,第4期

机译：使用组织良好的网络网格分区索引方法对KNN查询进行有效的空间查询处理
4. Efficient Data Ingestion and Query Processing for LSM-Based Storage Systems [C] . Chen Luo, Michael J. Carey International conference on very large data bases . 2019

机译：基于LSM的存储系统的有效数据提取和查询处理
5. Energy-efficient query-informed routing for query processing in sensor networks [D] . Zhang, Zhiguo 2008

机译：用于传感器网络中查询处理的高能效查询通知路由
6. Skyline Query Processing in Sensor Network Based on Data Centric Storage [O] . Seokil Song, Yunsik Kwak, Seokhee Lee 2012

机译：基于数据中心存储的传感器网络天际线查询处理
7. Efficient data ingestion and query processing for LSM-based storage systems [O] . Chen Luo, Michael J. Carey 2019

机译：基于LSM的存储系统的高效数据摄取和查询处理

Efficient storage and query processing of set-valued attributes.

摘要

著录项

相似文献

相关主题

期刊订阅