Understanding and Optimizing Conjunctive Predicates Under Memory-Efficient Storage Layouts

Wang Zeke; Liu Xue; Zhang Kai; Zhou Haihang; He Bingsheng

首页> 外文期刊>IEEE Transactions on Knowledge and Data Engineering >Understanding and Optimizing Conjunctive Predicates Under Memory-Efficient Storage Layouts

【24h】

Understanding and Optimizing Conjunctive Predicates Under Memory-Efficient Storage Layouts

机译：在内存高效的存储布局下了解和优化联合谓词

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Database queries can contain multiple predicates. The optimization of conjunctive predicates is still vital to the overall performance of analytic data processing tasks. Prior work proposes several memory-efficient storage layouts, e.g., BitWeaving and ByteSlice, to significantly accelerate predicate evaluation, as circuit-level intra-cycle parallelism available in modern CPUs can be exploited such that the total number of instructions can be dramatically reduced. However, the performance potential of conjunctive predicates has not been harvested yet under such storage layouts as there is no accurate cost model to provide necessary insights that guide the optimization process. In this paper, we propose a hybrid empirical/analytical cost model (Understanding) to unveil the performance characteristics of such storage layouts when applying to predicate evaluation. Our cost model takes into account effect of non-linear factors, e.g., cache miss and branch misprediction, and easily applies to different CPUs. The main finding from our cost model is to distinguish high-cost instruction (which suffers from cache miss and/or branch misprediction) from low-cost instruction (which enjoys cache hit and correct branch prediction) in the context of predicate evaluation under these storage layouts. Guided by such a finding, we propose a simple execution scheme Hebe (Optimizing), which is order-oblivious while maintaining high performance. Hebe is attractive to the query optimizer (QO), as the QO does not need to go through a sampling process to decide the optimal evaluation order in advance. The intuition behind Hebe is to significantly reduce the number of high-cost instructions while keeping low-cost instructions unchanged. Our finding from Hebe sheds light on the importance of accurate cost model that guide us to derive an efficient execution scheme for query processing on modern CPUs.

机译：数据库查询可以包含多个谓词。联合谓词的优化对分析数据处理任务的整体性能仍然至关重要。事先工作提出了几个记忆有效的存储布局，例如，位织造和Byteslice，以显着加速谓词评估，因为可以利用现代CPU中可用的电路级内循环并行性，以便可以显着降低指令总数。然而，在这种存储布局之下尚未收获联合谓词的性能潜力，因为没有准确的成本模型，以提供指导优化过程的必要洞察。在本文中，我们提出了一种混合实证/分析成本模型（理解），以在申请谓词评估时揭示这种存储布局的性能特征。我们的成本模型考虑了非线性因素的影响，例如缓存未命中和分支错误规定，并轻松适用于不同的CPU。我们的成本模型的主要发现是将高成本指令（遭受高速缓存未命中和/或分支错误规定）在这些存储下的谓词评估的上下文中，从低成本指令（在高速缓存命中和正确的分支预测）中布局。通过这样的发现，我们提出了一个简单的执行方案Hebe（优化），这是令人满意的，同时保持高性能。 Hebe对查询优化器（Qo）有吸引力，因为Qo不需要通过采样过程来提前决定最佳评估顺序。 Hebe背后的直觉是显着减少高成本指令的数量，同时保持低成本指令不变。我们从Hebbe Sheds阐明了准确成本模型的重要性，指导我们在现代CPU上获得了高效执行方案进行查询处理。

著录项

来源
《IEEE Transactions on Knowledge and Data Engineering》 |2021年第6期|2803-2817|共15页
作者
Wang Zeke; Liu Xue; Zhang Kai; Zhou Haihang; He Bingsheng;
展开▼
作者单位

Zhejiang Univ Collaborat Innovat Ctr Artificial Intelligence MO Hangzhou 310027 Zhejiang Peoples R China;

Northeastern Univ Dept Comp Sci Shenyang 110819 Peoples R China;

Fudan Univ Sch Comp Sci Key Lab Data Sci Shanghai 200433 Peoples R China;

Natl Univ Singapore Sch Comp Singapore 119077 Singapore;

Natl Univ Singapore Sch Comp Singapore 119077 Singapore;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Layout; Memory management; Optimization; Acceleration; Space exploration; Computer science; Dictionaries; Database; conjunctive predicates; storage layout; CPU;

机译：布局;记忆管理;优化;加速;太空探索;计算机科学;词典;数据库;结合谓词;存储布局;CPU;

相似文献

外文文献
中文文献
专利

1. A multi-period optimization model for conjunctive surface water-ground water use via aquifer storage and recovery in Corpus Christi, Texas [J] . E. Annette Hernandez, Venkatesh Uddameri, Marcelo A. Arreola Jr. Environmental earth sciences . 2014,第6期

机译：得克萨斯州科珀斯克里斯蒂市通过含水层存储和回收利用的地表水-地下水联合利用的多周期优化模型
2. Optimization of S-CO_2 power conversion layouts with energy storage for the pulsed DEMO reactor [J] . Syblik Jan, Entler Slavomir, Stepanek Jan, Fusion Engineering and Design . 2021,第Auga期

机译：S-CO_2电源转换布局对脉冲演示反应器的能量存储器的优化
3. On comparison of two-level and global optimization schemes for layout design of storage ponds [J] . Lu Wei, Qin Xiaosheng, Yu Jianjun Journal of Hydrology . 2019,第期

机译：关于储存池布局设计的两级和全局优化方案的比较
4. Optimization of Conjunctive Predicates for Main Memory Column Stores [C] . Fisnik Kastrati, Guido Moerkotte International conference on very large data bases . 2016

机译：主存储列存储的合取谓语的优化
5. Memory-Efficient Optimization Over Positive Semidefinite Matrices [D] . Naber, Andrew Thomas. 2020

机译：积极的半纤维矩阵上的记忆有效优化
6. P01.077 Optimizing array layouts for glioblastoma therapy with tumor treating fields (TTFields) - Use of oblique array layouts surpass default left-right/anterior-posterior positions in a computer simulation model [O] . A R Korshoej, N Mikic, J H Sørensen, 2018

机译：P01.077使用肿瘤治疗场（TTFields）优化胶质母细胞瘤治疗的阵列布局-倾斜阵列布局的使用超过了计算机仿真模型中默认的左右/前后位置
7. Improving Storage Performance Through Layout Optimizations [O] . Bhadkamkar Medha 2009

机译：通过布局优化提高存储性能

Understanding and Optimizing Conjunctive Predicates Under Memory-Efficient Storage Layouts

摘要

著录项

相似文献

相关主题

期刊订阅