Parallel Processing of Dynamic Continuous Queries over Streaming Data Flows

Ze Deng; Xiaoming Wu; Lizhen Wang; Xiaodao Chen; Ranjan Rajiv; Zomaya Albert; Dan Chen

首页> 外文期刊>Parallel and Distributed Systems, IEEE Transactions on >Parallel Processing of Dynamic Continuous Queries over Streaming Data Flows

【24h】

Parallel Processing of Dynamic Continuous Queries over Streaming Data Flows

机译：流数据流上动态连续查询的并行处理

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

More and more real-time applications need to handle dynamic continuous queries over streaming data of high density. Conventional data and query indexing approaches generally do not apply for excessive costs in either maintenance or space. Aiming at these problems, this study first proposes a new indexing structure by fusing an adaptive cell and KDB-tree, namely CKDB-tree. A cell-tree indexing approach has been developed on the basis of the CKDB-tree that supports dynamic continuous queries. The approach significantly reduces the space costs and scales well with the increasing data size. Towards providing a scalable solution to filtering massive steaming data, this study has explored the feasibility to utilize the contemporary general-purpose computing on the graphics processing unit (GPGPU). The CKDB-tree-based approach has been extended to operate on both the CPU (host) and the GPU (device). The GPGPU-aided approach performs query indexing on the host while perform streaming data filtering on the device in a massively parallel manner. The two heterogeneous tasks execute in parallel and the latency of streaming data transfer between the host and the device is hidden. The experimental results indicate that (1) CKDB-tree can reduce the space cost comparing to the cell-based indexing structure by 60 percent on average, (2) the approach upon the CKDB-tree outperforms the traditional counterparts upon the KDB-tree by 66, 75 and 79 percent in average for uniform, skewed and hyper-skewed data in terms of update costs, and (3) the GPGPU-aided approach greatly improves the approach upon the CKDB-tree with the support of only a single Kepler GPU, and it provides real-time filtering of streaming data with 2.5M data tuples per second. The massively parallel computing technology exhibits great potentials in streaming data monitoring.

机译：越来越多的实时应用程序需要处理高密度流数据上的动态连续查询。传统的数据和查询索引方法通常不会在维护或空间上花费过多的成本。针对这些问题，本研究首先通过将自适应单元和KDB树（即CKDB树）融合，提出了一种新的索引结构。在支持动态连续查询的CKDB树的基础上开发了一种单元树索引方法。该方法显着降低了空间成本，并随着数据大小的增加而很好地扩展。为了提供可扩展的解决方案以过滤大量的蒸汽数据，本研究探索了在图形处理单元（GPGPU）上利用当代通用计算的可行性。基于CKDB树的方法已扩展为可以在CPU（主机）和GPU（设备）上运行。 GPGPU辅助方法在主机上执行查询索引，同时以大规模并行方式在设备上执行流数据过滤。这两个异构任务并行执行，并且主机和设备之间的流数据传输延迟被隐藏。实验结果表明：（1）与基于单元的索引结构相比，CKDB树可将空间成本平均降低60％；（2）CKDB树的方法比KDB树的方法要好于传统方法。就更新成本而言，均匀，偏斜和超偏斜数据的平均值分别为66％，75％和79％；（3）GPGPU辅助方法仅通过单个Kepler GPU的支持就大大改进了CKDB树上的方法。，并以每秒250万个数据元组的速度对流数据进行实时过滤。大规模并行计算技术在流数据监视中显示出巨大的潜力。

著录项

来源
《Parallel and Distributed Systems, IEEE Transactions on》 |2015年第3期|834-846|共13页
作者
Ze Deng; Xiaoming Wu; Lizhen Wang; Xiaodao Chen; Ranjan Rajiv; Zomaya Albert; Dan Chen;
展开▼
作者单位

Sch. of Comput. Sci., China Univ. of Geosci., Wuhan, China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
graphics processing units; indexing; parallel processing; query processing; CKDB-tree structure; GPGPU; Kepler GPU; cell-tree indexing approach; data indexing approach; dynamic continuous queries; general-purpose computing; graphics processing unit; indexing structure; massively parallel computing technology; parallel processing; query indexing approach; streaming data filtering; streaming data flow; streaming data monitoring; Computer architecture; Graphics processing units; Indexing; Maintenance engineering; Monitoring; Real-time systems; GPGPU; KDB-Tree; Streaming data; big data computing; cell-tree query indexing structure; data-intensive computing;

机译：图形处理单元;索引;并行处理;查询处理;CKDB-树结构;GPGPU;开普勒GPU;单元树索引方法;数据索引方法;动态连续查询;通用计算;图形处理单元;索引结构;大规模并行计算技术;并行处理;查询索引方法;流数据过滤;流数据流;流数据监视;计算机体系结构;图形处理单元;索引;维护工程;监控;实时系统;GPGPU;KDB-Tree;流数据;大数据计算;单元树查询索引结构;数据密集型计算;

相似文献

外文文献
中文文献
专利

1. Parallel processing of continuous queries over data streams [J] . Ali A. Safaei, Mostafa S. Haghjoo Distributed and Parallel Databases . 2010,第2a3期

机译：并行处理数据流中的连续查询
2. Dynamic routing of data stream tuples among parallel query plan running on multi-core processors [J] . Ali A. Safari, AH Sharifrazavian, Mohsen Sharifi, Distributed and Parallel Databases . 2012,第2期

机译：在多核处理器上运行的并行查询计划中数据流元组的动态路由
3. Parallel Continuous Preference Queries over Out-of-Order and Bursty Data Streams [J] . Gabriele Mencagli, Massimo Torquati, Marco Danelutto, IEEE Transactions on Parallel and Distributed Systems . 2017,第9期

机译：乱序和突发数据流上的并行连续首选项查询
4. Dynamic continuous query processing over streaming Data [C] . Ananthi M, Sreedhevi D K, Sumalatha M R International Conference on Computation of Power, Energy Information and Commuincation . 2016

机译：通过流数据进行动态连续查询处理
5. Processing continuous queries over streaming data with limited system resources. [D] . Babcock, Brian. 2006

机译：使用有限的系统资源处理流数据上的连续查询。
6. An Adaptive Parallel Processing Strategy for Complex Event Processing Systems over Data Streams in Wireless Sensor Networks [O] . Fuyuan Xiao, Masayoshi Aritsugi 2018

机译：无线传感器网络中数据流上复杂事件处理系统的自适应并行处理策略
7. Continuous query processing in data streams using duality of data and queries [O] . Hyo-sang Lim, Jae-gil Lee, Min-jae Lee, 2006

机译：使用数据和查询的二元性在数据流中进行连续查询处理

Parallel Processing of Dynamic Continuous Queries over Streaming Data Flows

摘要

著录项

相似文献

相关主题

期刊订阅