The DBMS - your big data sommelier

机译：DBMS-您的大数据侍酒师

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

When addressing the problem of “big” data volume, preparation costs are one of the key challenges: the high costs for loading, aggregating and indexing data leads to a long data-to-insight time. In addition to being a nuisance to the end-user, this latency prevents real-time analytics on “big” data. Fortunately, data often comes in semantic chunks such as files that contain data items that share some characteristics such as acquisition time or location. A data management system that exploits this trait can significantly lower the data preparation costs and the associated data-to-insight time by only investing in the preparation of the relevant chunks. In this paper, we develop such a system as an extension of an existing relational DBMS (MonetDB). To this end, we develop a query processing paradigm and data storage model that are partial-loading aware. The result is a system that can make a 1.2 TB dataset (consisting of 4000 chunks) ready for querying in less than 3 minutes on a single server-class machine while maintaining good query processing performance.

机译：在解决“大”数据量的问题时，准备成本是主要挑战之一：加载，聚合和索引数据的高成本导致较长的数据获取时间。这种延迟不仅给最终用户带来麻烦，而且还阻止了对“大”数据的实时分析。幸运的是，数据通常以语义块的形式出现，例如文件，其中包含共享某些特征（例如采集时间或位置）的数据项。利用此特征的数据管理系统仅投资相关块的准备工作即可显着降低数据准备成本和相关的数据收集时间。在本文中，我们开发了这样的系统，作为现有关系DBMS（MonetDB）的扩展。为此，我们开发了部分加载感知的查询处理范例和数据存储模型。结果是系统可以在不超过3分钟的时间内在一台服务器级计算机上准备好1.2 TB数据集（由4000个数据块组成）的查询，同时保持良好的查询处理性能。

著录项

来源
《IEEE international conference on data engineering》|2015年|1119-1130|共12页
会议地点
作者
Kargin Yagiz; Kersten Martin; Manegold Stefan; Pirk Holger;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Data in the time of COVID-19: a general methodology to select and secure a NoSQL DBMS for medical data [J] . Kamal A. ElDahshan, AbdAllah A. AlHabshy, Gaber E. Abutaleb PeerJ Computer Science . 2020,第1期

机译：Covid-19中的数据：用于为医疗数据选择和保护NoSQL DBMS的一般方法
2. Necessity to Design of New DBMS Platforms for Data Analysis in Market-oriented Cloud Computing: Properties and Limitations of Data Analysis [J] . Liladhar R. Rewatkar, Ujwal A. Lanjewar Journal of Computational Intelligence in Bioinformatics . 2016,第2期

机译：设计面向市场的云计算中用于数据分析的新DBMS平台的必要性：数据分析的属性和局限性
3. Using Data Compression for Increasing Efficiency of Data Transfer Between Main Memory and Intel Xeon Phi Coprocessor or NVidia GPU in Parallel DBMS [J] . Konstantin Y. Besedin, Pavel S. Kostenetskiy, Stepan O. Prikazchikov Procedia Computer Science . 2015,第1期

机译：使用数据压缩来提高并行DBMS中主内存与Intel Xeon Phi协处理器或NVidia GPU之间的数据传输效率
4. The DBMS - your big data sommelier [C] . Kargin Yagiz, Kersten Martin, Manegold Stefan, IEEE international conference on data engineering . 2015

机译：DBMS - 您的大数据索莫尔
5. Novel Selectivity Estimation Strategy for Modern DBMS [D] . Shin, Jun Hyung 2018

机译：现代DBMS的新型选择性估计策略
6. Evaluation of bone healing in canine tibial defects filled with cortical autograft commercial-DBM calf fetal DBM omentum and omentum-calf fetal DBM [O] . Amin Bigham-Sadegh, Iraj Karimi, Mahsa Alebouye, 2013

机译：评估自体皮质移植物市售DBM小牛胎儿DBM大网膜和大网膜小牛胎儿DBM填充的胫胫骨缺损的骨愈合情况
7. The DBMS - your Big Data Sommelier [O] . Kargın, Y., Kersten, M., Manegold, S., 2015

机译：DBMS-您的大数据侍酒师
8. Develop an Automated Data Base Management System (DBMS): Report on DBMS Software and User's Guide: Final Report, Task 2 [R] . 1987

机译：开发自动数据库管理系统（DBms）：DBms软件和用户指南报告：最终报告，任务2

The DBMS - your big data sommelier

摘要

著录项

相似文献

相关主题

期刊订阅