首页> 外文期刊>Software, IET >Improving semantic compression specification in large relational database
【24h】

Improving semantic compression specification in large relational database

机译:改进大型关系数据库中的语义压缩规范

获取原文
获取原文并翻译 | 示例
           

摘要

The large-scale relational databases normally have a large size and a high degree of sparsity. This has made database compression very important to improve the performance and save storage space. Using standard compression techniques (syntactic) such as Gzip or Zip does not take advantage of the relational properties, as these techniques do not look at the nature of the data. Since semantic compression accounts for and exploits both the meanings and dynamic ranges of error for individual attributes (lossy compression); and existing data dependencies and correlations between attributes in the table (lossless compression), it is very effective for table-data compression. Inspired by semantic compression, this study proposes a novel independent lossless compression system through utilising data-mining model to find the frequent pattern with maximum gain (representative row) in order to draw attribute semantics, besides a modified version of an augmented vector quantisation coder to increase total throughput of the database compression. This algorithm enables more granular and suitable for every kind of massive data tables after synthetically considering compression ratio, space, and speed. The experimentation with several very large real-life datasets indicates the superiority of the system with respect to previously known lossless semantic techniques.
机译:大型关系数据库通常具有较大的规模和高度的稀疏性。这使得数据库压缩对于提高性能和节省存储空间非常重要。使用标准压缩技术(语法)(例如Gzip或Zip)不会利用关系属性,因为这些技术不会考虑数据的性质。由于语义压缩考虑并利用了单个属性错误的含义和动态范围(有损压缩);以及表中属性之间现有的数据依赖性和相关性(无损压缩),这对于表数据压缩非常有效。受语义压缩的启发,本研究提出了一种新颖的独立无损压缩系统,它利用数据挖掘模型来查找具有最大增益的频繁模式(代表行)以绘制属性语义,此外还提供了增强矢量量化编码器的改进版本。增加数据库压缩的总吞吐量。综合考虑压缩率,空间和速度后,该算法可以使粒度更细小,并且适合各种海量数据表。对几个非常大的现实数据集的实验表明,该系统相对于先前已知的无损语义技术而言具有优越性。

著录项

  • 来源
    《Software, IET》 |2016年第4期|108-115|共8页
  • 作者

    Saad Mohamed Darwish;

  • 作者单位

    Institute of Graduate Studies and Research, Alexandria University, Egypt;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号