首页> 外文会议> >A Distributed Storage Model for Healthcare Big Data Designed on HBase
【24h】

A Distributed Storage Model for Healthcare Big Data Designed on HBase

机译:基于HBase设计的医疗大数据分布式存储模型

获取原文

摘要

With the explosive growth of healthcare data, traditional relational database management systems (RDBMS) are limited in scalability, storage of unstructured data, concurrency and cost. Thus we proposed a snowflake model based on HBase with a multi-table structure and three kinds of index tables for efficient assess of large-scale health records. A guideline for designing of index tables was proposed which covers the demands of most healthcare data processing applications. A benchmark test was carried out with six types of queries on a large dataset comprising 750 million records to compare the performance of the proposed model against the traditional tall-table model on HBase. We found that the snowflake model was more efficient than the tall-table model. The adoption of index tables could greatly improve the query speed and provided real-time queries for two models. In general, snowflake model could be used for managing large-scale healthcare data as an advantageous alternative.
机译:随着医疗保健数据的爆炸性增长,传统的关系数据库管理系统(RDBMS)在可伸缩性,非结构化数据的存储,并发性和成本方面受到了限制。因此,我们提出了一种基于HBase的雪花模型,该模型具有多表结构和三种索引表,可以有效地评估大型健康记录。提出了设计索引表的指南,该指南涵盖了大多数医疗保健数据处理应用程序的需求。在包含7.5亿条记录的大型数据集上对六种查询类型进行了基准测试,以比较建议的模型与HBase上传统的高表模型的性能。我们发现雪花模型比高桌子模型更有效。索引表的采用可以大大提高查询速度,并提供两种模型的实时查询。通常,雪花模型可以用作管理大规模医疗保健数据的一种替代方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号