...
首页> 外文期刊>IEEE/ACM transactions on computational biology and bioinformatics >Enabling Massive XML-Based Biological Data Management in HBase
【24h】

Enabling Massive XML-Based Biological Data Management in HBase

机译:在HBase中启用基于XML的基于XML的生物数据管理

获取原文
获取原文并翻译 | 示例
           

摘要

Publishing biological data in XML formats is attractive for organizations who would like to provide their bioinformatics resources in an extensible and machine-readable format. In the era of big data, massive XML-based biological data management is emerged as a challengeable issue. With the continuous growth of the XML-based biological data sets, it is usually frustrating to use traditional declarative query languages to provide efficient query capabilities in terms of processing speed and scale. In this study, we report a novel platform to store and query massive XML-based biological data collections. A prototype tool for constructing HBase tables from XML-based biological data collections is first developed, and then a formal approach to transform the XML query model into the MapReduce query model is proposed. Finally, an evaluation of the query performance of the proposed approach on the existing XML-based biological databases is presented, showing that the performance advantages of the proposed solution. The source code of the massive XML-based biological data management platform is freely available at https://github.com/lyotvincent/X2H.
机译:以XML格式出版生物数据对于希望以可扩展和机器可读格式提供生物信息化资源的组织具有吸引力。在大数据的时代,基于大规模的XML的生物数据管理被出现为有挑战性的问题。随着基于XML的生物数据集的连续增长,使用传统的声明性查询语言通常令人沮丧,以便在处理速度和比例方面提供有效的查询功能。在这项研究中,我们报告了一个新颖的平台来存储和查询基于XML的生物数据收集。首先开发用于构造基于XML的生物数据收集的HBase表的原型工具,然后提出了一种将XML查询模型转换为MapReduce查询模型的正式方法。最后,提出了对现有XML的生物数据库上提出的方法的查询性能的评估,显示了所提出的解决方案的性能优势。基于XML的基于XML的生物数据管理平台的源代码在HTTPS://github.com/lyotvincent/x2h自由使用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号