首页> 外文会议>International Convention on Information and Communication Technology, Electronics and Microelectronics >Cloudflow - A framework for MapReduce pipeline development in Biomedical Research
【24h】

Cloudflow - A framework for MapReduce pipeline development in Biomedical Research

机译:Cloudflow-生物医学研究中MapReduce管道开发的框架

获取原文

摘要

The data-driven parallelization framework Hadoop MapReduce allows analysing large data sets in a scalable way. Since the development of MapReduce programs can be a time-intensive and challenging task, the application and usage of Hadoop in Biomedical Research is still limited. Here we present Cloudflow, a high-level framework to hide the implementation details of Hadoop and to provide a set of building blocks to create biomedical pipelines in a more intuitive way. We demonstrate the benefit of Cloudflow on three different genetic use cases. It will be shown how the framework can be combined with the Hadoop workflow system Cloudgene and the cloud orchestration platform CloudMan to provide Hadoop pipelines as a service to everyone.
机译:数据驱动的并行化框架Hadoop MapReduce允许以可扩展的方式分析大型数据集。由于MapReduce程序的开发可能是一项耗时且具有挑战性的任务,因此Hadoop在生物医学研究中的应用和使用仍然受到限制。在这里,我们介绍Cloudflow,这是一个高级框架,用于隐藏Hadoop的实现细节并提供一组构建块,以更直观的方式创建生物医学管道。我们证明了Cloudflow在三种不同的遗传用例上的好处。将展示如何将该框架与Hadoop工作流系统Cloudgene和云编排平台CloudMan组合在一起,以将Hadoop管道作为服务提供给每个人。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号