首页> 外文期刊>International journal of web services research >Karma2: Provenance Management for Data-Driven Workflows
【24h】

Karma2: Provenance Management for Data-Driven Workflows

机译:Karma2:数据驱动工作流的来源管理

获取原文
获取原文并翻译 | 示例
           

摘要

The increasing ability for the sciences to sense the world around us is resulting in a growing need for data-driven e-Science applications that are under the control of workflows composed of services on the Grid. The focus of our work is on provenance collection for these workflows that are necessary to validate the workflow and to determine quality of generated data products. The challenge we address is to record uniform and usable provenance metadata that meets the domain needs while minimizing the modification burden on the service authors and the performance overhead on the workflow engine and the services. The framework is based on generating discrete provenance activities during the lifecycle of a workflow execution that can be aggregated to form complex data and process provenance graphs that can span across workflows. The implementation uses a loosely coupled publish-subscribe architecture for propagating these activities, and the capabilities of the system satisfy the needs of detailed provenance collection. A performance evaluation of a prototype finds a minimal performance overhead (in the range of 1% for an eight-service workflow using 271 data products).
机译:科学感知我们周围世界的能力日益增强,导致对数据驱动的电子科学应用程序的需求不断增长,这些应用程序由网格服务组成的工作流控制。我们工作的重点是这些工作流的来源收集,这对于验证工作流和确定生成的数据产品的质量是必需的。我们要解决的挑战是记录满足域需求的统一且可用的出处元数据,同时最大程度地减少服务作者的修改负担以及工作流引擎和服务的性能开销。该框架基于在工作流执行的生命周期内生成离散的起源活动,这些活动可以聚合在一起以形成可以跨越整个工作流的复杂数据和过程起源图。该实现使用一个松散耦合的发布-订阅体系结构来传播这些活动,并且系统的功能可以满足详细的物证收集需求。对原型的性能评估发现最小的性能开销(对于使用271个数据产品的八项服务工作流,范围为1%)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号