Dynamic-ETL: a hybrid approach for health data extraction, transformation and loading

Toan C. Ong; Michael G. Kahn; Bethany M. Kwan; Traci Yamashita; Elias Brandt; Patrick Hosokawa; Chris Uhrich; Lisa M. Schilling

首页> 外文期刊>BMC Medical Informatics and Decision Making >Dynamic-ETL: a hybrid approach for health data extraction, transformation and loading

【24h】

Dynamic-ETL: a hybrid approach for health data extraction, transformation and loading

机译：Dynamic-ETL：健康数据提取，转换和加载的混合方法

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Background Electronic health records (EHRs) contain detailed clinical data stored in proprietary formats with non-standard codes and structures. Participating in multi-site clinical research networks requires EHR data to be restructured and transformed into a common format and standard terminologies, and optimally linked to other data sources. The expertise and scalable solutions needed to transform data to conform to network requirements are beyond the scope of many health care organizations and there is a need for practical tools that lower the barriers of data contribution to clinical research networks. Methods We designed and implemented a health data transformation and loading approach, which we refer to as Dynamic ETL (Extraction, Transformation and Loading) (D-ETL), that automates part of the process through use of scalable, reusable and customizable code, while retaining manual aspects of the process that requires knowledge of complex coding syntax. This approach provides the flexibility required for the ETL of heterogeneous data, variations in semantic expertise, and transparency of transformation logic that are essential to implement ETL conventions across clinical research sharing networks. Processing workflows are directed by the ETL specifications guideline, developed by ETL designers with extensive knowledge of the structure and semantics of health data (i.e., “health data domain experts”) and target common data model. Results D-ETL was implemented to perform ETL operations that load data from various sources with different database schema structures into the Observational Medical Outcome Partnership (OMOP) common data model. The results showed that ETL rule composition methods and the D-ETL engine offer a scalable solution for health data transformation via automatic query generation to harmonize source datasets. Conclusions D-ETL supports a flexible and transparent process to transform and load health data into a target data model. This approach offers a solution that lowers technical barriers that prevent data partners from participating in research data networks, and therefore, promotes the advancement of comparative effectiveness research using secondary electronic health data.

机译：背景电子健康记录（EHR）包含以专有格式存储的详细临床数据，以及非标准的代码和结构。参与多站点临床研究网络要求将EHR数据重组并转换为通用格式和标准术语，并以最佳方式链接到其他数据源。转换数据以符合网络要求所需的专业知识和可扩展解决方案超出了许多医疗保健组织的范围，因此需要实用工具来降低数据对临床研究网络的贡献。方法我们设计并实现了一种健康数据转换和加载方法，我们将其称为动态ETL（提取，转换和加载）（D-ETL），该方法通过使用可伸缩，可重用和可自定义的代码来自动化过程的一部分，而保留需要复杂编码语法知识的过程的手动方面。这种方法提供了异构数据ETL所需的灵活性，语义专业知识的变化以及转换逻辑的透明性，这对于跨临床研究共享网络实施ETL约定至关重要。处理工作流由ETL规范指南指导，该指南由ETL设计人员开发，他们对健康数据（即“健康数据领域专家”）的结构和语义有广泛的了解，并针对通用数据模型。结果实现D-ETL以执行ETL操作，该操作将来自具有不同数据库架构结构的各种来源的数据加载到观察性医疗成果合作伙伴关系（OMOP）通用数据模型中。结果表明，ETL规则组合方法和D-ETL引擎通过自动查询生成来协调源数据集，为健康数据转换提供了可扩展的解决方案。结论D-ETL支持灵活，透明的过程，以将健康数据转换并加载到目标数据模型中。这种方法提供了一种解决方案，该解决方案降低了技术障碍，阻止了数据合作伙伴参与研究数据网络，从而促进了使用二次电子健康数据进行比较有效性研究的进展。

著录项

来源
《BMC Medical Informatics and Decision Making》 |2017年第1期|共1页
作者
Toan C. Ong; Michael G. Kahn; Bethany M. Kwan; Traci Yamashita; Elias Brandt; Patrick Hosokawa; Chris Uhrich; Lisa M. Schilling;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类医药、卫生;
关键词

相似文献

外文文献
中文文献
专利

1. PROSES EXTRACTION, TRANSFORMATION, AND LOADING PADA PEMODELAN DATA WAREHOUSE PO. SUMBER ALAM KUTOARJO [J] . Agustinus Fritz Wijaya, Antonius Teddy Sugiarto Jurnal Terapan Teknologi Informasi: JUTEI . 2017,第1期

机译：仓库PO上的提取，转换和加载过程。天然来源KUTOARJO
2. EMD: entity mapping diagram for automated extraction, transformation, and loading processes in data warehousing [J] . Abdeltawab M.A. Hendawi, Shaker H. AN El-Sappagh International journal of intelligent information and database systems . 2012,第3期

机译：EMD：用于在数据仓库中自动提取，转换和加载过程的实体映射图
3. Investigation of Extraction, Transformation, and Loading Techniques for Traffic Data Warehouses [J] . Brian L. Smith, Simona Babiceanu Transportation Research Record . 2004,第1879期

机译：交通数据仓库的提取，转换和加载技术研究
4. Data Integration for Rubber Import and Export Information: An Extraction Transformation Load (ETL) Approach [C] . MIMI SAFINAZ JAMALUDDIN, NURULHUDA MOHD AZMI, NAZRI KAMA, International Conference on Applied Computer and Applied Computational Science . 2015

机译：橡胶进出口信息的数据集成：提取转换负载（ETL）方法
5. Mercury and methylmercury in Spring Lake, Minnesota: A mass balance approach comparing redox transformations, methylmercury photodegradation, sediment loading, and watershed processes. [D] . Hines, Neal Albert. 2004

机译：明尼苏达州斯普林湖的汞和甲基汞：一种质量平衡方法，比较了氧化还原转化，甲基汞光降解，沉积物负载和分水岭过程。
6. Dynamic-ETL: a hybrid approach for health data extraction transformation and loading [O] . Toan C. Ong, Michael G. Kahn, Bethany M. Kwan, 2017

机译：Dynamic-ETL：健康数据提取转换和加载的混合方法
7. A Framework for Classification of Electronic Health Data Extraction-Transformation-Loading Challenges in Data Network Participation [O] . Toan Ong, Rosina Pradhananga, Erin Holve, 2017

机译：数据网络参与中电子健康数据提取转换加载挑战的分类框架
8. InfoXtract Location Normalization: A Hybrid Approach to Geographic References in Information Extraction [R] . Li, H. , Ihari, R. K. , Niu, C. , 2003

机译：InfoXtract位置归一化：信息抽取中地理参考的混合方法

Dynamic-ETL: a hybrid approach for health data extraction, transformation and loading

摘要

著录项

相似文献

相关主题

期刊订阅