首页> 外文会议>IEEE International Congress on Big Data >Towards a Semantic Extract-Transform-Load (ETL) Framework for Big Data Integration
【24h】

Towards a Semantic Extract-Transform-Load (ETL) Framework for Big Data Integration

机译:面向大数据集成的语义提取-转换-加载(ETL)框架

获取原文

摘要

Big Data has become the new ubiquitous term used to describe massive collection of datasets that are difficult to process using traditional database and software techniques. Most of this data is inaccessible to users, as we need technology and tools to find, transform, analyze, and visualize data in order to make it consumable for decision-making. One aspect of Big Data research is dealing with the Variety of data that includes various formats such as structured, numeric, unstructured text data, email, video, audio, stock ticker, etc. Managing, merging, and governing a variety of data is the focus of this paper. This paper proposes a semantic Extract-Transform-Load (ETL) framework that uses semantic technologies to integrate and publish data from multiple sources as open linked data. This includes - creation of a semantic data model to provide a basis for integration and understanding of knowledge from multiple sources, creation of a distributed Web of data using Resource Description Framework (RDF) as the graph data model, extraction of useful knowledge and information from the combined data using SPARQL as the semantic query language.
机译:大数据已成为新的普遍使用的术语,用于描述海量数据集,而这些数据集很难使用传统的数据库和软件技术来处理。由于我们需要技术和工具来查找,转换,分析和可视化数据以使其可用于决策,因此大多数数据对于用户都是不可访问的。大数据研究的一个方面是处理各种数据,其中包括结构,数字,非结构化文本数据,电子邮件,视频,音频,股票行情等各种格式。管理,合并和管理各种数据是本文的重点。本文提出了一种语义提取-转换-加载(ETL)框架,该框架使用语义技术将来自多个源的数据集成和发布为开放链接数据。这包括-创建语义数据模型以提供集成和理解来自多个来源的知识的基础,使用资源描述框架(RDF)作为图形数据模型创建分布式Web数据,从中提取有用的知识和信息使用SPARQL作为语义查询语言的组合数据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号