首页> 外文学位 >Information brokering over heterogeneous digital data: A metadata-based approach.
【24h】

Information brokering over heterogeneous digital data: A metadata-based approach.

机译:异构数字数据上的信息代理:一种基于元数据的方法。

获取原文
获取原文并翻译 | 示例

摘要

Information overload, arising from different types of heterogeneous digital data readily accessible from millions of repositories, is a critical problem on the Global Information Infrastructure (GII). We present an information brokering approach, architecture and techniques that address issues related to information overload on the GII. The approach spans three levels: representation (structure/format/type) of digital data, information content captured in the data; and the vocabulary underlying the data. Metadata (data/information about data) is used to abstract from heterogeneous representational details and capture information content. Domain specific ontologies are used to represent and interoperate across different vocabularies used to characterize information content. The approach thus suggested induces a metadata-based architecture that enables information brokering at the different levels.; The feasibility of the approach is demonstrated by using a wide variety of metadata to capture information content for textual, image and structured data. These metadata belong to a wide spectrum and may range from metadata independent of the data content to those capturing information content in a application and domain specific manner. This thesis demonstrates how metadata characterizing information in a domain specific manner may enable: (a) media-independent correlation of information across heterogeneous media; and (b) vocabulary-based interoperation of information across different domains.; Example information brokering prototypes based on metadata capturing information content to varying degrees are presented as instantiations to validate the proposed architecture. We also identify the desired ("SEA") properties of an architecture in the presence of information overload, namely, scalability, extensibility and adaptability; and discuss in what measure the prototypes display these properties. The intrinsic trade-off between scalability and extensibility is identified and discussed. Adaptability, a new proposed property, is the ability of an information brokering system to adapt to different vocabularies used to describe similar information content. We show how maximizing scalability leads to issues of adaptability and how terminological relationships across domain specific ontologies characterizing vocabularies may be used to achieve interoperation and increase adaptability.
机译:易于从数百万个存储库访问的不同类型的异构数字数据引起的信息过载是全球信息基础架构(GII)上的关键问题。我们提出一种信息代理方法,体系结构和技术,以解决与GII上的信息过载有关的问题。该方法跨越三个层次:数字数据的表示(结构/格式/类型),数据中捕获的信息内容;以及数据的基础词汇。元数据(有关数据的数据/信息)用于从异构表示细节中抽象出来并捕获信息内容。特定领域本体用于表示不同词汇表并在用于描述信息内容的词汇表上进行互操作。因此,该方法提出了一种基于元数据的体系结构,该体系结构可以在不同级别进行信息代理。通过使用各种元数据捕获文本,图像和结构化数据的信息内容,证明了该方法的可行性。这些元数据属于广泛的范围,范围可以从独立于数据内容的元数据到以应用程序和领域特定方式捕获信息内容的元数据。本论文证明了以域特定方式表征信息的元数据如何能够实现:(a)跨异构媒体的独立于媒体的信息相关性; (b)跨领域的基于词汇的信息互操作;以基于元数据的程度捕获信息内容的示例信息代理原型为例,以验证所提出的体系结构。在信息过载的情况下,我们还确定了体系结构的期望(“ SEA”)属性,即可伸缩性,可扩展性和适应性。并讨论原型以何种方式显示这些属性。确定并讨论了可伸缩性和可扩展性之间的内在折衷。适应性是一项新提议的属性,它是信息代理系统适应用于描述相似信息内容的不同词汇的能力。我们展示了最大化可扩展性如何导致适应性问题,以及跨领域特定词汇表述词汇的术语关系如何用于实现互操作和提高适应性。

著录项

  • 作者

    Kashyap, Vipul Y.;

  • 作者单位

    Rutgers The State University of New Jersey - New Brunswick.;

  • 授予单位 Rutgers The State University of New Jersey - New Brunswick.;
  • 学科 Computer Science.; Information Science.
  • 学位 Ph.D.
  • 年度 1998
  • 页码 252 p.
  • 总页数 252
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术;信息与知识传播;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号