On the Midpoint of a Set of XML Documents

机译：在一组XML文档的中点上

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The WWW contains a huge amount of documents. Some of them share the subject, but are generated by different people or even organizations. To guarantee the interchange of such documents, we can use XML, which allows to share documents that do not have the same structure. However, it makes difficult to understand the core of such heterogeneous documents (in general, schema is not available). In this paper, we offer a characterization and algorithm to obtain the midpoint (in terms of a resemblance function) of a set of semi-structured, heterogeneous documents without optional elements. The trivial case of midpoint would be the common elements to all documents. Nevertheless, in cases with several heterogeneous documents this may result in an empty set. Thus, we consider that those elements present in a given amount of documents belong to the midpoint. A exact schema could always be found generating optional elements. However, the exact schema of the whole set may result in overspecialization (lots of optional elements), which would make it useless.

机译：WWW包含大量文档。其中一些共享主题，但由不同的人甚至组织生成。为了保证这些文档的交换，我们可以使用XML，允许共享没有相同结构的文档。但是，难以理解这种异构文件的核心（一般，模式不可用）。在本文中，我们提供了一种表征和算法，以获得一组半结构化的异构文档的中点（在相似函数方面），无需可选元素。中点的琐碎案将是所有文件的常见元素。尽管如此，在几个异构文件的情况下，这可能导致空集。因此，我们认为给定的文件中存在的那些元素属于中点。可以始终找到一个完全的架构生成可选元素。但是，整个集合的确切架构可能会导致超微化（许多可选元素），这将使它无用。

著录项

来源
《International Conference on Database and Expert Systems Applications》|2005年||共10页
会议地点
作者
Alberto Abello; Xavier de Palol; Mohand-Saied Hacid;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP311.13;
关键词

相似文献

外文文献
中文文献
专利

1. Research on Basic Operations for Query Probabilistic XML Document Based on Path Set [J] . Jianwei Wang, Zhongxiao Hao Journal of software . 2013,第4期

机译：基于路径集的概率XML文档的基本操作研究
2. Research on Basic Operations for Query Probabilistic XML Document Based on Path Set [J] . Jianwei Wang1, 2, Zhongxiao Hao1, Journal of software . 2013,第4期

机译：基于路径集的查询概率XML文档的基本操作研究
3. A Systematic Approach for Changing XML Namespaces in XML Schemas and Managing their Effects on Associated XML Documents under Schema Versioning [J] . Zouhaier Brahmia, Fabio Grandi, Rafik Bouaziz Journal of digital information management . 2016,第5期

机译：一种在XML模式中更改XML命名空间并管理其在模式版本控制下对关联XML文档的影响的系统方法
4. On the Midpoint of a Set of XML Documents [C] . Alberto Abello, Xavier de Palol, Mohand-Saied Hacid International Conference on Database and Expert Systems Applications . 2005

机译：在一组XML文档的中点上
5. Data hiding and detection in office open XML (OOXML) documents . [D] . Raffay, Muhammad Ali. 2011

机译：Office Open XML（OOXML）文档中的数据隐藏和检测。
6. Using XML Metadata to Enable the Automatic Generation and Processing of HTML Forms from XML Documents [O] . Anil K. Dubey, Henry C. Chueh 2001

机译：使用XML元数据启用从XML文档自动生成和处理HTML表单的功能
7. Comparing Document Object Model (DOM) and simple API for XML (SAX) in processing XML document in leave application system [O] . Wahid Juliana 2008

机译：在休假申请系统中处理XML文档时，比较文档对象模型（DOM）和XML的简单API（SAX）

On the Midpoint of a Set of XML Documents

摘要

著录项

相似文献

相关主题

期刊订阅