【24h】

A Quantitative Summary of XML Structures

机译:XML结构的定量总结

获取原文
获取原文并翻译 | 示例

摘要

Statistical summaries in relational databases mainly focus on the distribution of data values and have been found useful for various applications, such as query evaluation and data storage. As xml has been widely used, e.g. for online data exchange, the need for (corresponding) statistical summaries in xml has been evident. While relational techniques may be applicable to the data values in xml documents, novel techniques are requried for summarizing the structures of xml documents. In this paper, we propose metrics for major structural properties, in particular, nestings of entities and one-to-many relationships, of XML documents. Our technique is different from the existing ones in that we generate a quantitative summary of an xml structure. By using our approach, we illustrate that some popular real-world and synthetic xml benchmark datasets are indeed highly skewed and hardly hierarchical and contain few recursions. We wish this preliminary finding shreds insight on improving the design of xml benchmarking and experimentations.
机译:关系数据库中的统计摘要主要集中于数据值的分布,并且已发现对各种应用程序(例如查询评估和数据存储)有用。由于xml已被广泛使用,例如对于在线数据交换,很明显需要xml中的(相应)统计摘要。尽管关系技术可能适用于xml文档中的数据值,但仍需要新颖的技术来概括xml文档的结构。在本文中,我们提出了主要结构属性的度量标准,尤其是XML文档的实体嵌套和一对多关系。我们的技术与现有技术的不同之处在于,我们生成xml结构的定量摘要。通过使用我们的方法,我们说明了一些流行的现实世界和合成xml基准测试数据集的确存在很大的偏差,几乎没有层次结构,并且包含很少的递归。我们希望这一初步发现能为改善xml基准测试和实验的设计带来见识。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号