首页> 外文会议>Knowledge-Based Systems for Safety Critical Applications >A framework for the selective dissemination of XML documents based on inferred user profiles
【24h】

A framework for the selective dissemination of XML documents based on inferred user profiles

机译:一个基于推断的用户概要文件选择性分发XML文档的框架

获取原文
获取原文并翻译 | 示例

摘要

As the amount of data available online and the number of pervasive applications that take advantage of it increase, systems that support selective dissemination of information are becoming more popular. At the same time, XML is becoming the standard for document exchange over the Internet. A key capability of emerging information dissemination systems is therefore the effective filtering of a continuous stream of XML data items according to user preferences. Here we propose a model for information dissemination that integrates profile inference with data dissemination and takes advantage of the structured content in XML documents. Starting from the assumption that explicitly stating one's information interests is an inconvenient and error-prone process, we aim to automatically construct user profiles. We do this by clustering items previously deemed valuable by the user according to a novel similarity measure that takes advantage of the semantic content of XML. Furthermore, we index the profiles from all users into a multilevel index structure whose nodes naturally will be a close match to subject areas present in the document collection. Such an approach is both intuitive and efficient since the indexing structure is not primarily affected by an increasing number of users. To support our claims we experimentally validate our method and report on its effectiveness and efficiency.
机译:随着在线可用数据量的增加和利用它的无处不在的应用程序数量的增加,支持选择性传播信息的系统变得越来越流行。同时,XML正在成为Internet上文档交换的标准。因此,新兴的信息传播系统的一项关键功能是根据用户偏好对XML数据项的连续流进行有效过滤。在这里,我们提出了一种信息传播模型,该模型将配置文件推断与数据传播集成在一起,并利用了XML文档中的结构化内容。从以下假设开始:明确声明个人的信息兴趣是一个不便且容易出错的过程,我们旨在自动构建用户个人资料。我们通过利用一种利用XML语义内容的新颖的相似性度量,将以前被用户认为有价值的项目聚类,从而做到这一点。此外,我们将来自所有用户的配置文件索引到一个多级索引结构中,该结构的节点自然会与文档集中存在的主题区域紧密匹配。由于索引结构主要不受越来越多的用户影响,因此这种方法既直观又有效。为了支持我们的主张,我们通过实验验证了我们的方法并报告了其有效性和效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号