【24h】

Query Rewriting for Extracting Data Behind HTML Forms

机译:查询重写以提取HTML表单后的数据

获取原文
获取原文并翻译 | 示例

摘要

Much of the information on the Web is stored in specialized searchable databases and can only be accessed by interacting with a form or a series of forms. As a result, enabling automated agents and Web crawlers to interact with form-based interfaces designed primarily for humans is of great value. This paper describes a system that can fill out Web forms automatically according to a given user query against an ontological description of an application domain and, to the extent possible, can extract just the relevant data behind these Web forms. Experimental results on two application domains show that the approach can work well.
机译:Web上的许多信息都存储在专门的可搜索数据库中,并且只能通过与一种或多种表单交互来访问。因此,使自动化代理程序和Web搜寻器能够与主要为人类设计的基于表单的界面进行交互具有巨大的价值。本文介绍了一种系统,该系统可以根据给定的用户查询针对应用程序域的本体描述自动填写Web表单,并在可能的范围内仅提取这些Web表单背后的相关数据。在两个应用领域的实验结果表明该方法可以很好地工作。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号