首页> 外文会议>International conference on web information systems engineering >Research on Automate Discovery of Deep Web Interfaces
【24h】

Research on Automate Discovery of Deep Web Interfaces

机译:深网界面自动发现的研究

获取原文

摘要

The main means to obtain information from Deep Web is submitting query condition through the provided query interfaces, so it is the first problem that needs to be solved for Deep Web data integration system. At present, most researchers think of query interface is merely defined within the form html tag. This paper firstly proposes the concept of interface block, then designs the interface block location method based on page and vision information, and finally takes the judgment of whether interface block is a query interface or not as the special multi-class classification problems and by applying classification algorithm combining C4.5 decision tree and SVM. The experiment adopts TEL-8 data sets of UIUC, and the findings indicate that the method in this paper get an accuracy of 97.30%, and has good feasibility and practicability.
机译:从Deep Web获取信息的主要方法是通过提供的查询接口提交查询条件,因此它是深网络数据集成系统需要解决的第一个问题。目前,大多数研究人员认为查询接口只是在表单HTML标记中定义。本文首先提出了接口块的概念,然后根据页面和视觉信息设计接口块位置方法,最后判断接口块是否是查询界面,也可以作为特殊的多级分类问题和应用C4.5决策树和SVM组合分类算法。该实验采用TEL-8数据集UIUC,结果表明,本文中的方法得到了97.30%的准确性,具有良好的可行性和实用性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号