首页> 外文学位 >Querying Web pages with database query languages.
【24h】

Querying Web pages with database query languages.

机译:使用数据库查询语言查询网页。

获取原文
获取原文并翻译 | 示例

摘要

As the World Wide Web is growing at a phenomenal rate, it becomes more and more difficult to retrieve information of interest from the enormous number of resources that are available. Currently, there are two ways to retrieve information from the Web, namely, navigation/browsing and searching by search engines. However, these search methods have significant limitations, such as, the "lost-in-hyperspace" phenomenon, the ignorance of the hypertext structure, etc. These drawbacks motivated the development of a flexible and powerful web query system.; This thesis presents a prototype system developed to query the Web with database query languages. In our prototype system, the Web is modeled as a labeled directed graph which can be stored in a relational database. A parser was designed and implemented in our prototype system to extract the information of a web page from the source HTML file and store it into the database. Three query facilities are developed in the prototype system, namely, the content query, the structure query and the advanced query, which can be used to pose queries on both the content and the hypertext structure of web pages. Extensive experiments have been performed to test the prototype system. The testing results show that database query languages can be used successfully in querying the Web.
机译:随着万维网以惊人的速度增长,从大量可用资源中检索感兴趣的信息变得越来越困难。当前,有两种方法可以从Web检索信息,即导航/浏览和搜索引擎搜索。但是,这些搜索方法具有很大的局限性,例如“超空间丢失”现象,超文本结构的无知等。这些缺点促使了灵活而强大的Web查询系统的发展。本文提出了一种原型系统,该系统被开发为使用数据库查询语言来查询Web。在我们的原型系统中,Web被建模为可以存储在关系数据库中的带标签的有向图。在我们的原型系统中设计并实现了一个解析器,以从源HTML文件中提取网页信息并将其存储到数据库中。在原型系统中开发了三种查询功能,即内容查询,结构查询和高级查询,它们可用于对网页的内容和超文本结构进行查询。已经进行了广泛的实验以测试原型系统。测试结果表明,数据库查询语言可以成功用于Web查询。

著录项

  • 作者

    Yang, Xiaoyu.;

  • 作者单位

    The University of Western Ontario (Canada).;

  • 授予单位 The University of Western Ontario (Canada).;
  • 学科 Computer Science.; Information Science.
  • 学位 M.Sc.
  • 年度 1999
  • 页码 78 p.
  • 总页数 78
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术;信息与知识传播;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号