首页> 外文学位 >An ontology-driven concept-based information retrieveal approach for Web documents.
【24h】

An ontology-driven concept-based information retrieveal approach for Web documents.

机译:基于本体的基于概念的Web文档信息检索方法。

获取原文
获取原文并翻译 | 示例

摘要

Building computer agents that can utilize the meanings in the text of Web documents is a promising extension of current search technology. Concept-based information retrieval applies "intelligent" agents to identify Web documents that match user queries. A new concept-based information retrieval framework, Hybrid Ontology-based Textual Information Retrieval (HOTIR), is introduced in this thesis. HOTIR accepts conventional keyword-based queries, translates them into concept-based queries, enriches definitions of concepts with supplementary knowledge from a knowledge base, and ranks documents by aggregating "equivalent" concepts identified in them. The concept-based queries in HOTIR are organized in a hierarchy of concepts (HofC) and definitions of concepts are added from a knowledge base to enhance their meanings. The knowledge base is a modified ontology (ModOnt) that can enrich the HofC with concept definitions in the form of related-concepts, terms, their importance values, and their relations. The ModOnt relies on an adaptive assignment of term importance (AATI) scheme that continuously updates the importance of terms/concepts using Web documents. The identified concepts in a Web document that match those in the HofC are evaluated using ordered weighted averaging (OWA) operators, and documents are ranked according to the degree to which they satisfy the HofC. The case studies and experiments presented in the thesis are designed to validate the performance of HOTIR.
机译:建立可以利用Web文档文本中含义的计算机代理是当前搜索技术的有希望的扩展。基于概念的信息检索应用“智能”代理来识别与用户查询匹配的Web文档。本文介绍了一种新的基于概念的信息检索框架,即基于混合本体的文本信息检索(HOTIR)。 HOTIR接受常规的基于关键字的查询,将其转换为基于概念的查询,使用来自知识库的补充知识丰富概念的定义,并通过汇总在其中识别出的“等效”概念来对文档进行排名。 HOTIR中基于概念的查询按概念层次(HofC)进行组织,并从知识库中添加概念的定义以增强其含义。知识库是一种改进的本体(ModOnt),可以用相关概念,术语,它们的重要性值和它们之间的关系的形式来丰富HofC的概念定义。 ModOnt依靠术语重要性的自适应分配(AATI)方案,该方案使用Web文档不断更新术语/概念的重要性。使用有序加权平均(OWA)运算符评估Web文档中与HofC中匹配的概念,然后根据其满足HofC的程度对文档进行排名。本文提出的案例研究和实验旨在验证HOTIR的性能。

著录项

  • 作者

    Li, Zhan.;

  • 作者单位

    University of Alberta (Canada).;

  • 授予单位 University of Alberta (Canada).;
  • 学科 Engineering Computer.Information Science.Web Studies.
  • 学位 Ph.D.
  • 年度 2010
  • 页码 171 p.
  • 总页数 171
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 老年病学;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号