首页> 外文学位 >A category-based similarity algorithm for semantic similarity in information sharing.
【24h】

A category-based similarity algorithm for semantic similarity in information sharing.

机译:用于信息共享中语义相似度的基于类别的相似度算法。

获取原文
获取原文并翻译 | 示例

摘要

Similarity measures are mechanisms that assign a numeric score indicating how closely two documents, or a document and a query match. The Cosine measure is one of the similarity measures that treat a document or a query as a vector of weighted terms or keywords. The similarity distance calculated by the Cosine measure is based on the exact matching of keywords. Thus the semantic relatedness between the keywords of the two documents is not considered.;The CSA framework improves the information sharing among agents that are semantically close to each other but whose vectors of key-phrases are syntactically different. The evaluation of CSA within the ACORN system demonstrates that the number of agents with which each agent shares its information increases compared to the original ACORN system. In the evaluation part, cliques of agents are used as criteria to show the performance of the new ACORN system after applying CSA. The agents that are very similar to each other join the same clique. The more populated the cliques are, the better the information sharing among the agents is. The average number of agents inside cliques increases after applying CSA to the ACORN system.;This thesis presents a category-based similarity algorithm (CSA) to determine the semantic similarity between any two pieces of information. The CSA is implemented inside the ACORN system and adds the semantic similarity feature to ACORN. CSA is applicable inside any information sharing system in which the pieces of information are represented as vectors of weighted keywords.
机译:相似度度量是一种机制,用于分配数值分数,以指示两个文档或一个文档和一个查询的匹配程度。余弦度量是将文档或查询视为加权术语或关键字的向量的相似性度量之一。余弦测度计算出的相似度距离基于关键字的精确匹配。因此,不考虑两个文档的关键字之间的语义相关性。CSA框架改进了语义上彼此接近但关键字短语的向量在语法上不同的代理之间的信息共享。对ACORN系统中CSA的评估表明,与原始ACORN系统相比,与每个代理共享其信息的代理数量有所增加。在评估部分,代理群体被用作标准,以显示应用CSA后新ACORN系统的性能。彼此非常相似的代理加入同一集团。群体越多,座席之间的信息共享就越好。在将CSA应用于ACORN系统后,集团内部的平均代理数量会增加。本文提出了一种基于类别的相似度算法(CSA),用于确定任意两条信息之间的语义相似度。 CSA在ACORN系统内部实现,并向ACORN添加了语义相似性功能。 CSA适用于任何信息共享系统,其中信息片段表示为加权关键字的向量。

著录项

  • 作者

    Miralaei, Sepideh.;

  • 作者单位

    University of New Brunswick (Canada).;

  • 授予单位 University of New Brunswick (Canada).;
  • 学科 Computer science.
  • 学位 M.C.S.
  • 年度 2005
  • 页码 157 p.
  • 总页数 157
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号