首页> 外文会议>Language processing and intelligent information systems >Related Entity Finding Using Semantic Clustering Based on Wikipedia Categories
【24h】

Related Entity Finding Using Semantic Clustering Based on Wikipedia Categories

机译:基于维基百科类别的使用语义聚类的相关实体查找

获取原文
获取原文并翻译 | 示例

摘要

We present a system that performs Related Entity Finding, that is, Question Answering that exploits Semantic Information from the WWW and returns URIs as answers. Our system uses a search engine to gather all candidate answer entities and then a linear combination of Information Retrieval measures to choose the most relevant. For each one we look up its Wikipedia page and construct a novel vector representation based on the tokenization of the Wikipedia category names. This novel representation gives our system the ability to compute a measure of semantic relatedness between entities, even if the entities do not share any common category. We use this property to perform a semantic clustering of the candidate entities and show that the biggest cluster contains entities that axe closely related semantically and can be considered as answers to the query. Performance measured on 20 topics from the 2009 TREC Related Entity Finding task shows competitive results.
机译:我们提出了一个执行相关实体查找(即问题回答)的系统,该问题利用来自WWW的语义信息并返回URI作为答案。我们的系统使用搜索引擎来收集所有候选答案实体,然后使用信息检索度量的线性组合来选择最相关的实体。对于每个人,我们都查找其Wikipedia页面,并基于Wikipedia类别名称的标记化来构造一种新颖的矢量表示形式。这种新颖的表示方式使我们的系统能够计算实体之间的语义相关性度量,即使实体不共享任何公共类别也是如此。我们使用此属性对候选实体执行语义聚类,并显示最大的聚类包含语义上密切相关的实体,可以将其视为查询的答案。根据2009 TREC相关实体查找任务中的20个主题测得的绩效显示出具有竞争力的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号