首页> 外文会议>International Conference on Future Information Engineering >Distributionally Extended Network-Based Word Sense Disambiguation in Semantic Clustering of Polish Texts
【24h】

Distributionally Extended Network-Based Word Sense Disambiguation in Semantic Clustering of Polish Texts

机译:波兰语文本语义聚类中分布扩展的基于网络的词义歧义

获取原文

摘要

In the paper we present an extended version of the graph-based unsupervised Word Sense Disambiguation algorithm. The algorithm is based on the spreading activation scheme applied to the graphs dynamically built on the basis of the text words and a large wordnet. The algorithm, originally proposed for English and Princeton WordNet, was adapted to Polish and plWordNet. An extension based on the knowledge acquired from the corpus-derived Measure of Semantic Relatedness was proposed. The extended algorithm was evaluated against the manually disambiguated corpus. We observed improvement in the case of the disambiguation performed for shorter text contexts. In addition the algorithm application expressed improvement in document clustering task.
机译:在论文中,我们呈现了一种基于图的无监督字消歧歧义算法的扩展版本。该算法基于应用于基于文本单词和大写字网动态构建的图形的扩展激活方案。最初提出英语和普林斯顿Wordnet的算法适用于波兰语和PLONDNET。提出了基于从语料库衍生的语义相关性测量获得的知识的扩展。针对手动消化歧义的语料库评估扩展算法。我们观察了对短文本上下文执行歧义的情况的改进。此外,算法应用程序表达了文档聚类任务的改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号