首页> 外文会议>22nd International Conference on Computational Linguistics >Using Three Way Data for Word Sense Discrimination
【24h】

Using Three Way Data for Word Sense Discrimination

机译:使用三向数据进行词义识别

获取原文
获取原文并翻译 | 示例

摘要

In this paper, an extension of a dimensionality reduction algorithm called NON-NEGATIVE MATRIX FACTORIZATION is presented that combines both 'bag of words' data and syntactic data, in order to find semantic dimensions according to which both words and syntactic relations can be classified. The use of three way data allows one to determine which dimension(s) are responsible for a certain sense of a word, and adapt the corresponding feature vector accordingly, 'subtracting' one sense to discover another one. The intuition in this is that the syntactic features of the syntax-based approach can be disambiguated by the semantic dimensions found by the bag of words approach. The novel approach is embedded into clustering algorithms, to make it fully automatic. The approach is carried out for Dutch, and evaluated against EuroWordNet.
机译:本文提出了一种称为非负矩阵分解的降维算法的扩展,该算法将“词袋”数据和句法数据结合在一起,以找到可以对词和句法关系进行分类的语义维度。三种方式的数据的使用允许人们确定哪个维度负责某个词的某种意义,并相应地调整相应的特征向量,“减去”一种意义以发现另一种意义。直觉是基于语法的方法的语法特征可以通过单词袋方法发现的语义维度来消除歧义。新颖的方法被嵌入到聚类算法中,以使其完全自动化。该方法是针对荷兰语执行的,并针对EuroWordNet进行了评估。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号