首页> 外文会议>Annual meeting of the Association for Computational Linguistics >On the Compositionality Prediction of Noun Phrases using Poincare Embeddings
【24h】

On the Compositionality Prediction of Noun Phrases using Poincare Embeddings

机译:论庞加雷嵌入的名词短语的合成性预测

获取原文

摘要

The compositionality degree of multiword expressions indicates to what extent the meaning of a phrase can be derived from the meaning of its constituents and their grammatical relations. Prediction of (non)-compositionality is a task that has been frequently addressed with distributional semantic models. We introduce a novel technique to blend hierarchical information with distributional information for predicting compositionality. In particular, we use hypernymy information of the multiword and its constituents encoded in the form of the recently introduced Poincare embeddings in addition to the distributional information to detect compositionality for noun phrases. Using a weighted average of the distributional similarity and a Poincare similarity function, we obtain consistent and substantial, statistically significant improvement across three gold standard datasets over state-of-the-art models based on distributional information only. Unlike traditional approaches that solely use an unsupervised setting, we have also framed the problem as a supervised task, obtaining comparable improvements. Further, we publicly release our Poincare embeddings, which are trained on the output of handcrafted lexical-syntactic patterns on a large corpus.
机译:多字级的合成度表达表示短语的含义可以从其成分的含义和他们的语法关系中得出。 (非) - 符合性的预测是经常用分配语义模型解决的任务。我们介绍一种新颖的技术来混合具有用于预测合成性的分配信息的分级信息。特别是,除了分配信息之外,我们还使用以最近引入的Poincare Embeddings的形式编码的多字和其成分的复杂信息,以检测名词短语的组成性。使用分布相似性和庞观地相似函数的加权平均值,我们基于仅基于分布信息的最先进的模型,在三个金标准数据集中获得一致而实质的,统计上显着的改进。与单独使用无人监督设置的传统方法不同,我们还将问题框架作为监督任务,获得可比的改进。此外,我们公开发布我们的庞然军嵌入物,这些嵌入式训练在大型语料库上手工制作的词法句法模式的产出。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号