首页> 外文会议>11th Workshop on multiword expressions >Multiword Expression Identification with Recurring Tree Fragments and Association Measures
【24h】

Multiword Expression Identification with Recurring Tree Fragments and Association Measures

机译:带有重复树碎片的多词表达识别和关联度量

获取原文
获取原文并翻译 | 示例

摘要

We present a novel approach for the identification of multiword expressions (MWEs). The methodology extracts a large set of recurring syntactic fragments from a given treebank using a Tree-Kernel method. Differently from previous studies, the expressions underlying these fragments are arbitrarily long and can include intervening gaps. In the initial study we use these fragments to identify MWEs as a parsing task (in a supervised manner) as proposed by Green et al. (2011). Here we obtain a small improvement over previous results. In the second part, we compare various association measures in reranking the expressions underlying these fragments in an unsupervised fashion. We show how a newly defined measure (Log Inside Ratio) based on statistical parsing techniques is able to outperform classical association measures in the French data.
机译:我们提出了一种识别多词表达(MWE)的新颖方法。该方法使用树核方法从给定的树库中提取大量重复出现的句法片段。与以前的研究不同,这些片段背后的表达任意长,并且可能包含中间的间隙。在最初的研究中,我们使用这些片段将MWE识别为解析任务(以监督方式),这是Green等人提出的。 (2011)。在这里,我们对以前的结果进行了一些改进。在第二部分中,我们比较了各种关联度量,以无监督的方式重新排列了这些片段背后的表达。我们将展示基于统计分析技术的新定义的度量(对数内比)如何能够胜过法国数据中的经典关联度量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号