首页> 外文会议>Workshop on multiword expressions >Building a Lexicon of Formulaic Language for Language Learners
【24h】

Building a Lexicon of Formulaic Language for Language Learners

机译:建立一个用于语言学习者的公式语言词典

获取原文

摘要

Though the multiword lexicon has long been of interest in computational linguistics, most relevant work is targeted at only a small portion of it. Our work is motivated by the needs of learners for more comprehensive resources reflecting formulaic language that goes beyond what is likely to be codified in a dictionary. Working from an initial sequential segmentation approach, we present two enhancements: the use of a new measure to promote the identification of lexicalized sequences, and an expansion to include sequences with gaps. We evaluate using a novel method that allows us to calculate an estimate of recall without a reference lexicon, showing that good performance in the second enhancement depends crucially on the first, and that our lexicon conforms much more with human judgment of formulaic language than alternatives.
机译:虽然众多lexicon长期以来对计算语言学感兴趣,但大多数相关的工作都仅针对它的一小部分。我们的工作受到学习者的需求,了解更全面的资源,反映了超出字典中可能编纂的公式语言的公式语言。从初始顺序分割方法工作,我们提出了两种增强功能:使用新措施来促进识别词汇化序列,以及扩展以包括具有间隙的序列。我们使用一种新的方法评估,该方法允许我们在没有参考词典的情况下计算召回的估计,表明第二个增强中的良好性能依赖于第一,并且我们的词典与替代方案相比,我们的词汇符合人为判断。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号