首页> 外文会议>Annual meeting of the Association for Computational Linguistics >Reliability-aware Dynamic Feature Composition for Name Tagging
【24h】

Reliability-aware Dynamic Feature Composition for Name Tagging

机译:可靠性感知名称标记的动态功能组合

获取原文

摘要

While word embcddings are widely used for a variety of tasks and substantially improve the performance, their quality is not consistent throughout the vocabulary due to the long-tail distribution of word frequency. Without sufficient contexts, embeddings of rare words are usually less reliable than those of common words. However, current models typically trust all word embeddings equally regardless of their reliability and thus may introduce noise and hurt the performance. Since names often contain rare and unknown words, this problem is particularly critical for name tagging. In this paper, we propose a novel reliability-aware name tagging model to tackle this issue. We design a set of word frequency-based reliability signals to indicate the quality of each word embedding. Guided by the reliability signals, the model is able to dynamically select and compose features such as word embedding and character-level representation using gating mechanisms. For example, if an input word is rare, the model relies less on its word embedding and assigns higher weights to its character and contextual features. Experiments on OntoNotes 5.0 show that our model outperforms the baseline model, obtaining up to 6.2% absolute gain in F-score. In cross-genre experiments on six genres in OntoNotes, our model improves the performance for most genre pairs and achieves 2.3% absolute F-score gain on average.~1
机译:虽然Word Embcddings广泛用于各种任务并大大提高性能,但由于单词频率的长尾分布,它们的质量在整个词汇中都不是一致的。没有足够的背景,罕见单词的嵌入通常不如普通词的那些。然而,当前模型通常相应地信任所有单词嵌入式,无论其可靠性如何,都可能引入噪音并损害性能。由于名称通常包含稀有和未知的单词,因此此问题对于名称标记尤为重要。在本文中,我们提出了一种新颖的可靠性感知名称标记模型来解决这个问题。我们设计了一组基于词频率的可靠性信号,以指示每个单词嵌入的质量。由可靠性信号引导,模型能够动态地选择和撰写使用门控机制的字嵌入和字符级表示等特征。例如,如果输入字是罕见的,则该模型依赖于其单词嵌入并为其字符和上下文特征分配更高权重。 Ontonotes 5.0的实验表明,我们的模型优于基线模型,从得分中获得高达6.2%的绝对增益。在Ontonotes的六种类型的交叉类型实验中,我们的模型提高了大多数类型对的性能,平均实现了2.3%的绝对F分数。〜1

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号