Using Morphological Knowledge in Open-Vocabulary Neural Language Models

机译：在开放词汇的神经语言模型中使用形态学知识

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Languages with productive morphology pose problems for language models that generate words from a fixed vocabulary. Although character-based models allow any possible word type to be generated, they are linguistically naive: they must discover that words exist and are delimited by spaces-basic linguistic facts that are built in to the structure of word-based models. We introduce an open-vocabulary language model that incorporates more sophisticated linguistic knowledge by predicting words using a mixture of three generative processes: (1) by generating words as a sequence of characters, (2) by directly generating full word forms, and (3) by generating words as a sequence of morphemes that are combined using a hand-written morphological analyzer. Experiments on Finnish, Turkish, and Russian show that our model outperforms character sequence models and other strong baselines on intrinsic and extrinsic measures. Furthermore, we show that our model learns to exploit morphological knowledge encoded in the analyzer, and, as a byproduct, it can perform effective unsupervised morphological disambiguation.

机译：具有生产形态的语言会给从固定词汇表生成单词的语言模型带来问题。尽管基于字符的模型允许生成任何可能的单词类型，但是它们在语言上是幼稚的：它们必须发现单词的存在并受到基于单词的模型结构中内置的基于空间的语言事实的界定。我们引入了一种开放词汇的语言模型，该模型通过使用三种生成过程的混合物来预测单词来结合更复杂的语言知识：（1）通过将单词作为字符序列生成;（2）通过直接生成完整的单词形式;以及（3））生成单词作为语素序列，并使用手写形态分析仪进行组合。在芬兰语，土耳其语和俄语上进行的实验表明，我们的模型优于字符序列模型以及其他有关内在和外在度量的强大基线。此外，我们表明，我们的模型学会了利用分析器中编码的形态学知识，并且作为副产品，它可以执行有效的无监督形态学消歧。

著录项

来源
《Annual conference of the North American Chapter of the Association for Computational Linguistics: human language technologies》|2018年|1435-1445|共11页
会议地点
作者
Austin Matthews; Graham Neubig; Chris Dyer;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. A language model using variable length tokens for open-vocabulary Hangul text recognition [J] . Ryu SH, Kim JH Pattern Recognition: The Journal of the Pattern Recognition Society . 2004,第7期

机译：使用可变长度标记进行开放式韩文文本识别的语言模型
2. Joint Morphological-Lexical Language Modeling for Processing Morphologically Rich Languages With Application to Dialectal Arabic [J] . Sarikaya R., Afify M., Deng Y., IEEE transactions on audio, speech and language processing . 2008,第7期

机译：形态-词汇联合语言建模，用于处理形态丰富的语言及其在方言阿拉伯语中的应用
3. NeuMorph: Neural Morphological Tagging for Low-Resource Languages-An Experimental Study for Indie Languages [J] . Chakrabarty Abhisek, Chaturvedi Akshay, Garain Utpal ACM transactions on Asian language information processing . 2020,第1期

机译：NeuMorph：低资源语言的神经形态学标记-独立语言的实验研究
4. Using Morphological Knowledge in Open-Vocabulary Neural Language Models [C] . Austin Matthews, Graham Neubig, Chris Dyer Annual conference of the North American Chapter of the Association for Computational Linguistics: human language technologies . 2018

机译：在开放词汇神经语言模型中使用形态学知识
5. Conditional Neural Language Models for Multimodal Learning and Natural Language Understanding [D] . Kiros, Jamie Ryan. 2018

机译：用于多模式学习和自然语言理解的条件神经语言模型
6. Neural Systems Language: A Formal Modeling Language for the Systematic Description Unambiguous Communication and Automated Digital Curation of Neural Connectivity [O] . Ramsay A. Brown, Larry W. Swanson -1

机译：神经系统语言：用于系统描述明确通信和神经连接自动数字化的形式化建模语言
7. Learning to Create and Reuse Words in Open-Vocabulary Neural Language Modeling [O] . Kawakami, Kazuya, Dyer, Chris, Blunsom, Phil 2017

机译：学习创造和重用开放词汇神经语言中的单词造型

Using Morphological Knowledge in Open-Vocabulary Neural Language Models

摘要

著录项

相似文献

相关主题

期刊订阅