Bayesian Learning for Neural Dependency Parsing

机译：贝叶斯学习神经依赖解析

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

While neural dependency parsers provide state-of-the-art accuracy for several languages, they still rely on large amounts of costly labeled training data. We demonstrate that in the small data regime, where uncertainty around parameter estimation and model prediction matters the most, Bayesian neural modeling is very effective. In order to overcome the computational and statistical costs of the approximate inference step in this framework, we utilize an efficient sampling procedure via stochastic gradient Langevin dynamics to generate samples from the approximated posterior. Moreover, we show that our Bayesian neural parser can be further improved when integrated into a multi-task parsing and POS tagging framework, designed to minimize task interference via an adversarial procedure. When trained and tested on 6 languages with less than 5k training instances, our parser consistently outperforms the strong BiLSTM baseline (Kiper-wasser and Goldberg, 2016). Compared with the BiAFFINE parser (Dozat et al., 2017) our model achieves an improvement of up to 3% for Vietnamese and Irish, while our multi-task model achieves an improvement of up to 9% across five languages: Farsi, Russian, Turkish. Vietnamese, and Irish.

机译：虽然神经依赖解析器为多种语言提供最先进的准确性，但它们仍然依靠大量的昂贵标记的训练数据。我们证明，在小数据制度中，在参数估计和模型预测周围的不确定性最重要的情况下，贝叶斯神经建模非常有效。为了克服本框架中的近似推理步骤的计算和统计成本，我们通过随机梯度Langevin动力学利用有效的采样过程来从近似后的后部产生样品。此外，我们表明，当集成到多任务解析和POS标记框架中时，我们的贝叶斯神经解析器可以进一步提高，旨在通过对抗过程最小化任务干扰。当培训和测试6种语言的培训实例，我们的解析器始终如一地优于强大的Bilstm基线（Kiper-Wasser和Goldberg，2016）。与Biaffine Parser（Dozat等，2017）相比，我们的模式实现了越南和爱尔兰最高可达3％的增长，而我们的多任务模型可实现五种语言的9％：Farsi，俄罗斯，土耳其。越南人，爱尔兰人。

著录项

来源
《Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies》|2019年|xciii p. 3498-4195|共11页
会议地点
作者
Ehsan Shareghi; Yingzhen Li; Yi Zhu; Roi Reichart; Anna Korhonen;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;
关键词

相似文献

外文文献
中文文献
专利

1. Dependency Parsing Schemata and Mildly Non-Projective Dependency Parsing [J] . Carlos Gómez-Rodrígue, John Carrol, David Wei Computational linguistics . 2011,第3期

机译：依赖项解析方案和轻度非投影依赖项解析
2. Using Short Dependency Relations from Auto-Parsed Data for Chinese Dependency Parsing [J] . WENLIANG CHEN, DAISUKE KAWAHARA, KIYOTAKA UCHIMOTO, ACM transactions on Asian language information processing . 2009,第3期

机译：使用自动解析数据中的短依赖性关系进行中文依赖性分析
3. Neural Dependency Parser for Tibetan Sentences [J] . An Bo, Long Congjun ACM transactions on Asian and low-resource language information processing . 2021,第2期

机译：西藏句子的神经依赖解析器
4. Bayesian Learning for Neural Dependency Parsing [C] . Ehsan Shareghi, Yingzhen Li, Yi Zhu, Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies . 2019

机译：贝叶斯学习的神经依赖性解析
5. Learning structured classifiers for statistical dependency parsing [D] . Wang, Qin Iris 2008

机译：学习结构化分类器以进行统计依赖性解析
6. Trends in syntactic parsing: anticipation Bayesian estimation and good-enough parsing [O] . Matthew J. Traxler -1

机译：语法分析的趋势：预期贝叶斯估计和足够好的语法分析
7. Dependency Parsing with Dynamic Bayesian Network [O] . Savova, Virginia, Peshkin, Leonid 2007

机译：动态贝叶斯网络的依赖性解析

Bayesian Learning for Neural Dependency Parsing

摘要

著录项

相似文献

相关主题

期刊订阅