Semi-supervised Domain Adaptation for Dependency Parsing via Improved Contextualized Word Representations

机译：通过改进的上下文化字表示，半监督域适应依赖性解析

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In recent years, parsing performance is dramatically improved on in-domain texts thanks to the rapid progress of deep neural network models. The major challenge for current parsing research is to improve parsing performance on out-of-domain texts that are very different from the in-domain training data when there is only a small-scale out-domain labeled data. To deal with this problem, we propose to improve the contextualized word representations via adversarial learning and fine-tuning BERT processes. Concretely, we apply adversarial learning to three representative semi-supervised domain adaption methods, i.e., direct concatenation (CON), feature augmentation (FA), and domain embedding (DE) with two useful strategies, i.e., fused target-domain word representations and orthogonality constraints, thus enabling to model more pure yet effective domain-specific and domain-invariant representations. Simultaneously, we utilize a large-scale target-domain unlabeled data to fine-tune BERT with only the language model loss, thus obtaining reliable contextualized word representations that benefit for the cross-domain dependency parsing. Experiments on a benchmark dataset show that our proposed adversarial approaches achieve consistent improvements, and fine-tuning BERT further boosts the parsing accuracy by a large margin. Our single model achieves the same state-of-the-art performance as the top submitted system in the NLPCC-2019 shared task, which uses ensemble models and BERT.

机译：近年来，由于深神经网络模型的快速进展，域文本的解析性能大大提高。当前解析研究的主要挑战是提高域名文本的解析性能与域名训练数据的域外文本非常不同，当时只有小尺寸的Out域标记数据。要解决这个问题，我们建议通过对抗性学习和微调伯特过程来改善上下文化词。具体地，我们将对抗学习应用于三个代表性半监督域适应方法，即直接连接（CON），功能增强（FA），以及具有两个有用策略的域嵌入（DE），即融合目标域字表示和正交约束，从而使更模拟更纯但有效的域特定域和域不变的表示。同时，我们仅利用大规模的目标域未标记的数据，只有语言模型丢失，从而获得了对跨域依赖解析有益的可靠上下文化词表示。基准数据集的实验表明，我们提出的对抗性方法实现了一致的改进，微调杆进一步通过大边距提高了解析精度。我们的单一模型实现了与NLPCC-2019共享任务中的顶级提交的系统相同的最先进的性能，它使用集合模型和BERT。

著录项

来源
《International Conference on Computational Linguistics》|2020年|3806-3817|共12页
会议地点
作者
Ying Li; Zhenghua Li; Min Zhang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Deep Contextualized Word Embeddings for Universal Dependency Parsing [J] . Liu Yijia, Che Wanxiang, Wang Yuxuan, ACM transactions on Asian language information processing . 2020,第1期

机译：通用关联解析的深度上下文化词嵌入
2. A word clustering approach to domain adaptation: Robust parsing of source and target domains [J] . DJAME SEDDAH, MARIE CANDITO, ENRIQUE HENESTROZA ANGUIANO Journal of logic and computation . 2014,第2期

机译：用于领域适应的词聚类方法：源域和目标域的强大解析
3. Transition-Based Korean Dependency Parsing Using Hybrid Word Representations of Syllables and Morphemes with LSTMs [J] . Na Seung-Hoon, Li Jianri, Shin Jong-Hoon, ACM transactions on Asian language information processing . 2019,第2期

机译：基于音节和词素与LSTM的混合词表示的基于过渡的韩语依存关系解析
4. Semi-supervised Domain Adaptation for Dependency Parsing [C] . Zhenghua Li, Xue Peng, Min Zhang, Annual meeting of the Association for Computational Linguistics . 2019

机译：半监督域自适应的依赖项解析
5. Towards Effective Domain Adaptation of Dependency Parsing [D] . Mukherjee, Atreyee. 2020

机译：朝着有效的域改编依赖解析
6. Automated MeSH Indexing of Biomedical Literature Using Contextualized Word Representations [O] . Dimitrios A. Koutsomitropoulos, Andreas D. Andriopoulos -1

机译：使用上下文化词表示法对生物医学文献进行自动MeSH索引
7. Semi-supervised Domain Adaptation for Dependency Parsing [O] . Zhenghua Li, Xue Peng, Min Zhang, 2019

机译：半监督域适应依赖解析

Semi-supervised Domain Adaptation for Dependency Parsing via Improved Contextualized Word Representations

摘要

著录项

相似文献

相关主题

期刊订阅