Few-Shot and Zero-Shot Learning for Historical Text Normalization

机译：用于历史文本规范化的少量学习和零学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Historical text normalization often relies on small training datasets. Recent work has shown that multi-task learning can lead to significant improvements by exploiting synergies with related datasets, but there has been no systematic study of different multitask learning architectures. This paper evaluates 63 multi-task learning configurations for sequence-to-sequence-based historical text normalization across ten datasets from eight languages, using autoencoding, grapheme-to-phoneme mapping, and lemmatization as auxiliary tasks. We observe consistent, significant improvements across languages when training data for the target task is limited, but minimal or no improvements when training data is abundant. We also show that zero-shot learning outperforms the simple, but relatively strong, identity baseline.

机译：历史文本规范化通常依赖于小的训练数据集。最近的工作表明，多任务学习可以通过利用与相关数据集的协同作用来带来重大改进，但是尚未对不同的多任务学习体系结构进行系统研究。本文使用自动编码，字素到音素映射和词素化作为辅助任务，对63种多任务学习配置进行了评估，以对来自八种语言的十个数据集进行基于序列到序列的历史文本规范化。当目标任务的训练数据有限时，我们会观察到跨语言的一致，显着的改进，但是当训练数据丰富时，则几乎没有或没有改进。我们还表明，零击学习的性能优于简单但相对较强的身份基准。

著录项

来源
《Workshop on deep learing approaches for low-resource natural language processing》|2019年|104-114|共11页
会议地点
作者
Marcel Bollmann; Natalia Korchagina; Anders Sogaard;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. MSplit LBI: Realizing Feature Selection and Dense Estimation Simultaneously in Few-shot and Zero-shot Learning [J] . Bo Zhao, Xinwei Sun, Yanwei Fu, JMLR: Workshop and Conference Proceedings . 2018,第2010期

机译：MSPLIT LBI：在几次拍摄和零射击学习中同时实现特征选择和密集估计
2. Learning transferable features in meta-learning for few-shot text classification [J] . Xu Jincheng, Du Qingfeng Pattern recognition letters . 2020,第Jula期

机译：学习Meta-Learning中的可转让功能，用于几次文本分类
3. Few-shot learning for short text classification [J] . Leiming Yan, Yuhui Zheng, Jie Cao Multimedia Tools and Applications . 2018,第22期

机译：短文本学习的少量学习
4. Few-Shot and Zero-Shot Learning for Historical Text Normalization [C] . Marcel Bollmann, Natalia Korchagina, Anders Sogaard Workshop on deep learing approaches for low-resource natural language processing . 2019

机译：历史文本规范化的几次射击和零射击学习
5. Class Representative Projection for Text-Based Zero-Shot Learning [D] . Narne, Sai Sri. 2020

机译：基于文本的零射击学习的类代表投影
6. Few-Shot and Zero-Shot Multi-Label Learning for Structured Label Spaces [O] . Anthony Rios, Ramakanth Kavuluru -1

机译：结构化标签空间的少拍和零拍多标签学习
7. Few-Shot and Zero-Shot Learning for Historical Text Normalization [O] . Marcel Bollmann, Natalia Korchagina, Anders Søgaard 2019

机译：历史文本规范化的几次射击和零射击学习

Few-Shot and Zero-Shot Learning for Historical Text Normalization

摘要

著录项

相似文献

相关主题

期刊订阅