首页> 外文会议> >An Embarrassingly Simple Approach for Transfer Learning from Pretrained Language Models

【24h】

An Embarrassingly Simple Approach for Transfer Learning from Pretrained Language Models

机译：从预训练的语言模型进行迁移学习的一种非常尴尬的简单方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

A growing number of state-of-the-art transfer learning methods employ language models pretrained on large generic corpora. In this paper we present a conceptually simple and effective transfer learning approach that addresses the problem of catastrophic forgetting. Specifically, we combine the task-specific optimization function with an auxiliary language model objective, which is adjusted during the training process. This preserves language regularities captured by language models, while enabling sufficient adaptation for solving the target task. Our method does not require pre-training or finetuning separate components of the network and we train our models end-to-end in a single step. We present results on a variety of challenging affective and text classification tasks, surpassing well established transfer learning methods with greater level of complexity.

机译：越来越多的最新转移学习方法采用在大型通用语料库上预先训练的语言模型。在本文中，我们提出了一种概念上简单有效的转移学习方法，用于解决灾难性遗忘问题。具体来说，我们将特定于任务的优化功能与辅助语言模型目标结合在一起，该目标可以在培训过程中进行调整。这样可以保留语言模型捕获的语言规则，同时可以进行充分的调整以解决目标任务。我们的方法不需要预先训练或微调网络的各个组成部分，而我们只需一步就可以对模型进行端到端训练。我们介绍了各种具有挑战性的情感和文本分类任务的结果，这些结果超过了成熟的，具有更高复杂度的转移学习方法。

著录项

来源
《》|2019年|2089-2095|共7页
会议地点
作者
Alexandra Chronopoulou; Christos Baziotis; Alexandros Potamianos;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Three ideal observer models for rule learning in simple languages [J] . Frank M.C., Tenenbaum J.B. Cognition: International Journal of Cognitive Psychology . 2011,第3期

机译：三种用于简单语言规则学习的理想观察者模型
2. Learning Simpler Language Models with the Differential State Framework [J] . Alexander G. Ororbia II, Tomas Mikolov, David Reitter Neural computation . 2017,第12期

机译：使用差分状态框架学习更简单的语言模型
3. Machine Learning Constrained with Dimensional Analysis and Scaling Laws: Simple, Transferable, and Interpretable Models of Materials from Small Datasets [J] . Kumar Narendra, Rajagopalan Padmini, Pankajakshan Praveen, Chemistry of Materials: A Publication of the American Chemistry Society . 2019,第2期

机译：机器学习限制尺寸分析和缩放规律：来自小型数据集的简单，可转让和可解释的材料模型
4. An Embarrassingly Simple Approach for Transfer Learning from Pretrained Language Models [C] . Alexandra Chronopoulou, Christos Baziotis, Alexandros Potamianos Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies . 2019

机译：从预磨料语言模型转移学习的一种令人尴尬的简单方法
5. THE SAVINGS TRANSFER EFFECT OF TEACHING MATHEMATICAL MODELING ON LEARNING A PHYSICS UNIT USING A MASTERY LEARNING APPROACH [D] . SRIVASTAVA, DINESH MOHAN 1983

机译：运用大师学习方法进行数学建模教学的节省转移效果。
6. Modeling aspects of the language of life through transfer-learning protein sequences [O] . Michael Heinzinger, Ahmed Elnaggar, Yu Wang, 2019

机译：通过转移学习蛋白序列模拟生活语言的各个方面
7. English Language Learning Motivation and English Language Learning Anxiety in Saudi Military Cadets: A Structural Equation Modelling Approach [O] . Ali Falah Alqahtani 2018

机译：沙特军校学生英语学习动机与英语学习焦虑：结构方程模型方法

An Embarrassingly Simple Approach for Transfer Learning from Pretrained Language Models

摘要

著录项

相似文献

相关主题

期刊订阅