【24h】

Continual and Multi-Task Architecture Search

机译:持续和多任务架构搜索

获取原文

摘要

Architecture search is the process of automatically learning the neural model or cell structure that best suits the given task. Recently, this approach has shown promising performance improvements (on language modeling and image classification) with reasonable training speed, using a weight sharing strategy called Efficient Neural Architecture Search (ENAS). In our work, we first introduce a novel continual architecture search (CAS) approach, so as to continually evolve the model parameters during the sequential training of several tasks, without losing performance on previously learned tasks (via block-sparsity and orthogonality constraints), thus enabling life-long learning. Next, we explore a multi-task architecture search (MAS) approach over ENAS for finding a unified, single cell structure that performs well across multiple tasks (via joint controller rewards), and hence allows more generalizable transfer of the cell structure knowledge to an unseen new task. We empirically show the effectiveness of our sequential continual learning and parallel multi-task learning based architecture search approaches on diverse sentence-pair classification tasks (GLUE) and multimodal-generation based video captioning tasks. Further, we present several ablations and analyses on the learned cell structures.~1
机译:建筑搜索是自动学习神经网络模型或细胞结构,最适合给定任务的过程。近来,这种方法已经显示出有前途的性能改进(对语言模型和图片分类)有合理的训练速度,采用一种叫做高效的神经结构搜索(ENAS)重量共享战略。在我们的工作中,我们首先引入新的持续的架构搜索(CAS)的方式,以几个任务的顺序训练中不断演变的模型参数,而不会失去对以前学过的任务的性能(通过块稀疏性和正交性约束),从而使终身学习。接下来,我们探索了多任务架构搜索超过ENAS(MAS)的方法寻找一个统一的,单细胞的结构,以及跨多个任务执行(通过联合控制器奖励),从而使细胞结构知识更普及转移到看不见的新任务。我们经验上展示我们的顺序不断学习的有效性和并行多任务学习基础架构的搜索方法不同的上句对分类任务(胶)和基于多代视频字幕任务。此外,我们提出几个消融和分析上了解到的细胞结构。〜1

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号