Transfer learning by prototype generation in continuous spaces

Munoz de Cote Enrique; Garcia Esteban O.; Morales Eduardo F.

首页> 外文期刊>Adaptive Behavior >Transfer learning by prototype generation in continuous spaces

【24h】

Transfer learning by prototype generation in continuous spaces

机译：在连续空间中通过原型生成转移学习

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In machine learning, learning a task is expensive (many training samples are needed) and it is therefore of general interest to be able to reuse knowledge across tasks. This is the case in aerial robotics applications, where an autonomous aerial robot cannot interact with the environment hazard free. Prototype generation is a well known technique commonly used in supervised learning to help reduce the number of samples needed to learn a task. However, little is known about how such techniques can be used in a reinforcement learning task. In this work we propose an algorithm that, in order to learn a new (target) task, first generates new samplesprototypesbased on samples acquired previously in a known (source) task. The proposed approach uses Gaussian processes to learn a continuous multidimensional transition function, rendering the method capable of reasoning directly in continuous (states and actions) domains. We base the prototype generation on a careful selection of a subset of samples from the source task (based on known filtering techniques) and transforming such samples using the (little) knowledge acquired in the target task. Our experimental evidence gathered in known reinforcement learning benchmark tasks, as well as a challenging quadcopter to helicopter transfer task, suggests that prototype generation is feasible and, furthermore, that the filtering technique used is not as important as a correct transformation model.

机译：在机器学习中，学习任务很昂贵（需要许多训练样本），因此能够跨任务重用知识是人们普遍关注的问题。在空中机器人应用中就是这种情况，在这种情况下，自主的空中机器人无法与环境进行无害交互。原型生成是监督学习中常用的一种众所周知的技术，可以帮助减少学习任务所需的样本数量。但是，关于如何在强化学习任务中使用这些技术知之甚少。在这项工作中，我们提出了一种算法，该算法为了学习新的（目标）任务，首先基于先前在已知（源）任务中获取的样本生成新的样本原型。所提出的方法使用高斯过程来学习连续的多维转换函数，从而使该方法能够直接在连续（状态和动作）域中进行推理。我们基于从源任务中仔细选择样本子集的基础上生成原型（基于已知的过滤技术），并使用在目标任务中获得的（少量）知识对此类样本进行转换。我们在已知的强化学习基准任务以及具有挑战性的四轴直升机到直升机的转移任务中收集的实验证据表明，原型生成是可行的，此外，所使用的过滤技术不如正确的转换模型重要。

著录项

来源
《Adaptive Behavior》 |2016年第6期|464-478|共15页
作者
Munoz de Cote Enrique; Garcia Esteban O.; Morales Eduardo F.;
展开▼
作者单位

Inst Nacl Astrofis Opt & Electr, Dept Comp Sci, Luis Enrique Erro 1, Puebla 72840, Mexico;

Inst Nacl Astrofis Opt & Electr, Dept Comp Sci, Luis Enrique Erro 1, Puebla 72840, Mexico;

Inst Nacl Astrofis Opt & Electr, Dept Comp Sci, Luis Enrique Erro 1, Puebla 72840, Mexico;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Transfer learning; reinforcement learning; Gaussian processes; prototype generation;

机译：转移学习;强化学习;高斯过程;原型生成;

相似文献

外文文献
中文文献
专利

1. TRANSFER LEARNING FOR CONTINUOUS STATE AND ACTION SPACES [J] . ESTEBAN O. GARCIA, ENRIQUE MUNOZ DE COTEf, EDUARDO F. MORALES International Journal of Pattern Recognition and Artificial Intelligence . 2014,第7期

机译：连续状态和动作空间的转移学习
2. Transfer of learning in the learning society: How can the barriers between different learning spaces be surmounted, and how can the gap between learning inside and outside schools be bridged? [J] . Knud Illeris International Journal of Lifelong Education . 2009,第2期

机译：学习型社会中的学习转移：如何克服不同学习空间之间的障碍，如何弥合校内外学习之间的差距？
3. Fundamental notions for the second generation Fukui project and a prototypal problem of the normed repeat space and its super spaces [J] . Shigeru Arimoto Journal of Mathematical Chemistry . 2011,第4期

机译：第二代福井项目的基本概念以及规范重复空间及其超级空间的原型问题
4. Prototypical Metric Transfer Learning for Continuous Speech Keyword Spotting with Limited Training Data [C] . Harshita Seth, Pulkit Kumar, Muktabh Mayank International Conference on Soft Computing Models in Industrial and Environmental Applications . 2020

机译：具有有限培训数据的连续语音关键字的原型公制传输学习
5. Learning Transferable Knowledge Through Embedding Spaces [D] . Rostami, Mohammad. 2019

机译：通过嵌入空格学习可转让的知识
6. Transfer Learning across Feature-Rich Heterogeneous Feature Spaces via Feature-Space Remapping (FSR) [O] . Kyle D. Feuz, Diane J. Cook -1

机译：通过功能空间重新映射（FSR）在功能丰富的异构功能空间上转移学习
7. Prototypical Metric Transfer Learning for Continuous Speech Keyword Spotting with Limited Training Data [O] . Harshita Seth, Pulkit Kumar, Muktabh Mayank Srivastava 2019

机译：具有有限培训数据的连续语音关键字的原型公制传输学习

Transfer learning by prototype generation in continuous spaces

摘要

著录项

相似文献

相关主题

期刊订阅