A comparison of supervised and reinforcement learning methods on a reinforcement learning task

机译：强化学习任务的监督学习和强化学习方法比较

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The forward modeling approach of M.I. Jordan and J.E. Rumelhart (1990) has been shown to be applicable when supervised learning methods are to be used for solving reinforcement learning tasks. Because such tasks are natural candidates for the application of reinforcement learning methods, there is a need to evaluate the relative merits of these two learning methods on reinforcement learning tasks. The author presents one such comparison on a task involving learning to control an unstable, nonminimum phase, dynamic system. The comparison shows that the reinforcement learning method used performs better than the supervised learning method. An examination of the learning behavior of the two methods indicates that the differences in performance can be attributed to the underlying mechanics of the two learning methods, which provides grounds for believing that similar performance differences can be expected on other reinforcement learning tasks as well.

机译：M.I.的正向建模方法乔丹和鲁默哈特（J.E. Rumelhart）（1990）已证明适用于将监督学习方法用于解决强化学习任务的情况。由于此类任务是强化学习方法应用的自然候选者，因此需要评估这两种学习方法在强化学习任务上的相对优点。作者对一项涉及学习控制不稳定，非最小相位，动态系统的任务进行了这样的比较。比较表明，所使用的强化学习方法比监督学习方法表现更好。对这两种方法的学习行为的检查表明，性能的差异可以归因于两种学习方法的内在机理，这为相信在其他强化学习任务上也预期会有类似的性能差异提供了依据。

著录项

来源
《Intelligent Control, 1991., Proceedings of the 1991 IEEE International Symposium on》|1991年|P.394-399|共6页
会议地点
作者
Gullapalli; V.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术;
关键词

相似文献

外文文献
中文文献
专利

1. Reinforcement Learning Toolbox: Reinforcement Learning for Optimal Control Tasks Institute for Theoretical Computer Science TU-GRAZ [J] . Gerhard Neumann OGAI Journal . 2007,第3期

机译：强化学习工具箱：针对最优控制任务的强化学习理论计算机科学研究院TU-GRAZ
2. Multi-objective safe reinforcement learning: the relationship between multi-objective reinforcement learning and safe reinforcement learning [J] . Naoto Horie, Tohgoroh Matsui, Koichi Moriyama, Artificial life and robotics . 2019,第3期

机译：多目标安全强化学习：多目标强化学习与安全强化学习之间的关系
3. Complex crowdsourcing task allocation strategies employing supervised and reinforcement learning [J] . Lizhen Cui, Xudong Zhao, Lei Liu, International Journal of Crowd Science . 2017,第2期

机译：采用监督和强化学习的复杂众包任务分配策略
4. A comparison of supervised and reinforcement learning methods on a reinforcement learning task [C] . Gullapalli V., Institute of Electric and Electronic Engineer IEEE International Symposium on Intelligent Control . 1991

机译：对加固学习任务的监督和加固学习方法的比较
5. Training a Neural Network to Construct Sentences from an Inputted Word List: A Comparison Between Supervised and Reinforcement Learning Methods [D] . Black, Samuel 2018

机译：训练神经网络以从输入的单词列表构建句子：监督学习和强化学习方法之间的比较
6. The Outcome-Representation Learning model: a novel reinforcement learning model of the Iowa Gambling Task [O] . Nathaniel Haines, Jasmin Vassileva, Woo-Young Ahn -1

机译：结果表征学习模型：爱荷华州赌博任务的新型强化学习模型
7. A Comparison Of Supervised And Reinforcement Learning Methods On A Reinforcement Learning Task [O] . Vijaykumar Gullapalli 1992

机译：强化学习任务中监督学习和强化学习方法的比较

A comparison of supervised and reinforcement learning methods on a reinforcement learning task

摘要

著录项

相似文献

相关主题

期刊订阅