Reinforcement Learning for RoboCup Soccer Keepaway

Peter Stone; Richard S. Sutton; Gregory Kuhlmann

首页> 外文期刊>Adaptive Behavior >Reinforcement Learning for RoboCup Soccer Keepaway

【24h】

Reinforcement Learning for RoboCup Soccer Keepaway

机译：RoboCup足球禁区的强化学习

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

RoboCup simulated soccer presents many challenges to reinforcement learning methods, including a large state space, hidden and uncertain state, multiple independent agents learning simultaneously, and long and variable delays in the effects of actions. We describe our application of episodic SMDP Sarsa(λ) with linear tile-coding function approximation and variable λ to learning higher-level decisions in a keepaway subtask of RoboCup soccer. In keepaway, one team, "the keepers," tries to keep control of the ball for as long as possible despite the efforts of "the takers." The keepers learn individually when to hold the ball and when to pass to a teammate. Our agents learned policies that significantly outperform a range of benchmark policies. We demonstrate the generality of our approach by applying it to a number of task variations including different field sizes and different numbers of players on each team.

机译：RoboCup模拟足球对强化学习方法提出了许多挑战，包括较大的状态空间，隐藏和不确定的状态，多个独立的代理同时学习以及动作效果的长时间和可变延迟。我们描述了带有线性瓦片编码函数逼近和变量λ的情节式SMDP Sarsa（λ）在学习RoboCup足球的子任务中的高层决策中的应用。在收球时，尽管“收球者”做出了努力，但一支球队“收球者”会尽力保持对球的控制。守门员会分别学习何时握球以及何时传递给队友。我们的代理商了解到的政策明显优于一系列基准政策。通过将其应用于许多任务变体，包括每个团队的不同领域规模和不同人数的球员，我们展示了我们方法的一般性。

著录项

来源
《Adaptive Behavior》 |2005年第3期|p.165-188|共24页
作者
Peter Stone; Richard S. Sutton; Gregory Kuhlmann;
展开▼
作者单位

Department of Computer Sciences, The University of Texas at Austin, 1 University Station C0500, Austin, TX 78712-0233, USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类生物科学;
关键词
multiagent systems; machine learning; multiagent learning; reinforcement learning; robot soccer;

机译：多主体系统;机器学习;多主体学习;强化学习;机器人足球;

相似文献

外文文献
中文文献
专利

1. マルチエージェント連続タスクにおける報酬設計の実験的考察－RoboCup Soccer Keepaway タスクを例として [J] . 荒井幸代, 田中信行, Sachiyo Arai, 人工知能学会論文誌 . 2006,第6期

机译：多Agent连续任务中奖励设计的实验考虑-以RoboCup足球禁忌任务为例
2. マルチエージェント連続タスクにおける報酬設計の実験的考察－RoboCup Soccer Keepaway タスクを例として [J] . 荒井幸代, 田中信行, Sachiyo Arai, 人工知能学会論文誌 . 2006,第6期

机译：多售后持续任务 - Robocup足球昆虫淘场任务的补偿设计实验研究作为示例
3. Learning RoboCup-Keepaway with Kernels [J] . Daniel Polani, Tobias Jung JMLR: Workshop and Conference Proceedings . 2007,第2007期

机译：用内核学习RoboCup-Keepaway
4. Reinforcement Learning with Case-Based Heuristics for RoboCup Soccer Keepaway [C] . Celiberto Jr. Luiz A., Matsuura Jackson P., Mantaras Ramon Lopez de, 2012 Brazilian Robotics Symposium and Latin American Robotics Symposium. . 2012

机译：借助基于案例的启发式技术进行RoboCup足球禁运的强化学习
5. A scene learning and recognition framework for RoboCup clients. [D] . Lam, Kevin. 2005

机译：针对RoboCup客户的场景学习和识别框架。
6. Structure-Preserving Imitation Learning With Delayed Reward: An Evaluation Within the RoboCup Soccer 2D Simulation Environment [O] . Quang Dang Nguyen, Mikhail Prokopenko 2020

机译：延迟奖励的结构保留模仿学习：Robocup Soccer 2D模拟环境中的评估
7. Two steps reinforcement learning en robocup-soccer keepaway [O] . López-Bueno Hernández Iván 2009

机译：两步强化学习en robocup-soccer keepaway

Reinforcement Learning for RoboCup Soccer Keepaway

摘要

著录项

相似文献

相关主题

期刊订阅