Partner Approximating Learners (PAL): Simulation-Accelerated Learning with Explicit Partner Modeling in Multi-Agent Domains

机译：合作伙伴近似学习者（PAL）：在多代理商域中使用明确的合作伙伴建模进行模拟加速学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Mixed cooperative-competitive control scenarios where the interacting partners exhibit individual goals are very challenging for reinforcement learning agents. An example of such scenarios is given by human-machine interaction. In order to contribute towards intuitive human-machine collaboration, this work focuses on problems in the continuous state and control domain and prohibits explicit communication. More precisely, the agents do not know the others' goals or control laws but only sense their control inputs retrospectively. The proposed framework combines a partner model learned from online data with a reinforcement learning agent that is trained in a simulated environment including the partner model. This procedure overcomes drawbacks of independent learners and benefits from a reduced amount of real world data required for reinforcement learning—an aspect that is vital in the human-machine context. Experimental results reveal that the method learns fast due to the simulated environment and adapts to the constantly changing partner due of the partner model.

机译：对于强化学习代理而言，互动合作伙伴展现出各自目标的混合合作竞争控制场景是非常具有挑战性的。人机交互给出了这种情况的一个例子。为了促进直观的人机协作，这项工作着眼于连续状态和控制领域中的问题，并禁止进行明确的交流。更准确地说，代理人不知道其他人的目标或控制律，而只能追溯地感觉到他们的控制输入。提出的框架将从在线数据中学习到的合作伙伴模型与在包括合作伙伴模型的模拟环境中训练的强化学习代理相结合。此过程克服了独立学习者的弊端，并从减少的强化学习所需的真实世界数据中受益（这在人机环境中至关重要）。实验结果表明，该方法在模拟环境下学习很快，并且由于伙伴模型而适应不断变化的伙伴。

著录项

来源
《International Conference on Control, Automation and Robotics》|2020年|746-752|共7页
会议地点
作者
Florian Köpf; Alexander Nitsch; Michael Flad; Sören Hohmann;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
reinforcement learning; mixed cooperative-competitive control; machine learning in control; opponent modeling;

机译：强化学习;混合竞争竞争控制;控制中的机器学习;对手建模;

相似文献

外文文献
中文文献
专利

1. Modeling of Human Fatty Acid Synthase and &ITin Silico&IT Docking of Acyl Carrier Protein Domain and Its Partner Catalytic Domains [J] . Viegas Matilde F., Neves Rui P. P., Ramos Maria J., The journal of physical chemistry, B. Condensed matter, materials, surfaces, interfaces & biophysical . 2018,第1期

机译：人脂肪酸合成酶的建模及乙基载体蛋白域及其伴伴催化结构域的含硅及其对接
2. Multi-agent reinforcement learning with approximate model learning for competitive games [J] . Young Joon Park, Yoon Sang Cho, Seoung Bum Kim PLoS One . 2019,第9期

机译：竞争游戏近似模型学习多功能辅助加固学习
3. Shared domains of competence of approximate learning models using measures of separability of classes [J] . Luengo J., Herrera F. Information Sciences: An International Journal . 2012,第Null期

机译：使用类可分性度量的近似学习模型的能力共享域
4. Homology modeling of the DNA binding and dimerization partner domains of E2F1 transcription factor protein in homo sapiens [C] . Nayan Mohd Yasser, Jusoh Siti Azma, Mutalip Siti Syairah Mohd, 2012 IEEE Symposium on Business, Engineering and Industrial Applications. . 2012

机译：智人E2F1转录因子蛋白的DNA结合和二聚体伴侣结构域的同源性建模
5. How Trait and State Social Anxiety Impact Perceptions of Support when Sharing Good News with Romantic Partners: Using the Actor-Partner Interdependence Model to Explore Self-reports, Partner-reports, and Behavioral Observations. [D] . Ferssizidis, Panagiota. 2013

机译：与浪漫伴侣分享好消息时，特质和状态社交焦虑如何影响对支持感的认识：使用演员-伴侣相互依存模型探索自我报告，伴侣报告和行为观察。
6. Multi-agent reinforcement learning with approximate model learning for competitive games [O] . Young Joon Park, Yoon Sang Cho, Seoung Bum Kim 2012

机译：多主体强化学习和近似模型学习的竞技游戏
7. Di-XL – 4. setkání partnerů a konference “Libraries as Powerful Partners in Promoting Results of Lifelong Learning Projects”: Di-XL - 4th partners' meeting and conference “Libraries as Powerful Partners in Promoting Results of Lifelong Learning Projects” [O] . 2014

机译：Di-XL – 4.setkání合作伙伴ů纪念“图书馆作为促进终身学习项目成果的有力合作伙伴”：Di-XL-第四届合作伙伴会议和会议“图书馆作为促进终身学习项目成果的有力合作伙伴”

Partner Approximating Learners (PAL): Simulation-Accelerated Learning with Explicit Partner Modeling in Multi-Agent Domains

摘要

著录项

相似文献

相关主题

期刊订阅