首页> 外文期刊>AI communications >Why the 'selfish' optimizing agents could solve the decentralized reinforcement learning problems
【24h】

Why the 'selfish' optimizing agents could solve the decentralized reinforcement learning problems

机译:为什么“自私”优化代理可以解决分散的强化学习问题

获取原文
获取原文并翻译 | 示例
       

摘要

Multidisciplinary Design Optimization (MDO) is a computational approach for optimizing design of a complex system of systems that require knowledge from multiple disciplines. In a former study, we explored and found that the individual discipline feasible (IDF), a type of MDO design technique, performed well in several benchmark test cases of decentralized Reinforcement Learning (RL) problems, in particular, stabilizing an unknown system. However, the earlier study was not able to resolve as to why the overall system of systems, even with strongly coupled systems, could be stabilized when each agent just focused on stabilizing itself. In this work, we make significant extension in resolving this behavior by conducting a theoretical analysis of the MDO solution of RL problems. Through the analysis, we show that with the proper control law, each MDO agent should be able to bring its state closer to the 0-stable point regardless of how the other agents' states impact the state of the whole system. This is the main reason why the 'selfish' MDO-IDF agents are successful in learning to stabilize the overall system. The simulation results, including benchmark test cases, verify our analysis. Therefore, we propose that the MDO would be a promising solution in many other decentralized RL problems.
机译:多学科设计优化(MDO)是一种用于优化需要从多个学科知识的复杂系统的设计的计算方法。在前面的研究中,我们探索并发现个体学科可行(IDF),一种MDO设计技术,在分散的加强学习(RL)问题的几个基准测试用例中表现良好,特别是稳定未知系统。然而,早期的研究无法解决为什么系统的整体系统,即使具有强烈耦合的系统,也可以稳定当每个代理专注于稳定本身时稳定。在这项工作中,我们通过对RL问题的MDO解决方案进行理论分析,在解决这种行为方面取得了重大的扩展。通过分析,我们表明,通过适当的控制法,每种MDO代理都应该能够将其州更接近0稳定的点,而不管其他代理商的国家如何影响整个系统的状态。这是“自私”MDO-IDF代理人成功学习稳定整个系统的主要原因。仿真结果,包括基准测试用例,验证我们的分析。因此,我们建议MDO将在许多其他分散的RL问题中成为一个有希望的解决方案。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号