Self-organizing cognitive agents and reinforcement learning in multi-agent environment

机译：自组织认知智能体和多智能体环境中的强化学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents a self-organizing cognitive architecture, known as TD-FALCON, that learns to function through its interaction with the environment. TD-FALCON learns the value functions of the state-action space estimated through a temporal difference (TD) method. The learned value functions are then used to determine the optimal actions based on an action selection policy. We present a specific instance of TD-FALCON based on an e-greedy action policy and a Q-learning value estimation formula. Experiments based on a minefield navigation task and a minefield pursuit task show that TD-FALCON systems are able to adapt and function well in a multi-agent environment without an explicit mechanism for collaboration.

机译：本文提出了一种自组织的认知架构，称为TD-FALCON，该架构通过与环境的相互作用来学习其功能。 TD-FALCON学习通过时间差（TD）方法估计的状态-作用空间的值函数。然后，将学习值函数用于基于操作选择策略来确定最佳操作。我们基于电子贪婪行为策略和Q学习价值估计公式，给出了TD-FALCON的特定实例。基于雷场导航任务和雷场追踪任务的实验表明，TD-FALCON系统能够在多主体环境中适应并正常运行，而无需明确的协作机制。

著录项

来源
《Intelligent Agent Technology, IEEE/WIC/ACM International Conference on》|2005年|P.351-357|共7页
会议地点
作者
Tan A.-H.; Xiao D.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类工业技术;
关键词
cognition; learning (artificial intelligence); multi-agent systems; self-organising feature maps; Q-learning value estimation formula; TD-FALCON; e-greedy action policy; minefield navigation task; minefield pursuit task; multiagent environment; reinforcement learn;

机译：认知;学习（人工智能）;多主体系统;自组织特征图; Q学习价值估计公式; TD-FALCON;电子贪婪行动策略;雷区导航任务;雷场追踪任务;多主体环境;强化学习;

相似文献

外文文献
中文文献
专利

1. Learning adversarial policy in multiple scenes environment via multi-agent reinforcement learning [J] . Li Yang, Wang Xinzhi, Wang Wei, Connection Science . 2021,第3期

机译：通过多功能钢筋学习在多个场景环境中学习对抗性政策
2. Energy-Efficient Resource Allocation in Cognitive Radio Networks Under Cooperative Multi-Agent Model-Free Reinforcement Learning Schemes [J] . Kaur Amandeep, Kumar Krishan IEEE transactions on network and service management . 2020,第3期

机译：在合作多剂型无代理模型加强学习计划下认知无线电网络中的节能资源分配
3. Multi-Agent Reinforcement Learning Based Opportunistic Routing and Channel Assignment for Mobile Cognitive Radio Ad Hoc Network [J] . Sunita S. Barve, Parag Kulkarni Mobile networks & applications . 2014,第6期

机译：基于多智能强化学习的移动认知无线电自组织网络的机会路由和信道分配
4. Self-organizing cognitive agents and reinforcement learning in multi-agent environment [C] . Tan A.-H., Xiao D. IEEE/WIC/ACM International Conference on Intelligent Agent Technology . 2005

机译：多助理环境中的自组织认知代理和加强学习
5. A Coordinated Reinforcement Learning Framework for Multi-Agent Virtual Environments. [D] . Sause, William J. 2013

机译：多代理虚拟环境的协作强化学习框架。
6. Multi-agent reinforcement learning with approximate model learning for competitive games [O] . Young Joon Park, Yoon Sang Cho, Seoung Bum Kim 2012

机译：多主体强化学习和近似模型学习的竞技游戏
7. Selforganizing cognitive agents and reinforcement learning in multi-agent environment [O] . Ah-hwee Tan, Dan Xiao 2005

机译：在多主体环境中自组织认知主体并加强学习

Self-organizing cognitive agents and reinforcement learning in multi-agent environment

摘要

著录项

相似文献

相关主题

期刊订阅