Heuristic dynamic programming with internal goal representation

Ni Z.; He H.

首页> 外文期刊>Soft computing: A fusion of foundations, methodologies and applications >Heuristic dynamic programming with internal goal representation

【24h】

Heuristic dynamic programming with internal goal representation

机译：具有内部目标表示的启发式动态规划

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we analyze an internal goal structure based on heuristic dynamic programming, named GrHDP, to tackle the 2-D maze navigation problem. Classical reinforcement learning approaches have been introduced to solve this problem in literature, yet no intermediate reward has been assigned before reaching the final goal. In this paper, we integrated one additional network, namely goal network, into the traditional heuristic dynamic programming (HDP) design to provide the internal reward/goal representation. The architecture of our proposed approach is presented, followed by the simulation of 2-D maze navigation (10*10) problem. For fair comparison, we conduct the same simulation environment settings for the traditional HDP approach. Simulation results show that our proposed GrHDP can obtain faster convergent speed with respect to the sum of square error, and also achieve lower error eventually.

机译：在本文中，我们分析了基于启发式动态规划的内部目标结构GrHDP，以解决二维迷宫导航问题。为了解决文学中的这一问题，引入了经典的强化学习方法，但是在达到最终目标之前尚未分配任何中间奖励。在本文中，我们将另外一个网络（目标网络）集成到传统的启发式动态规划（HDP）设计中，以提供内部奖励/目标表示。介绍了我们提出的方法的体系结构，然后模拟了二维迷宫导航（10 * 10）问题。为了公平地比较，我们对传统的HDP方法进行相同的仿真环境设置。仿真结果表明，相对于平方误差之和，我们提出的GrHDP收敛速度更快，最终误差也较小。

著录项

来源
《Soft computing: A fusion of foundations, methodologies and applications》 |2013年第11期|共8页
作者
Ni Z.; He H.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算机软件;
关键词
Adaptive dynamic programming (ADP); Goal representation heuristic dynamic programming (GrHDP); Maze navigation/path planning; Reinforcement learning (RL);

机译：自适应动态规划（ADP）;目标表示启发式动态规划（GrHDP）;迷宫导航/路径规划;强化学习（RL）;

相似文献

外文文献
中文文献
专利

1. Heuristic dynamic programming with internal goal representation [J] . Ni Z., He H. Soft computing: A fusion of foundations, methodologies and applications . 2013,第11期

机译：具有内部目标表示的启发式动态规划
2. Goal Representation Heuristic Dynamic Programming on Maze Navigation [J] . Ni Z., He H., Wen J., Neural Networks and Learning Systems, IEEE Transactions on . 2013,第12期

机译：迷宫导航目标表示启发式动态规划
3. HEURISTIC MODELING FOR A DYNAMIC AND GOAL PROGRAMMING IN PRODUCTION PLANNING OF CONTINUOUS MANUFACTURING SYSTEMS [J] . JAHAN A, ABDOLSHAH M Chinese Journal of Mechanical Engineering . 2007,第5期

机译：连续制造系统生产计划中动态和目标规划的启发式建模
4. Supplementary damping control of VSC-HVDC for interarea oscillation using goal representation heuristic dynamic programming [C] . Yu Shen, Weibiao Chen, Wei Yao, International Conference on AC and DC Power Transmission . 2016

机译：使用目标表示启发式动态规划的VSC-HVDC补充阻尼控制区域间振荡
5. Goal representation adaptive dynamic programming for machine intelligence [D] . Ni, Zhen 2015

机译：用于机器智能的目标表示自适应动态编程
6. Dynamic integration of forward planning and heuristic preferences during multiple goal pursuit [O] . Florian Ott, Dimitrije Marković, Alexander Strobel, 2020

机译：在多目标追求中动态整合前瞻性计划和启发式偏好
7. Heuristic allocation based on a dynamic programming state-space representation [O] . Dragut, AB Andreea 2002

机译：基于动态编程状态空间表示的启发式分配

Heuristic dynamic programming with internal goal representation

摘要

著录项

相似文献

相关主题

期刊订阅