首页> 外文期刊>Neural Networks and Learning Systems, IEEE Transactions on >GrDHP: A General Utility Function Representation for Dual Heuristic Dynamic Programming
【24h】

GrDHP: A General Utility Function Representation for Dual Heuristic Dynamic Programming

机译:GrDHP:双重启发式动态规划的通用效用函数表示

获取原文
获取原文并翻译 | 示例
           

摘要

A general utility function representation is proposed to provide the required derivable and adjustable utility function for the dual heuristic dynamic programming (DHP) design. Goal representation DHP (GrDHP) is presented with a goal network being on top of the traditional DHP design. This goal network provides a general mapping between the system states and the derivatives of the utility function. With this proposed architecture, we can obtain the required derivatives of the utility function directly from the goal network. In addition, instead of a fixed predefined utility function in literature, we conduct an online learning process for the goal network so that the derivatives of the utility function can be adaptively tuned over time. We provide the control performance of both the proposed GrDHP and the traditional DHP approaches under the same environment and parameter settings. The statistical simulation results and the snapshot of the system variables are presented to demonstrate the improved learning and controlling performance. We also apply both approaches to a power system example to further demonstrate the control capabilities of the GrDHP approach.
机译:提出了一种通用效用函数表示法,为双重启发式动态规划(DHP)设计提供所需的可推导和可调效用函数。目标表示DHP(GrDHP)的目标网络位于传统DHP设计之上。该目标网络提供了系统状态与效用函数的导数之间的一般映射。利用这种提议的体系结构,我们可以直接从目标网络获得效用函数的所需导数。另外,代替文献中固定的预定义效用函数,我们对目标网络进行在线学习,以便可以随时间自适应地调整效用函数的派生。我们在相同的环境和参数设置下提供了建议的GrDHP和传统DHP方法的控制性能。给出了统计仿真结果和系统变量的快照,以演示改进的学习和控制性能。我们还将这两种方法都应用于电力系统示例,以进一步证明GrDHP方法的控制能力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号