...
首页> 外文期刊>Electric power systems research >Actor-critic learning for optimal building energy management with phase change materials
【24h】

Actor-critic learning for optimal building energy management with phase change materials

机译:演员 - 评论评论对相变材料的最佳建筑能源管理学习

获取原文
获取原文并翻译 | 示例
           

摘要

Energy management in buildings using phase change materials (PCM) to improve thermal performance is challenging due to the nonlinear thermal capacity of the PCM. To address this problem, this paper adopts a model-free actor-critic on-policy reinforcement learning method based on deep deterministic policy gradient (DDPG). The proposed approach overcomes the major weakness of model-based approaches, such as approximate dynamic programming (ADP), which require an explicit thermal model of the building under control. This requirement makes a plug-and-play implementation of the energy management algorithm in an existing smart meter difficult due to the wide variety of building design and construction types. To overcome this difficulty, we use a DDPG algorithm that can learn policies in continuous action spaces without access to the full dynamics of the building. We demonstrate the competitive performance of DDPG by benchmarking it against an ADP-based approach with access to the full thermal dynamics of the building.
机译:由于PCM的非线性热容量,使用相变材料(PCM)来提高热性能的建筑物中的能源管理是具有挑战性的。为了解决这个问题,本文采用基于深度确定性政策梯度(DDPG)的无模型演员批评者对政策加强学习方法。该方法克服了基于模型的方法的主要弱点,例如近似动态编程(ADP),这需要控制的建筑物的明确热模型。由于各种建筑设计和施工类型,这一要求在现有的智能仪表中进行了能量管理算法的即插即用实施。为了克服这种困难,我们使用DDPG算法,可以在连续动作空间中学习策略而无需访问建筑物的完整动态。我们通过基于基于ADP的方法来展示DDPG的竞争性能,可以访问建筑物的全部热动态。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号