首页> 外国专利> MULTI-AGENT DEEP REINFORCEMENT LEARNING FOR DYNAMICALLY CONTROLLING ELECTRICAL EQUIPMENT IN BUILDINGS

MULTI-AGENT DEEP REINFORCEMENT LEARNING FOR DYNAMICALLY CONTROLLING ELECTRICAL EQUIPMENT IN BUILDINGS

机译：多功能深度加固，用于在建筑物中动态控制电气设备的深度钢筋学习

页面导航

摘要
著录项
相似文献

摘要

Reinforcement Learning agent interacting with a real-world building to determine optimal policy may not be viable due to comfort constraints. Embodiments of the present disclosure provide multi-deep agent RL for dynamically controlling electrical equipment in buildings, wherein a simulation model is generated using design specification of (i) controllable electrical equipment (or subsystem) and (ii) building. Each RL agent is trained using simulation model and deployed in the subsystem. Reward function for each subsystem includes some portion of reward from other subsystem(s). Based on reward function of each RL agent, each RL agent learns an optimal control parameter during execution of RL agent in subsystem. Further, a global optimal control parameter list is generated using the optimal control parameter. The control parameters in the global optimal control parameters list are fine-tuned to improve subsystem's performance. Information on fine-tuning parameters of the subsystem and reward function are used for training RL agents.

机译：由于舒适的限制，加固学习代理与真实建筑物的互动以确定最佳政策可能不会是可行的。本公开的实施例提供了用于在建筑物中动态控制电气设备的多深代理R1，其中使用（i）可控电气设备（或子系统）和（ii）建筑物的设计规范产生仿真模型。每个RL代理使用仿真模型进行培训并在子系统中部署。每个子系统的奖励函数包括来自其他子系统的一些奖励。基于每个RL代理的奖励功能，每个RL代理在子系统中执行RL代理期间的最佳控制参数。此外，使用最优控制参数生成全局最佳控制参数列表。全局最优控制参数列表中的控制参数是微调的，以提高子系统的性能。有关子系统和奖励功能的微调参数的信息用于培训RL代理。

著录项

公开/公告号US2021200163A1

专利类型
公开/公告日2021-07-01

原文格式PDF
申请/专利权人 TATA CONSULTANCY SERVICES LIMITED;
展开▼

申请/专利号US202017029788
发明设计人 SRINARAYANA NAGARATHINAM;AVINASH ACHAR;ARUNCHANDAR VASAN;
展开▼

申请日2020-09-23
分类号G05B13/02;F24F11/62;
国家 US
入库时间 2022-08-24 19:42:26

相似文献

专利
外文文献
中文文献