首页> 外文期刊>Performance evaluation review >Information Aggregation for Constrained Online Control
【24h】

Information Aggregation for Constrained Online Control

机译:约束在线控制的信息聚合

获取原文
获取原文并翻译 | 示例
           

摘要

We consider a two-controller online control problem where a central controller chooses an action from a feasible set that is determined by time-varying and coupling constraints, which depend on all past actions and states. The central controller's goal is to minimize the cumulative cost; however, the controller has access to neither the feasible set nor the dynamics directly, which are determined by a remote local controller. Instead, the central controller receives only an aggregate summary of the feasibility information from the local controller, which does not know the system costs. We show that it is possible for an online algorithm using feasibility information to nearly match the dynamic regret of an online algorithm using perfect information whenever the feasible sets satisfy a causal invariance criterion and there is a sufficiently large prediction window size. To do so, we use a form of feasibility aggregation based on entropic maximization in combination with a novel online algorithm, named Penalized Predictive Control (PPC).
机译:我们考虑一个双控制器在线控制问题,其中中央控制器从可行的集合选择由时变和耦合约束确定的可行集合,这取决于所有过去的动作和状态。中央控制力局的目标是最大限度地减少累计成本;然而,控制器可以通过远程本地控制器决定,控制器可以直接访问可行的SET和动态。相反,中央控制器仅接收来自本地控制器的可行性信息的聚合概述,这不知道系统成本。我们表明,只要可行的组满足因果不变性标准,可以使用可行性信息使用可行性信息与在线算法的动态遗憾地匹配的动态遗忘。为此,我们使用基于熵最大化的一种可行性聚合,结合新颖的在线算法,命名为惩罚预测控制(PPC)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号