首页> 外国专利> SYSTEM AND METHOD FOR TASK CONTROL BASED ON BAYESIAN META-REINFORCEMENT LEARNING

SYSTEM AND METHOD FOR TASK CONTROL BASED ON BAYESIAN META-REINFORCEMENT LEARNING

机译:基于贝叶斯元强化学习的任务控制系统和方法

摘要

Methods, systems, and apparatus, including computer programs encoded on computer storage media for task control based on Bayesian Meta-Reinforcement learning. An exemplary method includes obtaining a base machine learning (ML) model trained based on historical data collected from historical tasks. The base ML model represents a prior distribution of model parameters in a neural network representing control policies. The exemplary method further includes receiving observed data from a new control task; training a task-level ML model based on the base ML model and the observed data, wherein the task-level ML model represents a posterior distribution of the model parameters; sampling, based on the posterior distribution of the model parameters, a set of the model parameters that represent a control policy; and applying the control policy in performing the new control task.
机译:方法、系统和装置,包括在计算机存储介质上编码的用于基于贝叶斯元强化学习的任务控制的计算机程序。一种示例性方法包括获得基于从历史任务收集的历史数据训练的基本机器学习(ML)模型。基本ML模型表示表示控制策略的神经网络中模型参数的先验分布。该示例性方法还包括从新的控制任务接收观察到的数据;基于基本ML模型和观察数据训练任务级ML模型,其中任务级ML模型表示模型参数的后验分布;基于模型参数的后验分布,采样表示控制策略的一组模型参数;以及在执行新的控制任务时应用控制策略。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号