首页> 外国专利> SYSTEM AND METHOD FOR TASK CONTROL BASED ON BAYESIAN META-REINFORCEMENT LEARNING

SYSTEM AND METHOD FOR TASK CONTROL BASED ON BAYESIAN META-REINFORCEMENT LEARNING

机译：基于贝叶斯元强化学习的任务控制系统和方法

页面导航

摘要
著录项
相似文献

摘要

Methods, systems, and apparatus, including computer programs encoded on computer storage media for task control based on Bayesian Meta-Reinforcement learning. An exemplary method includes obtaining a base machine learning (ML) model trained based on historical data collected from historical tasks. The base ML model represents a prior distribution of model parameters in a neural network representing control policies. The exemplary method further includes receiving observed data from a new control task; training a task-level ML model based on the base ML model and the observed data, wherein the task-level ML model represents a posterior distribution of the model parameters; sampling, based on the posterior distribution of the model parameters, a set of the model parameters that represent a control policy; and applying the control policy in performing the new control task.

机译：方法、系统和装置，包括在计算机存储介质上编码的用于基于贝叶斯元强化学习的任务控制的计算机程序。一种示例性方法包括获得基于从历史任务收集的历史数据训练的基本机器学习（ML）模型。基本ML模型表示表示控制策略的神经网络中模型参数的先验分布。该示例性方法还包括从新的控制任务接收观察到的数据；基于基本ML模型和观察数据训练任务级ML模型，其中任务级ML模型表示模型参数的后验分布；基于模型参数的后验分布，采样表示控制策略的一组模型参数；以及在执行新的控制任务时应用控制策略。

著录项

公开/公告号US2022180744A1

专利类型
公开/公告日2022-06-09

原文格式PDF
申请/专利权人 BEIJING DIDI INFINITY TECHNOLOGY AND DEVELOPMENT CO. LTD.;
展开▼

申请/专利号US202017116062
发明设计人 YAYI ZOU;ZHIWEI QIN;
展开▼

申请日2020-12-09
分类号G08G1/081;G08G1/01;G06K9/62;G06N3/08;
国家 US
入库时间 2022-08-25 01:29:46

相似文献

专利
外文文献
中文文献