首页> 外国专利> Control method determination apparatus, control policy determination method, control policy determination program

Control method determination apparatus, control policy determination method, control policy determination program

机译:控制方法确定装置,控制策略确定方法,控制策略确定程序

摘要

PROBLEM TO BE SOLVED: To provide a control measure determination device capable of performing high-speed approximation of a value iteration calculation.SOLUTION: A control measure determination device comprises: an initial linear function generation section 1 which generates a candidate group of linear functions which give linear components to a value function in a belief space on the basis of environment sensing information including indeterminacy; a dual transformation section 2 which transforms the candidate group in the belief space into a plurality of points in a dual space; a convex hull approximate calculation section 32 which calculates an approximate convex hull of the plurality of points; a membership determination section 34 which determines a membership function of apexes of the approximate convex hull; a convex hull upper side extraction section 36 which extracts an upper side of the approximate convex hull; an inverse dual transformation section 4 which inversely transforms the apexes belonging to the upper side into the linear function in the belief space; a linear function updating section 6 which updates the linear function in accordance with back-up step numbers on the basis of the obtained linear function and outputs the updated linear function to the dual transformation section 2; and a value function determination section 5 which obtains a plurality of linear components of an approximate value function on the basis of the linear function obtained through the inverse transformation after updating the linear function.
机译:解决的问题:提供一种能够进行值迭代计算的高速近似的控制量度确定装置。解决方案:控制量度确定装置包括:初始线性函数生成部1,其生成候选线性函数组,根据包括不确定性在内的环境感知信息,将线性分量赋予信念空间中的值函数;对偶变换部分2,将信念空间中的候选组变换为对偶空间中的多个点;凸包近似计算部32,计算多个点的近似凸包。隶属度确定部分34,其确定近似凸包的顶点的隶属度函数;凸包上侧提取部36,提取近似凸包的上侧。逆对偶变换部4,将属于上侧的顶点逆变换为置信空间中的线性函数。线性函数更新部6,其基于所获得的线性函数,根据备份步数更新线性函数,并将更新后的线性函数输出至对偶变换部2。值函数确定部分5,其在更新线性函数之后,基于通过逆变换获得的线性函数,获得近似值函数的多个线性分量。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号