首页> 外文会议>AAAI Conference on Artificial Intelligence >Universal Value Iteration Networks: When Spatially-Invariant Is Not Universal
【24h】

Universal Value Iteration Networks: When Spatially-Invariant Is Not Universal

机译:通用价值迭代网络:当空间不变不是通用时

获取原文

摘要

In this paper, we first formally define the problem set of spatially invariant Markov Decision Processes (MDPs), and show that Value Iteration Networks (VIN) and its extensions are computationally bounded to it due to the use of the convolution kernel. To generalize VIN to spatially variant MDPs, we propose Universal Value Iteration Networks (UVIN). In comparison with VIN, UVIN automatically leams a flexible but compact network structure to encode the transition dynamics of the problems and support the differentiable planning module. We evaluate UVIN with both spatially invariant and spatially variant tasks, including navigation in regular maze, chessboard maze, and Mars, and Minecraft item syntheses. Results show that UVIN can achieve similar performance as VIN and its extensions on spatially invariant tasks, and significantly outperforms other models on more general problems.
机译:在本文中,我们首先正式定义了空间不变的马尔可夫决策过程(MDP)的问题集,并显示了由于使用卷积内核而计算到它的值迭代网络(VIN)及其扩展。 为了将VIN概括到空间变量MDP,我们提出了通用价值迭代网络(UVIN)。 与VIN相比,UVIN自动培养灵活但紧凑的网络结构以编码问题的转换动态,并支持可分辨率的计划模块。 我们在空间不变和空间变体任务中评估Uvin,包括常规迷宫,棋盘迷宫和火星的导航,以及MINECRAFT项目合成。 结果表明,uvin可以在空间不变任务上实现类似的性能及其对空间不变的任务的扩展,并显着优于其他模型更普遍的问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号