Universal Value Iteration Networks: When Spatially-Invariant Is Not Universal

机译：通用价值迭代网络：当空间不变不是通用时

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we first formally define the problem set of spatially invariant Markov Decision Processes (MDPs), and show that Value Iteration Networks (VIN) and its extensions are computationally bounded to it due to the use of the convolution kernel. To generalize VIN to spatially variant MDPs, we propose Universal Value Iteration Networks (UVIN). In comparison with VIN, UVIN automatically leams a flexible but compact network structure to encode the transition dynamics of the problems and support the differentiable planning module. We evaluate UVIN with both spatially invariant and spatially variant tasks, including navigation in regular maze, chessboard maze, and Mars, and Minecraft item syntheses. Results show that UVIN can achieve similar performance as VIN and its extensions on spatially invariant tasks, and significantly outperforms other models on more general problems.

机译：在本文中，我们首先正式定义了空间不变的马尔可夫决策过程（MDP）的问题集，并显示了由于使用卷积内核而计算到它的值迭代网络（VIN）及其扩展。为了将VIN概括到空间变量MDP，我们提出了通用价值迭代网络（UVIN）。与VIN相比，UVIN自动培养灵活但紧凑的网络结构以编码问题的转换动态，并支持可分辨率的计划模块。我们在空间不变和空间变体任务中评估Uvin，包括常规迷宫，棋盘迷宫和火星的导航，以及MINECRAFT项目合成。结果表明，uvin可以在空间不变任务上实现类似的性能及其对空间不变的任务的扩展，并显着优于其他模型更普遍的问题。

著录项

来源
《AAAI Conference on Artificial Intelligence》|2020年|6243-7030p|共8页
会议地点
作者
Li Zhang; Xin Li; Sen Chen; Hongyu Zang; Jie Huang; Mingzhong Wang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词

相似文献

外文文献
中文文献
专利

1. Representation of Universal Quantifier in Bulgarian Language with Universal Networking Language [J] . Velislava Stoykova Procedia - Social and Behavioral Sciences . 2015,第期

机译：具有通用网络语言的保加利亚语的通用量词表示
2. Strategy for universal access to health and universal health coverage and the contribution of the International Nursing Networks [J] . Silvia Helena De Bortoli Cassiani Revista Latino-Americana de Enfermagem . 2014,第6期

机译：普遍获得卫生和全民健康覆盖的战略以及国际护理网络的贡献
3. Universality and Non-Universality in Behavior of Self-Repairing Random Networks [J] . A. S. Ioselevich, D. S. Lyubshin JETP Letters . 2009,第8期

机译：自修复随机网络行为的普遍性和非普遍性
4. Universal Value Iteration Networks: When Spatially-Invariant Is Not Universal [C] . Li Zhang, Xin Li, Sen Chen, AAAI Conference on Artificial Intelligence . 2020

机译：通用价值迭代网络：当空间不变不是通用时
5. The Case of the Missing Universal: The British Universal History (1736-1766) and the Evolution of Universal History [D] . Beelen, Tim Antonius Lambertus. 2020

机译：失踪普遍的案例：英国普遍历史（1736-1766）和普遍历史的演变
6. Strategy for universal access to health and universal health coverage andthe contribution of the International Nursing Networks [O] . Silvia Helena De Bortoli Cassiani 2014

机译：普遍获得卫生和全民健康覆盖的战略以及国际护理网络的贡献
7. Representation of Universal Quantifier in Bulgarian Language with Universal Networking Language [O] . Stoykova Velislava 2015

机译：通用语言在保加利亚语中的通用量词表示
8. Universal Grids: Universal Transverse Mercator (UTM) and Universal Polar Stereographic (UPS). Edition 1. [R] . Hager, J. W., Behensky, J. F., Drew, B. W. 1989

机译：通用网格：通用横轴墨卡托（UTm）和通用极地立体（Ups）。第1版

Universal Value Iteration Networks: When Spatially-Invariant Is Not Universal

摘要

著录项

相似文献

相关主题

期刊订阅