ARKAQ-Learning: Autonomous State Space Segmentation and Policy Generation

机译：ARKAQ-Learning：自治状态空间分割和策略生成

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

A real world environment is often partially observable by the agents either because of noisy sensors or incomplete perception. Autonomous strategy planning under uncertainty has two major challenges. First, autonomous segmentation of the state space for a given task; Second, emerging complex behaviors that deal with each state segment. This paper suggests a new approach that handles both by utilizing combination of various techniques, namely ARKAQ-Learning (ART 2-A networks augmented with Kalman Filters and Q-Learning). The algorithm is an online algorithm and it has low space and computational complexity. The algorithm was run for some well known partially observable Markov decision process problems. World Model Generator could reveal the hidden states, mapping non-Markovian model to Markovian internal state space. Policy Generator could build the optimal policy on the internal Markovian state model.

机译：由于噪声传感器或不完整的感知，代理商通常可以部分观察到真实环境。不确定性下的自主策略规划有两个主要挑战。首先，针对给定任务的状态空间的自动分割；其次，出现了涉及每个州段的复杂行为。本文提出了一种新方法，该方法可以通过利用多种技术的组合来处理这两种情况，即ARKAQ-Learning（使用卡尔曼滤波器和Q-Learning增强的ART 2-A网络）。该算法是一种在线算法，具有空间小，计算复杂度高的特点。该算法针对一些众所周知的部分可观察到的马尔可夫决策过程问题运行。世界模型生成器可以揭示隐藏状态，从而将非马尔可夫模型映射到马尔可夫内部状态空间。策略生成器可以在内部马尔可夫状态模型上建立最佳策略。

著录项

来源
《International Symposium on Computer and Information Sciences(ISCIS 2005); 20051026-28; Istanbul(TR)》|2005年|P.512-523|共12页
会议地点 Istanbul(TR)
作者
Alp Sardag; H. Levent Akin;
展开▼
作者单位

Bogazici University, Department of Computer Engineering, 34342 Bebek, Istanbul, Turkey;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类计算机的应用;
关键词

相似文献

外文文献
中文文献
专利

1. Time series segmentation for state-model generation of autonomous aquatic drones: A systematic framework [J] . Alberto Castellini, Manuele Bicego, Francesco Masillo, Engineering Applications of Artificial Intelligence . 2020,第Apra期

机译：自主水产无人机状态模型生成的时间序列分割：系统框架
2. Behavior Acquisition in Partially Observable Environments by Autonomous Segmentation of the Observation Space [J] . Kousuke Inoue, Tamio Arai, Jun Ota Journal of robotics and mechatronics . 2015,第3a157期

机译：通过观察空间的自动分割获取部分可观察环境中的行为
3. A bare-photovoltaic tether for consumable-less and autonomous space propulsion and power generation [J] . Tajmar M., Sanchez-Arriaga G. Acta astronautica . 2021,第Mara期

机译：用于消耗少和自主空间推进和发电的裸光托硝基
4. ARKAQ-Learning: Autonomous State Space Segmentation and Policy Generation [C] . Alp Sardag, H. Levent Akin International Symposium on Computer and Information Sciences . 2005

机译：Arkaq-Learning：自主国家空间分割和政策生成
5. Autonomous mental development in high-dimensional and continuous state and action spaces and its application in autonomous learning of speech. [D] . Joshi, Ameet Vijay. 2003

机译：高维，连续状态和动作空间中的自主思维发展及其在语音自主学习中的应用。
6. MR Imaging–based Multimodal Autoidentification of Perivascular Spaces (mMAPS): Automated Morphologic Segmentation of Enlarged Perivascular Spaces at Clinical Field Strength [O] . Erin L. Boespflug, Daniel L. Schwartz, David Lahna, -1

机译：基于MR成像的血管周围空间多模式自动识别（mMAPS）：在临床视野强度下扩大的血管周围空间的形态自动分割
7. Cell-autonomous generation of the wave pattern within the vertebrate segmentation clock [O] . Laurel A. Rohde, Arianne Bercowsky-Rama, Jose Negrete, 2021

机译：脊椎动物分割时钟内的波纹的细胞自主生成

ARKAQ-Learning: Autonomous State Space Segmentation and Policy Generation

摘要

著录项

相似文献

相关主题

期刊订阅