Sequential    -Learning With Kalman Filtering for Multirobot Cooperative Transportation

Wang Y.; de Silva C. W.

首页> 外文期刊>Mechatronics, IEEE/ASME Transactions on >Sequential -Learning With Kalman Filtering for Multirobot Cooperative Transportation

【24h】

Sequential -Learning With Kalman Filtering for Multirobot Cooperative Transportation

机译：卡尔曼滤波的序贯学习在多机器人协同运输中的应用

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents a modified, distributed $Q$-learning algorithm, termed as sequential $Q$-learning with Kalman filtering (SQKF), for decision making associated with multirobot cooperation. The SQKF algorithm developed here has the following characteristics. 1) The learning process is arranged in a sequential manner (i.e., the robots will not make decisions simultaneously, but in a predefined sequence) so as to promote cooperation among robots and reduce their $Q$-learning spaces. 2) A robot will not update its $Q$-values with observed global rewards. Instead, it will employ a specific Kalman filter to extract its real local reward from the global reward, thereby updating its $Q$-table with this local reward. The new SQKF algorithm is intended to solve two problems in multirobot $Q$ -learning: credit assignment and behavior conflicts. The detailed procedure of the SQKF algorithm is presented, and its application is illustrated using a prototype multirobot experimental system. The experimental results show that the algorithm has better performance than the conventional single-agent $Q$-learning algorithm or the team $Q$-learning algorithm in the multirobot domain.

机译：本文提出了一种改进的分布式$ Q $学习算法，称为带卡尔曼滤波的顺序$ Q $学习（SQKF），用于与多机器人合作相关的决策。这里开发的SQKF算法具有以下特征。 1）学习过程是按顺序排列的（即机器人不会同时做出决定，而是以预定的顺序进行决策），以促进机器人之间的合作并减少他们的$ Q $学习空间。 2）机器人不会使用观察到的全球奖励来更新其$ Q $值。取而代之的是，它将使用特定的卡尔曼滤波器从全局奖励中提取其实际的本地奖励，从而使用此本地奖励更新其$ Q $表。新的SQKF算法旨在解决多机器人$ Q $学习中的两个问题：信用分配和行为冲突。给出了SQKF算法的详细过程，并使用原型多机器人实验系统说明了其应用。实验结果表明，该算法在多机器人领域比常规的单代理$ Q $学习算法或团队$ Q $学习算法具有更好的性能。

著录项

来源
《Mechatronics, IEEE/ASME Transactions on》 |2010年第2期|P.261-268|共8页
作者
Wang Y.; de Silva C. W.;
展开▼
作者单位

Industrial Automation Laboratory, Department of Mechanical Engineering, The University of British Columbia, Vancouver, Canada;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
$Q$-learning; Decision making; multirobot systems;

机译：$ Q $-学习;决策;多机器人系统;

相似文献

外文文献
中文文献
专利

1. Nonlinear filtering for sequential spacecraft attitude estimation with real data: Cubature Kalman Filter, Unscented Kalman Filter and Extended Kalman Filter [J] . Garcia R. V., Pardal P. C. P. M., Kuga H. K., Advances in space research . 2019,第2期

机译：用于使用实际数据进行连续航天器姿态估计的非线性滤波：Cubature Kalman滤波，Unscented Kalman滤波和Extended Kalman滤波
2. Nonlinear filtering for sequential spacecraft attitude estimation with real data: Cubature Kalman Filter, Unscented Kalman Filter and Extended Kalman Filter [J] . Garcia R. V., Pardal P. C. P. M., Kuga H. K., Journal of neurosurgical sciences . 2019,第2期

机译：具有真实数据的连续航天器姿态估计的非线性滤波：Cubature Kalman滤波器，Unscented Kalman滤波器和扩展卡尔曼滤波器
3. A sequential learning method with Kalman filter and extreme learning machine for regression and time series forecasting [J] . Nobrega Jarley P., Oliveira Adriano L. I. Neurocomputing . 2019,第APRa14期

机译：带有卡尔曼滤波和极限学习机的顺序学习方法用于回归和时间序列预测
4. Cooperative Vehicle Localization Base on Extended Kalman Filter In Intelligent Transportation System [C] . Liping Du, Longji Chen, Xiaotian Hou, 2019 28th Wireless and Optical Communications Conference . 2019

机译：智能交通系统中基于扩展卡尔曼滤波的协同车辆定位
5. Non-coherent FSK and GFSK receivers using particle-Kalman filtering theory and sequential importance resampling technique [D] . Abdallah, Alhaj-Saleh. 2016

机译：使用粒子-Kalman滤波理论和顺序重要性重采采样技术的非相干FSK和GFSK接收器
6. Learning Multirobot Hose Transportation and Deployment by Distributed Round-Robin Q-Learning [O] . Borja Fernandez-Gauna, Ismael Etxeberria-Agiriano, Manuel Graña -1

机译：通过分布式轮循Q学习学习多机器人软管的运输和部署
7. Kalman filtering method and its application to air pollution episode forecasting. Kalmanifilters are a class of linear minimum-error-variance sequential state estimation algorithms [O] . P Switzer, P Zannetti 1979

机译：卡尔曼滤波方法及其在空气污染集预测中的应用。 Kalmanifilters是一类线性最小误差 - 方差顺序状态估计算法

Sequential -Learning With Kalman Filtering for Multirobot Cooperative Transportation

摘要

著录项

相似文献

相关主题

期刊订阅