Learning Behavior Fusion Estimation from Demonstration

机译：学习行为融合估计从示范

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

A critical challenge in robot learning from demonstration is the ability to map the behavior of the trainer onto the robot's existing repertoire of basic/primitive capabilities. Following a behavior-based approach, we aim to express a teacher's demonstration as a linear combination (or fusion) of the robot's primitives. We treat this problem as a state estimation problem over the space of possible linear fusion weights. We consider this fusion state to be a model of the teacher's control policy expressed with respect to the robot's capabilities. Once estimated under various sensory preconditions, fusion state estimates are used as a coordination policy for online robot control to imitate the teacher's decision making. A particle filter is used to infer fusion state from control commands demonstrated by the teacher and predicted by each primitive. The particle filter allows for inference under the ambiguity over a large space of likely fusion combinations and dynamic changes to the teacher's policy over time. We present results of our approach in a simulated and real world environments with a Pioneer 3DX mobile robot.

机译：从示范中的机器人学习中的一个关键挑战是能够将培训师的行为映射到机器人现有的基本/原始能力的曲目。遵循基于行为的方法，我们的目标是将教师演示表达为机器人基元的线性组合（或融合）。在可能的线性融合重量的空间上，我们将此问题视为状态估计问题。我们认为这种融合状态是教师控制政策的模型，这些模型是针对机器人能力表达的。一旦在各种感官前提下估计，融合状态估计被用作在线机器人控制的协调政策，以模仿教师的决策。粒子滤波器用于从教师演示的控制命令推断融合状态，并通过每个原语预测。粒子过滤器允许在含糊不清的歧义下推理，这是可能的融合组合的大型空间，以及随着时间的推移对教师政策的动态变化。我们在具有先驱3DX移动机器人的模拟和现实世界环境中提出了我们的方法的结果。

著录项

来源
《International Symposium on Robot and Human Interactive Communication》|2006年||共6页
会议地点
作者
Monica Nicolescu; Odest Chadwicke Jenkins; Adam Olenderski;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP11-53;
关键词

相似文献

外文文献
中文文献
专利

1. Estimation of Tritium and Dust Source Term in European DEMOnstration Fusion Reactor During Accident Scenarios [J] . Guido Mazzini, Tadas Kaliatka, Maria Teresa Porfiri Journal of nuclear engineering and radiation science . 2019,第3期

机译：欧洲示范融合反应堆中氚和灰尘术语估算事故情景的估算
2. Deep Behavioral Cloning for Traffic Control with Virtual Expert Demonstration Under a Parallel Learning Framework ? [J] . Xiaoshuang Li, Fenghua Zhu, Fei-Yue Wang IFAC PapersOnLine . 2020,第5期

机译：在并行学习框架下对虚拟专家演示的流量控制的深度行为克隆？
3. Learning Physical Collaborative Robot Behaviors From Human Demonstrations [J] . Leonel Rozo, Sylvain Calinon, Darwin G. Caldwell, IEEE Transactions on Robotics . 2016,第3期

机译：通过人类演示学习物理协作机器人行为
4. Behavior Fusion Estimation for Robot Learning from Demonstration [C] . Nicolescu, M., Jenkins, . 2006

机译：演示中机器人学习的行为融合估计
5. Machine Learning-Based Fusion Studies of Rainfall Estimation from Spaceborne and Ground-Based Radars [D] . Tan, Haiming. 2019

机译：基于机器学习的太空载雷达降雨估计融合研究
6. An Incremental Learning Framework to Enhance Teaching by Demonstration Based on Multimodal Sensor Fusion [O] . Jie Li, Junpei Zhong, Jingfeng Yang, 2020

机译：基于多模式传感器融合的演示提升教学的增量学习框架
7. Behavior Fusion Estimation for Robot Learning from Demonstration [O] . Monica Nicolescu 2008

机译：基于演示的机器人学习行为融合估计

Learning Behavior Fusion Estimation from Demonstration

摘要

著录项

相似文献

相关主题

期刊订阅