SOFT ACTOR-CRITIC REINFORCEMENT LEARNING FOR ROBOTIC MANIPULATOR WITH HINDSIGHT EXPERIENCE REPLAY

Yan Tao; Zhang Wenan; Yang Simon X.; Yu Li

首页> 外文期刊>International Journal of Robotics & Automation >SOFT ACTOR-CRITIC REINFORCEMENT LEARNING FOR ROBOTIC MANIPULATOR WITH HINDSIGHT EXPERIENCE REPLAY

【24h】

SOFT ACTOR-CRITIC REINFORCEMENT LEARNING FOR ROBOTIC MANIPULATOR WITH HINDSIGHT EXPERIENCE REPLAY

机译：软电演位批评机器人机器人与后勤体验重播的批评

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The key challenges in applying reinforcement learning (RL) to complex robotic control tasks are the fragile convergence property, very high sample complexity and the need to shape a reward function. In this work, we present a soft actor-critic (SAC) style algorithm, an off-policy actor-critic RL method based on the maximum entropy RL framework, where the objective of the actor is to maximize the expected reward while also maximizing the entropy. This effectively improves the stability of the performance of algorithm and the robustness to modelling and estimate error. Moreover, we combine SAC with a new transition replay scheme called hindsight experience replay so as to make policy learning more efficient from sparse rewards. Finally, the effectiveness of the proposed method is verified on a range of manipulation tasks in simulated environment.

机译：将强化学习（RL）应用于复杂的机器人控制任务的关键挑战是脆弱的收敛性，非常高的样本复杂性，并且需要塑造奖励功能。在这项工作中，我们提出了一种软演员 - 评论家（SAC）风格算法，这是一种基于最大熵RL框架的禁止策略演员 - 评论家RL方法，其中演员的目标是最大化预期的奖励，同时也最大化熵。这有效地提高了算法性能的稳定性和建模和估计误差的鲁棒性。此外，我们将SAC与一个名为Hindsight体验重放的新转换重播方案组合，以便从稀疏奖励中更高效地制作策略学习。最后，在模拟环境中的一系列操作任务中验证了所提出的方法的有效性。

著录项

来源
《International Journal of Robotics & Automation》 |2019年第5期|共8页
作者
Yan Tao; Zhang Wenan; Yang Simon X.; Yu Li;
展开▼
作者单位

Zhejiang Univ Technol Coll Informat Engn 288 Liuhe Rd Hangzhou 310023 Zhejiang Peoples R China;

Zhejiang Univ Technol Coll Informat Engn 288 Liuhe Rd Hangzhou 310023 Zhejiang Peoples R China;

Univ Guelph Adv Robot &

Intelligent Syst Lab Guelph ON Canada;

Zhejiang Univ Technol Coll Informat Engn 288 Liuhe Rd Hangzhou 310023 Zhejiang Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类机器人技术;
关键词
Reinforcement learning; maximum entropy; robotic manipulation; hindsight experience replay;

机译：加固学习;最大熵;机器人操纵;后敏感体验重放;

相似文献

外文文献
中文文献
专利

1. SOFT ACTOR-CRITIC REINFORCEMENT LEARNING FOR ROBOTIC MANIPULATOR WITH HINDSIGHT EXPERIENCE REPLAY [J] . Yan Tao, Zhang Wenan, Yang Simon X., International Journal of Robotics & Automation . 2019,第5期

机译：软电演位批评机器人机器人与后勤体验重播的批评
2. Model-Based Reinforcement Learning for Closed-Loop Dynamic Control of Soft Robotic Manipulators [J] . Thuruthel Thomas George, Falotico Egidio, Renda Federico, IEEE Transactions on Robotics . 2019,第1期

机译：基于模型的强化学习，用于软机器人机械臂的闭环动态控制
3. Robotic Knee Tracking Control to Mimic the Intact Human Knee Profile Based on Actor-Critic Reinforcement Learning [J] . Ruofan Wu, Zhikai Yao, Jennie Si, 自动化学报（英文版） . 2022,第001期

机译：Robotic Knee Tracking Control to Mimic the Intact Human Knee Profile Based on Actor-Critic Reinforcement Learning
4. Goal Density-based Hindsight Experience Prioritization for Multi-Goal Robot Manipulation Reinforcement Learning [C] . Yingyi Kuang, Abraham Itzhak Weinberg, George Vogiatzis, IEEE International Conference on Robot and Human Interactive Communication . 2020

机译：基于目标密度的后目标体验优先级，用于多目标机器人操纵强化学习
5. Entropy-Based Experience Replay in Reinforcement Learning [D] . Dadvar, Mehdi. 2020

机译：基于熵的体验重播在加固学习中
6. Path Planning for Multi-Arm Manipulators Using Deep Reinforcement Learning: Soft Actor–Critic with Hindsight Experience Replay [O] . Evan Prianto, MyeongSeop Kim, Jae-Han Park, 2020

机译：使用深度加强学习的多臂操纵器的路径规划：软演员 - 与后敏感体验重播
7. Path Planning for Multi-Arm Manipulators Using Deep Reinforcement Learning: Soft Actor–Critic with Hindsight Experience Replay [O] . Evan Prianto, MyeongSeop Kim, Jae-Han Park, 2020

机译：使用深度加强学习的多臂操纵器的路径规划：软演员 - 与后敏感体验重播
8. Enhanced Experience Replay for Deep Reinforcement Learning. [R] . Doria, D., Dawson, B., Vindiola, M. 2015

机译：增强深度强化学习的体验重播。

SOFT ACTOR-CRITIC REINFORCEMENT LEARNING FOR ROBOTIC MANIPULATOR WITH HINDSIGHT EXPERIENCE REPLAY

摘要

著录项

相似文献

相关主题

期刊订阅