Continuous shared control in prosthetic hand grasp tasks by Deep Deterministic Policy Gradient with Hindsight Experience Replay

Zhaolong Gao; Rongyu Tang; Luyao Chen; Qiang Huang; Jiping He

首页> 外文期刊>International Journal of Advanced Robotic Systems >Continuous shared control in prosthetic hand grasp tasks by Deep Deterministic Policy Gradient with Hindsight Experience Replay

【24h】

Continuous shared control in prosthetic hand grasp tasks by Deep Deterministic Policy Gradient with Hindsight Experience Replay

机译：通过深度确定性政策梯度与后敏感体验重放的持续共享控制掌握任务

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Grasp using a prosthetic hand in real life can be a difficult task. The amputee users are often capable of planning the reaching trajectory and hand grasp location selection, however, failed in precise finger movements, such as adapting the fingers to the surface of the object without excessive force. It is much efficient to leave that part to the machine autonomy. In order to combine the intention and planning ability of users with robotic control, the shared control is introduced in which users’ inputs and robot control methods are combined to achieve a goal. The shared control problem can be formulated as a Partially Observable Markov Decision Process. To find the optimal control policy, we adopt an adaptive dynamic programming and reinforcement learning-based control algorithm-Deep Deterministic Policy Gradient combined with Hindsight Experience Replay. We proposed the algorithm with a prediction layer using the reparameterization technique. The system was tested in a modified simulation environment for the ability to follow the user’s intention and keep the contact force in boundary for safety.

机译：掌握现实生活中的假肢可能是一项艰巨的任务。截肢者用户通常能够规划到达轨迹和手动抓握位置选择，但是，在精确的手指运动中失败，例如将手指调整到物体的表面而没有过大的力。将该部分留给机器自主权是有效的。为了结合有机机器人控制的意图和规划能力，引入了共享控制，其中将用户输入和机器人控制方法组合以实现目标。共享控制问题可以作为部分观察到的马尔可夫决策过程制定。为了找到最佳控制策略，我们采用了自适应动态规划和基于强化学习的控制算法 - 深度确定性政策梯度与后敏感体验重放相结合。我们使用Reparameterization技术提出了具有预测层的算法。该系统在修改的模拟环境中进行了测试，以便能够遵循用户的意图，并保持安全的边界中的接触力。

著录项

来源
《International Journal of Advanced Robotic Systems》 |2020年第4期|共页
作者
Zhaolong Gao; Rongyu Tang; Luyao Chen; Qiang Huang; Jiping He;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类
关键词
Shared controlreinforcement learningadaptive dynamic programming prosthetic handtelerobotics;

机译：共享ControlReInforilation LearningAveive动态编程假肢手台托管;

相似文献

外文文献
中文文献
专利

1. Efficient experience replay based deep deterministic policy gradient for AGC dispatch in integrated energy system [J] . Li Jiawen, Yu Tao, Zhang Xiaoshun, Applied Energy . 2021,第Mara1期

机译：基于EAGC调度的基于EAG COMPERING CONTIOMIC梯度的高效体验重播
2. Asynchronous Episodic Deep Deterministic Policy Gradient: Toward Continuous Control in Computationally Complex Environments [J] . Zhizheng Zhang, Jiale Chen, Zhibo Chen, Cybernetics, IEEE Transactions on . 2021,第2期

机译：异步epiSodic深度确定性政策梯度：在计算复杂环境中连续控制
3. Feedback Deep Deterministic Policy Gradient With Fuzzy Reward for Robotic Multiple Peg-in-Hole Assembly Tasks [J] . Xu Jing, Hou Zhimin, Wang Wei, IEEE transactions on industrial informatics . 2019,第3期

机译：机器人多重钉孔装配任务的带有模糊奖励的反馈确定性策略梯度
4. Duplicated Replay Buffer for Asynchronous Deep Deterministic Policy Gradient [C] . Seyed Mohammad Seyed Motehayeri, Vahid Baghi, Ehsan Maani Miandoab, International Computer Conference, Computer Society of Iran . 2021

机译：用于异步深度确定性策略梯度的重复重放缓冲区
5. Path Planning for Multi-Arm Manipulators Using Deep Reinforcement Learning: Soft Actor–Critic with Hindsight Experience Replay [O] . Evan Prianto, MyeongSeop Kim, Jae-Han Park, 2020

机译：使用深度加强学习的多臂操纵器的路径规划：软演员 - 与后敏感体验重播
6. Continuous shared control in prosthetic hand grasp tasks by Deep Deterministic Policy Gradient with Hindsight Experience Replay [O] . Zhaolong Gao, Rongyu Tang, Luyao Chen, 2020

机译：通过深度确定性政策梯度与后敏感体验重放的持续共享控制掌握任务

Continuous shared control in prosthetic hand grasp tasks by Deep Deterministic Policy Gradient with Hindsight Experience Replay

摘要

著录项

相似文献

相关主题

期刊订阅