Continuous shared control in prosthetic hand grasp tasks by Deep Deterministic Policy Gradient with Hindsight Experience Replay

Grasp using a prosthetic hand in real life can be a difficult task. The amputee users are often capable of planning the reaching trajectory and hand grasp location selection, however, failed in precise finger movements, such as adapting the fingers to the surface of the object without excessive forc...

Full description

Bibliographic Details
Main Authors:	Zhaolong Gao, Rongyu Tang, Luyao Chen, Qiang Huang, Jiping He
Format:	Article
Language:	English
Published:	SAGE Publishing 2020-08-01
Series:	International Journal of Advanced Robotic Systems
Online Access:	https://doi.org/10.1177/1729881420936851

id	doaj-2af17a2a2f534af8bdc90d0b86ab5da1
record_format	Article
spelling	doaj-2af17a2a2f534af8bdc90d0b86ab5da12020-11-25T03:57:24ZengSAGE PublishingInternational Journal of Advanced Robotic Systems1729-88142020-08-011710.1177/1729881420936851Continuous shared control in prosthetic hand grasp tasks by Deep Deterministic Policy Gradient with Hindsight Experience ReplayZhaolong Gao0Rongyu Tang1Luyao Chen2Qiang Huang3Jiping He4 School of Artificial Intelligence and Automation, , Wuhan, China Beijing Advanced Innovation Center for Intelligent Robot and System, , Beijing, China School of Optical and Electronic Information, , Wuhan, China Beijing Advanced Innovation Center for Intelligent Robot and System, , Beijing, China Beijing Advanced Innovation Center for Intelligent Robot and System, , Beijing, ChinaGrasp using a prosthetic hand in real life can be a difficult task. The amputee users are often capable of planning the reaching trajectory and hand grasp location selection, however, failed in precise finger movements, such as adapting the fingers to the surface of the object without excessive force. It is much efficient to leave that part to the machine autonomy. In order to combine the intention and planning ability of users with robotic control, the shared control is introduced in which users’ inputs and robot control methods are combined to achieve a goal. The shared control problem can be formulated as a Partially Observable Markov Decision Process. To find the optimal control policy, we adopt an adaptive dynamic programming and reinforcement learning-based control algorithm-Deep Deterministic Policy Gradient combined with Hindsight Experience Replay. We proposed the algorithm with a prediction layer using the reparameterization technique. The system was tested in a modified simulation environment for the ability to follow the user’s intention and keep the contact force in boundary for safety.https://doi.org/10.1177/1729881420936851
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Zhaolong Gao Rongyu Tang Luyao Chen Qiang Huang Jiping He
spellingShingle	Zhaolong Gao Rongyu Tang Luyao Chen Qiang Huang Jiping He Continuous shared control in prosthetic hand grasp tasks by Deep Deterministic Policy Gradient with Hindsight Experience Replay International Journal of Advanced Robotic Systems
author_facet	Zhaolong Gao Rongyu Tang Luyao Chen Qiang Huang Jiping He
author_sort	Zhaolong Gao
title	Continuous shared control in prosthetic hand grasp tasks by Deep Deterministic Policy Gradient with Hindsight Experience Replay
title_short	Continuous shared control in prosthetic hand grasp tasks by Deep Deterministic Policy Gradient with Hindsight Experience Replay
title_full	Continuous shared control in prosthetic hand grasp tasks by Deep Deterministic Policy Gradient with Hindsight Experience Replay
title_fullStr	Continuous shared control in prosthetic hand grasp tasks by Deep Deterministic Policy Gradient with Hindsight Experience Replay
title_full_unstemmed	Continuous shared control in prosthetic hand grasp tasks by Deep Deterministic Policy Gradient with Hindsight Experience Replay
title_sort	continuous shared control in prosthetic hand grasp tasks by deep deterministic policy gradient with hindsight experience replay
publisher	SAGE Publishing
series	International Journal of Advanced Robotic Systems
issn	1729-8814
publishDate	2020-08-01
description	Grasp using a prosthetic hand in real life can be a difficult task. The amputee users are often capable of planning the reaching trajectory and hand grasp location selection, however, failed in precise finger movements, such as adapting the fingers to the surface of the object without excessive force. It is much efficient to leave that part to the machine autonomy. In order to combine the intention and planning ability of users with robotic control, the shared control is introduced in which users’ inputs and robot control methods are combined to achieve a goal. The shared control problem can be formulated as a Partially Observable Markov Decision Process. To find the optimal control policy, we adopt an adaptive dynamic programming and reinforcement learning-based control algorithm-Deep Deterministic Policy Gradient combined with Hindsight Experience Replay. We proposed the algorithm with a prediction layer using the reparameterization technique. The system was tested in a modified simulation environment for the ability to follow the user’s intention and keep the contact force in boundary for safety.
url	https://doi.org/10.1177/1729881420936851
work_keys_str_mv	AT zhaolonggao continuoussharedcontrolinprosthetichandgrasptasksbydeepdeterministicpolicygradientwithhindsightexperiencereplay AT rongyutang continuoussharedcontrolinprosthetichandgrasptasksbydeepdeterministicpolicygradientwithhindsightexperiencereplay AT luyaochen continuoussharedcontrolinprosthetichandgrasptasksbydeepdeterministicpolicygradientwithhindsightexperiencereplay AT qianghuang continuoussharedcontrolinprosthetichandgrasptasksbydeepdeterministicpolicygradientwithhindsightexperiencereplay AT jipinghe continuoussharedcontrolinprosthetichandgrasptasksbydeepdeterministicpolicygradientwithhindsightexperiencereplay
_version_	1724461094318112768

Continuous shared control in prosthetic hand grasp tasks by Deep Deterministic Policy Gradient with Hindsight Experience Replay

Similar Items