An Autonomous Path Planning Model for Unmanned Ships Based on Deep Reinforcement Learning

Deep reinforcement learning (DRL) has excellent performance in continuous control problems and it is widely used in path planning and other fields. An autonomous path planning model based on DRL is proposed to realize the intelligent path planning of unmanned ships in the unknown environment. The mo...

Full description

Bibliographic Details
Main Authors:	Siyu Guo, Xiuguo Zhang, Yisong Zheng, Yiquan Du
Format:	Article
Language:	English
Published:	MDPI AG 2020-01-01
Series:	Sensors
Subjects:	unmanned ships deep reinforcement learning ddpg autonomous path planning end-to-end collision avoidance
Online Access:	https://www.mdpi.com/1424-8220/20/2/426

id	doaj-210d2456f1f44c918a5b35be0b2bd6fc
record_format	Article
spelling	doaj-210d2456f1f44c918a5b35be0b2bd6fc2020-11-25T01:45:08ZengMDPI AGSensors1424-82202020-01-0120242610.3390/s20020426s20020426An Autonomous Path Planning Model for Unmanned Ships Based on Deep Reinforcement LearningSiyu Guo0Xiuguo Zhang1Yisong Zheng2Yiquan Du3School of Information Science and Technology, Dalian Maritime University, Dalian 116026, ChinaSchool of Information Science and Technology, Dalian Maritime University, Dalian 116026, ChinaSchool of Information Science and Technology, Dalian Maritime University, Dalian 116026, ChinaSchool of Information Science and Technology, Dalian Maritime University, Dalian 116026, ChinaDeep reinforcement learning (DRL) has excellent performance in continuous control problems and it is widely used in path planning and other fields. An autonomous path planning model based on DRL is proposed to realize the intelligent path planning of unmanned ships in the unknown environment. The model utilizes the deep deterministic policy gradient (DDPG) algorithm, through the continuous interaction with the environment and the use of historical experience data; the agent learns the optimal action strategy in a simulation environment. The navigation rules and the ship’s encounter situation are transformed into a navigation restricted area, so as to achieve the purpose of planned path safety in order to ensure the validity and accuracy of the model. Ship data provided by ship automatic identification system (AIS) are used to train this path planning model. Subsequently, the improved DRL is obtained by combining DDPG with the artificial potential field. Finally, the path planning model is integrated into the electronic chart platform for experiments. Through the establishment of comparative experiments, the results show that the improved model can achieve autonomous path planning, and it has good convergence speed and stability.https://www.mdpi.com/1424-8220/20/2/426unmanned shipsdeep reinforcement learningddpgautonomous path planningend-to-endcollision avoidance
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Siyu Guo Xiuguo Zhang Yisong Zheng Yiquan Du
spellingShingle	Siyu Guo Xiuguo Zhang Yisong Zheng Yiquan Du An Autonomous Path Planning Model for Unmanned Ships Based on Deep Reinforcement Learning Sensors unmanned ships deep reinforcement learning ddpg autonomous path planning end-to-end collision avoidance
author_facet	Siyu Guo Xiuguo Zhang Yisong Zheng Yiquan Du
author_sort	Siyu Guo
title	An Autonomous Path Planning Model for Unmanned Ships Based on Deep Reinforcement Learning
title_short	An Autonomous Path Planning Model for Unmanned Ships Based on Deep Reinforcement Learning
title_full	An Autonomous Path Planning Model for Unmanned Ships Based on Deep Reinforcement Learning
title_fullStr	An Autonomous Path Planning Model for Unmanned Ships Based on Deep Reinforcement Learning
title_full_unstemmed	An Autonomous Path Planning Model for Unmanned Ships Based on Deep Reinforcement Learning
title_sort	autonomous path planning model for unmanned ships based on deep reinforcement learning
publisher	MDPI AG
series	Sensors
issn	1424-8220
publishDate	2020-01-01
description	Deep reinforcement learning (DRL) has excellent performance in continuous control problems and it is widely used in path planning and other fields. An autonomous path planning model based on DRL is proposed to realize the intelligent path planning of unmanned ships in the unknown environment. The model utilizes the deep deterministic policy gradient (DDPG) algorithm, through the continuous interaction with the environment and the use of historical experience data; the agent learns the optimal action strategy in a simulation environment. The navigation rules and the ship’s encounter situation are transformed into a navigation restricted area, so as to achieve the purpose of planned path safety in order to ensure the validity and accuracy of the model. Ship data provided by ship automatic identification system (AIS) are used to train this path planning model. Subsequently, the improved DRL is obtained by combining DDPG with the artificial potential field. Finally, the path planning model is integrated into the electronic chart platform for experiments. Through the establishment of comparative experiments, the results show that the improved model can achieve autonomous path planning, and it has good convergence speed and stability.
topic	unmanned ships deep reinforcement learning ddpg autonomous path planning end-to-end collision avoidance
url	https://www.mdpi.com/1424-8220/20/2/426
work_keys_str_mv	AT siyuguo anautonomouspathplanningmodelforunmannedshipsbasedondeepreinforcementlearning AT xiuguozhang anautonomouspathplanningmodelforunmannedshipsbasedondeepreinforcementlearning AT yisongzheng anautonomouspathplanningmodelforunmannedshipsbasedondeepreinforcementlearning AT yiquandu anautonomouspathplanningmodelforunmannedshipsbasedondeepreinforcementlearning AT siyuguo autonomouspathplanningmodelforunmannedshipsbasedondeepreinforcementlearning AT xiuguozhang autonomouspathplanningmodelforunmannedshipsbasedondeepreinforcementlearning AT yisongzheng autonomouspathplanningmodelforunmannedshipsbasedondeepreinforcementlearning AT yiquandu autonomouspathplanningmodelforunmannedshipsbasedondeepreinforcementlearning
_version_	1725024919129948160

An Autonomous Path Planning Model for Unmanned Ships Based on Deep Reinforcement Learning

Similar Items