Navigation in Restricted Channels Under Environmental Conditions: Fast-Time Simulation by Asynchronous Deep Reinforcement Learning

This paper proposes an efficient method, based on reinforcement learning, to be used as ship controller in fast-time simulators within restricted channels. The controller must operate the rudder in a realistic manner in both time and angle variation so as to approximate human piloting. The method is...

Full description

Bibliographic Details
Main Authors:	Jose Amendola, Lucas S. Miura, Anna H. Reali Costa, Fabio G. Cozman, Eduardo Aoun Tannuri
Format:	Article
Language:	English
Published:	IEEE 2020-01-01
Series:	IEEE Access
Subjects:	Fast-time maneuvering simulations machine learning deep reinforcement learning ship path following control
Online Access:	https://ieeexplore.ieee.org/document/9163360/

id	doaj-870d1d9631a846418df19b04b157e81f
record_format	Article
spelling	doaj-870d1d9631a846418df19b04b157e81f2021-03-30T03:25:10ZengIEEEIEEE Access2169-35362020-01-01814919914921310.1109/ACCESS.2020.30156619163360Navigation in Restricted Channels Under Environmental Conditions: Fast-Time Simulation by Asynchronous Deep Reinforcement LearningJose Amendola0https://orcid.org/0000-0002-9374-4724Lucas S. Miura1Anna H. Reali Costa2https://orcid.org/0000-0001-7309-4528Fabio G. Cozman3https://orcid.org/0000-0003-4077-4935Eduardo Aoun Tannuri4https://orcid.org/0000-0001-7040-413XNumerical Offshore Tank Laboratory, University of São Paulo, São Paulo, BrazilNumerical Offshore Tank Laboratory, University of São Paulo, São Paulo, BrazilIntelligent Techniques Laboratory, University of São Paulo, São Paulo, BrazilDepartment of Mechatronics Engineering and Mechanical Systems, University of São Paulo, São Paulo, BrazilDepartment of Mechatronics Engineering and Mechanical Systems, University of São Paulo, São Paulo, BrazilThis paper proposes an efficient method, based on reinforcement learning, to be used as ship controller in fast-time simulators within restricted channels. The controller must operate the rudder in a realistic manner in both time and angle variation so as to approximate human piloting. The method is well suited to scenarios where no previous navigation data is available; it takes into account, during training, both the effect of environmental conditions and also curves in channels. We resort to an asynchronous distributed version of the reinforcement learning algorithm Deep Q Network (DQN), handling channel segments as separate episodes and including curvature information as context variables (thus moving away from most work in the literature). We tested our proposal in the channel of Porto Sudeste, in the southern Brazilian coast, with realistic environment scenarios where wind and current incidence varies along the channel. The method keeps a simple representation and can be applied to any port channel configuration that respects local technical regulations.https://ieeexplore.ieee.org/document/9163360/Fast-time maneuvering simulationsmachine learningdeep reinforcement learningship path following control
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Jose Amendola Lucas S. Miura Anna H. Reali Costa Fabio G. Cozman Eduardo Aoun Tannuri
spellingShingle	Jose Amendola Lucas S. Miura Anna H. Reali Costa Fabio G. Cozman Eduardo Aoun Tannuri Navigation in Restricted Channels Under Environmental Conditions: Fast-Time Simulation by Asynchronous Deep Reinforcement Learning IEEE Access Fast-time maneuvering simulations machine learning deep reinforcement learning ship path following control
author_facet	Jose Amendola Lucas S. Miura Anna H. Reali Costa Fabio G. Cozman Eduardo Aoun Tannuri
author_sort	Jose Amendola
title	Navigation in Restricted Channels Under Environmental Conditions: Fast-Time Simulation by Asynchronous Deep Reinforcement Learning
title_short	Navigation in Restricted Channels Under Environmental Conditions: Fast-Time Simulation by Asynchronous Deep Reinforcement Learning
title_full	Navigation in Restricted Channels Under Environmental Conditions: Fast-Time Simulation by Asynchronous Deep Reinforcement Learning
title_fullStr	Navigation in Restricted Channels Under Environmental Conditions: Fast-Time Simulation by Asynchronous Deep Reinforcement Learning
title_full_unstemmed	Navigation in Restricted Channels Under Environmental Conditions: Fast-Time Simulation by Asynchronous Deep Reinforcement Learning
title_sort	navigation in restricted channels under environmental conditions: fast-time simulation by asynchronous deep reinforcement learning
publisher	IEEE
series	IEEE Access
issn	2169-3536
publishDate	2020-01-01
description	This paper proposes an efficient method, based on reinforcement learning, to be used as ship controller in fast-time simulators within restricted channels. The controller must operate the rudder in a realistic manner in both time and angle variation so as to approximate human piloting. The method is well suited to scenarios where no previous navigation data is available; it takes into account, during training, both the effect of environmental conditions and also curves in channels. We resort to an asynchronous distributed version of the reinforcement learning algorithm Deep Q Network (DQN), handling channel segments as separate episodes and including curvature information as context variables (thus moving away from most work in the literature). We tested our proposal in the channel of Porto Sudeste, in the southern Brazilian coast, with realistic environment scenarios where wind and current incidence varies along the channel. The method keeps a simple representation and can be applied to any port channel configuration that respects local technical regulations.
topic	Fast-time maneuvering simulations machine learning deep reinforcement learning ship path following control
url	https://ieeexplore.ieee.org/document/9163360/
work_keys_str_mv	AT joseamendola navigationinrestrictedchannelsunderenvironmentalconditionsfasttimesimulationbyasynchronousdeepreinforcementlearning AT lucassmiura navigationinrestrictedchannelsunderenvironmentalconditionsfasttimesimulationbyasynchronousdeepreinforcementlearning AT annahrealicosta navigationinrestrictedchannelsunderenvironmentalconditionsfasttimesimulationbyasynchronousdeepreinforcementlearning AT fabiogcozman navigationinrestrictedchannelsunderenvironmentalconditionsfasttimesimulationbyasynchronousdeepreinforcementlearning AT eduardoaountannuri navigationinrestrictedchannelsunderenvironmentalconditionsfasttimesimulationbyasynchronousdeepreinforcementlearning
_version_	1724183536040148992

Navigation in Restricted Channels Under Environmental Conditions: Fast-Time Simulation by Asynchronous Deep Reinforcement Learning

Similar Items