Research on autonomous underwater vehicle wall following based on reinforcement learning and multi-sonar weighted round robin mode

When autonomous underwater vehicle following the wall, a common problem is interference between sonars equipped in the autonomous underwater vehicle. A novel work mode with weighted polling (which can be also called “weighted round robin mode”) which can independently identify the environment, dynam...

Full description

Bibliographic Details
Main Authors: Xiangbin Wang, Guocheng Zhang, Yushan Sun, Lei Wan, Jian Cao
Format: Article
Language:English
Published: SAGE Publishing 2020-05-01
Series:International Journal of Advanced Robotic Systems
Online Access:https://doi.org/10.1177/1729881420925311
id doaj-0e8157c19d0c409b849dcd5b317a8af3
record_format Article
spelling doaj-0e8157c19d0c409b849dcd5b317a8af32020-11-25T03:42:26ZengSAGE PublishingInternational Journal of Advanced Robotic Systems1729-88142020-05-011710.1177/1729881420925311Research on autonomous underwater vehicle wall following based on reinforcement learning and multi-sonar weighted round robin modeXiangbin WangGuocheng ZhangYushan SunLei WanJian CaoWhen autonomous underwater vehicle following the wall, a common problem is interference between sonars equipped in the autonomous underwater vehicle. A novel work mode with weighted polling (which can be also called “weighted round robin mode”) which can independently identify the environment, dynamically establish the environmental model, and switch the operating frequency of the sonar is proposed in this article. The dynamic weighted polling mode solves the problem of sonar interference. By dynamically switching the operating frequency of the sonar, the efficiency of following the wall is improved. Through the interpolation algorithm based on velocity interpolation, the data of different frequency ranging sonar are time registered to solve the asynchronous problem of multi-sonar and the system outputs according to the frequency of high-frequency sonar. With the reinforcement learning algorithm, autonomous underwater vehicle can follow the wall at a certain distance according to the distance obtained from the polling mode. At last, the tank test verified the effectiveness of the algorithm.https://doi.org/10.1177/1729881420925311
collection DOAJ
language English
format Article
sources DOAJ
author Xiangbin Wang
Guocheng Zhang
Yushan Sun
Lei Wan
Jian Cao
spellingShingle Xiangbin Wang
Guocheng Zhang
Yushan Sun
Lei Wan
Jian Cao
Research on autonomous underwater vehicle wall following based on reinforcement learning and multi-sonar weighted round robin mode
International Journal of Advanced Robotic Systems
author_facet Xiangbin Wang
Guocheng Zhang
Yushan Sun
Lei Wan
Jian Cao
author_sort Xiangbin Wang
title Research on autonomous underwater vehicle wall following based on reinforcement learning and multi-sonar weighted round robin mode
title_short Research on autonomous underwater vehicle wall following based on reinforcement learning and multi-sonar weighted round robin mode
title_full Research on autonomous underwater vehicle wall following based on reinforcement learning and multi-sonar weighted round robin mode
title_fullStr Research on autonomous underwater vehicle wall following based on reinforcement learning and multi-sonar weighted round robin mode
title_full_unstemmed Research on autonomous underwater vehicle wall following based on reinforcement learning and multi-sonar weighted round robin mode
title_sort research on autonomous underwater vehicle wall following based on reinforcement learning and multi-sonar weighted round robin mode
publisher SAGE Publishing
series International Journal of Advanced Robotic Systems
issn 1729-8814
publishDate 2020-05-01
description When autonomous underwater vehicle following the wall, a common problem is interference between sonars equipped in the autonomous underwater vehicle. A novel work mode with weighted polling (which can be also called “weighted round robin mode”) which can independently identify the environment, dynamically establish the environmental model, and switch the operating frequency of the sonar is proposed in this article. The dynamic weighted polling mode solves the problem of sonar interference. By dynamically switching the operating frequency of the sonar, the efficiency of following the wall is improved. Through the interpolation algorithm based on velocity interpolation, the data of different frequency ranging sonar are time registered to solve the asynchronous problem of multi-sonar and the system outputs according to the frequency of high-frequency sonar. With the reinforcement learning algorithm, autonomous underwater vehicle can follow the wall at a certain distance according to the distance obtained from the polling mode. At last, the tank test verified the effectiveness of the algorithm.
url https://doi.org/10.1177/1729881420925311
work_keys_str_mv AT xiangbinwang researchonautonomousunderwatervehiclewallfollowingbasedonreinforcementlearningandmultisonarweightedroundrobinmode
AT guochengzhang researchonautonomousunderwatervehiclewallfollowingbasedonreinforcementlearningandmultisonarweightedroundrobinmode
AT yushansun researchonautonomousunderwatervehiclewallfollowingbasedonreinforcementlearningandmultisonarweightedroundrobinmode
AT leiwan researchonautonomousunderwatervehiclewallfollowingbasedonreinforcementlearningandmultisonarweightedroundrobinmode
AT jiancao researchonautonomousunderwatervehiclewallfollowingbasedonreinforcementlearningandmultisonarweightedroundrobinmode
_version_ 1724525076997472256