Multi-AUV Collaborative Target Recognition Based on Transfer-Reinforcement Learning

Due to the existence of unfavorable factors such as turbid water quality and target occlusion, it is difficult to obtain valid data of target features. Due to the repeated calculation of similar data, the real-time performance of the algorithm is poor. In view of the above problems, this paper propo...

Full description

Bibliographic Details
Main Authors: Lei Cai, Qiankun Sun, Tao Xu, Yukun Ma, Zhenxue Chen
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9007726/
id doaj-b8cc3e9a543d4cf0baa2fe56e5049022
record_format Article
spelling doaj-b8cc3e9a543d4cf0baa2fe56e50490222021-03-30T02:42:44ZengIEEEIEEE Access2169-35362020-01-018392733928410.1109/ACCESS.2020.29761219007726Multi-AUV Collaborative Target Recognition Based on Transfer-Reinforcement LearningLei Cai0https://orcid.org/0000-0003-4811-5854Qiankun Sun1Tao Xu2Yukun Ma3https://orcid.org/0000-0002-4419-4287Zhenxue Chen4https://orcid.org/0000-0001-9637-5170School of Artificial Intelligence, Henan Institute of Science and Technology, Xinxiang, ChinaSchool of Information Engineering, Henan Institute of Science and Technology, Xinxiang, ChinaSchool of Artificial Intelligence, Henan Institute of Science and Technology, Xinxiang, ChinaSchool of Artificial Intelligence, Henan Institute of Science and Technology, Xinxiang, ChinaSchool of Control Science and Engineering, Shandong University, Jinan, ChinaDue to the existence of unfavorable factors such as turbid water quality and target occlusion, it is difficult to obtain valid data of target features. Due to the repeated calculation of similar data, the real-time performance of the algorithm is poor. In view of the above problems, this paper proposes a multi-AUV collaborative target recognition method based on transfer-reinforcement learning. The features of the target information which is collected by multi-AUV are fused based on wavelet transformation and affine invariance. The similarity of features is calculated by Mahalanobis distance and the learning model is selected autonomously based on the similarity threshold. Based on the Q-learning reinforcement learning model, the target information under the interference environment is trained intensively, and the effective features are extracted and stored in the source domain, which can reduce the impact of the environmental interference on the target recognition. The feature transfer learning model based on deep confidence network transfers the feature data of the source domain to the target domain, reducing the repeated calculation of similar data, and then ensuring the real-time performance of the algorithm. Simulation experiments are conducted in the SUN dataset under five underwater environments (turbid water, target occlusion, insufficient light, complex background, and overlapping targets), and the results demonstrate that the proposed model achieves better performance.https://ieeexplore.ieee.org/document/9007726/Small sampletarget recognitionmulti-AUV collaborationreinforcement learningtransfer learning
collection DOAJ
language English
format Article
sources DOAJ
author Lei Cai
Qiankun Sun
Tao Xu
Yukun Ma
Zhenxue Chen
spellingShingle Lei Cai
Qiankun Sun
Tao Xu
Yukun Ma
Zhenxue Chen
Multi-AUV Collaborative Target Recognition Based on Transfer-Reinforcement Learning
IEEE Access
Small sample
target recognition
multi-AUV collaboration
reinforcement learning
transfer learning
author_facet Lei Cai
Qiankun Sun
Tao Xu
Yukun Ma
Zhenxue Chen
author_sort Lei Cai
title Multi-AUV Collaborative Target Recognition Based on Transfer-Reinforcement Learning
title_short Multi-AUV Collaborative Target Recognition Based on Transfer-Reinforcement Learning
title_full Multi-AUV Collaborative Target Recognition Based on Transfer-Reinforcement Learning
title_fullStr Multi-AUV Collaborative Target Recognition Based on Transfer-Reinforcement Learning
title_full_unstemmed Multi-AUV Collaborative Target Recognition Based on Transfer-Reinforcement Learning
title_sort multi-auv collaborative target recognition based on transfer-reinforcement learning
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2020-01-01
description Due to the existence of unfavorable factors such as turbid water quality and target occlusion, it is difficult to obtain valid data of target features. Due to the repeated calculation of similar data, the real-time performance of the algorithm is poor. In view of the above problems, this paper proposes a multi-AUV collaborative target recognition method based on transfer-reinforcement learning. The features of the target information which is collected by multi-AUV are fused based on wavelet transformation and affine invariance. The similarity of features is calculated by Mahalanobis distance and the learning model is selected autonomously based on the similarity threshold. Based on the Q-learning reinforcement learning model, the target information under the interference environment is trained intensively, and the effective features are extracted and stored in the source domain, which can reduce the impact of the environmental interference on the target recognition. The feature transfer learning model based on deep confidence network transfers the feature data of the source domain to the target domain, reducing the repeated calculation of similar data, and then ensuring the real-time performance of the algorithm. Simulation experiments are conducted in the SUN dataset under five underwater environments (turbid water, target occlusion, insufficient light, complex background, and overlapping targets), and the results demonstrate that the proposed model achieves better performance.
topic Small sample
target recognition
multi-AUV collaboration
reinforcement learning
transfer learning
url https://ieeexplore.ieee.org/document/9007726/
work_keys_str_mv AT leicai multiauvcollaborativetargetrecognitionbasedontransferreinforcementlearning
AT qiankunsun multiauvcollaborativetargetrecognitionbasedontransferreinforcementlearning
AT taoxu multiauvcollaborativetargetrecognitionbasedontransferreinforcementlearning
AT yukunma multiauvcollaborativetargetrecognitionbasedontransferreinforcementlearning
AT zhenxuechen multiauvcollaborativetargetrecognitionbasedontransferreinforcementlearning
_version_ 1724184773344100352