Multi-AUV Collaborative Target Recognition Based on Transfer-Reinforcement Learning
Due to the existence of unfavorable factors such as turbid water quality and target occlusion, it is difficult to obtain valid data of target features. Due to the repeated calculation of similar data, the real-time performance of the algorithm is poor. In view of the above problems, this paper propo...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2020-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9007726/ |
id |
doaj-b8cc3e9a543d4cf0baa2fe56e5049022 |
---|---|
record_format |
Article |
spelling |
doaj-b8cc3e9a543d4cf0baa2fe56e50490222021-03-30T02:42:44ZengIEEEIEEE Access2169-35362020-01-018392733928410.1109/ACCESS.2020.29761219007726Multi-AUV Collaborative Target Recognition Based on Transfer-Reinforcement LearningLei Cai0https://orcid.org/0000-0003-4811-5854Qiankun Sun1Tao Xu2Yukun Ma3https://orcid.org/0000-0002-4419-4287Zhenxue Chen4https://orcid.org/0000-0001-9637-5170School of Artificial Intelligence, Henan Institute of Science and Technology, Xinxiang, ChinaSchool of Information Engineering, Henan Institute of Science and Technology, Xinxiang, ChinaSchool of Artificial Intelligence, Henan Institute of Science and Technology, Xinxiang, ChinaSchool of Artificial Intelligence, Henan Institute of Science and Technology, Xinxiang, ChinaSchool of Control Science and Engineering, Shandong University, Jinan, ChinaDue to the existence of unfavorable factors such as turbid water quality and target occlusion, it is difficult to obtain valid data of target features. Due to the repeated calculation of similar data, the real-time performance of the algorithm is poor. In view of the above problems, this paper proposes a multi-AUV collaborative target recognition method based on transfer-reinforcement learning. The features of the target information which is collected by multi-AUV are fused based on wavelet transformation and affine invariance. The similarity of features is calculated by Mahalanobis distance and the learning model is selected autonomously based on the similarity threshold. Based on the Q-learning reinforcement learning model, the target information under the interference environment is trained intensively, and the effective features are extracted and stored in the source domain, which can reduce the impact of the environmental interference on the target recognition. The feature transfer learning model based on deep confidence network transfers the feature data of the source domain to the target domain, reducing the repeated calculation of similar data, and then ensuring the real-time performance of the algorithm. Simulation experiments are conducted in the SUN dataset under five underwater environments (turbid water, target occlusion, insufficient light, complex background, and overlapping targets), and the results demonstrate that the proposed model achieves better performance.https://ieeexplore.ieee.org/document/9007726/Small sampletarget recognitionmulti-AUV collaborationreinforcement learningtransfer learning |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Lei Cai Qiankun Sun Tao Xu Yukun Ma Zhenxue Chen |
spellingShingle |
Lei Cai Qiankun Sun Tao Xu Yukun Ma Zhenxue Chen Multi-AUV Collaborative Target Recognition Based on Transfer-Reinforcement Learning IEEE Access Small sample target recognition multi-AUV collaboration reinforcement learning transfer learning |
author_facet |
Lei Cai Qiankun Sun Tao Xu Yukun Ma Zhenxue Chen |
author_sort |
Lei Cai |
title |
Multi-AUV Collaborative Target Recognition Based on Transfer-Reinforcement Learning |
title_short |
Multi-AUV Collaborative Target Recognition Based on Transfer-Reinforcement Learning |
title_full |
Multi-AUV Collaborative Target Recognition Based on Transfer-Reinforcement Learning |
title_fullStr |
Multi-AUV Collaborative Target Recognition Based on Transfer-Reinforcement Learning |
title_full_unstemmed |
Multi-AUV Collaborative Target Recognition Based on Transfer-Reinforcement Learning |
title_sort |
multi-auv collaborative target recognition based on transfer-reinforcement learning |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2020-01-01 |
description |
Due to the existence of unfavorable factors such as turbid water quality and target occlusion, it is difficult to obtain valid data of target features. Due to the repeated calculation of similar data, the real-time performance of the algorithm is poor. In view of the above problems, this paper proposes a multi-AUV collaborative target recognition method based on transfer-reinforcement learning. The features of the target information which is collected by multi-AUV are fused based on wavelet transformation and affine invariance. The similarity of features is calculated by Mahalanobis distance and the learning model is selected autonomously based on the similarity threshold. Based on the Q-learning reinforcement learning model, the target information under the interference environment is trained intensively, and the effective features are extracted and stored in the source domain, which can reduce the impact of the environmental interference on the target recognition. The feature transfer learning model based on deep confidence network transfers the feature data of the source domain to the target domain, reducing the repeated calculation of similar data, and then ensuring the real-time performance of the algorithm. Simulation experiments are conducted in the SUN dataset under five underwater environments (turbid water, target occlusion, insufficient light, complex background, and overlapping targets), and the results demonstrate that the proposed model achieves better performance. |
topic |
Small sample target recognition multi-AUV collaboration reinforcement learning transfer learning |
url |
https://ieeexplore.ieee.org/document/9007726/ |
work_keys_str_mv |
AT leicai multiauvcollaborativetargetrecognitionbasedontransferreinforcementlearning AT qiankunsun multiauvcollaborativetargetrecognitionbasedontransferreinforcementlearning AT taoxu multiauvcollaborativetargetrecognitionbasedontransferreinforcementlearning AT yukunma multiauvcollaborativetargetrecognitionbasedontransferreinforcementlearning AT zhenxuechen multiauvcollaborativetargetrecognitionbasedontransferreinforcementlearning |
_version_ |
1724184773344100352 |