Improved Action-Decision Network for Visual Tracking With Meta-Learning

Visual tracking is a challenging problem since it usually faces adverse factors, such as object deformation, fast motion, occlusion, and background clutter in practical applications. Reinforcement learning based Action-Decision Network (ADNet) has shown great potential for object tracking. However,...

Full description

Bibliographic Details
Main Authors: Detian Huang, Lingke Kong, Jianqing Zhu, Lixin Zheng
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8808847/
id doaj-74ae806e0df1495982184e6b4969f4e6
record_format Article
spelling doaj-74ae806e0df1495982184e6b4969f4e62021-03-30T00:00:08ZengIEEEIEEE Access2169-35362019-01-01711720611721810.1109/ACCESS.2019.29365518808847Improved Action-Decision Network for Visual Tracking With Meta-LearningDetian Huang0https://orcid.org/0000-0002-8542-3728Lingke Kong1Jianqing Zhu2https://orcid.org/0000-0001-8840-3629Lixin Zheng3https://orcid.org/0000-0002-5146-8661Fujian Provincial Academic Engineering Research Centre in Industrial Intellectual Techniques and Systems, College of Engineering, Huaqiao University, Quanzhou, ChinaFujian Provincial Academic Engineering Research Centre in Industrial Intellectual Techniques and Systems, College of Engineering, Huaqiao University, Quanzhou, ChinaFujian Provincial Academic Engineering Research Centre in Industrial Intellectual Techniques and Systems, College of Engineering, Huaqiao University, Quanzhou, ChinaFujian Provincial Academic Engineering Research Centre in Industrial Intellectual Techniques and Systems, College of Engineering, Huaqiao University, Quanzhou, ChinaVisual tracking is a challenging problem since it usually faces adverse factors, such as object deformation, fast motion, occlusion, and background clutter in practical applications. Reinforcement learning based Action-Decision Network (ADNet) has shown great potential for object tracking. However, ADNet has some shortcomings in optimal action selection and action reward, and suffers from inefficient tracking. To this end, an improved ADNet is proposed to enhance the tracking accuracy and efficiency. Firstly, the multi-domain training is incorporated into ADNet to further improve the feature extraction ability of its convolution layers. Then, in the reinforcement learning based training phase, both the selection criteria for optimal action and the reward function are redesigned separately to explore more appropriate action and eliminate useless action. Finally, an effective online adaptive update strategy is proposed to adapt to the appearance changes or deformation of the object during actual tracking. Specifically, meta-learning is utilized to pursue the most appropriate parameters for the network so that the parameters are closer to the optimal ones in the subsequent tracking process. Experimental results demonstrate that the proposed tracker has advantages over ADNet in terms of accuracy and efficiency.https://ieeexplore.ieee.org/document/8808847/Image processingvisual trackingreinforcement learningmeta-learningmulti-domain training
collection DOAJ
language English
format Article
sources DOAJ
author Detian Huang
Lingke Kong
Jianqing Zhu
Lixin Zheng
spellingShingle Detian Huang
Lingke Kong
Jianqing Zhu
Lixin Zheng
Improved Action-Decision Network for Visual Tracking With Meta-Learning
IEEE Access
Image processing
visual tracking
reinforcement learning
meta-learning
multi-domain training
author_facet Detian Huang
Lingke Kong
Jianqing Zhu
Lixin Zheng
author_sort Detian Huang
title Improved Action-Decision Network for Visual Tracking With Meta-Learning
title_short Improved Action-Decision Network for Visual Tracking With Meta-Learning
title_full Improved Action-Decision Network for Visual Tracking With Meta-Learning
title_fullStr Improved Action-Decision Network for Visual Tracking With Meta-Learning
title_full_unstemmed Improved Action-Decision Network for Visual Tracking With Meta-Learning
title_sort improved action-decision network for visual tracking with meta-learning
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2019-01-01
description Visual tracking is a challenging problem since it usually faces adverse factors, such as object deformation, fast motion, occlusion, and background clutter in practical applications. Reinforcement learning based Action-Decision Network (ADNet) has shown great potential for object tracking. However, ADNet has some shortcomings in optimal action selection and action reward, and suffers from inefficient tracking. To this end, an improved ADNet is proposed to enhance the tracking accuracy and efficiency. Firstly, the multi-domain training is incorporated into ADNet to further improve the feature extraction ability of its convolution layers. Then, in the reinforcement learning based training phase, both the selection criteria for optimal action and the reward function are redesigned separately to explore more appropriate action and eliminate useless action. Finally, an effective online adaptive update strategy is proposed to adapt to the appearance changes or deformation of the object during actual tracking. Specifically, meta-learning is utilized to pursue the most appropriate parameters for the network so that the parameters are closer to the optimal ones in the subsequent tracking process. Experimental results demonstrate that the proposed tracker has advantages over ADNet in terms of accuracy and efficiency.
topic Image processing
visual tracking
reinforcement learning
meta-learning
multi-domain training
url https://ieeexplore.ieee.org/document/8808847/
work_keys_str_mv AT detianhuang improvedactiondecisionnetworkforvisualtrackingwithmetalearning
AT lingkekong improvedactiondecisionnetworkforvisualtrackingwithmetalearning
AT jianqingzhu improvedactiondecisionnetworkforvisualtrackingwithmetalearning
AT lixinzheng improvedactiondecisionnetworkforvisualtrackingwithmetalearning
_version_ 1724188778349723648