STResNet_CF Tracker: The Deep Spatiotemporal Features Learning for Correlation Filter Based Robust Visual Object Tracking

Constructing a robust appearance model of the visual object is a crucial task for visual object tracking. Recently, more and more studies combine spatial feature with a temporal feature to improve the tracking performance. These methods successfully apply the features from spatial and temporal to ad...

Full description

Bibliographic Details
Main Authors: Zhengyu Zhu, Bing Liu, Yunbo Rao, Qiao Liu, Rui Zhang
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8660630/
id doaj-0de4bc27df5b4b25ad43022e693d097e
record_format Article
spelling doaj-0de4bc27df5b4b25ad43022e693d097e2021-03-29T22:18:50ZengIEEEIEEE Access2169-35362019-01-017301423015610.1109/ACCESS.2019.29031618660630STResNet_CF Tracker: The Deep Spatiotemporal Features Learning for Correlation Filter Based Robust Visual Object TrackingZhengyu Zhu0Bing Liu1https://orcid.org/0000-0003-3643-2473Yunbo Rao2https://orcid.org/0000-0001-5433-7379Qiao Liu3Rui Zhang4College of Computer Science, Chongqing University, Chongqing, ChinaCollege of Computer Science, Chongqing University, Chongqing, ChinaSchool of Information and Software Engineering, University of Electronic Science and Technology of China, Chengdu, ChinaSchool of Computer Science and Technology, Harbin Institute of Technology Shenzhen Graduate School, Shenzhen, ChinaCollege of Computer Science, Chongqing University, Chongqing, ChinaConstructing a robust appearance model of the visual object is a crucial task for visual object tracking. Recently, more and more studies combine spatial feature with a temporal feature to improve the tracking performance. These methods successfully apply the features from spatial and temporal to address the problem for tracking. This paper presents a novel method for visual object tracking based on spatiotemporal feature combined with correlation filters. In this paper, the visual features of a target object are extracted from a spatial-temporal residual network (STResNet) appearance model with two sub-networks. The STResNet appearance model learns separately spatial feature and temporal feature, respectively, so that we can effectively utilize spatial context around the surrounding of the target object in each frame and the temporal relationship between successive frames to refine the appearance representation of the target object. Finally, our spatiotemporal fusion feature from STResNet appearance model is incorporated into the correlation filter for robust visual object tracking. The experimental results show that our method achieves similar or better performance compared with the other tracking methods based on convolutional neural networks or correlation filter.https://ieeexplore.ieee.org/document/8660630/Spatiotemporal residual networkcorrelation filtervisual object trackingdeep learningconvolutional neural networks
collection DOAJ
language English
format Article
sources DOAJ
author Zhengyu Zhu
Bing Liu
Yunbo Rao
Qiao Liu
Rui Zhang
spellingShingle Zhengyu Zhu
Bing Liu
Yunbo Rao
Qiao Liu
Rui Zhang
STResNet_CF Tracker: The Deep Spatiotemporal Features Learning for Correlation Filter Based Robust Visual Object Tracking
IEEE Access
Spatiotemporal residual network
correlation filter
visual object tracking
deep learning
convolutional neural networks
author_facet Zhengyu Zhu
Bing Liu
Yunbo Rao
Qiao Liu
Rui Zhang
author_sort Zhengyu Zhu
title STResNet_CF Tracker: The Deep Spatiotemporal Features Learning for Correlation Filter Based Robust Visual Object Tracking
title_short STResNet_CF Tracker: The Deep Spatiotemporal Features Learning for Correlation Filter Based Robust Visual Object Tracking
title_full STResNet_CF Tracker: The Deep Spatiotemporal Features Learning for Correlation Filter Based Robust Visual Object Tracking
title_fullStr STResNet_CF Tracker: The Deep Spatiotemporal Features Learning for Correlation Filter Based Robust Visual Object Tracking
title_full_unstemmed STResNet_CF Tracker: The Deep Spatiotemporal Features Learning for Correlation Filter Based Robust Visual Object Tracking
title_sort stresnet_cf tracker: the deep spatiotemporal features learning for correlation filter based robust visual object tracking
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2019-01-01
description Constructing a robust appearance model of the visual object is a crucial task for visual object tracking. Recently, more and more studies combine spatial feature with a temporal feature to improve the tracking performance. These methods successfully apply the features from spatial and temporal to address the problem for tracking. This paper presents a novel method for visual object tracking based on spatiotemporal feature combined with correlation filters. In this paper, the visual features of a target object are extracted from a spatial-temporal residual network (STResNet) appearance model with two sub-networks. The STResNet appearance model learns separately spatial feature and temporal feature, respectively, so that we can effectively utilize spatial context around the surrounding of the target object in each frame and the temporal relationship between successive frames to refine the appearance representation of the target object. Finally, our spatiotemporal fusion feature from STResNet appearance model is incorporated into the correlation filter for robust visual object tracking. The experimental results show that our method achieves similar or better performance compared with the other tracking methods based on convolutional neural networks or correlation filter.
topic Spatiotemporal residual network
correlation filter
visual object tracking
deep learning
convolutional neural networks
url https://ieeexplore.ieee.org/document/8660630/
work_keys_str_mv AT zhengyuzhu stresnetx005fcftrackerthedeepspatiotemporalfeatureslearningforcorrelationfilterbasedrobustvisualobjecttracking
AT bingliu stresnetx005fcftrackerthedeepspatiotemporalfeatureslearningforcorrelationfilterbasedrobustvisualobjecttracking
AT yunborao stresnetx005fcftrackerthedeepspatiotemporalfeatureslearningforcorrelationfilterbasedrobustvisualobjecttracking
AT qiaoliu stresnetx005fcftrackerthedeepspatiotemporalfeatureslearningforcorrelationfilterbasedrobustvisualobjecttracking
AT ruizhang stresnetx005fcftrackerthedeepspatiotemporalfeatureslearningforcorrelationfilterbasedrobustvisualobjecttracking
_version_ 1724191809351974912