Proposal-Based Visual Tracking Using Spatial Cascaded Transformed Region Proposal Network

Region proposal network (RPN) based trackers employ the classification and regression block to generate the proposals, the proposal that contains the highest similarity score is formulated to be the groundtruth candidate of next frame. However, region proposal network based trackers cannot make the...

Full description

Bibliographic Details
Main Authors:	Ximing Zhang, Shujuan Luo, Xuewu Fan
Format:	Article
Language:	English
Published:	MDPI AG 2020-08-01
Series:	Sensors
Subjects:	visual tracking spatial cascaded networks shrinkage loss multi-cue proposals re-ranking region proposals networks
Online Access:	https://www.mdpi.com/1424-8220/20/17/4810

id	doaj-061d4911870349659c2cf27746dbd8c8
record_format	Article
spelling	doaj-061d4911870349659c2cf27746dbd8c82020-11-25T03:01:29ZengMDPI AGSensors1424-82202020-08-01204810481010.3390/s20174810Proposal-Based Visual Tracking Using Spatial Cascaded Transformed Region Proposal NetworkXiming Zhang0Shujuan Luo1Xuewu Fan2Faculty of Space, Xi’an Institute of Optics and Precision Mechanics of CAS, Xi’an 710119, ChinaSchool of Astronautics, Northwestern Polytechnical Universty, Xi’an 710072, ChinaFaculty of Space, Xi’an Institute of Optics and Precision Mechanics of CAS, Xi’an 710119, ChinaRegion proposal network (RPN) based trackers employ the classification and regression block to generate the proposals, the proposal that contains the highest similarity score is formulated to be the groundtruth candidate of next frame. However, region proposal network based trackers cannot make the best of the features from different convolutional layers, and the original loss function cannot alleviate the data imbalance issue of the training procedure. We propose the Spatial Cascaded Transformed RPN to combine the RPN and STN (spatial transformer network) together, in order to successfully obtain the proposals of high quality, which can simultaneously improves the robustness. The STN can transfer the spatial transformed features though different stages, which extends the spatial representation capability of such networks handling complex scenarios such as scale variation and affine transformation. We break the restriction though an easy samples penalization loss (shrinkage loss) instead of smooth L1 function. Moreover, we perform the multi-cue proposals re-ranking to guarantee the accuracy of the proposed tracker. We extensively prove the effectiveness of our proposed method on the ablation studies of the tracking datasets, which include OTB-2015 (Object Tracking Benchmark 2015), VOT-2018 (Visual Object Tracking 2018), LaSOT (Large Scale Single Object Tracking), TrackingNet (A Large-Scale Dataset and Benchmark for Object Tracking in the Wild) and UAV123 (UAV Tracking Dataset).https://www.mdpi.com/1424-8220/20/17/4810visual trackingspatial cascaded networksshrinkage lossmulti-cue proposals re-rankingregion proposals networks
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Ximing Zhang Shujuan Luo Xuewu Fan
spellingShingle	Ximing Zhang Shujuan Luo Xuewu Fan Proposal-Based Visual Tracking Using Spatial Cascaded Transformed Region Proposal Network Sensors visual tracking spatial cascaded networks shrinkage loss multi-cue proposals re-ranking region proposals networks
author_facet	Ximing Zhang Shujuan Luo Xuewu Fan
author_sort	Ximing Zhang
title	Proposal-Based Visual Tracking Using Spatial Cascaded Transformed Region Proposal Network
title_short	Proposal-Based Visual Tracking Using Spatial Cascaded Transformed Region Proposal Network
title_full	Proposal-Based Visual Tracking Using Spatial Cascaded Transformed Region Proposal Network
title_fullStr	Proposal-Based Visual Tracking Using Spatial Cascaded Transformed Region Proposal Network
title_full_unstemmed	Proposal-Based Visual Tracking Using Spatial Cascaded Transformed Region Proposal Network
title_sort	proposal-based visual tracking using spatial cascaded transformed region proposal network
publisher	MDPI AG
series	Sensors
issn	1424-8220
publishDate	2020-08-01
description	Region proposal network (RPN) based trackers employ the classification and regression block to generate the proposals, the proposal that contains the highest similarity score is formulated to be the groundtruth candidate of next frame. However, region proposal network based trackers cannot make the best of the features from different convolutional layers, and the original loss function cannot alleviate the data imbalance issue of the training procedure. We propose the Spatial Cascaded Transformed RPN to combine the RPN and STN (spatial transformer network) together, in order to successfully obtain the proposals of high quality, which can simultaneously improves the robustness. The STN can transfer the spatial transformed features though different stages, which extends the spatial representation capability of such networks handling complex scenarios such as scale variation and affine transformation. We break the restriction though an easy samples penalization loss (shrinkage loss) instead of smooth L1 function. Moreover, we perform the multi-cue proposals re-ranking to guarantee the accuracy of the proposed tracker. We extensively prove the effectiveness of our proposed method on the ablation studies of the tracking datasets, which include OTB-2015 (Object Tracking Benchmark 2015), VOT-2018 (Visual Object Tracking 2018), LaSOT (Large Scale Single Object Tracking), TrackingNet (A Large-Scale Dataset and Benchmark for Object Tracking in the Wild) and UAV123 (UAV Tracking Dataset).
topic	visual tracking spatial cascaded networks shrinkage loss multi-cue proposals re-ranking region proposals networks
url	https://www.mdpi.com/1424-8220/20/17/4810
work_keys_str_mv	AT ximingzhang proposalbasedvisualtrackingusingspatialcascadedtransformedregionproposalnetwork AT shujuanluo proposalbasedvisualtrackingusingspatialcascadedtransformedregionproposalnetwork AT xuewufan proposalbasedvisualtrackingusingspatialcascadedtransformedregionproposalnetwork
_version_	1724693543998455808

Proposal-Based Visual Tracking Using Spatial Cascaded Transformed Region Proposal Network

Similar Items