VISUAL TRACKING UTILIZING OBJECT CONCEPT FROM DEEP LEARNING NETWORK

Despite having achieved good performance, visual tracking is still an open area of research, especially when target undergoes serious appearance changes which are not included in the model. So, in this paper, we replace the appearance model by a concept model which is learned from large-scale data...

Full description

Bibliographic Details
Main Authors: C. Xiao, A. Yilmaz, S. Lia
Format: Article
Language:English
Published: Copernicus Publications 2017-05-01
Series:ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences
Online Access:http://www.isprs-ann-photogramm-remote-sens-spatial-inf-sci.net/IV-1-W1/125/2017/isprs-annals-IV-1-W1-125-2017.pdf
id doaj-e70cdbfd09b64e2eb8fb687e973bf0da
record_format Article
spelling doaj-e70cdbfd09b64e2eb8fb687e973bf0da2020-11-24T21:05:30ZengCopernicus PublicationsISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences2194-90422194-90502017-05-01IV-1-W112513210.5194/isprs-annals-IV-1-W1-125-2017VISUAL TRACKING UTILIZING OBJECT CONCEPT FROM DEEP LEARNING NETWORKC. Xiao0A. Yilmaz1S. Lia2S. Lia3Photogrammetric Computer Vision Laboratory, The Ohio State University, USAPhotogrammetric Computer Vision Laboratory, The Ohio State University, USAPhotogrammetric Computer Vision Laboratory, The Ohio State University, USAInstitute of Computing Technology, Chinese Academy of Sciences, Beijing, ChinaDespite having achieved good performance, visual tracking is still an open area of research, especially when target undergoes serious appearance changes which are not included in the model. So, in this paper, we replace the appearance model by a concept model which is learned from large-scale datasets using a deep learning network. The concept model is a combination of high-level semantic information that is learned from myriads of objects with various appearances. In our tracking method, we generate the target’s concept by combining the learned object concepts from classification task. We also demonstrate that the last convolutional feature map can be used to generate a heat map to highlight the possible location of the given target in new frames. Finally, in the proposed tracking framework, we utilize the target image, the search image cropped from the new frame and their heat maps as input into a localization network to find the final target position. Compared to the other state-of-the-art trackers, the proposed method shows the comparable and at times better performance in real-time.http://www.isprs-ann-photogramm-remote-sens-spatial-inf-sci.net/IV-1-W1/125/2017/isprs-annals-IV-1-W1-125-2017.pdf
collection DOAJ
language English
format Article
sources DOAJ
author C. Xiao
A. Yilmaz
S. Lia
S. Lia
spellingShingle C. Xiao
A. Yilmaz
S. Lia
S. Lia
VISUAL TRACKING UTILIZING OBJECT CONCEPT FROM DEEP LEARNING NETWORK
ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences
author_facet C. Xiao
A. Yilmaz
S. Lia
S. Lia
author_sort C. Xiao
title VISUAL TRACKING UTILIZING OBJECT CONCEPT FROM DEEP LEARNING NETWORK
title_short VISUAL TRACKING UTILIZING OBJECT CONCEPT FROM DEEP LEARNING NETWORK
title_full VISUAL TRACKING UTILIZING OBJECT CONCEPT FROM DEEP LEARNING NETWORK
title_fullStr VISUAL TRACKING UTILIZING OBJECT CONCEPT FROM DEEP LEARNING NETWORK
title_full_unstemmed VISUAL TRACKING UTILIZING OBJECT CONCEPT FROM DEEP LEARNING NETWORK
title_sort visual tracking utilizing object concept from deep learning network
publisher Copernicus Publications
series ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences
issn 2194-9042
2194-9050
publishDate 2017-05-01
description Despite having achieved good performance, visual tracking is still an open area of research, especially when target undergoes serious appearance changes which are not included in the model. So, in this paper, we replace the appearance model by a concept model which is learned from large-scale datasets using a deep learning network. The concept model is a combination of high-level semantic information that is learned from myriads of objects with various appearances. In our tracking method, we generate the target’s concept by combining the learned object concepts from classification task. We also demonstrate that the last convolutional feature map can be used to generate a heat map to highlight the possible location of the given target in new frames. Finally, in the proposed tracking framework, we utilize the target image, the search image cropped from the new frame and their heat maps as input into a localization network to find the final target position. Compared to the other state-of-the-art trackers, the proposed method shows the comparable and at times better performance in real-time.
url http://www.isprs-ann-photogramm-remote-sens-spatial-inf-sci.net/IV-1-W1/125/2017/isprs-annals-IV-1-W1-125-2017.pdf
work_keys_str_mv AT cxiao visualtrackingutilizingobjectconceptfromdeeplearningnetwork
AT ayilmaz visualtrackingutilizingobjectconceptfromdeeplearningnetwork
AT slia visualtrackingutilizingobjectconceptfromdeeplearningnetwork
AT slia visualtrackingutilizingobjectconceptfromdeeplearningnetwork
_version_ 1716768544589348864