Edge-boxes based Convolutional Neural Networks Approach for Multi-Target Tracking in Traffic Scenes

碩士 === 國立清華大學 === 資訊工程學系所 === 105 === Computer vision is important for autonomous cars to detect or track the object nearby such as people, vehicles or animals. However, there are many problems in visual tracking including illumination variation, deformation and occlusion, etc. To deal with these co...

Full description

Bibliographic Details
Main Authors: Lu, Ming-En, 呂明恩
Other Authors: Wang, Jia-Shung
Format: Others
Language:en_US
Published: 2017
Online Access:http://ndltd.ncl.edu.tw/handle/rcmks6
id ndltd-TW-105NTHU5392085
record_format oai_dc
spelling ndltd-TW-105NTHU53920852019-05-15T23:53:47Z http://ndltd.ncl.edu.tw/handle/rcmks6 Edge-boxes based Convolutional Neural Networks Approach for Multi-Target Tracking in Traffic Scenes 以Edge-boxes為基礎的卷積神經網路方法應用於交通場景多目標追蹤 Lu, Ming-En 呂明恩 碩士 國立清華大學 資訊工程學系所 105 Computer vision is important for autonomous cars to detect or track the object nearby such as people, vehicles or animals. However, there are many problems in visual tracking including illumination variation, deformation and occlusion, etc. To deal with these complicated problems, the appearance model is utilized to describe the target, and the discriminative model is adopted to classify a candidate whether it is the target or the background. In this thesis, we propose a tracking method based on Edge-boxes, which provides a small but high-quality set of proposals based on edges. In addition, a pre-trained Convolutional Neural Network (CNN) is used to extract the feature from an image patch, and then compute the cost functions of the appearance, motion and size. With these costs, online Support Vector Machine (SVM) is adopted to be a classifier instead of simple computation of the sum of the costs. Finally, we maintain our tracker by updating templates, predicted state, and SVM. The experimental results demonstrate that the proposed method performs well in videos existing many challenges. Since the pre-trained CNN extracts general features of targets, the illumination variation and deformation problems can be easily solved, and the success rate can be up to 96% under the overlap threshold 0.5. The tracking failure is caused by occlusion in the videos can be also avoided due to high-quality proposals generated by Edge-boxes and templates stored in previous frames, and the MOTA can be up to 95.238%. Wang, Jia-Shung 王家祥 2017 學位論文 ; thesis 42 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立清華大學 === 資訊工程學系所 === 105 === Computer vision is important for autonomous cars to detect or track the object nearby such as people, vehicles or animals. However, there are many problems in visual tracking including illumination variation, deformation and occlusion, etc. To deal with these complicated problems, the appearance model is utilized to describe the target, and the discriminative model is adopted to classify a candidate whether it is the target or the background. In this thesis, we propose a tracking method based on Edge-boxes, which provides a small but high-quality set of proposals based on edges. In addition, a pre-trained Convolutional Neural Network (CNN) is used to extract the feature from an image patch, and then compute the cost functions of the appearance, motion and size. With these costs, online Support Vector Machine (SVM) is adopted to be a classifier instead of simple computation of the sum of the costs. Finally, we maintain our tracker by updating templates, predicted state, and SVM. The experimental results demonstrate that the proposed method performs well in videos existing many challenges. Since the pre-trained CNN extracts general features of targets, the illumination variation and deformation problems can be easily solved, and the success rate can be up to 96% under the overlap threshold 0.5. The tracking failure is caused by occlusion in the videos can be also avoided due to high-quality proposals generated by Edge-boxes and templates stored in previous frames, and the MOTA can be up to 95.238%.
author2 Wang, Jia-Shung
author_facet Wang, Jia-Shung
Lu, Ming-En
呂明恩
author Lu, Ming-En
呂明恩
spellingShingle Lu, Ming-En
呂明恩
Edge-boxes based Convolutional Neural Networks Approach for Multi-Target Tracking in Traffic Scenes
author_sort Lu, Ming-En
title Edge-boxes based Convolutional Neural Networks Approach for Multi-Target Tracking in Traffic Scenes
title_short Edge-boxes based Convolutional Neural Networks Approach for Multi-Target Tracking in Traffic Scenes
title_full Edge-boxes based Convolutional Neural Networks Approach for Multi-Target Tracking in Traffic Scenes
title_fullStr Edge-boxes based Convolutional Neural Networks Approach for Multi-Target Tracking in Traffic Scenes
title_full_unstemmed Edge-boxes based Convolutional Neural Networks Approach for Multi-Target Tracking in Traffic Scenes
title_sort edge-boxes based convolutional neural networks approach for multi-target tracking in traffic scenes
publishDate 2017
url http://ndltd.ncl.edu.tw/handle/rcmks6
work_keys_str_mv AT lumingen edgeboxesbasedconvolutionalneuralnetworksapproachformultitargettrackingintrafficscenes
AT lǚmíngēn edgeboxesbasedconvolutionalneuralnetworksapproachformultitargettrackingintrafficscenes
AT lumingen yǐedgeboxeswèijīchǔdejuǎnjīshénjīngwǎnglùfāngfǎyīngyòngyújiāotōngchǎngjǐngduōmùbiāozhuīzōng
AT lǚmíngēn yǐedgeboxeswèijīchǔdejuǎnjīshénjīngwǎnglùfāngfǎyīngyòngyújiāotōngchǎngjǐngduōmùbiāozhuīzōng
_version_ 1719157861329666048