Improving Single Shot Multibox Detector with Feature Pyramid Networks Structure and Postprocessing for Better Object Detection Performance

碩士 === 國立臺灣大學 === 電信工程學研究所 === 105 === In recent years, Convolutional Neural Networks(CNNs) have gained a lot of popularity in computer vision. In this work, we will use convolutional neural networks for object detection. To start with, we use Single Shot Multibox Detector(SSD) [7] as our basic fram...

Full description

Bibliographic Details
Main Authors: Yao-Ren Chang, 張耀仁
Other Authors: 丁建均
Format: Others
Language:en_US
Published: 2017
Online Access:http://ndltd.ncl.edu.tw/handle/tkguz5
id ndltd-TW-105NTU05435059
record_format oai_dc
spelling ndltd-TW-105NTU054350592019-05-15T23:39:38Z http://ndltd.ncl.edu.tw/handle/tkguz5 Improving Single Shot Multibox Detector with Feature Pyramid Networks Structure and Postprocessing for Better Object Detection Performance 改進單次多框偵測器架構與後處理 Yao-Ren Chang 張耀仁 碩士 國立臺灣大學 電信工程學研究所 105 In recent years, Convolutional Neural Networks(CNNs) have gained a lot of popularity in computer vision. In this work, we will use convolutional neural networks for object detection. To start with, we use Single Shot Multibox Detector(SSD) [7] as our basic framework. We impose Feature Pyramid Networks on SSD to combine local and global information. We also adjust postprocessing with bounding box voting for better localization. For comparison, we test our model on Pascal VOC 2007 test dataset. During evaluation, we calculate Average Precision(AP) for each model and class. Then, we average each AP to get mean Average Precision(mAP) as our final evaluation metric. With original SSD, we can have 77.21% mAP in Pascal VOC 2007 test dataset. In this thesis, our simulation results show that, the proposed method outperforms the original SSD and has better performance for object detection. Our final model can achieve 77.85% mAP. 丁建均 2017 學位論文 ; thesis 63 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立臺灣大學 === 電信工程學研究所 === 105 === In recent years, Convolutional Neural Networks(CNNs) have gained a lot of popularity in computer vision. In this work, we will use convolutional neural networks for object detection. To start with, we use Single Shot Multibox Detector(SSD) [7] as our basic framework. We impose Feature Pyramid Networks on SSD to combine local and global information. We also adjust postprocessing with bounding box voting for better localization. For comparison, we test our model on Pascal VOC 2007 test dataset. During evaluation, we calculate Average Precision(AP) for each model and class. Then, we average each AP to get mean Average Precision(mAP) as our final evaluation metric. With original SSD, we can have 77.21% mAP in Pascal VOC 2007 test dataset. In this thesis, our simulation results show that, the proposed method outperforms the original SSD and has better performance for object detection. Our final model can achieve 77.85% mAP.
author2 丁建均
author_facet 丁建均
Yao-Ren Chang
張耀仁
author Yao-Ren Chang
張耀仁
spellingShingle Yao-Ren Chang
張耀仁
Improving Single Shot Multibox Detector with Feature Pyramid Networks Structure and Postprocessing for Better Object Detection Performance
author_sort Yao-Ren Chang
title Improving Single Shot Multibox Detector with Feature Pyramid Networks Structure and Postprocessing for Better Object Detection Performance
title_short Improving Single Shot Multibox Detector with Feature Pyramid Networks Structure and Postprocessing for Better Object Detection Performance
title_full Improving Single Shot Multibox Detector with Feature Pyramid Networks Structure and Postprocessing for Better Object Detection Performance
title_fullStr Improving Single Shot Multibox Detector with Feature Pyramid Networks Structure and Postprocessing for Better Object Detection Performance
title_full_unstemmed Improving Single Shot Multibox Detector with Feature Pyramid Networks Structure and Postprocessing for Better Object Detection Performance
title_sort improving single shot multibox detector with feature pyramid networks structure and postprocessing for better object detection performance
publishDate 2017
url http://ndltd.ncl.edu.tw/handle/tkguz5
work_keys_str_mv AT yaorenchang improvingsingleshotmultiboxdetectorwithfeaturepyramidnetworksstructureandpostprocessingforbetterobjectdetectionperformance
AT zhāngyàorén improvingsingleshotmultiboxdetectorwithfeaturepyramidnetworksstructureandpostprocessingforbetterobjectdetectionperformance
AT yaorenchang gǎijìndāncìduōkuāngzhēncèqìjiàgòuyǔhòuchùlǐ
AT zhāngyàorén gǎijìndāncìduōkuāngzhēncèqìjiàgòuyǔhòuchùlǐ
_version_ 1719151851670077440