Improving Single Shot Multibox Detector with Feature Pyramid Networks Structure and Postprocessing for Better Object Detection Performance
碩士 === 國立臺灣大學 === 電信工程學研究所 === 105 === In recent years, Convolutional Neural Networks(CNNs) have gained a lot of popularity in computer vision. In this work, we will use convolutional neural networks for object detection. To start with, we use Single Shot Multibox Detector(SSD) [7] as our basic fram...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2017
|
Online Access: | http://ndltd.ncl.edu.tw/handle/tkguz5 |
id |
ndltd-TW-105NTU05435059 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-105NTU054350592019-05-15T23:39:38Z http://ndltd.ncl.edu.tw/handle/tkguz5 Improving Single Shot Multibox Detector with Feature Pyramid Networks Structure and Postprocessing for Better Object Detection Performance 改進單次多框偵測器架構與後處理 Yao-Ren Chang 張耀仁 碩士 國立臺灣大學 電信工程學研究所 105 In recent years, Convolutional Neural Networks(CNNs) have gained a lot of popularity in computer vision. In this work, we will use convolutional neural networks for object detection. To start with, we use Single Shot Multibox Detector(SSD) [7] as our basic framework. We impose Feature Pyramid Networks on SSD to combine local and global information. We also adjust postprocessing with bounding box voting for better localization. For comparison, we test our model on Pascal VOC 2007 test dataset. During evaluation, we calculate Average Precision(AP) for each model and class. Then, we average each AP to get mean Average Precision(mAP) as our final evaluation metric. With original SSD, we can have 77.21% mAP in Pascal VOC 2007 test dataset. In this thesis, our simulation results show that, the proposed method outperforms the original SSD and has better performance for object detection. Our final model can achieve 77.85% mAP. 丁建均 2017 學位論文 ; thesis 63 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立臺灣大學 === 電信工程學研究所 === 105 === In recent years, Convolutional Neural Networks(CNNs) have gained a lot of popularity in computer vision. In this work, we will use convolutional neural networks for object detection. To start with, we use Single Shot Multibox Detector(SSD) [7] as our basic framework. We impose Feature Pyramid Networks on SSD to combine local and global information. We also adjust postprocessing with bounding box voting for better localization. For comparison, we test our model on Pascal VOC 2007 test dataset. During evaluation, we calculate Average Precision(AP) for each model and class. Then, we average each AP to get mean Average Precision(mAP) as our final evaluation metric. With original SSD, we can have 77.21% mAP in Pascal VOC 2007 test dataset. In this thesis, our simulation results show that, the proposed method outperforms the original SSD and has better performance for object detection. Our final model can achieve 77.85% mAP.
|
author2 |
丁建均 |
author_facet |
丁建均 Yao-Ren Chang 張耀仁 |
author |
Yao-Ren Chang 張耀仁 |
spellingShingle |
Yao-Ren Chang 張耀仁 Improving Single Shot Multibox Detector with Feature Pyramid Networks Structure and Postprocessing for Better Object Detection Performance |
author_sort |
Yao-Ren Chang |
title |
Improving Single Shot Multibox Detector with Feature Pyramid Networks Structure and Postprocessing for Better Object Detection Performance |
title_short |
Improving Single Shot Multibox Detector with Feature Pyramid Networks Structure and Postprocessing for Better Object Detection Performance |
title_full |
Improving Single Shot Multibox Detector with Feature Pyramid Networks Structure and Postprocessing for Better Object Detection Performance |
title_fullStr |
Improving Single Shot Multibox Detector with Feature Pyramid Networks Structure and Postprocessing for Better Object Detection Performance |
title_full_unstemmed |
Improving Single Shot Multibox Detector with Feature Pyramid Networks Structure and Postprocessing for Better Object Detection Performance |
title_sort |
improving single shot multibox detector with feature pyramid networks structure and postprocessing for better object detection performance |
publishDate |
2017 |
url |
http://ndltd.ncl.edu.tw/handle/tkguz5 |
work_keys_str_mv |
AT yaorenchang improvingsingleshotmultiboxdetectorwithfeaturepyramidnetworksstructureandpostprocessingforbetterobjectdetectionperformance AT zhāngyàorén improvingsingleshotmultiboxdetectorwithfeaturepyramidnetworksstructureandpostprocessingforbetterobjectdetectionperformance AT yaorenchang gǎijìndāncìduōkuāngzhēncèqìjiàgòuyǔhòuchùlǐ AT zhāngyàorén gǎijìndāncìduōkuāngzhēncèqìjiàgòuyǔhòuchùlǐ |
_version_ |
1719151851670077440 |