Summary: | 碩士 === 國立臺灣大學 === 電信工程學研究所 === 105 === In recent years, Convolutional Neural Networks(CNNs) have gained a lot of popularity in computer vision. In this work, we will use convolutional neural networks for object detection. To start with, we use Single Shot Multibox Detector(SSD) [7] as our basic framework. We impose Feature Pyramid Networks on SSD to combine local and global information. We also adjust postprocessing with bounding box voting for better localization. For comparison, we test our model on Pascal VOC 2007 test dataset. During evaluation, we calculate Average Precision(AP) for each model and class. Then, we average each AP to get mean Average Precision(mAP) as our final evaluation metric. With original SSD, we can have 77.21% mAP in Pascal VOC 2007 test dataset. In this thesis, our simulation results show that, the proposed method outperforms the original SSD and has better performance for object detection. Our final model can achieve 77.85% mAP.
|