Part-based Pyramid Pooling Feature Fusion in Multi-Scale Supervised Network for Person Re-Identification

碩士 === 國立臺灣科技大學 === 電機工程系 === 107 === Person Re-identification is a technique that uses computer vision techniques to determine whether a particular pedestrian is present in images or video sequence. It is widely believed to be a sub-question for image retrieval, given a camera of pedestrian images,...

Full description

Bibliographic Details
Main Authors: Cing-Han Chou, 周青翰
Other Authors: Shun-Feng Su
Format: Others
Language:en_US
Published: 2019
Online Access:http://ndltd.ncl.edu.tw/handle/m3au2d
id ndltd-TW-107NTUS5442112
record_format oai_dc
spelling ndltd-TW-107NTUS54421122019-10-24T05:20:28Z http://ndltd.ncl.edu.tw/handle/m3au2d Part-based Pyramid Pooling Feature Fusion in Multi-Scale Supervised Network for Person Re-Identification 行人重識別之人體部位金字塔多尺度池化特徵融合監督網路 Cing-Han Chou 周青翰 碩士 國立臺灣科技大學 電機工程系 107 Person Re-identification is a technique that uses computer vision techniques to determine whether a particular pedestrian is present in images or video sequence. It is widely believed to be a sub-question for image retrieval, given a camera of pedestrian images, the image of the pedestrian across the device is retrieved. In order to obtain pedestrian features with multi-scale and discriminative characteristics, this study proposes a Part-based Pyramid Pooling Feature Fusion in Multi-Scale Supervised Network (PFMSNet). The multi-scale feature of the pedestrian part is extracted by the pyramid pooling module, and the multi-scale features are concatenated to make the network have a larger receptive field for the classification of body parts. However, the upsampling operation before the concatenate will result in the generation of noise. Therefore, using the SE (Squeeze-and-Excitation) Block network structure, the feature map is weighted by the channel attention, so that the noise and redundant features are filtered out and the important information is retained. Finally, the multi-scale feature independent classification task is added to make the network a bi-branch classification task model, which can further supervise multi-scale features and add more semantic information. The neural network model proposed in this study is trained and tested in the two datasets of Market1501 and DukeMTMC-reID. In the two datasets, Rank-1 reaches 94.7% and 87.7%, and mAP reaches 85.1% and 75.6%. Shun-Feng Su 蘇順豐 2019 學位論文 ; thesis 64 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立臺灣科技大學 === 電機工程系 === 107 === Person Re-identification is a technique that uses computer vision techniques to determine whether a particular pedestrian is present in images or video sequence. It is widely believed to be a sub-question for image retrieval, given a camera of pedestrian images, the image of the pedestrian across the device is retrieved. In order to obtain pedestrian features with multi-scale and discriminative characteristics, this study proposes a Part-based Pyramid Pooling Feature Fusion in Multi-Scale Supervised Network (PFMSNet). The multi-scale feature of the pedestrian part is extracted by the pyramid pooling module, and the multi-scale features are concatenated to make the network have a larger receptive field for the classification of body parts. However, the upsampling operation before the concatenate will result in the generation of noise. Therefore, using the SE (Squeeze-and-Excitation) Block network structure, the feature map is weighted by the channel attention, so that the noise and redundant features are filtered out and the important information is retained. Finally, the multi-scale feature independent classification task is added to make the network a bi-branch classification task model, which can further supervise multi-scale features and add more semantic information. The neural network model proposed in this study is trained and tested in the two datasets of Market1501 and DukeMTMC-reID. In the two datasets, Rank-1 reaches 94.7% and 87.7%, and mAP reaches 85.1% and 75.6%.
author2 Shun-Feng Su
author_facet Shun-Feng Su
Cing-Han Chou
周青翰
author Cing-Han Chou
周青翰
spellingShingle Cing-Han Chou
周青翰
Part-based Pyramid Pooling Feature Fusion in Multi-Scale Supervised Network for Person Re-Identification
author_sort Cing-Han Chou
title Part-based Pyramid Pooling Feature Fusion in Multi-Scale Supervised Network for Person Re-Identification
title_short Part-based Pyramid Pooling Feature Fusion in Multi-Scale Supervised Network for Person Re-Identification
title_full Part-based Pyramid Pooling Feature Fusion in Multi-Scale Supervised Network for Person Re-Identification
title_fullStr Part-based Pyramid Pooling Feature Fusion in Multi-Scale Supervised Network for Person Re-Identification
title_full_unstemmed Part-based Pyramid Pooling Feature Fusion in Multi-Scale Supervised Network for Person Re-Identification
title_sort part-based pyramid pooling feature fusion in multi-scale supervised network for person re-identification
publishDate 2019
url http://ndltd.ncl.edu.tw/handle/m3au2d
work_keys_str_mv AT cinghanchou partbasedpyramidpoolingfeaturefusioninmultiscalesupervisednetworkforpersonreidentification
AT zhōuqīnghàn partbasedpyramidpoolingfeaturefusioninmultiscalesupervisednetworkforpersonreidentification
AT cinghanchou xíngrénzhòngshíbiézhīréntǐbùwèijīnzìtǎduōchǐdùchíhuàtèzhēngrónghéjiāndūwǎnglù
AT zhōuqīnghàn xíngrénzhòngshíbiézhīréntǐbùwèijīnzìtǎduōchǐdùchíhuàtèzhēngrónghéjiāndūwǎnglù
_version_ 1719277128575352832