Part-based Pyramid Pooling Feature Fusion in Multi-Scale Supervised Network for Person Re-Identification
碩士 === 國立臺灣科技大學 === 電機工程系 === 107 === Person Re-identification is a technique that uses computer vision techniques to determine whether a particular pedestrian is present in images or video sequence. It is widely believed to be a sub-question for image retrieval, given a camera of pedestrian images,...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2019
|
Online Access: | http://ndltd.ncl.edu.tw/handle/m3au2d |
id |
ndltd-TW-107NTUS5442112 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-107NTUS54421122019-10-24T05:20:28Z http://ndltd.ncl.edu.tw/handle/m3au2d Part-based Pyramid Pooling Feature Fusion in Multi-Scale Supervised Network for Person Re-Identification 行人重識別之人體部位金字塔多尺度池化特徵融合監督網路 Cing-Han Chou 周青翰 碩士 國立臺灣科技大學 電機工程系 107 Person Re-identification is a technique that uses computer vision techniques to determine whether a particular pedestrian is present in images or video sequence. It is widely believed to be a sub-question for image retrieval, given a camera of pedestrian images, the image of the pedestrian across the device is retrieved. In order to obtain pedestrian features with multi-scale and discriminative characteristics, this study proposes a Part-based Pyramid Pooling Feature Fusion in Multi-Scale Supervised Network (PFMSNet). The multi-scale feature of the pedestrian part is extracted by the pyramid pooling module, and the multi-scale features are concatenated to make the network have a larger receptive field for the classification of body parts. However, the upsampling operation before the concatenate will result in the generation of noise. Therefore, using the SE (Squeeze-and-Excitation) Block network structure, the feature map is weighted by the channel attention, so that the noise and redundant features are filtered out and the important information is retained. Finally, the multi-scale feature independent classification task is added to make the network a bi-branch classification task model, which can further supervise multi-scale features and add more semantic information. The neural network model proposed in this study is trained and tested in the two datasets of Market1501 and DukeMTMC-reID. In the two datasets, Rank-1 reaches 94.7% and 87.7%, and mAP reaches 85.1% and 75.6%. Shun-Feng Su 蘇順豐 2019 學位論文 ; thesis 64 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立臺灣科技大學 === 電機工程系 === 107 === Person Re-identification is a technique that uses computer vision techniques to determine whether a particular pedestrian is present in images or video sequence. It is widely believed to be a sub-question for image retrieval, given a camera of pedestrian images, the image of the pedestrian across the device is retrieved. In order to obtain pedestrian features with multi-scale and discriminative characteristics, this study proposes a Part-based Pyramid Pooling Feature Fusion in Multi-Scale Supervised Network (PFMSNet). The multi-scale feature of the pedestrian part is extracted by the pyramid pooling module, and the multi-scale features are concatenated to make the network have a larger receptive field for the classification of body parts. However, the upsampling operation before the concatenate will result in the generation of noise. Therefore, using the SE (Squeeze-and-Excitation) Block network structure, the feature map is weighted by the channel attention, so that the noise and redundant features are filtered out and the important information is retained. Finally, the multi-scale feature independent classification task is added to make the network a bi-branch classification task model, which can further supervise multi-scale features and add more semantic information. The neural network model proposed in this study is trained and tested in the two datasets of Market1501 and DukeMTMC-reID. In the two datasets, Rank-1 reaches 94.7% and 87.7%, and mAP reaches 85.1% and 75.6%.
|
author2 |
Shun-Feng Su |
author_facet |
Shun-Feng Su Cing-Han Chou 周青翰 |
author |
Cing-Han Chou 周青翰 |
spellingShingle |
Cing-Han Chou 周青翰 Part-based Pyramid Pooling Feature Fusion in Multi-Scale Supervised Network for Person Re-Identification |
author_sort |
Cing-Han Chou |
title |
Part-based Pyramid Pooling Feature Fusion in Multi-Scale Supervised Network for Person Re-Identification |
title_short |
Part-based Pyramid Pooling Feature Fusion in Multi-Scale Supervised Network for Person Re-Identification |
title_full |
Part-based Pyramid Pooling Feature Fusion in Multi-Scale Supervised Network for Person Re-Identification |
title_fullStr |
Part-based Pyramid Pooling Feature Fusion in Multi-Scale Supervised Network for Person Re-Identification |
title_full_unstemmed |
Part-based Pyramid Pooling Feature Fusion in Multi-Scale Supervised Network for Person Re-Identification |
title_sort |
part-based pyramid pooling feature fusion in multi-scale supervised network for person re-identification |
publishDate |
2019 |
url |
http://ndltd.ncl.edu.tw/handle/m3au2d |
work_keys_str_mv |
AT cinghanchou partbasedpyramidpoolingfeaturefusioninmultiscalesupervisednetworkforpersonreidentification AT zhōuqīnghàn partbasedpyramidpoolingfeaturefusioninmultiscalesupervisednetworkforpersonreidentification AT cinghanchou xíngrénzhòngshíbiézhīréntǐbùwèijīnzìtǎduōchǐdùchíhuàtèzhēngrónghéjiāndūwǎnglù AT zhōuqīnghàn xíngrénzhòngshíbiézhīréntǐbùwèijīnzìtǎduōchǐdùchíhuàtèzhēngrónghéjiāndūwǎnglù |
_version_ |
1719277128575352832 |