Mobile Product Recognition with Deep Learning of Category Selection, Fine-grained Appearance and Part Configuration
碩士 === 國立清華大學 === 資訊工程學系 === 104 === The goal of product recognition is to retrieve database images similar to the query image and then determine the product of the query based on the retrieved images. In product recognition, there are two issues, inter-product similarity and intra-product diversity...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2016
|
Online Access: | http://ndltd.ncl.edu.tw/handle/85024293032152018215 |
id |
ndltd-TW-104NTHU5392123 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-104NTHU53921232017-07-16T04:29:26Z http://ndltd.ncl.edu.tw/handle/85024293032152018215 Mobile Product Recognition with Deep Learning of Category Selection, Fine-grained Appearance and Part Configuration 藉由深度學習預測商品類別再結合細部外觀以及局部組態之商品辨識 Huang, Chia Wan 黃家琬 碩士 國立清華大學 資訊工程學系 104 The goal of product recognition is to retrieve database images similar to the query image and then determine the product of the query based on the retrieved images. In product recognition, there are two issues, inter-product similarity and intra-product diversity, concerning the recognition performance. To address these two issues, we first introduce an intuitive multi-stage method which consists of three convolutional neural networks (CNN) and the proposed similarity measurement. Since the similarity estimation with all images in the database costs a great amount of time, we further propose a two-stage method which estimates similarity with the images under the predicted category of the query. In this scenario, we need to offline cluster products into categories. However, because traditional clustering methods extract features and cluster images separately without coordination, they usually end up with improper cluster assignment. To tackle the weakness of these methods, we propose two clustering schemes that iteratively refine the last few layers of Faster RCNN toward the best category assignment and feature representation. Next, to achieve an efficient recognition process, we execute the repetitive steps of part configuration comparison as well as feature extraction once in the two-stage method. For validation, we conduct experiments on SHORT and our own collected dataset, PRODUCT-100, taken under different variations. The experimental results show that the proposed two-stage method demonstrates promising performance with regard to both accuracy and efficiency. Hsu, Chiou Ting 許秋婷 2016 學位論文 ; thesis 52 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立清華大學 === 資訊工程學系 === 104 === The goal of product recognition is to retrieve database images similar to the query
image and then determine the product of the query based on the retrieved images. In
product recognition, there are two issues, inter-product similarity and intra-product
diversity, concerning the recognition performance. To address these two issues, we
first introduce an intuitive multi-stage method which consists of three convolutional
neural networks (CNN) and the proposed similarity measurement. Since the similarity
estimation with all images in the database costs a great amount of time, we further
propose a two-stage method which estimates similarity with the images under the
predicted category of the query. In this scenario, we need to offline cluster products
into categories. However, because traditional clustering methods extract features and
cluster images separately without coordination, they usually end up with improper
cluster assignment. To tackle the weakness of these methods, we propose two
clustering schemes that iteratively refine the last few layers of Faster RCNN toward
the best category assignment and feature representation. Next, to achieve an efficient
recognition process, we execute the repetitive steps of part configuration comparison
as well as feature extraction once in the two-stage method. For validation, we conduct
experiments on SHORT and our own collected dataset, PRODUCT-100, taken under
different variations. The experimental results show that the proposed two-stage
method demonstrates promising performance with regard to both accuracy and
efficiency.
|
author2 |
Hsu, Chiou Ting |
author_facet |
Hsu, Chiou Ting Huang, Chia Wan 黃家琬 |
author |
Huang, Chia Wan 黃家琬 |
spellingShingle |
Huang, Chia Wan 黃家琬 Mobile Product Recognition with Deep Learning of Category Selection, Fine-grained Appearance and Part Configuration |
author_sort |
Huang, Chia Wan |
title |
Mobile Product Recognition with Deep Learning of Category Selection, Fine-grained Appearance and Part Configuration |
title_short |
Mobile Product Recognition with Deep Learning of Category Selection, Fine-grained Appearance and Part Configuration |
title_full |
Mobile Product Recognition with Deep Learning of Category Selection, Fine-grained Appearance and Part Configuration |
title_fullStr |
Mobile Product Recognition with Deep Learning of Category Selection, Fine-grained Appearance and Part Configuration |
title_full_unstemmed |
Mobile Product Recognition with Deep Learning of Category Selection, Fine-grained Appearance and Part Configuration |
title_sort |
mobile product recognition with deep learning of category selection, fine-grained appearance and part configuration |
publishDate |
2016 |
url |
http://ndltd.ncl.edu.tw/handle/85024293032152018215 |
work_keys_str_mv |
AT huangchiawan mobileproductrecognitionwithdeeplearningofcategoryselectionfinegrainedappearanceandpartconfiguration AT huángjiāwǎn mobileproductrecognitionwithdeeplearningofcategoryselectionfinegrainedappearanceandpartconfiguration AT huangchiawan jíyóushēndùxuéxíyùcèshāngpǐnlèibiézàijiéhéxìbùwàiguānyǐjíjúbùzǔtàizhīshāngpǐnbiànshí AT huángjiāwǎn jíyóushēndùxuéxíyùcèshāngpǐnlèibiézàijiéhéxìbùwàiguānyǐjíjúbùzǔtàizhīshāngpǐnbiànshí |
_version_ |
1718496806807011328 |