Mobile Product Recognition with Deep Learning of Category Selection, Fine-grained Appearance and Part Configuration

碩士 === 國立清華大學 === 資訊工程學系 === 104 === The goal of product recognition is to retrieve database images similar to the query image and then determine the product of the query based on the retrieved images. In product recognition, there are two issues, inter-product similarity and intra-product diversity...

Full description

Bibliographic Details
Main Authors: Huang, Chia Wan, 黃家琬
Other Authors: Hsu, Chiou Ting
Format: Others
Language:en_US
Published: 2016
Online Access:http://ndltd.ncl.edu.tw/handle/85024293032152018215
id ndltd-TW-104NTHU5392123
record_format oai_dc
spelling ndltd-TW-104NTHU53921232017-07-16T04:29:26Z http://ndltd.ncl.edu.tw/handle/85024293032152018215 Mobile Product Recognition with Deep Learning of Category Selection, Fine-grained Appearance and Part Configuration 藉由深度學習預測商品類別再結合細部外觀以及局部組態之商品辨識 Huang, Chia Wan 黃家琬 碩士 國立清華大學 資訊工程學系 104 The goal of product recognition is to retrieve database images similar to the query image and then determine the product of the query based on the retrieved images. In product recognition, there are two issues, inter-product similarity and intra-product diversity, concerning the recognition performance. To address these two issues, we first introduce an intuitive multi-stage method which consists of three convolutional neural networks (CNN) and the proposed similarity measurement. Since the similarity estimation with all images in the database costs a great amount of time, we further propose a two-stage method which estimates similarity with the images under the predicted category of the query. In this scenario, we need to offline cluster products into categories. However, because traditional clustering methods extract features and cluster images separately without coordination, they usually end up with improper cluster assignment. To tackle the weakness of these methods, we propose two clustering schemes that iteratively refine the last few layers of Faster RCNN toward the best category assignment and feature representation. Next, to achieve an efficient recognition process, we execute the repetitive steps of part configuration comparison as well as feature extraction once in the two-stage method. For validation, we conduct experiments on SHORT and our own collected dataset, PRODUCT-100, taken under different variations. The experimental results show that the proposed two-stage method demonstrates promising performance with regard to both accuracy and efficiency. Hsu, Chiou Ting 許秋婷 2016 學位論文 ; thesis 52 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立清華大學 === 資訊工程學系 === 104 === The goal of product recognition is to retrieve database images similar to the query image and then determine the product of the query based on the retrieved images. In product recognition, there are two issues, inter-product similarity and intra-product diversity, concerning the recognition performance. To address these two issues, we first introduce an intuitive multi-stage method which consists of three convolutional neural networks (CNN) and the proposed similarity measurement. Since the similarity estimation with all images in the database costs a great amount of time, we further propose a two-stage method which estimates similarity with the images under the predicted category of the query. In this scenario, we need to offline cluster products into categories. However, because traditional clustering methods extract features and cluster images separately without coordination, they usually end up with improper cluster assignment. To tackle the weakness of these methods, we propose two clustering schemes that iteratively refine the last few layers of Faster RCNN toward the best category assignment and feature representation. Next, to achieve an efficient recognition process, we execute the repetitive steps of part configuration comparison as well as feature extraction once in the two-stage method. For validation, we conduct experiments on SHORT and our own collected dataset, PRODUCT-100, taken under different variations. The experimental results show that the proposed two-stage method demonstrates promising performance with regard to both accuracy and efficiency.
author2 Hsu, Chiou Ting
author_facet Hsu, Chiou Ting
Huang, Chia Wan
黃家琬
author Huang, Chia Wan
黃家琬
spellingShingle Huang, Chia Wan
黃家琬
Mobile Product Recognition with Deep Learning of Category Selection, Fine-grained Appearance and Part Configuration
author_sort Huang, Chia Wan
title Mobile Product Recognition with Deep Learning of Category Selection, Fine-grained Appearance and Part Configuration
title_short Mobile Product Recognition with Deep Learning of Category Selection, Fine-grained Appearance and Part Configuration
title_full Mobile Product Recognition with Deep Learning of Category Selection, Fine-grained Appearance and Part Configuration
title_fullStr Mobile Product Recognition with Deep Learning of Category Selection, Fine-grained Appearance and Part Configuration
title_full_unstemmed Mobile Product Recognition with Deep Learning of Category Selection, Fine-grained Appearance and Part Configuration
title_sort mobile product recognition with deep learning of category selection, fine-grained appearance and part configuration
publishDate 2016
url http://ndltd.ncl.edu.tw/handle/85024293032152018215
work_keys_str_mv AT huangchiawan mobileproductrecognitionwithdeeplearningofcategoryselectionfinegrainedappearanceandpartconfiguration
AT huángjiāwǎn mobileproductrecognitionwithdeeplearningofcategoryselectionfinegrainedappearanceandpartconfiguration
AT huangchiawan jíyóushēndùxuéxíyùcèshāngpǐnlèibiézàijiéhéxìbùwàiguānyǐjíjúbùzǔtàizhīshāngpǐnbiànshí
AT huángjiāwǎn jíyóushēndùxuéxíyùcèshāngpǐnlèibiézàijiéhéxìbùwàiguānyǐjíjúbùzǔtàizhīshāngpǐnbiànshí
_version_ 1718496806807011328