DiffNet: A Learning to Compare Deep Network for Product Recognition

The paper focuses on the identification of different objects in a pair of images taken from the same environment, which is challenging and has wide application. We propose a single deep convolutional neural network termed as DiffNet to solve this problem. DiffNet takes a pair of images as the input...

Full description

Bibliographic Details
Main Authors:	Bin Hu, Nuoya Zhou, Qiang Zhou, Xinggang Wang, Wenyu Liu
Format:	Article
Language:	English
Published:	IEEE 2020-01-01
Series:	IEEE Access
Subjects:	Product recognition object detection DiffNet Siamese network
Online Access:	https://ieeexplore.ieee.org/document/8962053/

id	doaj-670cdf849a374010bd26e66298491633
record_format	Article
spelling	doaj-670cdf849a374010bd26e662984916332021-03-30T02:52:01ZengIEEEIEEE Access2169-35362020-01-018193361934410.1109/ACCESS.2020.29670908962053DiffNet: A Learning to Compare Deep Network for Product RecognitionBin Hu0Nuoya Zhou1Qiang Zhou2Xinggang Wang3Wenyu Liu4https://orcid.org/0000-0002-4582-7488School of Electronic Information and Communications, Huazhong University of Science and Technology, Wuhan, ChinaSchool of Electronic Information and Communications, Huazhong University of Science and Technology, Wuhan, ChinaSchool of Electronic Information and Communications, Huazhong University of Science and Technology, Wuhan, ChinaSchool of Electronic Information and Communications, Huazhong University of Science and Technology, Wuhan, ChinaSchool of Electronic Information and Communications, Huazhong University of Science and Technology, Wuhan, ChinaThe paper focuses on the identification of different objects in a pair of images taken from the same environment, which is challenging and has wide application. We propose a single deep convolutional neural network termed as DiffNet to solve this problem. DiffNet takes a pair of images as the input and directly regresses the bounding boxes of different objects. To train DiffNet, we only need to label the different objects, rather than all objects in input images, which significantly reduces human labeling efforts. Experiments are performed on an image dataset collected from unmanned containers. DiffNet obtains a very high product detection accuracy of 95.56% mAP at the speed of 143 fps measured on an NVIDIA TITAN Xp GPU.https://ieeexplore.ieee.org/document/8962053/Product recognitionobject detectionDiffNetSiamese network
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Bin Hu Nuoya Zhou Qiang Zhou Xinggang Wang Wenyu Liu
spellingShingle	Bin Hu Nuoya Zhou Qiang Zhou Xinggang Wang Wenyu Liu DiffNet: A Learning to Compare Deep Network for Product Recognition IEEE Access Product recognition object detection DiffNet Siamese network
author_facet	Bin Hu Nuoya Zhou Qiang Zhou Xinggang Wang Wenyu Liu
author_sort	Bin Hu
title	DiffNet: A Learning to Compare Deep Network for Product Recognition
title_short	DiffNet: A Learning to Compare Deep Network for Product Recognition
title_full	DiffNet: A Learning to Compare Deep Network for Product Recognition
title_fullStr	DiffNet: A Learning to Compare Deep Network for Product Recognition
title_full_unstemmed	DiffNet: A Learning to Compare Deep Network for Product Recognition
title_sort	diffnet: a learning to compare deep network for product recognition
publisher	IEEE
series	IEEE Access
issn	2169-3536
publishDate	2020-01-01
description	The paper focuses on the identification of different objects in a pair of images taken from the same environment, which is challenging and has wide application. We propose a single deep convolutional neural network termed as DiffNet to solve this problem. DiffNet takes a pair of images as the input and directly regresses the bounding boxes of different objects. To train DiffNet, we only need to label the different objects, rather than all objects in input images, which significantly reduces human labeling efforts. Experiments are performed on an image dataset collected from unmanned containers. DiffNet obtains a very high product detection accuracy of 95.56% mAP at the speed of 143 fps measured on an NVIDIA TITAN Xp GPU.
topic	Product recognition object detection DiffNet Siamese network
url	https://ieeexplore.ieee.org/document/8962053/
work_keys_str_mv	AT binhu diffnetalearningtocomparedeepnetworkforproductrecognition AT nuoyazhou diffnetalearningtocomparedeepnetworkforproductrecognition AT qiangzhou diffnetalearningtocomparedeepnetworkforproductrecognition AT xinggangwang diffnetalearningtocomparedeepnetworkforproductrecognition AT wenyuliu diffnetalearningtocomparedeepnetworkforproductrecognition
_version_	1724184416783171584

DiffNet: A Learning to Compare Deep Network for Product Recognition

Similar Items