DiffNet: A Learning to Compare Deep Network for Product Recognition
The paper focuses on the identification of different objects in a pair of images taken from the same environment, which is challenging and has wide application. We propose a single deep convolutional neural network termed as DiffNet to solve this problem. DiffNet takes a pair of images as the input...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2020-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/8962053/ |
id |
doaj-670cdf849a374010bd26e66298491633 |
---|---|
record_format |
Article |
spelling |
doaj-670cdf849a374010bd26e662984916332021-03-30T02:52:01ZengIEEEIEEE Access2169-35362020-01-018193361934410.1109/ACCESS.2020.29670908962053DiffNet: A Learning to Compare Deep Network for Product RecognitionBin Hu0Nuoya Zhou1Qiang Zhou2Xinggang Wang3Wenyu Liu4https://orcid.org/0000-0002-4582-7488School of Electronic Information and Communications, Huazhong University of Science and Technology, Wuhan, ChinaSchool of Electronic Information and Communications, Huazhong University of Science and Technology, Wuhan, ChinaSchool of Electronic Information and Communications, Huazhong University of Science and Technology, Wuhan, ChinaSchool of Electronic Information and Communications, Huazhong University of Science and Technology, Wuhan, ChinaSchool of Electronic Information and Communications, Huazhong University of Science and Technology, Wuhan, ChinaThe paper focuses on the identification of different objects in a pair of images taken from the same environment, which is challenging and has wide application. We propose a single deep convolutional neural network termed as DiffNet to solve this problem. DiffNet takes a pair of images as the input and directly regresses the bounding boxes of different objects. To train DiffNet, we only need to label the different objects, rather than all objects in input images, which significantly reduces human labeling efforts. Experiments are performed on an image dataset collected from unmanned containers. DiffNet obtains a very high product detection accuracy of 95.56% mAP at the speed of 143 fps measured on an NVIDIA TITAN Xp GPU.https://ieeexplore.ieee.org/document/8962053/Product recognitionobject detectionDiffNetSiamese network |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Bin Hu Nuoya Zhou Qiang Zhou Xinggang Wang Wenyu Liu |
spellingShingle |
Bin Hu Nuoya Zhou Qiang Zhou Xinggang Wang Wenyu Liu DiffNet: A Learning to Compare Deep Network for Product Recognition IEEE Access Product recognition object detection DiffNet Siamese network |
author_facet |
Bin Hu Nuoya Zhou Qiang Zhou Xinggang Wang Wenyu Liu |
author_sort |
Bin Hu |
title |
DiffNet: A Learning to Compare Deep Network for Product Recognition |
title_short |
DiffNet: A Learning to Compare Deep Network for Product Recognition |
title_full |
DiffNet: A Learning to Compare Deep Network for Product Recognition |
title_fullStr |
DiffNet: A Learning to Compare Deep Network for Product Recognition |
title_full_unstemmed |
DiffNet: A Learning to Compare Deep Network for Product Recognition |
title_sort |
diffnet: a learning to compare deep network for product recognition |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2020-01-01 |
description |
The paper focuses on the identification of different objects in a pair of images taken from the same environment, which is challenging and has wide application. We propose a single deep convolutional neural network termed as DiffNet to solve this problem. DiffNet takes a pair of images as the input and directly regresses the bounding boxes of different objects. To train DiffNet, we only need to label the different objects, rather than all objects in input images, which significantly reduces human labeling efforts. Experiments are performed on an image dataset collected from unmanned containers. DiffNet obtains a very high product detection accuracy of 95.56% mAP at the speed of 143 fps measured on an NVIDIA TITAN Xp GPU. |
topic |
Product recognition object detection DiffNet Siamese network |
url |
https://ieeexplore.ieee.org/document/8962053/ |
work_keys_str_mv |
AT binhu diffnetalearningtocomparedeepnetworkforproductrecognition AT nuoyazhou diffnetalearningtocomparedeepnetworkforproductrecognition AT qiangzhou diffnetalearningtocomparedeepnetworkforproductrecognition AT xinggangwang diffnetalearningtocomparedeepnetworkforproductrecognition AT wenyuliu diffnetalearningtocomparedeepnetworkforproductrecognition |
_version_ |
1724184416783171584 |