Geometric property-based convolutional neural network for indoor object detection
Indoor object detection is a very demanding and important task for robot applications. Object knowledge, such as two-dimensional (2D) shape and depth information, may be helpful for detection. In this article, we focus on region-based convolutional neural network (CNN) detector and propose a geometr...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
SAGE Publishing
2021-02-01
|
Series: | International Journal of Advanced Robotic Systems |
Online Access: | https://doi.org/10.1177/1729881421993323 |
id |
doaj-21ae5f12fadc4d988b67a6278829a0fb |
---|---|
record_format |
Article |
spelling |
doaj-21ae5f12fadc4d988b67a6278829a0fb2021-02-17T02:34:25ZengSAGE PublishingInternational Journal of Advanced Robotic Systems1729-88142021-02-011810.1177/1729881421993323Geometric property-based convolutional neural network for indoor object detectionXintao Ding0Boquan Li1Jinbao Wang2 Anhui Province Key Laboratory of Network and Information Security, Wuhu, China School of Mathematics and Statistics, , Wuhu, China Anhui Province Key Laboratory of Network and Information Security, Wuhu, ChinaIndoor object detection is a very demanding and important task for robot applications. Object knowledge, such as two-dimensional (2D) shape and depth information, may be helpful for detection. In this article, we focus on region-based convolutional neural network (CNN) detector and propose a geometric property-based Faster R-CNN method (GP-Faster) for indoor object detection. GP-Faster incorporates geometric property in Faster R-CNN to improve the detection performance. In detail, we first use mesh grids that are the intersections of direct and inverse proportion functions to generate appropriate anchors for indoor objects. After the anchors are regressed to the regions of interest produced by a region proposal network (RPN-RoIs), we then use 2D geometric constraints to refine the RPN-RoIs, in which the 2D constraint of every classification is a convex hull region enclosing the width and height coordinates of the ground-truth boxes on the training set. Comparison experiments are implemented on two indoor datasets SUN2012 and NYUv2. Since the depth information is available in NYUv2, we involve depth constraints in GP-Faster and propose 3D geometric property-based Faster R-CNN (DGP-Faster) on NYUv2. The experimental results show that both GP-Faster and DGP-Faster increase the performance of the mean average precision.https://doi.org/10.1177/1729881421993323 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Xintao Ding Boquan Li Jinbao Wang |
spellingShingle |
Xintao Ding Boquan Li Jinbao Wang Geometric property-based convolutional neural network for indoor object detection International Journal of Advanced Robotic Systems |
author_facet |
Xintao Ding Boquan Li Jinbao Wang |
author_sort |
Xintao Ding |
title |
Geometric property-based convolutional neural network for indoor object detection |
title_short |
Geometric property-based convolutional neural network for indoor object detection |
title_full |
Geometric property-based convolutional neural network for indoor object detection |
title_fullStr |
Geometric property-based convolutional neural network for indoor object detection |
title_full_unstemmed |
Geometric property-based convolutional neural network for indoor object detection |
title_sort |
geometric property-based convolutional neural network for indoor object detection |
publisher |
SAGE Publishing |
series |
International Journal of Advanced Robotic Systems |
issn |
1729-8814 |
publishDate |
2021-02-01 |
description |
Indoor object detection is a very demanding and important task for robot applications. Object knowledge, such as two-dimensional (2D) shape and depth information, may be helpful for detection. In this article, we focus on region-based convolutional neural network (CNN) detector and propose a geometric property-based Faster R-CNN method (GP-Faster) for indoor object detection. GP-Faster incorporates geometric property in Faster R-CNN to improve the detection performance. In detail, we first use mesh grids that are the intersections of direct and inverse proportion functions to generate appropriate anchors for indoor objects. After the anchors are regressed to the regions of interest produced by a region proposal network (RPN-RoIs), we then use 2D geometric constraints to refine the RPN-RoIs, in which the 2D constraint of every classification is a convex hull region enclosing the width and height coordinates of the ground-truth boxes on the training set. Comparison experiments are implemented on two indoor datasets SUN2012 and NYUv2. Since the depth information is available in NYUv2, we involve depth constraints in GP-Faster and propose 3D geometric property-based Faster R-CNN (DGP-Faster) on NYUv2. The experimental results show that both GP-Faster and DGP-Faster increase the performance of the mean average precision. |
url |
https://doi.org/10.1177/1729881421993323 |
work_keys_str_mv |
AT xintaoding geometricpropertybasedconvolutionalneuralnetworkforindoorobjectdetection AT boquanli geometricpropertybasedconvolutionalneuralnetworkforindoorobjectdetection AT jinbaowang geometricpropertybasedconvolutionalneuralnetworkforindoorobjectdetection |
_version_ |
1724265646834843648 |