Geometric property-based convolutional neural network for indoor object detection

Indoor object detection is a very demanding and important task for robot applications. Object knowledge, such as two-dimensional (2D) shape and depth information, may be helpful for detection. In this article, we focus on region-based convolutional neural network (CNN) detector and propose a geometr...

Full description

Bibliographic Details
Main Authors: Xintao Ding, Boquan Li, Jinbao Wang
Format: Article
Language:English
Published: SAGE Publishing 2021-02-01
Series:International Journal of Advanced Robotic Systems
Online Access:https://doi.org/10.1177/1729881421993323
id doaj-21ae5f12fadc4d988b67a6278829a0fb
record_format Article
spelling doaj-21ae5f12fadc4d988b67a6278829a0fb2021-02-17T02:34:25ZengSAGE PublishingInternational Journal of Advanced Robotic Systems1729-88142021-02-011810.1177/1729881421993323Geometric property-based convolutional neural network for indoor object detectionXintao Ding0Boquan Li1Jinbao Wang2 Anhui Province Key Laboratory of Network and Information Security, Wuhu, China School of Mathematics and Statistics, , Wuhu, China Anhui Province Key Laboratory of Network and Information Security, Wuhu, ChinaIndoor object detection is a very demanding and important task for robot applications. Object knowledge, such as two-dimensional (2D) shape and depth information, may be helpful for detection. In this article, we focus on region-based convolutional neural network (CNN) detector and propose a geometric property-based Faster R-CNN method (GP-Faster) for indoor object detection. GP-Faster incorporates geometric property in Faster R-CNN to improve the detection performance. In detail, we first use mesh grids that are the intersections of direct and inverse proportion functions to generate appropriate anchors for indoor objects. After the anchors are regressed to the regions of interest produced by a region proposal network (RPN-RoIs), we then use 2D geometric constraints to refine the RPN-RoIs, in which the 2D constraint of every classification is a convex hull region enclosing the width and height coordinates of the ground-truth boxes on the training set. Comparison experiments are implemented on two indoor datasets SUN2012 and NYUv2. Since the depth information is available in NYUv2, we involve depth constraints in GP-Faster and propose 3D geometric property-based Faster R-CNN (DGP-Faster) on NYUv2. The experimental results show that both GP-Faster and DGP-Faster increase the performance of the mean average precision.https://doi.org/10.1177/1729881421993323
collection DOAJ
language English
format Article
sources DOAJ
author Xintao Ding
Boquan Li
Jinbao Wang
spellingShingle Xintao Ding
Boquan Li
Jinbao Wang
Geometric property-based convolutional neural network for indoor object detection
International Journal of Advanced Robotic Systems
author_facet Xintao Ding
Boquan Li
Jinbao Wang
author_sort Xintao Ding
title Geometric property-based convolutional neural network for indoor object detection
title_short Geometric property-based convolutional neural network for indoor object detection
title_full Geometric property-based convolutional neural network for indoor object detection
title_fullStr Geometric property-based convolutional neural network for indoor object detection
title_full_unstemmed Geometric property-based convolutional neural network for indoor object detection
title_sort geometric property-based convolutional neural network for indoor object detection
publisher SAGE Publishing
series International Journal of Advanced Robotic Systems
issn 1729-8814
publishDate 2021-02-01
description Indoor object detection is a very demanding and important task for robot applications. Object knowledge, such as two-dimensional (2D) shape and depth information, may be helpful for detection. In this article, we focus on region-based convolutional neural network (CNN) detector and propose a geometric property-based Faster R-CNN method (GP-Faster) for indoor object detection. GP-Faster incorporates geometric property in Faster R-CNN to improve the detection performance. In detail, we first use mesh grids that are the intersections of direct and inverse proportion functions to generate appropriate anchors for indoor objects. After the anchors are regressed to the regions of interest produced by a region proposal network (RPN-RoIs), we then use 2D geometric constraints to refine the RPN-RoIs, in which the 2D constraint of every classification is a convex hull region enclosing the width and height coordinates of the ground-truth boxes on the training set. Comparison experiments are implemented on two indoor datasets SUN2012 and NYUv2. Since the depth information is available in NYUv2, we involve depth constraints in GP-Faster and propose 3D geometric property-based Faster R-CNN (DGP-Faster) on NYUv2. The experimental results show that both GP-Faster and DGP-Faster increase the performance of the mean average precision.
url https://doi.org/10.1177/1729881421993323
work_keys_str_mv AT xintaoding geometricpropertybasedconvolutionalneuralnetworkforindoorobjectdetection
AT boquanli geometricpropertybasedconvolutionalneuralnetworkforindoorobjectdetection
AT jinbaowang geometricpropertybasedconvolutionalneuralnetworkforindoorobjectdetection
_version_ 1724265646834843648