Multi-attribute self-attention guided vehicle local region detection based on convolutional neural network architecture

Many pieces of information are included in the front region of a vehicle, especially in windshield and bumper regions. Thus, windshield or bumper region detection is making sense to extract useful information. But the existing windshield and bumper detection methods based on traditional artificial f...

Full description

Bibliographic Details
Main Authors: Jingbo Chen, Shengyong Chen, Linjie Bian
Format: Article
Language:English
Published: SAGE Publishing 2020-08-01
Series:International Journal of Advanced Robotic Systems
Online Access:https://doi.org/10.1177/1729881420944343
id doaj-bbe648a0407d47efa440f1c9be7bfe82
record_format Article
spelling doaj-bbe648a0407d47efa440f1c9be7bfe822020-11-25T03:55:03ZengSAGE PublishingInternational Journal of Advanced Robotic Systems1729-88142020-08-011710.1177/1729881420944343Multi-attribute self-attention guided vehicle local region detection based on convolutional neural network architectureJingbo ChenShengyong ChenLinjie BianMany pieces of information are included in the front region of a vehicle, especially in windshield and bumper regions. Thus, windshield or bumper region detection is making sense to extract useful information. But the existing windshield and bumper detection methods based on traditional artificial features are not robust enough. Those features may become invalid in many real situations (e.g. occlude, illumination change, viewpoint change.). In this article, we propose a multi-attribute-guided vehicle discriminately region detection method based on convolutional neural network and not rely on bounding box regression. We separate the net into two branches, respectively, for identification (ID) and Model attributes training. Therefore, the feature spaces of different attributes become more independent. Additionally, we embed a self-attention block into our framework to improve the performance of local region detection. We train our model on PKU_VD data set which has a huge number of images inside. Furthermore, we labeled the handcrafted bounding boxes on 5000 randomly picked testing images, and 1020 of them are used for evaluation and 3980 as the training data for YOLOv3 . We use Intersection over Union for quantitative evaluation. Experiments were conducted in three different latest convolutional neural network trunks to illustrate the detection performance of the proposed method. Simultaneously, in terms of quantitative evaluation, the performance of our method is close to YOLOv3 even without handcrafted bounding boxes.https://doi.org/10.1177/1729881420944343
collection DOAJ
language English
format Article
sources DOAJ
author Jingbo Chen
Shengyong Chen
Linjie Bian
spellingShingle Jingbo Chen
Shengyong Chen
Linjie Bian
Multi-attribute self-attention guided vehicle local region detection based on convolutional neural network architecture
International Journal of Advanced Robotic Systems
author_facet Jingbo Chen
Shengyong Chen
Linjie Bian
author_sort Jingbo Chen
title Multi-attribute self-attention guided vehicle local region detection based on convolutional neural network architecture
title_short Multi-attribute self-attention guided vehicle local region detection based on convolutional neural network architecture
title_full Multi-attribute self-attention guided vehicle local region detection based on convolutional neural network architecture
title_fullStr Multi-attribute self-attention guided vehicle local region detection based on convolutional neural network architecture
title_full_unstemmed Multi-attribute self-attention guided vehicle local region detection based on convolutional neural network architecture
title_sort multi-attribute self-attention guided vehicle local region detection based on convolutional neural network architecture
publisher SAGE Publishing
series International Journal of Advanced Robotic Systems
issn 1729-8814
publishDate 2020-08-01
description Many pieces of information are included in the front region of a vehicle, especially in windshield and bumper regions. Thus, windshield or bumper region detection is making sense to extract useful information. But the existing windshield and bumper detection methods based on traditional artificial features are not robust enough. Those features may become invalid in many real situations (e.g. occlude, illumination change, viewpoint change.). In this article, we propose a multi-attribute-guided vehicle discriminately region detection method based on convolutional neural network and not rely on bounding box regression. We separate the net into two branches, respectively, for identification (ID) and Model attributes training. Therefore, the feature spaces of different attributes become more independent. Additionally, we embed a self-attention block into our framework to improve the performance of local region detection. We train our model on PKU_VD data set which has a huge number of images inside. Furthermore, we labeled the handcrafted bounding boxes on 5000 randomly picked testing images, and 1020 of them are used for evaluation and 3980 as the training data for YOLOv3 . We use Intersection over Union for quantitative evaluation. Experiments were conducted in three different latest convolutional neural network trunks to illustrate the detection performance of the proposed method. Simultaneously, in terms of quantitative evaluation, the performance of our method is close to YOLOv3 even without handcrafted bounding boxes.
url https://doi.org/10.1177/1729881420944343
work_keys_str_mv AT jingbochen multiattributeselfattentionguidedvehiclelocalregiondetectionbasedonconvolutionalneuralnetworkarchitecture
AT shengyongchen multiattributeselfattentionguidedvehiclelocalregiondetectionbasedonconvolutionalneuralnetworkarchitecture
AT linjiebian multiattributeselfattentionguidedvehiclelocalregiondetectionbasedonconvolutionalneuralnetworkarchitecture
_version_ 1724471016461172736