A Slimmer Network with Polymorphic and Group Attention Modules for More Efficient Object Detection in Aerial Images

Object detection is one of the core technologies in aerial image processing and analysis. Although existing aerial image object detection methods based on deep learning have made some progress, there are still some problems remained: (1) Most existing methods fail to simultaneously consider multi-sc...

Full description

Bibliographic Details
Main Authors: Wei Guo, Weihong Li, Zhenghao Li, Weiguo Gong, Jinkai Cui, Xinran Wang
Format: Article
Language:English
Published: MDPI AG 2020-11-01
Series:Remote Sensing
Subjects:
Online Access:https://www.mdpi.com/2072-4292/12/22/3750
id doaj-3d71f63458e647288c657b047d6235c4
record_format Article
spelling doaj-3d71f63458e647288c657b047d6235c42020-11-25T03:59:37ZengMDPI AGRemote Sensing2072-42922020-11-01123750375010.3390/rs12223750A Slimmer Network with Polymorphic and Group Attention Modules for More Efficient Object Detection in Aerial ImagesWei Guo0Weihong Li1Zhenghao Li2Weiguo Gong3Jinkai Cui4Xinran Wang5Key Lab of Optoelectronic Technology and Systems Ministry of Education, College of Optoelectronic Engineering, Chongqing University, Chongqing 400044, ChinaKey Lab of Optoelectronic Technology and Systems Ministry of Education, College of Optoelectronic Engineering, Chongqing University, Chongqing 400044, ChinaChongqing Institute of Green and Intelligent Technology, Chinese Academy of Sciences, Chongqing 400714, ChinaKey Lab of Optoelectronic Technology and Systems Ministry of Education, College of Optoelectronic Engineering, Chongqing University, Chongqing 400044, ChinaKey Lab of Optoelectronic Technology and Systems Ministry of Education, College of Optoelectronic Engineering, Chongqing University, Chongqing 400044, ChinaKey Lab of Optoelectronic Technology and Systems Ministry of Education, College of Optoelectronic Engineering, Chongqing University, Chongqing 400044, ChinaObject detection is one of the core technologies in aerial image processing and analysis. Although existing aerial image object detection methods based on deep learning have made some progress, there are still some problems remained: (1) Most existing methods fail to simultaneously consider multi-scale and multi-shape object characteristics in aerial images, which may lead to some missing or false detections; (2) high precision detection generally requires a large and complex network structure, which usually makes it difficult to achieve the high detection efficiency and deploy the network on resource-constrained devices for practical applications. To solve these problems, we propose a slimmer network for more efficient object detection in aerial images. Firstly, we design a polymorphic module (PM) for simultaneously learning the multi-scale and multi-shape object features, so as to better detect the hugely different objects in aerial images. Then, we design a group attention module (GAM) for better utilizing the diversiform concatenation features in the network. By designing multiple detection headers with adaptive anchors and the above-mentioned two modules, we propose a one-stage network called PG-YOLO for realizing the higher detection accuracy. Based on the proposed network, we further propose a more efficient channel pruning method, which can slim the network parameters from 63.7 million (M) to 3.3M that decreases the parameter size by 94.8%, so it can significantly improve the detection efficiency for real-time detection. Finally, we execute the comparative experiments on three public aerial datasets, and the experimental results show that the proposed method outperforms the state-of-the-art methods.https://www.mdpi.com/2072-4292/12/22/3750aerial imagesobject detectionchannel pruningpolymorphic module (PM)group attention module (GAM)
collection DOAJ
language English
format Article
sources DOAJ
author Wei Guo
Weihong Li
Zhenghao Li
Weiguo Gong
Jinkai Cui
Xinran Wang
spellingShingle Wei Guo
Weihong Li
Zhenghao Li
Weiguo Gong
Jinkai Cui
Xinran Wang
A Slimmer Network with Polymorphic and Group Attention Modules for More Efficient Object Detection in Aerial Images
Remote Sensing
aerial images
object detection
channel pruning
polymorphic module (PM)
group attention module (GAM)
author_facet Wei Guo
Weihong Li
Zhenghao Li
Weiguo Gong
Jinkai Cui
Xinran Wang
author_sort Wei Guo
title A Slimmer Network with Polymorphic and Group Attention Modules for More Efficient Object Detection in Aerial Images
title_short A Slimmer Network with Polymorphic and Group Attention Modules for More Efficient Object Detection in Aerial Images
title_full A Slimmer Network with Polymorphic and Group Attention Modules for More Efficient Object Detection in Aerial Images
title_fullStr A Slimmer Network with Polymorphic and Group Attention Modules for More Efficient Object Detection in Aerial Images
title_full_unstemmed A Slimmer Network with Polymorphic and Group Attention Modules for More Efficient Object Detection in Aerial Images
title_sort slimmer network with polymorphic and group attention modules for more efficient object detection in aerial images
publisher MDPI AG
series Remote Sensing
issn 2072-4292
publishDate 2020-11-01
description Object detection is one of the core technologies in aerial image processing and analysis. Although existing aerial image object detection methods based on deep learning have made some progress, there are still some problems remained: (1) Most existing methods fail to simultaneously consider multi-scale and multi-shape object characteristics in aerial images, which may lead to some missing or false detections; (2) high precision detection generally requires a large and complex network structure, which usually makes it difficult to achieve the high detection efficiency and deploy the network on resource-constrained devices for practical applications. To solve these problems, we propose a slimmer network for more efficient object detection in aerial images. Firstly, we design a polymorphic module (PM) for simultaneously learning the multi-scale and multi-shape object features, so as to better detect the hugely different objects in aerial images. Then, we design a group attention module (GAM) for better utilizing the diversiform concatenation features in the network. By designing multiple detection headers with adaptive anchors and the above-mentioned two modules, we propose a one-stage network called PG-YOLO for realizing the higher detection accuracy. Based on the proposed network, we further propose a more efficient channel pruning method, which can slim the network parameters from 63.7 million (M) to 3.3M that decreases the parameter size by 94.8%, so it can significantly improve the detection efficiency for real-time detection. Finally, we execute the comparative experiments on three public aerial datasets, and the experimental results show that the proposed method outperforms the state-of-the-art methods.
topic aerial images
object detection
channel pruning
polymorphic module (PM)
group attention module (GAM)
url https://www.mdpi.com/2072-4292/12/22/3750
work_keys_str_mv AT weiguo aslimmernetworkwithpolymorphicandgroupattentionmodulesformoreefficientobjectdetectioninaerialimages
AT weihongli aslimmernetworkwithpolymorphicandgroupattentionmodulesformoreefficientobjectdetectioninaerialimages
AT zhenghaoli aslimmernetworkwithpolymorphicandgroupattentionmodulesformoreefficientobjectdetectioninaerialimages
AT weiguogong aslimmernetworkwithpolymorphicandgroupattentionmodulesformoreefficientobjectdetectioninaerialimages
AT jinkaicui aslimmernetworkwithpolymorphicandgroupattentionmodulesformoreefficientobjectdetectioninaerialimages
AT xinranwang aslimmernetworkwithpolymorphicandgroupattentionmodulesformoreefficientobjectdetectioninaerialimages
AT weiguo slimmernetworkwithpolymorphicandgroupattentionmodulesformoreefficientobjectdetectioninaerialimages
AT weihongli slimmernetworkwithpolymorphicandgroupattentionmodulesformoreefficientobjectdetectioninaerialimages
AT zhenghaoli slimmernetworkwithpolymorphicandgroupattentionmodulesformoreefficientobjectdetectioninaerialimages
AT weiguogong slimmernetworkwithpolymorphicandgroupattentionmodulesformoreefficientobjectdetectioninaerialimages
AT jinkaicui slimmernetworkwithpolymorphicandgroupattentionmodulesformoreefficientobjectdetectioninaerialimages
AT xinranwang slimmernetworkwithpolymorphicandgroupattentionmodulesformoreefficientobjectdetectioninaerialimages
_version_ 1724453793816379392