A Slimmer Network with Polymorphic and Group Attention Modules for More Efficient Object Detection in Aerial Images
Object detection is one of the core technologies in aerial image processing and analysis. Although existing aerial image object detection methods based on deep learning have made some progress, there are still some problems remained: (1) Most existing methods fail to simultaneously consider multi-sc...
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2020-11-01
|
Series: | Remote Sensing |
Subjects: | |
Online Access: | https://www.mdpi.com/2072-4292/12/22/3750 |
id |
doaj-3d71f63458e647288c657b047d6235c4 |
---|---|
record_format |
Article |
spelling |
doaj-3d71f63458e647288c657b047d6235c42020-11-25T03:59:37ZengMDPI AGRemote Sensing2072-42922020-11-01123750375010.3390/rs12223750A Slimmer Network with Polymorphic and Group Attention Modules for More Efficient Object Detection in Aerial ImagesWei Guo0Weihong Li1Zhenghao Li2Weiguo Gong3Jinkai Cui4Xinran Wang5Key Lab of Optoelectronic Technology and Systems Ministry of Education, College of Optoelectronic Engineering, Chongqing University, Chongqing 400044, ChinaKey Lab of Optoelectronic Technology and Systems Ministry of Education, College of Optoelectronic Engineering, Chongqing University, Chongqing 400044, ChinaChongqing Institute of Green and Intelligent Technology, Chinese Academy of Sciences, Chongqing 400714, ChinaKey Lab of Optoelectronic Technology and Systems Ministry of Education, College of Optoelectronic Engineering, Chongqing University, Chongqing 400044, ChinaKey Lab of Optoelectronic Technology and Systems Ministry of Education, College of Optoelectronic Engineering, Chongqing University, Chongqing 400044, ChinaKey Lab of Optoelectronic Technology and Systems Ministry of Education, College of Optoelectronic Engineering, Chongqing University, Chongqing 400044, ChinaObject detection is one of the core technologies in aerial image processing and analysis. Although existing aerial image object detection methods based on deep learning have made some progress, there are still some problems remained: (1) Most existing methods fail to simultaneously consider multi-scale and multi-shape object characteristics in aerial images, which may lead to some missing or false detections; (2) high precision detection generally requires a large and complex network structure, which usually makes it difficult to achieve the high detection efficiency and deploy the network on resource-constrained devices for practical applications. To solve these problems, we propose a slimmer network for more efficient object detection in aerial images. Firstly, we design a polymorphic module (PM) for simultaneously learning the multi-scale and multi-shape object features, so as to better detect the hugely different objects in aerial images. Then, we design a group attention module (GAM) for better utilizing the diversiform concatenation features in the network. By designing multiple detection headers with adaptive anchors and the above-mentioned two modules, we propose a one-stage network called PG-YOLO for realizing the higher detection accuracy. Based on the proposed network, we further propose a more efficient channel pruning method, which can slim the network parameters from 63.7 million (M) to 3.3M that decreases the parameter size by 94.8%, so it can significantly improve the detection efficiency for real-time detection. Finally, we execute the comparative experiments on three public aerial datasets, and the experimental results show that the proposed method outperforms the state-of-the-art methods.https://www.mdpi.com/2072-4292/12/22/3750aerial imagesobject detectionchannel pruningpolymorphic module (PM)group attention module (GAM) |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Wei Guo Weihong Li Zhenghao Li Weiguo Gong Jinkai Cui Xinran Wang |
spellingShingle |
Wei Guo Weihong Li Zhenghao Li Weiguo Gong Jinkai Cui Xinran Wang A Slimmer Network with Polymorphic and Group Attention Modules for More Efficient Object Detection in Aerial Images Remote Sensing aerial images object detection channel pruning polymorphic module (PM) group attention module (GAM) |
author_facet |
Wei Guo Weihong Li Zhenghao Li Weiguo Gong Jinkai Cui Xinran Wang |
author_sort |
Wei Guo |
title |
A Slimmer Network with Polymorphic and Group Attention Modules for More Efficient Object Detection in Aerial Images |
title_short |
A Slimmer Network with Polymorphic and Group Attention Modules for More Efficient Object Detection in Aerial Images |
title_full |
A Slimmer Network with Polymorphic and Group Attention Modules for More Efficient Object Detection in Aerial Images |
title_fullStr |
A Slimmer Network with Polymorphic and Group Attention Modules for More Efficient Object Detection in Aerial Images |
title_full_unstemmed |
A Slimmer Network with Polymorphic and Group Attention Modules for More Efficient Object Detection in Aerial Images |
title_sort |
slimmer network with polymorphic and group attention modules for more efficient object detection in aerial images |
publisher |
MDPI AG |
series |
Remote Sensing |
issn |
2072-4292 |
publishDate |
2020-11-01 |
description |
Object detection is one of the core technologies in aerial image processing and analysis. Although existing aerial image object detection methods based on deep learning have made some progress, there are still some problems remained: (1) Most existing methods fail to simultaneously consider multi-scale and multi-shape object characteristics in aerial images, which may lead to some missing or false detections; (2) high precision detection generally requires a large and complex network structure, which usually makes it difficult to achieve the high detection efficiency and deploy the network on resource-constrained devices for practical applications. To solve these problems, we propose a slimmer network for more efficient object detection in aerial images. Firstly, we design a polymorphic module (PM) for simultaneously learning the multi-scale and multi-shape object features, so as to better detect the hugely different objects in aerial images. Then, we design a group attention module (GAM) for better utilizing the diversiform concatenation features in the network. By designing multiple detection headers with adaptive anchors and the above-mentioned two modules, we propose a one-stage network called PG-YOLO for realizing the higher detection accuracy. Based on the proposed network, we further propose a more efficient channel pruning method, which can slim the network parameters from 63.7 million (M) to 3.3M that decreases the parameter size by 94.8%, so it can significantly improve the detection efficiency for real-time detection. Finally, we execute the comparative experiments on three public aerial datasets, and the experimental results show that the proposed method outperforms the state-of-the-art methods. |
topic |
aerial images object detection channel pruning polymorphic module (PM) group attention module (GAM) |
url |
https://www.mdpi.com/2072-4292/12/22/3750 |
work_keys_str_mv |
AT weiguo aslimmernetworkwithpolymorphicandgroupattentionmodulesformoreefficientobjectdetectioninaerialimages AT weihongli aslimmernetworkwithpolymorphicandgroupattentionmodulesformoreefficientobjectdetectioninaerialimages AT zhenghaoli aslimmernetworkwithpolymorphicandgroupattentionmodulesformoreefficientobjectdetectioninaerialimages AT weiguogong aslimmernetworkwithpolymorphicandgroupattentionmodulesformoreefficientobjectdetectioninaerialimages AT jinkaicui aslimmernetworkwithpolymorphicandgroupattentionmodulesformoreefficientobjectdetectioninaerialimages AT xinranwang aslimmernetworkwithpolymorphicandgroupattentionmodulesformoreefficientobjectdetectioninaerialimages AT weiguo slimmernetworkwithpolymorphicandgroupattentionmodulesformoreefficientobjectdetectioninaerialimages AT weihongli slimmernetworkwithpolymorphicandgroupattentionmodulesformoreefficientobjectdetectioninaerialimages AT zhenghaoli slimmernetworkwithpolymorphicandgroupattentionmodulesformoreefficientobjectdetectioninaerialimages AT weiguogong slimmernetworkwithpolymorphicandgroupattentionmodulesformoreefficientobjectdetectioninaerialimages AT jinkaicui slimmernetworkwithpolymorphicandgroupattentionmodulesformoreefficientobjectdetectioninaerialimages AT xinranwang slimmernetworkwithpolymorphicandgroupattentionmodulesformoreefficientobjectdetectioninaerialimages |
_version_ |
1724453793816379392 |