Automatic Building Segmentation of Aerial Imagery Using Multi-Constraint Fully Convolutional Networks

Automatic building segmentation from aerial imagery is an important and challenging task because of the variety of backgrounds, building textures and imaging conditions. Currently, research using variant types of fully convolutional networks (FCNs) has largely improved the performance of this task....

Full description

Bibliographic Details
Main Authors: Guangming Wu, Xiaowei Shao, Zhiling Guo, Qi Chen, Wei Yuan, Xiaodan Shi, Yongwei Xu, Ryosuke Shibasaki
Format: Article
Language:English
Published: MDPI AG 2018-03-01
Series:Remote Sensing
Subjects:
Online Access:http://www.mdpi.com/2072-4292/10/3/407
id doaj-d9dd779dcdf34c0f8d1fd628814445ca
record_format Article
spelling doaj-d9dd779dcdf34c0f8d1fd628814445ca2020-11-25T00:54:27ZengMDPI AGRemote Sensing2072-42922018-03-0110340710.3390/rs10030407rs10030407Automatic Building Segmentation of Aerial Imagery Using Multi-Constraint Fully Convolutional NetworksGuangming Wu0Xiaowei Shao1Zhiling Guo2Qi Chen3Wei Yuan4Xiaodan Shi5Yongwei Xu6Ryosuke Shibasaki7Center for Spatial Information Science, University of Tokyo, Kashiwa 277-8568, JapanCenter for Spatial Information Science, University of Tokyo, Kashiwa 277-8568, JapanCenter for Spatial Information Science, University of Tokyo, Kashiwa 277-8568, JapanCenter for Spatial Information Science, University of Tokyo, Kashiwa 277-8568, JapanCenter for Spatial Information Science, University of Tokyo, Kashiwa 277-8568, JapanCenter for Spatial Information Science, University of Tokyo, Kashiwa 277-8568, JapanCenter for Spatial Information Science, University of Tokyo, Kashiwa 277-8568, JapanCenter for Spatial Information Science, University of Tokyo, Kashiwa 277-8568, JapanAutomatic building segmentation from aerial imagery is an important and challenging task because of the variety of backgrounds, building textures and imaging conditions. Currently, research using variant types of fully convolutional networks (FCNs) has largely improved the performance of this task. However, pursuing more accurate segmentation results is still critical for further applications such as automatic mapping. In this study, a multi-constraint fully convolutional network (MC–FCN) model is proposed to perform end-to-end building segmentation. Our MC–FCN model consists of a bottom-up/top-down fully convolutional architecture and multi-constraints that are computed between the binary cross entropy of prediction and the corresponding ground truth. Since more constraints are applied to optimize the parameters of the intermediate layers, the multi-scale feature representation of the model is further enhanced, and hence higher performance can be achieved. The experiments on a very-high-resolution aerial image dataset covering 18 km 2 and more than 17,000 buildings indicate that our method performs well in the building segmentation task. The proposed MC–FCN method significantly outperforms the classic FCN method and the adaptive boosting method using features extracted by the histogram of oriented gradients. Compared with the state-of-the-art U–Net model, MC–FCN gains 3.2% (0.833 vs. 0.807) and 2.2% (0.893 vs. 0.874) relative improvements of Jaccard index and kappa coefficient with the cost of only 1.8% increment of the model-training time. In addition, the sensitivity analysis demonstrates that constraints at different positions have inconsistent impact on the performance of the MC–FCN.http://www.mdpi.com/2072-4292/10/3/407aerial imagerybuilding detectionconvolutional neural networkmulti-constraint fully convolutional networksfeature pyramid
collection DOAJ
language English
format Article
sources DOAJ
author Guangming Wu
Xiaowei Shao
Zhiling Guo
Qi Chen
Wei Yuan
Xiaodan Shi
Yongwei Xu
Ryosuke Shibasaki
spellingShingle Guangming Wu
Xiaowei Shao
Zhiling Guo
Qi Chen
Wei Yuan
Xiaodan Shi
Yongwei Xu
Ryosuke Shibasaki
Automatic Building Segmentation of Aerial Imagery Using Multi-Constraint Fully Convolutional Networks
Remote Sensing
aerial imagery
building detection
convolutional neural network
multi-constraint fully convolutional networks
feature pyramid
author_facet Guangming Wu
Xiaowei Shao
Zhiling Guo
Qi Chen
Wei Yuan
Xiaodan Shi
Yongwei Xu
Ryosuke Shibasaki
author_sort Guangming Wu
title Automatic Building Segmentation of Aerial Imagery Using Multi-Constraint Fully Convolutional Networks
title_short Automatic Building Segmentation of Aerial Imagery Using Multi-Constraint Fully Convolutional Networks
title_full Automatic Building Segmentation of Aerial Imagery Using Multi-Constraint Fully Convolutional Networks
title_fullStr Automatic Building Segmentation of Aerial Imagery Using Multi-Constraint Fully Convolutional Networks
title_full_unstemmed Automatic Building Segmentation of Aerial Imagery Using Multi-Constraint Fully Convolutional Networks
title_sort automatic building segmentation of aerial imagery using multi-constraint fully convolutional networks
publisher MDPI AG
series Remote Sensing
issn 2072-4292
publishDate 2018-03-01
description Automatic building segmentation from aerial imagery is an important and challenging task because of the variety of backgrounds, building textures and imaging conditions. Currently, research using variant types of fully convolutional networks (FCNs) has largely improved the performance of this task. However, pursuing more accurate segmentation results is still critical for further applications such as automatic mapping. In this study, a multi-constraint fully convolutional network (MC–FCN) model is proposed to perform end-to-end building segmentation. Our MC–FCN model consists of a bottom-up/top-down fully convolutional architecture and multi-constraints that are computed between the binary cross entropy of prediction and the corresponding ground truth. Since more constraints are applied to optimize the parameters of the intermediate layers, the multi-scale feature representation of the model is further enhanced, and hence higher performance can be achieved. The experiments on a very-high-resolution aerial image dataset covering 18 km 2 and more than 17,000 buildings indicate that our method performs well in the building segmentation task. The proposed MC–FCN method significantly outperforms the classic FCN method and the adaptive boosting method using features extracted by the histogram of oriented gradients. Compared with the state-of-the-art U–Net model, MC–FCN gains 3.2% (0.833 vs. 0.807) and 2.2% (0.893 vs. 0.874) relative improvements of Jaccard index and kappa coefficient with the cost of only 1.8% increment of the model-training time. In addition, the sensitivity analysis demonstrates that constraints at different positions have inconsistent impact on the performance of the MC–FCN.
topic aerial imagery
building detection
convolutional neural network
multi-constraint fully convolutional networks
feature pyramid
url http://www.mdpi.com/2072-4292/10/3/407
work_keys_str_mv AT guangmingwu automaticbuildingsegmentationofaerialimageryusingmulticonstraintfullyconvolutionalnetworks
AT xiaoweishao automaticbuildingsegmentationofaerialimageryusingmulticonstraintfullyconvolutionalnetworks
AT zhilingguo automaticbuildingsegmentationofaerialimageryusingmulticonstraintfullyconvolutionalnetworks
AT qichen automaticbuildingsegmentationofaerialimageryusingmulticonstraintfullyconvolutionalnetworks
AT weiyuan automaticbuildingsegmentationofaerialimageryusingmulticonstraintfullyconvolutionalnetworks
AT xiaodanshi automaticbuildingsegmentationofaerialimageryusingmulticonstraintfullyconvolutionalnetworks
AT yongweixu automaticbuildingsegmentationofaerialimageryusingmulticonstraintfullyconvolutionalnetworks
AT ryosukeshibasaki automaticbuildingsegmentationofaerialimageryusingmulticonstraintfullyconvolutionalnetworks
_version_ 1725234384286515200