Building Extraction Based on U-Net with an Attention Block and Multiple Losses
Semantic segmentation of high-resolution remote sensing images plays an important role in applications for building extraction. However, the current algorithms have some semantic information extraction limitations, and these can lead to poor segmentation results. To extract buildings with high accur...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2020-04-01
|
Series: | Remote Sensing |
Subjects: | |
Online Access: | https://www.mdpi.com/2072-4292/12/9/1400 |
id |
doaj-f9072bfb66c04af2a7100782a69b9019 |
---|---|
record_format |
Article |
spelling |
doaj-f9072bfb66c04af2a7100782a69b90192020-11-25T02:22:56ZengMDPI AGRemote Sensing2072-42922020-04-01121400140010.3390/rs12091400Building Extraction Based on U-Net with an Attention Block and Multiple LossesMingqiang Guo0Heng Liu1Yongyang Xu2Ying Huang3School of Geography and Information Engineering, China University of Geosciences, Wuhan 430074, ChinaSchool of Geography and Information Engineering, China University of Geosciences, Wuhan 430074, ChinaSchool of Geography and Information Engineering, China University of Geosciences, Wuhan 430074, ChinaWuhan Zondy Cyber Technology Co. Ltd., Wuhan 430074, ChinaSemantic segmentation of high-resolution remote sensing images plays an important role in applications for building extraction. However, the current algorithms have some semantic information extraction limitations, and these can lead to poor segmentation results. To extract buildings with high accuracy, we propose a multiloss neural network based on attention. The designed network, based on U-Net, can improve the sensitivity of the model by the attention block and suppress the background influence of irrelevant feature areas. To improve the ability of the model, a multiloss approach is proposed during training the network. The experimental results show that the proposed model offers great improvement over other state-of-the-art methods. For the public Inria Aerial Image Labeling dataset, the F1 score reached 76.96% and showed good performance on the Aerial Imagery for Roof Segmentation dataset.https://www.mdpi.com/2072-4292/12/9/1400building extractionattention blockmultiple lossessemantic segmentationremote sensing images |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Mingqiang Guo Heng Liu Yongyang Xu Ying Huang |
spellingShingle |
Mingqiang Guo Heng Liu Yongyang Xu Ying Huang Building Extraction Based on U-Net with an Attention Block and Multiple Losses Remote Sensing building extraction attention block multiple losses semantic segmentation remote sensing images |
author_facet |
Mingqiang Guo Heng Liu Yongyang Xu Ying Huang |
author_sort |
Mingqiang Guo |
title |
Building Extraction Based on U-Net with an Attention Block and Multiple Losses |
title_short |
Building Extraction Based on U-Net with an Attention Block and Multiple Losses |
title_full |
Building Extraction Based on U-Net with an Attention Block and Multiple Losses |
title_fullStr |
Building Extraction Based on U-Net with an Attention Block and Multiple Losses |
title_full_unstemmed |
Building Extraction Based on U-Net with an Attention Block and Multiple Losses |
title_sort |
building extraction based on u-net with an attention block and multiple losses |
publisher |
MDPI AG |
series |
Remote Sensing |
issn |
2072-4292 |
publishDate |
2020-04-01 |
description |
Semantic segmentation of high-resolution remote sensing images plays an important role in applications for building extraction. However, the current algorithms have some semantic information extraction limitations, and these can lead to poor segmentation results. To extract buildings with high accuracy, we propose a multiloss neural network based on attention. The designed network, based on U-Net, can improve the sensitivity of the model by the attention block and suppress the background influence of irrelevant feature areas. To improve the ability of the model, a multiloss approach is proposed during training the network. The experimental results show that the proposed model offers great improvement over other state-of-the-art methods. For the public Inria Aerial Image Labeling dataset, the F1 score reached 76.96% and showed good performance on the Aerial Imagery for Roof Segmentation dataset. |
topic |
building extraction attention block multiple losses semantic segmentation remote sensing images |
url |
https://www.mdpi.com/2072-4292/12/9/1400 |
work_keys_str_mv |
AT mingqiangguo buildingextractionbasedonunetwithanattentionblockandmultiplelosses AT hengliu buildingextractionbasedonunetwithanattentionblockandmultiplelosses AT yongyangxu buildingextractionbasedonunetwithanattentionblockandmultiplelosses AT yinghuang buildingextractionbasedonunetwithanattentionblockandmultiplelosses |
_version_ |
1724860950675193856 |