Building Extraction Based on U-Net with an Attention Block and Multiple Losses

Semantic segmentation of high-resolution remote sensing images plays an important role in applications for building extraction. However, the current algorithms have some semantic information extraction limitations, and these can lead to poor segmentation results. To extract buildings with high accur...

Full description

Bibliographic Details
Main Authors: Mingqiang Guo, Heng Liu, Yongyang Xu, Ying Huang
Format: Article
Language:English
Published: MDPI AG 2020-04-01
Series:Remote Sensing
Subjects:
Online Access:https://www.mdpi.com/2072-4292/12/9/1400
id doaj-f9072bfb66c04af2a7100782a69b9019
record_format Article
spelling doaj-f9072bfb66c04af2a7100782a69b90192020-11-25T02:22:56ZengMDPI AGRemote Sensing2072-42922020-04-01121400140010.3390/rs12091400Building Extraction Based on U-Net with an Attention Block and Multiple LossesMingqiang Guo0Heng Liu1Yongyang Xu2Ying Huang3School of Geography and Information Engineering, China University of Geosciences, Wuhan 430074, ChinaSchool of Geography and Information Engineering, China University of Geosciences, Wuhan 430074, ChinaSchool of Geography and Information Engineering, China University of Geosciences, Wuhan 430074, ChinaWuhan Zondy Cyber Technology Co. Ltd., Wuhan 430074, ChinaSemantic segmentation of high-resolution remote sensing images plays an important role in applications for building extraction. However, the current algorithms have some semantic information extraction limitations, and these can lead to poor segmentation results. To extract buildings with high accuracy, we propose a multiloss neural network based on attention. The designed network, based on U-Net, can improve the sensitivity of the model by the attention block and suppress the background influence of irrelevant feature areas. To improve the ability of the model, a multiloss approach is proposed during training the network. The experimental results show that the proposed model offers great improvement over other state-of-the-art methods. For the public Inria Aerial Image Labeling dataset, the F1 score reached 76.96% and showed good performance on the Aerial Imagery for Roof Segmentation dataset.https://www.mdpi.com/2072-4292/12/9/1400building extractionattention blockmultiple lossessemantic segmentationremote sensing images
collection DOAJ
language English
format Article
sources DOAJ
author Mingqiang Guo
Heng Liu
Yongyang Xu
Ying Huang
spellingShingle Mingqiang Guo
Heng Liu
Yongyang Xu
Ying Huang
Building Extraction Based on U-Net with an Attention Block and Multiple Losses
Remote Sensing
building extraction
attention block
multiple losses
semantic segmentation
remote sensing images
author_facet Mingqiang Guo
Heng Liu
Yongyang Xu
Ying Huang
author_sort Mingqiang Guo
title Building Extraction Based on U-Net with an Attention Block and Multiple Losses
title_short Building Extraction Based on U-Net with an Attention Block and Multiple Losses
title_full Building Extraction Based on U-Net with an Attention Block and Multiple Losses
title_fullStr Building Extraction Based on U-Net with an Attention Block and Multiple Losses
title_full_unstemmed Building Extraction Based on U-Net with an Attention Block and Multiple Losses
title_sort building extraction based on u-net with an attention block and multiple losses
publisher MDPI AG
series Remote Sensing
issn 2072-4292
publishDate 2020-04-01
description Semantic segmentation of high-resolution remote sensing images plays an important role in applications for building extraction. However, the current algorithms have some semantic information extraction limitations, and these can lead to poor segmentation results. To extract buildings with high accuracy, we propose a multiloss neural network based on attention. The designed network, based on U-Net, can improve the sensitivity of the model by the attention block and suppress the background influence of irrelevant feature areas. To improve the ability of the model, a multiloss approach is proposed during training the network. The experimental results show that the proposed model offers great improvement over other state-of-the-art methods. For the public Inria Aerial Image Labeling dataset, the F1 score reached 76.96% and showed good performance on the Aerial Imagery for Roof Segmentation dataset.
topic building extraction
attention block
multiple losses
semantic segmentation
remote sensing images
url https://www.mdpi.com/2072-4292/12/9/1400
work_keys_str_mv AT mingqiangguo buildingextractionbasedonunetwithanattentionblockandmultiplelosses
AT hengliu buildingextractionbasedonunetwithanattentionblockandmultiplelosses
AT yongyangxu buildingextractionbasedonunetwithanattentionblockandmultiplelosses
AT yinghuang buildingextractionbasedonunetwithanattentionblockandmultiplelosses
_version_ 1724860950675193856