Building Extraction from Very-High-Resolution Remote Sensing Images Using Semi-Supervised Semantic Edge Detection
The automated detection of buildings in remote sensing images enables understanding the distribution information of buildings, which is indispensable for many geographic and social applications, such as urban planning, change monitoring and population estimation. The performance of deep learning in...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2021-06-01
|
Series: | Remote Sensing |
Subjects: | |
Online Access: | https://www.mdpi.com/2072-4292/13/11/2187 |
id |
doaj-efd000cbc7c64352b4eccb6ebf96aacc |
---|---|
record_format |
Article |
spelling |
doaj-efd000cbc7c64352b4eccb6ebf96aacc2021-06-30T23:14:53ZengMDPI AGRemote Sensing2072-42922021-06-01132187218710.3390/rs13112187Building Extraction from Very-High-Resolution Remote Sensing Images Using Semi-Supervised Semantic Edge DetectionLiegang Xia0Xiongbo Zhang1Junxia Zhang2Haiping Yang3Tingting Chen4College of Computer Science and Technology, Zhejiang University of Technology, Hangzhou 310023, ChinaCollege of Computer Science and Technology, Zhejiang University of Technology, Hangzhou 310023, ChinaCollege of Computer Science and Technology, Zhejiang University of Technology, Hangzhou 310023, ChinaCollege of Computer Science and Technology, Zhejiang University of Technology, Hangzhou 310023, ChinaCollege of Computer Science and Technology, Zhejiang University of Technology, Hangzhou 310023, ChinaThe automated detection of buildings in remote sensing images enables understanding the distribution information of buildings, which is indispensable for many geographic and social applications, such as urban planning, change monitoring and population estimation. The performance of deep learning in images often depends on a large number of manually labeled samples, the production of which is time-consuming and expensive. Thus, this study focuses on reducing the number of labeled samples used and proposing a semi-supervised deep learning approach based on an edge detection network (SDLED), which is the first to introduce semi-supervised learning to the edge detection neural network for extracting building roof boundaries from high-resolution remote sensing images. This approach uses a small number of labeled samples and abundant unlabeled images for joint training. An expert-level semantic edge segmentation model is trained based on labeled samples, which guides unlabeled images to generate pseudo-labels automatically. The inaccurate label sets and manually labeled samples are used to update the semantic edge model together. Particularly, we modified the semantic segmentation network D-LinkNet to obtain high-quality pseudo-labels. Specifically, the main network architecture of D-LinkNet is retained while the multi-scale fusion is added in its second half to improve its performance on edge detection. The SDLED was tested on high-spatial-resolution remote sensing images taken from Google Earth. Results show that the SDLED performs better than the fully supervised method. Moreover, when the trained models were used to predict buildings in the neighboring counties, our approach was superior to the supervised way, with line IoU improvement of at least 6.47% and F1 score improvement of at least 7.49%.https://www.mdpi.com/2072-4292/13/11/2187semi-supervisedsemantic edge detectionbuilding extractiondeep learningvery-high-resolution image |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Liegang Xia Xiongbo Zhang Junxia Zhang Haiping Yang Tingting Chen |
spellingShingle |
Liegang Xia Xiongbo Zhang Junxia Zhang Haiping Yang Tingting Chen Building Extraction from Very-High-Resolution Remote Sensing Images Using Semi-Supervised Semantic Edge Detection Remote Sensing semi-supervised semantic edge detection building extraction deep learning very-high-resolution image |
author_facet |
Liegang Xia Xiongbo Zhang Junxia Zhang Haiping Yang Tingting Chen |
author_sort |
Liegang Xia |
title |
Building Extraction from Very-High-Resolution Remote Sensing Images Using Semi-Supervised Semantic Edge Detection |
title_short |
Building Extraction from Very-High-Resolution Remote Sensing Images Using Semi-Supervised Semantic Edge Detection |
title_full |
Building Extraction from Very-High-Resolution Remote Sensing Images Using Semi-Supervised Semantic Edge Detection |
title_fullStr |
Building Extraction from Very-High-Resolution Remote Sensing Images Using Semi-Supervised Semantic Edge Detection |
title_full_unstemmed |
Building Extraction from Very-High-Resolution Remote Sensing Images Using Semi-Supervised Semantic Edge Detection |
title_sort |
building extraction from very-high-resolution remote sensing images using semi-supervised semantic edge detection |
publisher |
MDPI AG |
series |
Remote Sensing |
issn |
2072-4292 |
publishDate |
2021-06-01 |
description |
The automated detection of buildings in remote sensing images enables understanding the distribution information of buildings, which is indispensable for many geographic and social applications, such as urban planning, change monitoring and population estimation. The performance of deep learning in images often depends on a large number of manually labeled samples, the production of which is time-consuming and expensive. Thus, this study focuses on reducing the number of labeled samples used and proposing a semi-supervised deep learning approach based on an edge detection network (SDLED), which is the first to introduce semi-supervised learning to the edge detection neural network for extracting building roof boundaries from high-resolution remote sensing images. This approach uses a small number of labeled samples and abundant unlabeled images for joint training. An expert-level semantic edge segmentation model is trained based on labeled samples, which guides unlabeled images to generate pseudo-labels automatically. The inaccurate label sets and manually labeled samples are used to update the semantic edge model together. Particularly, we modified the semantic segmentation network D-LinkNet to obtain high-quality pseudo-labels. Specifically, the main network architecture of D-LinkNet is retained while the multi-scale fusion is added in its second half to improve its performance on edge detection. The SDLED was tested on high-spatial-resolution remote sensing images taken from Google Earth. Results show that the SDLED performs better than the fully supervised method. Moreover, when the trained models were used to predict buildings in the neighboring counties, our approach was superior to the supervised way, with line IoU improvement of at least 6.47% and F1 score improvement of at least 7.49%. |
topic |
semi-supervised semantic edge detection building extraction deep learning very-high-resolution image |
url |
https://www.mdpi.com/2072-4292/13/11/2187 |
work_keys_str_mv |
AT liegangxia buildingextractionfromveryhighresolutionremotesensingimagesusingsemisupervisedsemanticedgedetection AT xiongbozhang buildingextractionfromveryhighresolutionremotesensingimagesusingsemisupervisedsemanticedgedetection AT junxiazhang buildingextractionfromveryhighresolutionremotesensingimagesusingsemisupervisedsemanticedgedetection AT haipingyang buildingextractionfromveryhighresolutionremotesensingimagesusingsemisupervisedsemanticedgedetection AT tingtingchen buildingextractionfromveryhighresolutionremotesensingimagesusingsemisupervisedsemanticedgedetection |
_version_ |
1721351843541417984 |