Building Extraction from Very-High-Resolution Remote Sensing Images Using Semi-Supervised Semantic Edge Detection

The automated detection of buildings in remote sensing images enables understanding the distribution information of buildings, which is indispensable for many geographic and social applications, such as urban planning, change monitoring and population estimation. The performance of deep learning in...

Full description

Bibliographic Details
Main Authors: Liegang Xia, Xiongbo Zhang, Junxia Zhang, Haiping Yang, Tingting Chen
Format: Article
Language:English
Published: MDPI AG 2021-06-01
Series:Remote Sensing
Subjects:
Online Access:https://www.mdpi.com/2072-4292/13/11/2187
id doaj-efd000cbc7c64352b4eccb6ebf96aacc
record_format Article
spelling doaj-efd000cbc7c64352b4eccb6ebf96aacc2021-06-30T23:14:53ZengMDPI AGRemote Sensing2072-42922021-06-01132187218710.3390/rs13112187Building Extraction from Very-High-Resolution Remote Sensing Images Using Semi-Supervised Semantic Edge DetectionLiegang Xia0Xiongbo Zhang1Junxia Zhang2Haiping Yang3Tingting Chen4College of Computer Science and Technology, Zhejiang University of Technology, Hangzhou 310023, ChinaCollege of Computer Science and Technology, Zhejiang University of Technology, Hangzhou 310023, ChinaCollege of Computer Science and Technology, Zhejiang University of Technology, Hangzhou 310023, ChinaCollege of Computer Science and Technology, Zhejiang University of Technology, Hangzhou 310023, ChinaCollege of Computer Science and Technology, Zhejiang University of Technology, Hangzhou 310023, ChinaThe automated detection of buildings in remote sensing images enables understanding the distribution information of buildings, which is indispensable for many geographic and social applications, such as urban planning, change monitoring and population estimation. The performance of deep learning in images often depends on a large number of manually labeled samples, the production of which is time-consuming and expensive. Thus, this study focuses on reducing the number of labeled samples used and proposing a semi-supervised deep learning approach based on an edge detection network (SDLED), which is the first to introduce semi-supervised learning to the edge detection neural network for extracting building roof boundaries from high-resolution remote sensing images. This approach uses a small number of labeled samples and abundant unlabeled images for joint training. An expert-level semantic edge segmentation model is trained based on labeled samples, which guides unlabeled images to generate pseudo-labels automatically. The inaccurate label sets and manually labeled samples are used to update the semantic edge model together. Particularly, we modified the semantic segmentation network D-LinkNet to obtain high-quality pseudo-labels. Specifically, the main network architecture of D-LinkNet is retained while the multi-scale fusion is added in its second half to improve its performance on edge detection. The SDLED was tested on high-spatial-resolution remote sensing images taken from Google Earth. Results show that the SDLED performs better than the fully supervised method. Moreover, when the trained models were used to predict buildings in the neighboring counties, our approach was superior to the supervised way, with line IoU improvement of at least 6.47% and F1 score improvement of at least 7.49%.https://www.mdpi.com/2072-4292/13/11/2187semi-supervisedsemantic edge detectionbuilding extractiondeep learningvery-high-resolution image
collection DOAJ
language English
format Article
sources DOAJ
author Liegang Xia
Xiongbo Zhang
Junxia Zhang
Haiping Yang
Tingting Chen
spellingShingle Liegang Xia
Xiongbo Zhang
Junxia Zhang
Haiping Yang
Tingting Chen
Building Extraction from Very-High-Resolution Remote Sensing Images Using Semi-Supervised Semantic Edge Detection
Remote Sensing
semi-supervised
semantic edge detection
building extraction
deep learning
very-high-resolution image
author_facet Liegang Xia
Xiongbo Zhang
Junxia Zhang
Haiping Yang
Tingting Chen
author_sort Liegang Xia
title Building Extraction from Very-High-Resolution Remote Sensing Images Using Semi-Supervised Semantic Edge Detection
title_short Building Extraction from Very-High-Resolution Remote Sensing Images Using Semi-Supervised Semantic Edge Detection
title_full Building Extraction from Very-High-Resolution Remote Sensing Images Using Semi-Supervised Semantic Edge Detection
title_fullStr Building Extraction from Very-High-Resolution Remote Sensing Images Using Semi-Supervised Semantic Edge Detection
title_full_unstemmed Building Extraction from Very-High-Resolution Remote Sensing Images Using Semi-Supervised Semantic Edge Detection
title_sort building extraction from very-high-resolution remote sensing images using semi-supervised semantic edge detection
publisher MDPI AG
series Remote Sensing
issn 2072-4292
publishDate 2021-06-01
description The automated detection of buildings in remote sensing images enables understanding the distribution information of buildings, which is indispensable for many geographic and social applications, such as urban planning, change monitoring and population estimation. The performance of deep learning in images often depends on a large number of manually labeled samples, the production of which is time-consuming and expensive. Thus, this study focuses on reducing the number of labeled samples used and proposing a semi-supervised deep learning approach based on an edge detection network (SDLED), which is the first to introduce semi-supervised learning to the edge detection neural network for extracting building roof boundaries from high-resolution remote sensing images. This approach uses a small number of labeled samples and abundant unlabeled images for joint training. An expert-level semantic edge segmentation model is trained based on labeled samples, which guides unlabeled images to generate pseudo-labels automatically. The inaccurate label sets and manually labeled samples are used to update the semantic edge model together. Particularly, we modified the semantic segmentation network D-LinkNet to obtain high-quality pseudo-labels. Specifically, the main network architecture of D-LinkNet is retained while the multi-scale fusion is added in its second half to improve its performance on edge detection. The SDLED was tested on high-spatial-resolution remote sensing images taken from Google Earth. Results show that the SDLED performs better than the fully supervised method. Moreover, when the trained models were used to predict buildings in the neighboring counties, our approach was superior to the supervised way, with line IoU improvement of at least 6.47% and F1 score improvement of at least 7.49%.
topic semi-supervised
semantic edge detection
building extraction
deep learning
very-high-resolution image
url https://www.mdpi.com/2072-4292/13/11/2187
work_keys_str_mv AT liegangxia buildingextractionfromveryhighresolutionremotesensingimagesusingsemisupervisedsemanticedgedetection
AT xiongbozhang buildingextractionfromveryhighresolutionremotesensingimagesusingsemisupervisedsemanticedgedetection
AT junxiazhang buildingextractionfromveryhighresolutionremotesensingimagesusingsemisupervisedsemanticedgedetection
AT haipingyang buildingextractionfromveryhighresolutionremotesensingimagesusingsemisupervisedsemanticedgedetection
AT tingtingchen buildingextractionfromveryhighresolutionremotesensingimagesusingsemisupervisedsemanticedgedetection
_version_ 1721351843541417984