A Deep Learning Approach to an Enhanced Building Footprint and Road Detection in High-Resolution Satellite Imagery

The detection of building footprints and road networks has many useful applications including the monitoring of urban development, real-time navigation, etc. Taking into account that a great deal of human attention is required by these remote sensing tasks, a lot of effort has been made to automate...

Full description

Bibliographic Details
Main Authors: Christian Ayala, Rubén Sesma, Carlos Aranda, Mikel Galar
Format: Article
Language:English
Published: MDPI AG 2021-08-01
Series:Remote Sensing
Subjects:
Online Access:https://www.mdpi.com/2072-4292/13/16/3135
id doaj-a48863681d2545759cb12187528626d9
record_format Article
spelling doaj-a48863681d2545759cb12187528626d92021-08-26T14:17:24ZengMDPI AGRemote Sensing2072-42922021-08-01133135313510.3390/rs13163135A Deep Learning Approach to an Enhanced Building Footprint and Road Detection in High-Resolution Satellite ImageryChristian Ayala0Rubén Sesma1Carlos Aranda2Mikel Galar3Tracasa Instrumental, Calle Cabárceno 6, 31621 Sarriguren, SpainTracasa Instrumental, Calle Cabárceno 6, 31621 Sarriguren, SpainTracasa Instrumental, Calle Cabárceno 6, 31621 Sarriguren, SpainInstitute of Smart Cities (ISC), Public University of Navarre (UPNA), Arrosadia Campus, 31006 Pamplona, SpainThe detection of building footprints and road networks has many useful applications including the monitoring of urban development, real-time navigation, etc. Taking into account that a great deal of human attention is required by these remote sensing tasks, a lot of effort has been made to automate them. However, the vast majority of the approaches rely on very high-resolution satellite imagery (<2.5 m) whose costs are not yet affordable for maintaining up-to-date maps. Working with the limited spatial resolution provided by high-resolution satellite imagery such as Sentinel-1 and Sentinel-2 (10 m) makes it hard to detect buildings and roads, since these labels may coexist within the same pixel. This paper focuses on this problem and presents a novel methodology capable of detecting building and roads with sub-pixel width by increasing the resolution of the output masks. This methodology consists of fusing Sentinel-1 and Sentinel-2 data (at 10 m) together with OpenStreetMap to train deep learning models for building and road detection at 2.5 m. This becomes possible thanks to the usage of OpenStreetMap vector data, which can be rasterized to any desired resolution. Accordingly, a few simple yet effective modifications of the U-Net architecture are proposed to not only semantically segment the input image, but also to learn how to enhance the resolution of the output masks. As a result, generated mappings quadruplicate the input spatial resolution, closing the gap between satellite and aerial imagery for building and road detection. To properly evaluate the generalization capabilities of the proposed methodology, a data-set composed of 44 cities across the Spanish territory have been considered and divided into training and testing cities. Both quantitative and qualitative results show that high-resolution satellite imagery can be used for sub-pixel width building and road detection following the proper methodology.https://www.mdpi.com/2072-4292/13/16/3135Sentinel-1Sentinel-2remote sensingbuilding detectionroad detectiondeep learning
collection DOAJ
language English
format Article
sources DOAJ
author Christian Ayala
Rubén Sesma
Carlos Aranda
Mikel Galar
spellingShingle Christian Ayala
Rubén Sesma
Carlos Aranda
Mikel Galar
A Deep Learning Approach to an Enhanced Building Footprint and Road Detection in High-Resolution Satellite Imagery
Remote Sensing
Sentinel-1
Sentinel-2
remote sensing
building detection
road detection
deep learning
author_facet Christian Ayala
Rubén Sesma
Carlos Aranda
Mikel Galar
author_sort Christian Ayala
title A Deep Learning Approach to an Enhanced Building Footprint and Road Detection in High-Resolution Satellite Imagery
title_short A Deep Learning Approach to an Enhanced Building Footprint and Road Detection in High-Resolution Satellite Imagery
title_full A Deep Learning Approach to an Enhanced Building Footprint and Road Detection in High-Resolution Satellite Imagery
title_fullStr A Deep Learning Approach to an Enhanced Building Footprint and Road Detection in High-Resolution Satellite Imagery
title_full_unstemmed A Deep Learning Approach to an Enhanced Building Footprint and Road Detection in High-Resolution Satellite Imagery
title_sort deep learning approach to an enhanced building footprint and road detection in high-resolution satellite imagery
publisher MDPI AG
series Remote Sensing
issn 2072-4292
publishDate 2021-08-01
description The detection of building footprints and road networks has many useful applications including the monitoring of urban development, real-time navigation, etc. Taking into account that a great deal of human attention is required by these remote sensing tasks, a lot of effort has been made to automate them. However, the vast majority of the approaches rely on very high-resolution satellite imagery (<2.5 m) whose costs are not yet affordable for maintaining up-to-date maps. Working with the limited spatial resolution provided by high-resolution satellite imagery such as Sentinel-1 and Sentinel-2 (10 m) makes it hard to detect buildings and roads, since these labels may coexist within the same pixel. This paper focuses on this problem and presents a novel methodology capable of detecting building and roads with sub-pixel width by increasing the resolution of the output masks. This methodology consists of fusing Sentinel-1 and Sentinel-2 data (at 10 m) together with OpenStreetMap to train deep learning models for building and road detection at 2.5 m. This becomes possible thanks to the usage of OpenStreetMap vector data, which can be rasterized to any desired resolution. Accordingly, a few simple yet effective modifications of the U-Net architecture are proposed to not only semantically segment the input image, but also to learn how to enhance the resolution of the output masks. As a result, generated mappings quadruplicate the input spatial resolution, closing the gap between satellite and aerial imagery for building and road detection. To properly evaluate the generalization capabilities of the proposed methodology, a data-set composed of 44 cities across the Spanish territory have been considered and divided into training and testing cities. Both quantitative and qualitative results show that high-resolution satellite imagery can be used for sub-pixel width building and road detection following the proper methodology.
topic Sentinel-1
Sentinel-2
remote sensing
building detection
road detection
deep learning
url https://www.mdpi.com/2072-4292/13/16/3135
work_keys_str_mv AT christianayala adeeplearningapproachtoanenhancedbuildingfootprintandroaddetectioninhighresolutionsatelliteimagery
AT rubensesma adeeplearningapproachtoanenhancedbuildingfootprintandroaddetectioninhighresolutionsatelliteimagery
AT carlosaranda adeeplearningapproachtoanenhancedbuildingfootprintandroaddetectioninhighresolutionsatelliteimagery
AT mikelgalar adeeplearningapproachtoanenhancedbuildingfootprintandroaddetectioninhighresolutionsatelliteimagery
AT christianayala deeplearningapproachtoanenhancedbuildingfootprintandroaddetectioninhighresolutionsatelliteimagery
AT rubensesma deeplearningapproachtoanenhancedbuildingfootprintandroaddetectioninhighresolutionsatelliteimagery
AT carlosaranda deeplearningapproachtoanenhancedbuildingfootprintandroaddetectioninhighresolutionsatelliteimagery
AT mikelgalar deeplearningapproachtoanenhancedbuildingfootprintandroaddetectioninhighresolutionsatelliteimagery
_version_ 1721190297862406144