SEMANTIC SEGMENTATION OF AIRBORNE IMAGES AND CORRESPONDING DIGITAL SURFACE MODELS – ADDITIONAL INPUT DATA OR ADDITIONAL TASK?

We analyze the effects of additional height data for semantic segmentation of aerial images with a convolutional encoder-decoder network. Besides a merely image-based semantic segmentation, we trained the same network with height as additional input and furthermore, we defined a multi-task model, wh...

Full description

Bibliographic Details
Main Authors: M. Schmitz, W. Brandenburger, H. Mayer
Format: Article
Language:English
Published: Copernicus Publications 2019-09-01
Series:The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences
Online Access:https://www.int-arch-photogramm-remote-sens-spatial-inf-sci.net/XLII-2-W16/195/2019/isprs-archives-XLII-2-W16-195-2019.pdf
id doaj-4304d4d3d3744a9186e536119ec38173
record_format Article
spelling doaj-4304d4d3d3744a9186e536119ec381732020-11-24T20:44:18ZengCopernicus PublicationsThe International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences1682-17502194-90342019-09-01XLII-2-W1619520010.5194/isprs-archives-XLII-2-W16-195-2019SEMANTIC SEGMENTATION OF AIRBORNE IMAGES AND CORRESPONDING DIGITAL SURFACE MODELS – ADDITIONAL INPUT DATA OR ADDITIONAL TASK?M. Schmitz0W. Brandenburger1H. Mayer2Institute for Applied Computer Science, Bundeswehr University Munich, Neubiberg, GermanyInstitute for Applied Computer Science, Bundeswehr University Munich, Neubiberg, GermanyInstitute for Applied Computer Science, Bundeswehr University Munich, Neubiberg, GermanyWe analyze the effects of additional height data for semantic segmentation of aerial images with a convolutional encoder-decoder network. Besides a merely image-based semantic segmentation, we trained the same network with height as additional input and furthermore, we defined a multi-task model, where we trained the network to estimate the relative height of objects in parallel to semantic segmentation on the image data only. Our findings are, that excellent results are possible for image data only and additional height information has no significant effect – neither when employed as extra input nor when used for multi-task training, even with differently weighted losses. Based on our results, we, thus, hypothesize that a strong encoder-decoder network implicitly learns the correlation of object categories and relative heights.https://www.int-arch-photogramm-remote-sens-spatial-inf-sci.net/XLII-2-W16/195/2019/isprs-archives-XLII-2-W16-195-2019.pdf
collection DOAJ
language English
format Article
sources DOAJ
author M. Schmitz
W. Brandenburger
H. Mayer
spellingShingle M. Schmitz
W. Brandenburger
H. Mayer
SEMANTIC SEGMENTATION OF AIRBORNE IMAGES AND CORRESPONDING DIGITAL SURFACE MODELS – ADDITIONAL INPUT DATA OR ADDITIONAL TASK?
The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences
author_facet M. Schmitz
W. Brandenburger
H. Mayer
author_sort M. Schmitz
title SEMANTIC SEGMENTATION OF AIRBORNE IMAGES AND CORRESPONDING DIGITAL SURFACE MODELS – ADDITIONAL INPUT DATA OR ADDITIONAL TASK?
title_short SEMANTIC SEGMENTATION OF AIRBORNE IMAGES AND CORRESPONDING DIGITAL SURFACE MODELS – ADDITIONAL INPUT DATA OR ADDITIONAL TASK?
title_full SEMANTIC SEGMENTATION OF AIRBORNE IMAGES AND CORRESPONDING DIGITAL SURFACE MODELS – ADDITIONAL INPUT DATA OR ADDITIONAL TASK?
title_fullStr SEMANTIC SEGMENTATION OF AIRBORNE IMAGES AND CORRESPONDING DIGITAL SURFACE MODELS – ADDITIONAL INPUT DATA OR ADDITIONAL TASK?
title_full_unstemmed SEMANTIC SEGMENTATION OF AIRBORNE IMAGES AND CORRESPONDING DIGITAL SURFACE MODELS – ADDITIONAL INPUT DATA OR ADDITIONAL TASK?
title_sort semantic segmentation of airborne images and corresponding digital surface models – additional input data or additional task?
publisher Copernicus Publications
series The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences
issn 1682-1750
2194-9034
publishDate 2019-09-01
description We analyze the effects of additional height data for semantic segmentation of aerial images with a convolutional encoder-decoder network. Besides a merely image-based semantic segmentation, we trained the same network with height as additional input and furthermore, we defined a multi-task model, where we trained the network to estimate the relative height of objects in parallel to semantic segmentation on the image data only. Our findings are, that excellent results are possible for image data only and additional height information has no significant effect – neither when employed as extra input nor when used for multi-task training, even with differently weighted losses. Based on our results, we, thus, hypothesize that a strong encoder-decoder network implicitly learns the correlation of object categories and relative heights.
url https://www.int-arch-photogramm-remote-sens-spatial-inf-sci.net/XLII-2-W16/195/2019/isprs-archives-XLII-2-W16-195-2019.pdf
work_keys_str_mv AT mschmitz semanticsegmentationofairborneimagesandcorrespondingdigitalsurfacemodelsadditionalinputdataoradditionaltask
AT wbrandenburger semanticsegmentationofairborneimagesandcorrespondingdigitalsurfacemodelsadditionalinputdataoradditionaltask
AT hmayer semanticsegmentationofairborneimagesandcorrespondingdigitalsurfacemodelsadditionalinputdataoradditionaltask
_version_ 1716817788984623104