SEMANTIC SEGMENTATION OF MANMADE LANDSCAPE STRUCTURES IN DIGITAL TERRAIN MODELS

We explore the use of semantic segmentation in Digital Terrain Models (DTMS) for detecting manmade landscape structures in archaeological sites. DTM data are stored and processed as large matrices of depth 1 as opposed to depth 3 in RGB images. The matrices usually contain continuous real-valued inf...

Full description

Bibliographic Details
Main Authors: B. Kazimi, F. Thiemann, M. Sester
Format: Article
Language:English
Published: Copernicus Publications 2019-09-01
Series:ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences
Online Access:https://www.isprs-ann-photogramm-remote-sens-spatial-inf-sci.net/IV-2-W7/87/2019/isprs-annals-IV-2-W7-87-2019.pdf
id doaj-0add1a285c434f2dab21be5f2b39e93e
record_format Article
spelling doaj-0add1a285c434f2dab21be5f2b39e93e2020-11-25T01:38:21ZengCopernicus PublicationsISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences2194-90422194-90502019-09-01IV-2-W7879410.5194/isprs-annals-IV-2-W7-87-2019SEMANTIC SEGMENTATION OF MANMADE LANDSCAPE STRUCTURES IN DIGITAL TERRAIN MODELSB. Kazimi0F. Thiemann1M. Sester2Institute of Cartography and Geoinformatics, Leibniz Universität Hannover, GermanyInstitute of Cartography and Geoinformatics, Leibniz Universität Hannover, GermanyInstitute of Cartography and Geoinformatics, Leibniz Universität Hannover, GermanyWe explore the use of semantic segmentation in Digital Terrain Models (DTMS) for detecting manmade landscape structures in archaeological sites. DTM data are stored and processed as large matrices of depth 1 as opposed to depth 3 in RGB images. The matrices usually contain continuous real-valued information upper bound of which is not fixed, such as distance or height from a reference surface. This is different from RGB images that contain integer values in a fixed range of 0 to 255. Additionally, RGB images are usually stored in smaller multidimensional matrices, and are more suitable as inputs for a neural network while the large DTMs are necessary to be split into smaller sub-matrices to be used by neural networks. Thus, while the spatial information of pixels in RGB images are important only locally within a single image, for DTM data, they are important locally, within a single sub-matrix processed for neural network, and also globally, in relation to the neighboring sub-matrices. To cope with the two differences, we apply min-max normalization to each input matrix fed to the neural network, and use a slightly modified version of DeepLabv3+ model for semantic segmentation. We show that with the architecture change, and the preprocessing, better results are achieved.https://www.isprs-ann-photogramm-remote-sens-spatial-inf-sci.net/IV-2-W7/87/2019/isprs-annals-IV-2-W7-87-2019.pdf
collection DOAJ
language English
format Article
sources DOAJ
author B. Kazimi
F. Thiemann
M. Sester
spellingShingle B. Kazimi
F. Thiemann
M. Sester
SEMANTIC SEGMENTATION OF MANMADE LANDSCAPE STRUCTURES IN DIGITAL TERRAIN MODELS
ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences
author_facet B. Kazimi
F. Thiemann
M. Sester
author_sort B. Kazimi
title SEMANTIC SEGMENTATION OF MANMADE LANDSCAPE STRUCTURES IN DIGITAL TERRAIN MODELS
title_short SEMANTIC SEGMENTATION OF MANMADE LANDSCAPE STRUCTURES IN DIGITAL TERRAIN MODELS
title_full SEMANTIC SEGMENTATION OF MANMADE LANDSCAPE STRUCTURES IN DIGITAL TERRAIN MODELS
title_fullStr SEMANTIC SEGMENTATION OF MANMADE LANDSCAPE STRUCTURES IN DIGITAL TERRAIN MODELS
title_full_unstemmed SEMANTIC SEGMENTATION OF MANMADE LANDSCAPE STRUCTURES IN DIGITAL TERRAIN MODELS
title_sort semantic segmentation of manmade landscape structures in digital terrain models
publisher Copernicus Publications
series ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences
issn 2194-9042
2194-9050
publishDate 2019-09-01
description We explore the use of semantic segmentation in Digital Terrain Models (DTMS) for detecting manmade landscape structures in archaeological sites. DTM data are stored and processed as large matrices of depth 1 as opposed to depth 3 in RGB images. The matrices usually contain continuous real-valued information upper bound of which is not fixed, such as distance or height from a reference surface. This is different from RGB images that contain integer values in a fixed range of 0 to 255. Additionally, RGB images are usually stored in smaller multidimensional matrices, and are more suitable as inputs for a neural network while the large DTMs are necessary to be split into smaller sub-matrices to be used by neural networks. Thus, while the spatial information of pixels in RGB images are important only locally within a single image, for DTM data, they are important locally, within a single sub-matrix processed for neural network, and also globally, in relation to the neighboring sub-matrices. To cope with the two differences, we apply min-max normalization to each input matrix fed to the neural network, and use a slightly modified version of DeepLabv3+ model for semantic segmentation. We show that with the architecture change, and the preprocessing, better results are achieved.
url https://www.isprs-ann-photogramm-remote-sens-spatial-inf-sci.net/IV-2-W7/87/2019/isprs-annals-IV-2-W7-87-2019.pdf
work_keys_str_mv AT bkazimi semanticsegmentationofmanmadelandscapestructuresindigitalterrainmodels
AT fthiemann semanticsegmentationofmanmadelandscapestructuresindigitalterrainmodels
AT msester semanticsegmentationofmanmadelandscapestructuresindigitalterrainmodels
_version_ 1725054340688773120