3D Reconstruction from Satellite Imagery Using Deep Learning

Learning-based multi-view stereo (MVS) has shown promising results in the domain of general 3D reconstruction. However, no work before this thesis has applied learning-based MVS to urban 3D reconstruction from satellite images. In this thesis, learning-based MVS is used to infer depth maps from sate...

Full description

Bibliographic Details
Main Author: Yngesjö, Tim
Format: Others
Language:English
Published: Linköpings universitet, Datorseende 2021
Subjects:
Online Access:http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-176622
id ndltd-UPSALLA1-oai-DiVA.org-liu-176622
record_format oai_dc
spelling ndltd-UPSALLA1-oai-DiVA.org-liu-1766222021-06-18T05:30:45Z3D Reconstruction from Satellite Imagery Using Deep LearningengYngesjö, TimLinköpings universitet, Datorseende20213D reconstructionDeep learningSignal ProcessingSignalbehandlingLearning-based multi-view stereo (MVS) has shown promising results in the domain of general 3D reconstruction. However, no work before this thesis has applied learning-based MVS to urban 3D reconstruction from satellite images. In this thesis, learning-based MVS is used to infer depth maps from satellite images. Models are trained on both synthetic and real satellite images from Las Vegas with ground truth data from a high-resolution aerial-based 3D model. This thesis also evaluates different methods for reconstructing digital surface models (DSM) and compares them to existing satellite-based 3D models at Maxar Technologies. The DSMs are created by either post-processing point clouds obtained from predicted depth maps or by an end-to-end approach where the depth map for an orthographic satellite image is predicted.  This thesis concludes that learning-based MVS can be used to predict accurate depth maps. Models trained on synthetic data yielded relatively good results, but not nearly as good as for models trained on real satellite images. The trained models also generalize relatively well to cities not present in training. This thesis also concludes that the reconstructed DSMs achieve better quantitative results than the existing 3D model in Las Vegas and similar results for the test sets from other cities. Compared to ground truth, the best-performing method achieved an L1 and L2 error of 14 % and 29 % lower than Maxar's current 3D model, respectively. The method that uses a point cloud as an intermediate step achieves better quantitative results compared to the end-to-end system. Very promising qualitative results are achieved with the proposed methods, especially when utilizing an end-to-end approach.  Student thesisinfo:eu-repo/semantics/bachelorThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-176622LiTH-ISY-Ex ; 21/5393–SEapplication/pdfinfo:eu-repo/semantics/openAccess
collection NDLTD
language English
format Others
sources NDLTD
topic 3D reconstruction
Deep learning
Signal Processing
Signalbehandling
spellingShingle 3D reconstruction
Deep learning
Signal Processing
Signalbehandling
Yngesjö, Tim
3D Reconstruction from Satellite Imagery Using Deep Learning
description Learning-based multi-view stereo (MVS) has shown promising results in the domain of general 3D reconstruction. However, no work before this thesis has applied learning-based MVS to urban 3D reconstruction from satellite images. In this thesis, learning-based MVS is used to infer depth maps from satellite images. Models are trained on both synthetic and real satellite images from Las Vegas with ground truth data from a high-resolution aerial-based 3D model. This thesis also evaluates different methods for reconstructing digital surface models (DSM) and compares them to existing satellite-based 3D models at Maxar Technologies. The DSMs are created by either post-processing point clouds obtained from predicted depth maps or by an end-to-end approach where the depth map for an orthographic satellite image is predicted.  This thesis concludes that learning-based MVS can be used to predict accurate depth maps. Models trained on synthetic data yielded relatively good results, but not nearly as good as for models trained on real satellite images. The trained models also generalize relatively well to cities not present in training. This thesis also concludes that the reconstructed DSMs achieve better quantitative results than the existing 3D model in Las Vegas and similar results for the test sets from other cities. Compared to ground truth, the best-performing method achieved an L1 and L2 error of 14 % and 29 % lower than Maxar's current 3D model, respectively. The method that uses a point cloud as an intermediate step achieves better quantitative results compared to the end-to-end system. Very promising qualitative results are achieved with the proposed methods, especially when utilizing an end-to-end approach. 
author Yngesjö, Tim
author_facet Yngesjö, Tim
author_sort Yngesjö, Tim
title 3D Reconstruction from Satellite Imagery Using Deep Learning
title_short 3D Reconstruction from Satellite Imagery Using Deep Learning
title_full 3D Reconstruction from Satellite Imagery Using Deep Learning
title_fullStr 3D Reconstruction from Satellite Imagery Using Deep Learning
title_full_unstemmed 3D Reconstruction from Satellite Imagery Using Deep Learning
title_sort 3d reconstruction from satellite imagery using deep learning
publisher Linköpings universitet, Datorseende
publishDate 2021
url http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-176622
work_keys_str_mv AT yngesjotim 3dreconstructionfromsatelliteimageryusingdeeplearning
_version_ 1719411049643376640