Facade Segmentation in the Wild

Facade parsing is a fundamental problem in urban modeling that forms the back- bone of a variety of tasks including procedural modeling, architectural analysis, urban reconstruction and quite often relies on semantic segmentation as the first step. With the shift to deep learning based approaches, e...

Full description

Bibliographic Details
Main Author: Para, Wamiq Reyaz
Other Authors: Wonka, Peter
Language:en
Published: 2019
Subjects:
Online Access:Para, W. R. (2019). Facade Segmentation in the Wild. KAUST Research Repository. https://doi.org/10.25781/KAUST-IYP0X
http://hdl.handle.net/10754/656528
id ndltd-kaust.edu.sa-oai-repository.kaust.edu.sa-10754-656528
record_format oai_dc
spelling ndltd-kaust.edu.sa-oai-repository.kaust.edu.sa-10754-6565282021-02-21T05:08:27Z Facade Segmentation in the Wild Para, Wamiq Reyaz Wonka, Peter Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division Alouini, Mohamed-Slim Thabet, Ali Kassem computer vison semantic segmentation Deep learning urban reconstruction Facade parsing is a fundamental problem in urban modeling that forms the back- bone of a variety of tasks including procedural modeling, architectural analysis, urban reconstruction and quite often relies on semantic segmentation as the first step. With the shift to deep learning based approaches, existing small-scale datasets are the bot- tleneck for making further progress in fa ̧cade segmentation and consequently fa ̧cade parsing. In this thesis, we propose a new fa ̧cade image dataset for semantic segmenta- tion called PSV-22, which is the largest such dataset. We show that PSV-22 captures semantics of fa ̧cades better than existing datasets. Additionally, we propose three architectural modifications to current state of the art deep-learning based semantic segmentation architectures and show that these modifications improve performance on our dataset and already existing datasets. Our modifications are generalizable to a large variety of semantic segmentation nets, but are fa ̧cade-specific and employ heuris- tics which arise from the regular grid-like nature of fac ̧ades. Furthermore, results show that our proposed architecture modifications improve the performance compared to baseline models as well as specialized segmentation approaches on fa ̧cade datasets and are either close in, or improve performance on existing datasets. We show that deep models trained on existing data have a substantial performance reduction on our data, whereas models trained only on our data actually improve when evaluated on existing datasets. We intend to release the dataset publically in the future. 2019-08-19T13:57:31Z 2019-08-19T13:57:31Z 2019-08-19 Thesis Para, W. R. (2019). Facade Segmentation in the Wild. KAUST Research Repository. https://doi.org/10.25781/KAUST-IYP0X 10.25781/KAUST-IYP0X http://hdl.handle.net/10754/656528 en
collection NDLTD
language en
sources NDLTD
topic computer vison
semantic segmentation
Deep learning
urban reconstruction
spellingShingle computer vison
semantic segmentation
Deep learning
urban reconstruction
Para, Wamiq Reyaz
Facade Segmentation in the Wild
description Facade parsing is a fundamental problem in urban modeling that forms the back- bone of a variety of tasks including procedural modeling, architectural analysis, urban reconstruction and quite often relies on semantic segmentation as the first step. With the shift to deep learning based approaches, existing small-scale datasets are the bot- tleneck for making further progress in fa ̧cade segmentation and consequently fa ̧cade parsing. In this thesis, we propose a new fa ̧cade image dataset for semantic segmenta- tion called PSV-22, which is the largest such dataset. We show that PSV-22 captures semantics of fa ̧cades better than existing datasets. Additionally, we propose three architectural modifications to current state of the art deep-learning based semantic segmentation architectures and show that these modifications improve performance on our dataset and already existing datasets. Our modifications are generalizable to a large variety of semantic segmentation nets, but are fa ̧cade-specific and employ heuris- tics which arise from the regular grid-like nature of fac ̧ades. Furthermore, results show that our proposed architecture modifications improve the performance compared to baseline models as well as specialized segmentation approaches on fa ̧cade datasets and are either close in, or improve performance on existing datasets. We show that deep models trained on existing data have a substantial performance reduction on our data, whereas models trained only on our data actually improve when evaluated on existing datasets. We intend to release the dataset publically in the future.
author2 Wonka, Peter
author_facet Wonka, Peter
Para, Wamiq Reyaz
author Para, Wamiq Reyaz
author_sort Para, Wamiq Reyaz
title Facade Segmentation in the Wild
title_short Facade Segmentation in the Wild
title_full Facade Segmentation in the Wild
title_fullStr Facade Segmentation in the Wild
title_full_unstemmed Facade Segmentation in the Wild
title_sort facade segmentation in the wild
publishDate 2019
url Para, W. R. (2019). Facade Segmentation in the Wild. KAUST Research Repository. https://doi.org/10.25781/KAUST-IYP0X
http://hdl.handle.net/10754/656528
work_keys_str_mv AT parawamiqreyaz facadesegmentationinthewild
_version_ 1719378105396625408