Facade Segmentation in the Wild
Facade parsing is a fundamental problem in urban modeling that forms the back- bone of a variety of tasks including procedural modeling, architectural analysis, urban reconstruction and quite often relies on semantic segmentation as the first step. With the shift to deep learning based approaches, e...
Main Author: | |
---|---|
Other Authors: | |
Language: | en |
Published: |
2019
|
Subjects: | |
Online Access: | Para, W. R. (2019). Facade Segmentation in the Wild. KAUST Research Repository. https://doi.org/10.25781/KAUST-IYP0X http://hdl.handle.net/10754/656528 |
id |
ndltd-kaust.edu.sa-oai-repository.kaust.edu.sa-10754-656528 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-kaust.edu.sa-oai-repository.kaust.edu.sa-10754-6565282021-02-21T05:08:27Z Facade Segmentation in the Wild Para, Wamiq Reyaz Wonka, Peter Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division Alouini, Mohamed-Slim Thabet, Ali Kassem computer vison semantic segmentation Deep learning urban reconstruction Facade parsing is a fundamental problem in urban modeling that forms the back- bone of a variety of tasks including procedural modeling, architectural analysis, urban reconstruction and quite often relies on semantic segmentation as the first step. With the shift to deep learning based approaches, existing small-scale datasets are the bot- tleneck for making further progress in fa ̧cade segmentation and consequently fa ̧cade parsing. In this thesis, we propose a new fa ̧cade image dataset for semantic segmenta- tion called PSV-22, which is the largest such dataset. We show that PSV-22 captures semantics of fa ̧cades better than existing datasets. Additionally, we propose three architectural modifications to current state of the art deep-learning based semantic segmentation architectures and show that these modifications improve performance on our dataset and already existing datasets. Our modifications are generalizable to a large variety of semantic segmentation nets, but are fa ̧cade-specific and employ heuris- tics which arise from the regular grid-like nature of fac ̧ades. Furthermore, results show that our proposed architecture modifications improve the performance compared to baseline models as well as specialized segmentation approaches on fa ̧cade datasets and are either close in, or improve performance on existing datasets. We show that deep models trained on existing data have a substantial performance reduction on our data, whereas models trained only on our data actually improve when evaluated on existing datasets. We intend to release the dataset publically in the future. 2019-08-19T13:57:31Z 2019-08-19T13:57:31Z 2019-08-19 Thesis Para, W. R. (2019). Facade Segmentation in the Wild. KAUST Research Repository. https://doi.org/10.25781/KAUST-IYP0X 10.25781/KAUST-IYP0X http://hdl.handle.net/10754/656528 en |
collection |
NDLTD |
language |
en |
sources |
NDLTD |
topic |
computer vison semantic segmentation Deep learning urban reconstruction |
spellingShingle |
computer vison semantic segmentation Deep learning urban reconstruction Para, Wamiq Reyaz Facade Segmentation in the Wild |
description |
Facade parsing is a fundamental problem in urban modeling that forms the back- bone of a variety of tasks including procedural modeling, architectural analysis, urban reconstruction and quite often relies on semantic segmentation as the first step. With the shift to deep learning based approaches, existing small-scale datasets are the bot- tleneck for making further progress in fa ̧cade segmentation and consequently fa ̧cade parsing. In this thesis, we propose a new fa ̧cade image dataset for semantic segmenta- tion called PSV-22, which is the largest such dataset. We show that PSV-22 captures semantics of fa ̧cades better than existing datasets. Additionally, we propose three architectural modifications to current state of the art deep-learning based semantic segmentation architectures and show that these modifications improve performance on our dataset and already existing datasets. Our modifications are generalizable to a large variety of semantic segmentation nets, but are fa ̧cade-specific and employ heuris- tics which arise from the regular grid-like nature of fac ̧ades. Furthermore, results show that our proposed architecture modifications improve the performance compared to baseline models as well as specialized segmentation approaches on fa ̧cade datasets and are either close in, or improve performance on existing datasets. We show that deep models trained on existing data have a substantial performance reduction on our data, whereas models trained only on our data actually improve when evaluated on existing datasets. We intend to release the dataset publically in the future. |
author2 |
Wonka, Peter |
author_facet |
Wonka, Peter Para, Wamiq Reyaz |
author |
Para, Wamiq Reyaz |
author_sort |
Para, Wamiq Reyaz |
title |
Facade Segmentation in the Wild |
title_short |
Facade Segmentation in the Wild |
title_full |
Facade Segmentation in the Wild |
title_fullStr |
Facade Segmentation in the Wild |
title_full_unstemmed |
Facade Segmentation in the Wild |
title_sort |
facade segmentation in the wild |
publishDate |
2019 |
url |
Para, W. R. (2019). Facade Segmentation in the Wild. KAUST Research Repository. https://doi.org/10.25781/KAUST-IYP0X http://hdl.handle.net/10754/656528 |
work_keys_str_mv |
AT parawamiqreyaz facadesegmentationinthewild |
_version_ |
1719378105396625408 |