Deep Learning Approach for Building Detection Using LiDAR–Orthophoto Fusion

This paper reports on a building detection approach based on deep learning (DL) using the fusion of Light Detection and Ranging (LiDAR) data and orthophotos. The proposed method utilized object-based analysis to create objects, a feature-level fusion, an autoencoder-based dimensionality reduction to...

Full description

Bibliographic Details
Main Authors: Faten Hamed Nahhas, Helmi Z. M. Shafri, Maher Ibrahim Sameen, Biswajeet Pradhan, Shattri Mansor
Format: Article
Language:English
Published: Hindawi Limited 2018-01-01
Series:Journal of Sensors
Online Access:http://dx.doi.org/10.1155/2018/7212307
id doaj-ac0a5c00f45c4069a903b78d35ff40ad
record_format Article
spelling doaj-ac0a5c00f45c4069a903b78d35ff40ad2020-11-25T01:27:11ZengHindawi LimitedJournal of Sensors1687-725X1687-72682018-01-01201810.1155/2018/72123077212307Deep Learning Approach for Building Detection Using LiDAR–Orthophoto FusionFaten Hamed Nahhas0Helmi Z. M. Shafri1Maher Ibrahim Sameen2Biswajeet Pradhan3Shattri Mansor4Department of Civil Engineering, Faculty of Engineering, Universiti Putra Malaysia (UPM), 43400 Serdang, Selangor, MalaysiaDepartment of Civil Engineering, Faculty of Engineering, Universiti Putra Malaysia (UPM), 43400 Serdang, Selangor, MalaysiaCentre for Advanced Modelling and Geospatial Information Systems (CAMGIS), Faculty of Engineering and IT, University of Technology Sydney, Sydney, NSW, AustraliaCentre for Advanced Modelling and Geospatial Information Systems (CAMGIS), Faculty of Engineering and IT, University of Technology Sydney, Sydney, NSW, AustraliaDepartment of Civil Engineering, Faculty of Engineering, Universiti Putra Malaysia (UPM), 43400 Serdang, Selangor, MalaysiaThis paper reports on a building detection approach based on deep learning (DL) using the fusion of Light Detection and Ranging (LiDAR) data and orthophotos. The proposed method utilized object-based analysis to create objects, a feature-level fusion, an autoencoder-based dimensionality reduction to transform low-level features into compressed features, and a convolutional neural network (CNN) to transform compressed features into high-level features, which were used to classify objects into buildings and background. The proposed architecture was optimized for the grid search method, and its sensitivity to hyperparameters was analyzed and discussed. The proposed model was evaluated on two datasets selected from an urban area with different building types. Results show that the dimensionality reduction by the autoencoder approach from 21 features to 10 features can improve detection accuracy from 86.06% to 86.19% in the working area and from 77.92% to 78.26% in the testing area. The sensitivity analysis also shows that the selection of the hyperparameter values of the model significantly affects detection accuracy. The best hyperparameters of the model are 128 filters in the CNN model, the Adamax optimizer, 10 units in the fully connected layer of the CNN model, a batch size of 8, and a dropout of 0.2. These hyperparameters are critical to improving the generalization capacity of the model. Furthermore, comparison experiments with the support vector machine (SVM) show that the proposed model with or without dimensionality reduction outperforms the SVM models in the working area. However, the SVM model achieves better accuracy in the testing area than the proposed model without dimensionality reduction. This study generally shows that the use of an autoencoder in DL models can improve the accuracy of building recognition in fused LiDAR–orthophoto data.http://dx.doi.org/10.1155/2018/7212307
collection DOAJ
language English
format Article
sources DOAJ
author Faten Hamed Nahhas
Helmi Z. M. Shafri
Maher Ibrahim Sameen
Biswajeet Pradhan
Shattri Mansor
spellingShingle Faten Hamed Nahhas
Helmi Z. M. Shafri
Maher Ibrahim Sameen
Biswajeet Pradhan
Shattri Mansor
Deep Learning Approach for Building Detection Using LiDAR–Orthophoto Fusion
Journal of Sensors
author_facet Faten Hamed Nahhas
Helmi Z. M. Shafri
Maher Ibrahim Sameen
Biswajeet Pradhan
Shattri Mansor
author_sort Faten Hamed Nahhas
title Deep Learning Approach for Building Detection Using LiDAR–Orthophoto Fusion
title_short Deep Learning Approach for Building Detection Using LiDAR–Orthophoto Fusion
title_full Deep Learning Approach for Building Detection Using LiDAR–Orthophoto Fusion
title_fullStr Deep Learning Approach for Building Detection Using LiDAR–Orthophoto Fusion
title_full_unstemmed Deep Learning Approach for Building Detection Using LiDAR–Orthophoto Fusion
title_sort deep learning approach for building detection using lidar–orthophoto fusion
publisher Hindawi Limited
series Journal of Sensors
issn 1687-725X
1687-7268
publishDate 2018-01-01
description This paper reports on a building detection approach based on deep learning (DL) using the fusion of Light Detection and Ranging (LiDAR) data and orthophotos. The proposed method utilized object-based analysis to create objects, a feature-level fusion, an autoencoder-based dimensionality reduction to transform low-level features into compressed features, and a convolutional neural network (CNN) to transform compressed features into high-level features, which were used to classify objects into buildings and background. The proposed architecture was optimized for the grid search method, and its sensitivity to hyperparameters was analyzed and discussed. The proposed model was evaluated on two datasets selected from an urban area with different building types. Results show that the dimensionality reduction by the autoencoder approach from 21 features to 10 features can improve detection accuracy from 86.06% to 86.19% in the working area and from 77.92% to 78.26% in the testing area. The sensitivity analysis also shows that the selection of the hyperparameter values of the model significantly affects detection accuracy. The best hyperparameters of the model are 128 filters in the CNN model, the Adamax optimizer, 10 units in the fully connected layer of the CNN model, a batch size of 8, and a dropout of 0.2. These hyperparameters are critical to improving the generalization capacity of the model. Furthermore, comparison experiments with the support vector machine (SVM) show that the proposed model with or without dimensionality reduction outperforms the SVM models in the working area. However, the SVM model achieves better accuracy in the testing area than the proposed model without dimensionality reduction. This study generally shows that the use of an autoencoder in DL models can improve the accuracy of building recognition in fused LiDAR–orthophoto data.
url http://dx.doi.org/10.1155/2018/7212307
work_keys_str_mv AT fatenhamednahhas deeplearningapproachforbuildingdetectionusinglidarorthophotofusion
AT helmizmshafri deeplearningapproachforbuildingdetectionusinglidarorthophotofusion
AT maheribrahimsameen deeplearningapproachforbuildingdetectionusinglidarorthophotofusion
AT biswajeetpradhan deeplearningapproachforbuildingdetectionusinglidarorthophotofusion
AT shattrimansor deeplearningapproachforbuildingdetectionusinglidarorthophotofusion
_version_ 1725106439193624576