Deep Learning Approach for Building Detection Using LiDAR–Orthophoto Fusion
This paper reports on a building detection approach based on deep learning (DL) using the fusion of Light Detection and Ranging (LiDAR) data and orthophotos. The proposed method utilized object-based analysis to create objects, a feature-level fusion, an autoencoder-based dimensionality reduction to...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Hindawi Limited
2018-01-01
|
Series: | Journal of Sensors |
Online Access: | http://dx.doi.org/10.1155/2018/7212307 |
id |
doaj-ac0a5c00f45c4069a903b78d35ff40ad |
---|---|
record_format |
Article |
spelling |
doaj-ac0a5c00f45c4069a903b78d35ff40ad2020-11-25T01:27:11ZengHindawi LimitedJournal of Sensors1687-725X1687-72682018-01-01201810.1155/2018/72123077212307Deep Learning Approach for Building Detection Using LiDAR–Orthophoto FusionFaten Hamed Nahhas0Helmi Z. M. Shafri1Maher Ibrahim Sameen2Biswajeet Pradhan3Shattri Mansor4Department of Civil Engineering, Faculty of Engineering, Universiti Putra Malaysia (UPM), 43400 Serdang, Selangor, MalaysiaDepartment of Civil Engineering, Faculty of Engineering, Universiti Putra Malaysia (UPM), 43400 Serdang, Selangor, MalaysiaCentre for Advanced Modelling and Geospatial Information Systems (CAMGIS), Faculty of Engineering and IT, University of Technology Sydney, Sydney, NSW, AustraliaCentre for Advanced Modelling and Geospatial Information Systems (CAMGIS), Faculty of Engineering and IT, University of Technology Sydney, Sydney, NSW, AustraliaDepartment of Civil Engineering, Faculty of Engineering, Universiti Putra Malaysia (UPM), 43400 Serdang, Selangor, MalaysiaThis paper reports on a building detection approach based on deep learning (DL) using the fusion of Light Detection and Ranging (LiDAR) data and orthophotos. The proposed method utilized object-based analysis to create objects, a feature-level fusion, an autoencoder-based dimensionality reduction to transform low-level features into compressed features, and a convolutional neural network (CNN) to transform compressed features into high-level features, which were used to classify objects into buildings and background. The proposed architecture was optimized for the grid search method, and its sensitivity to hyperparameters was analyzed and discussed. The proposed model was evaluated on two datasets selected from an urban area with different building types. Results show that the dimensionality reduction by the autoencoder approach from 21 features to 10 features can improve detection accuracy from 86.06% to 86.19% in the working area and from 77.92% to 78.26% in the testing area. The sensitivity analysis also shows that the selection of the hyperparameter values of the model significantly affects detection accuracy. The best hyperparameters of the model are 128 filters in the CNN model, the Adamax optimizer, 10 units in the fully connected layer of the CNN model, a batch size of 8, and a dropout of 0.2. These hyperparameters are critical to improving the generalization capacity of the model. Furthermore, comparison experiments with the support vector machine (SVM) show that the proposed model with or without dimensionality reduction outperforms the SVM models in the working area. However, the SVM model achieves better accuracy in the testing area than the proposed model without dimensionality reduction. This study generally shows that the use of an autoencoder in DL models can improve the accuracy of building recognition in fused LiDAR–orthophoto data.http://dx.doi.org/10.1155/2018/7212307 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Faten Hamed Nahhas Helmi Z. M. Shafri Maher Ibrahim Sameen Biswajeet Pradhan Shattri Mansor |
spellingShingle |
Faten Hamed Nahhas Helmi Z. M. Shafri Maher Ibrahim Sameen Biswajeet Pradhan Shattri Mansor Deep Learning Approach for Building Detection Using LiDAR–Orthophoto Fusion Journal of Sensors |
author_facet |
Faten Hamed Nahhas Helmi Z. M. Shafri Maher Ibrahim Sameen Biswajeet Pradhan Shattri Mansor |
author_sort |
Faten Hamed Nahhas |
title |
Deep Learning Approach for Building Detection Using LiDAR–Orthophoto Fusion |
title_short |
Deep Learning Approach for Building Detection Using LiDAR–Orthophoto Fusion |
title_full |
Deep Learning Approach for Building Detection Using LiDAR–Orthophoto Fusion |
title_fullStr |
Deep Learning Approach for Building Detection Using LiDAR–Orthophoto Fusion |
title_full_unstemmed |
Deep Learning Approach for Building Detection Using LiDAR–Orthophoto Fusion |
title_sort |
deep learning approach for building detection using lidar–orthophoto fusion |
publisher |
Hindawi Limited |
series |
Journal of Sensors |
issn |
1687-725X 1687-7268 |
publishDate |
2018-01-01 |
description |
This paper reports on a building detection approach based on deep learning (DL) using the fusion of Light Detection and Ranging (LiDAR) data and orthophotos. The proposed method utilized object-based analysis to create objects, a feature-level fusion, an autoencoder-based dimensionality reduction to transform low-level features into compressed features, and a convolutional neural network (CNN) to transform compressed features into high-level features, which were used to classify objects into buildings and background. The proposed architecture was optimized for the grid search method, and its sensitivity to hyperparameters was analyzed and discussed. The proposed model was evaluated on two datasets selected from an urban area with different building types. Results show that the dimensionality reduction by the autoencoder approach from 21 features to 10 features can improve detection accuracy from 86.06% to 86.19% in the working area and from 77.92% to 78.26% in the testing area. The sensitivity analysis also shows that the selection of the hyperparameter values of the model significantly affects detection accuracy. The best hyperparameters of the model are 128 filters in the CNN model, the Adamax optimizer, 10 units in the fully connected layer of the CNN model, a batch size of 8, and a dropout of 0.2. These hyperparameters are critical to improving the generalization capacity of the model. Furthermore, comparison experiments with the support vector machine (SVM) show that the proposed model with or without dimensionality reduction outperforms the SVM models in the working area. However, the SVM model achieves better accuracy in the testing area than the proposed model without dimensionality reduction. This study generally shows that the use of an autoencoder in DL models can improve the accuracy of building recognition in fused LiDAR–orthophoto data. |
url |
http://dx.doi.org/10.1155/2018/7212307 |
work_keys_str_mv |
AT fatenhamednahhas deeplearningapproachforbuildingdetectionusinglidarorthophotofusion AT helmizmshafri deeplearningapproachforbuildingdetectionusinglidarorthophotofusion AT maheribrahimsameen deeplearningapproachforbuildingdetectionusinglidarorthophotofusion AT biswajeetpradhan deeplearningapproachforbuildingdetectionusinglidarorthophotofusion AT shattrimansor deeplearningapproachforbuildingdetectionusinglidarorthophotofusion |
_version_ |
1725106439193624576 |