Unsupervised Monocular Depth Estimation Based on Residual Neural Network of Coarse–Refined Feature Extractions for Drone

To take full advantage of the information of images captured by drones and given that most existing monocular depth estimation methods based on supervised learning require vast quantities of corresponding ground truth depth data for training, the model of unsupervised monocular depth estimation base...

Full description

Bibliographic Details
Main Authors:	Tao Huang, Shuanfeng Zhao, Longlong Geng, Qian Xu
Format:	Article
Language:	English
Published:	MDPI AG 2019-10-01
Series:	Electronics
Subjects:	unsupervised residual neural network image reconstruction monocular depth estimation
Online Access:	https://www.mdpi.com/2079-9292/8/10/1179

id	doaj-74408b0c838e4889a5bf0514dd299d5a
record_format	Article
spelling	doaj-74408b0c838e4889a5bf0514dd299d5a2020-11-25T01:18:41ZengMDPI AGElectronics2079-92922019-10-01810117910.3390/electronics8101179electronics8101179Unsupervised Monocular Depth Estimation Based on Residual Neural Network of Coarse–Refined Feature Extractions for DroneTao Huang0Shuanfeng Zhao1Longlong Geng2Qian Xu3School of Mechanical Engineering, Xi’an University of Science and Technology, Xi’an 710054, ChinaSchool of Mechanical Engineering, Xi’an University of Science and Technology, Xi’an 710054, ChinaSchool of Mechanical Engineering, Xi’an University of Science and Technology, Xi’an 710054, ChinaSchool of Mechanical Engineering, Xi’an University of Science and Technology, Xi’an 710054, ChinaTo take full advantage of the information of images captured by drones and given that most existing monocular depth estimation methods based on supervised learning require vast quantities of corresponding ground truth depth data for training, the model of unsupervised monocular depth estimation based on residual neural network of coarse−refined feature extractions for drone is therefore proposed. As a virtual camera is introduced through a deep residual convolution neural network based on coarse−refined feature extractions inspired by the principle of binocular depth estimation, the unsupervised monocular depth estimation has become an image reconstruction problem. To improve the performance of our model for monocular depth estimation, the following innovations are proposed. First, the pyramid processing for input image is proposed to build the topological relationship between the resolution of input image and the depth of input image, which can improve the sensitivity of depth information from a single image and reduce the impact of input image resolution on depth estimation. Second, the residual neural network of coarse−refined feature extractions for corresponding image reconstruction is designed to improve the accuracy of feature extraction and solve the contradiction between the calculation time and the numbers of network layers. In addition, to predict high detail output depth maps, the long skip connections between corresponding layers in the neural network of coarse feature extractions and deconvolution neural network of refined feature extractions are designed. Third, the loss of corresponding image reconstruction based on the structural similarity index (SSIM), the loss of approximate disparity smoothness and the loss of depth map are united as a novel training loss to better train our model. The experimental results show that our model has superior performance on the KITTI dataset composed by corresponding left view and right view and Make3D dataset composed by image and corresponding ground truth depth map compared to the state-of-the-art monocular depth estimation methods and basically meet the requirements for depth information of images captured by drones when our model is trained on KITTI.https://www.mdpi.com/2079-9292/8/10/1179unsupervisedresidual neural networkimage reconstructionmonocular depth estimation
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Tao Huang Shuanfeng Zhao Longlong Geng Qian Xu
spellingShingle	Tao Huang Shuanfeng Zhao Longlong Geng Qian Xu Unsupervised Monocular Depth Estimation Based on Residual Neural Network of Coarse–Refined Feature Extractions for Drone Electronics unsupervised residual neural network image reconstruction monocular depth estimation
author_facet	Tao Huang Shuanfeng Zhao Longlong Geng Qian Xu
author_sort	Tao Huang
title	Unsupervised Monocular Depth Estimation Based on Residual Neural Network of Coarse–Refined Feature Extractions for Drone
title_short	Unsupervised Monocular Depth Estimation Based on Residual Neural Network of Coarse–Refined Feature Extractions for Drone
title_full	Unsupervised Monocular Depth Estimation Based on Residual Neural Network of Coarse–Refined Feature Extractions for Drone
title_fullStr	Unsupervised Monocular Depth Estimation Based on Residual Neural Network of Coarse–Refined Feature Extractions for Drone
title_full_unstemmed	Unsupervised Monocular Depth Estimation Based on Residual Neural Network of Coarse–Refined Feature Extractions for Drone
title_sort	unsupervised monocular depth estimation based on residual neural network of coarse–refined feature extractions for drone
publisher	MDPI AG
series	Electronics
issn	2079-9292
publishDate	2019-10-01
description	To take full advantage of the information of images captured by drones and given that most existing monocular depth estimation methods based on supervised learning require vast quantities of corresponding ground truth depth data for training, the model of unsupervised monocular depth estimation based on residual neural network of coarse−refined feature extractions for drone is therefore proposed. As a virtual camera is introduced through a deep residual convolution neural network based on coarse−refined feature extractions inspired by the principle of binocular depth estimation, the unsupervised monocular depth estimation has become an image reconstruction problem. To improve the performance of our model for monocular depth estimation, the following innovations are proposed. First, the pyramid processing for input image is proposed to build the topological relationship between the resolution of input image and the depth of input image, which can improve the sensitivity of depth information from a single image and reduce the impact of input image resolution on depth estimation. Second, the residual neural network of coarse−refined feature extractions for corresponding image reconstruction is designed to improve the accuracy of feature extraction and solve the contradiction between the calculation time and the numbers of network layers. In addition, to predict high detail output depth maps, the long skip connections between corresponding layers in the neural network of coarse feature extractions and deconvolution neural network of refined feature extractions are designed. Third, the loss of corresponding image reconstruction based on the structural similarity index (SSIM), the loss of approximate disparity smoothness and the loss of depth map are united as a novel training loss to better train our model. The experimental results show that our model has superior performance on the KITTI dataset composed by corresponding left view and right view and Make3D dataset composed by image and corresponding ground truth depth map compared to the state-of-the-art monocular depth estimation methods and basically meet the requirements for depth information of images captured by drones when our model is trained on KITTI.
topic	unsupervised residual neural network image reconstruction monocular depth estimation
url	https://www.mdpi.com/2079-9292/8/10/1179
work_keys_str_mv	AT taohuang unsupervisedmonoculardepthestimationbasedonresidualneuralnetworkofcoarserefinedfeatureextractionsfordrone AT shuanfengzhao unsupervisedmonoculardepthestimationbasedonresidualneuralnetworkofcoarserefinedfeatureextractionsfordrone AT longlonggeng unsupervisedmonoculardepthestimationbasedonresidualneuralnetworkofcoarserefinedfeatureextractionsfordrone AT qianxu unsupervisedmonoculardepthestimationbasedonresidualneuralnetworkofcoarserefinedfeatureextractionsfordrone
_version_	1725141116404105216

Unsupervised Monocular Depth Estimation Based on Residual Neural Network of Coarse–Refined Feature Extractions for Drone

Similar Items