Depth estimation for a road scene using a monocular image sequence based on fully convolutional neural network

An advanced driving assistant system is one of the most popular topics nowadays, and depth estimation is an important cue for advanced driving assistant system. Depth prediction is a key problem in understanding the geometry of a road scene for advanced driving assistant system. In comparison to oth...

Full description

Bibliographic Details
Main Authors: Haixia Wang, Yehao Sun, Zhiguo Zhang, Xiao Lu, Chunyang Sheng
Format: Article
Language:English
Published: SAGE Publishing 2020-05-01
Series:International Journal of Advanced Robotic Systems
Online Access:https://doi.org/10.1177/1729881420925305
id doaj-0fc43c2afcce4dada9b4ce5c0d8dab58
record_format Article
spelling doaj-0fc43c2afcce4dada9b4ce5c0d8dab582020-11-25T03:20:17ZengSAGE PublishingInternational Journal of Advanced Robotic Systems1729-88142020-05-011710.1177/1729881420925305Depth estimation for a road scene using a monocular image sequence based on fully convolutional neural networkHaixia WangYehao SunZhiguo ZhangXiao LuChunyang ShengAn advanced driving assistant system is one of the most popular topics nowadays, and depth estimation is an important cue for advanced driving assistant system. Depth prediction is a key problem in understanding the geometry of a road scene for advanced driving assistant system. In comparison to other depth estimation methods using stereo depth perception, determining depth relation using a monocular camera is considerably challenging. In this article, a fully convolutional neural network with skip connection based on a monocular video sequence is proposed. With the integration framework that combines skip connection, fully convolutional network and the consistency between consecutive frames of the input sequence, high-resolution depth maps are obtained with lightweight network training and fewer computations. The proposed method models depth estimation as a regression problem and trains the proposed network using a scale invariance optimization based on L2 loss function, which measures the relationships between points in the consecutive frames. The proposed method can be used for depth estimation of a road scene without the need for any extra information or geometric priors. Experiments on road scene data sets demonstrate that the proposed approach outperforms previous methods for monocular depth estimation in dynamic scenes. Compared with the currently proposed method, our method has achieved good results when using the Eigen split evaluation method. The obvious prominent one is that the linear root mean squared error result is 3.462 and the δ < 1.25 result is 0.892.https://doi.org/10.1177/1729881420925305
collection DOAJ
language English
format Article
sources DOAJ
author Haixia Wang
Yehao Sun
Zhiguo Zhang
Xiao Lu
Chunyang Sheng
spellingShingle Haixia Wang
Yehao Sun
Zhiguo Zhang
Xiao Lu
Chunyang Sheng
Depth estimation for a road scene using a monocular image sequence based on fully convolutional neural network
International Journal of Advanced Robotic Systems
author_facet Haixia Wang
Yehao Sun
Zhiguo Zhang
Xiao Lu
Chunyang Sheng
author_sort Haixia Wang
title Depth estimation for a road scene using a monocular image sequence based on fully convolutional neural network
title_short Depth estimation for a road scene using a monocular image sequence based on fully convolutional neural network
title_full Depth estimation for a road scene using a monocular image sequence based on fully convolutional neural network
title_fullStr Depth estimation for a road scene using a monocular image sequence based on fully convolutional neural network
title_full_unstemmed Depth estimation for a road scene using a monocular image sequence based on fully convolutional neural network
title_sort depth estimation for a road scene using a monocular image sequence based on fully convolutional neural network
publisher SAGE Publishing
series International Journal of Advanced Robotic Systems
issn 1729-8814
publishDate 2020-05-01
description An advanced driving assistant system is one of the most popular topics nowadays, and depth estimation is an important cue for advanced driving assistant system. Depth prediction is a key problem in understanding the geometry of a road scene for advanced driving assistant system. In comparison to other depth estimation methods using stereo depth perception, determining depth relation using a monocular camera is considerably challenging. In this article, a fully convolutional neural network with skip connection based on a monocular video sequence is proposed. With the integration framework that combines skip connection, fully convolutional network and the consistency between consecutive frames of the input sequence, high-resolution depth maps are obtained with lightweight network training and fewer computations. The proposed method models depth estimation as a regression problem and trains the proposed network using a scale invariance optimization based on L2 loss function, which measures the relationships between points in the consecutive frames. The proposed method can be used for depth estimation of a road scene without the need for any extra information or geometric priors. Experiments on road scene data sets demonstrate that the proposed approach outperforms previous methods for monocular depth estimation in dynamic scenes. Compared with the currently proposed method, our method has achieved good results when using the Eigen split evaluation method. The obvious prominent one is that the linear root mean squared error result is 3.462 and the δ < 1.25 result is 0.892.
url https://doi.org/10.1177/1729881420925305
work_keys_str_mv AT haixiawang depthestimationforaroadsceneusingamonocularimagesequencebasedonfullyconvolutionalneuralnetwork
AT yehaosun depthestimationforaroadsceneusingamonocularimagesequencebasedonfullyconvolutionalneuralnetwork
AT zhiguozhang depthestimationforaroadsceneusingamonocularimagesequencebasedonfullyconvolutionalneuralnetwork
AT xiaolu depthestimationforaroadsceneusingamonocularimagesequencebasedonfullyconvolutionalneuralnetwork
AT chunyangsheng depthestimationforaroadsceneusingamonocularimagesequencebasedonfullyconvolutionalneuralnetwork
_version_ 1724618423441293312