Road Scene Understanding with Semantic Segmentation and Object Hazard Level Prediction

碩士 === 國立清華大學 === 資訊工程學系 === 104 === We introduce a method for understanding road scenes and simultaneously predicting the hazard levels of three categories of objects in road scene images by using a fully convolutional network (FCN) architecture. In our approach, with a single input image, the mult...

Full description

Bibliographic Details
Main Authors: Kung, Wen Yao, 龔芠瑤
Other Authors: Chen, Hwann Tzong
Format: Others
Language:en_US
Published: 2016
Online Access:http://ndltd.ncl.edu.tw/handle/60494150880821921387
id ndltd-TW-104NTHU5392061
record_format oai_dc
spelling ndltd-TW-104NTHU53920612017-08-27T04:30:16Z http://ndltd.ncl.edu.tw/handle/60494150880821921387 Road Scene Understanding with Semantic Segmentation and Object Hazard Level Prediction 根據語意式切割與物體危險等級預測的道路場景理解 Kung, Wen Yao 龔芠瑤 碩士 國立清華大學 資訊工程學系 104 We introduce a method for understanding road scenes and simultaneously predicting the hazard levels of three categories of objects in road scene images by using a fully convolutional network (FCN) architecture. In our approach, with a single input image, the multi-task model produces a _ne segmentation result and a prediction of hazard levels in a form of heatmap. The model can be divided into three parts: shared net, segmentation net, and hazard level net. The shared net and segmentation net use the encoder-decoder architecture provided by Badrinarayanan et al . [2]. The hazard level net is a fully convolution network estimating hazard level of a segment with a coarse segmentation result. We also provide a dataset with the object segmentation ground truth and the hazard levels for training and evaluating the proposed deep networks. To prove that our network can learn highly semantic attributes of objects, we use two measurements to evaluate the performance of our method, and compare our method with a saliency-based method to show the difference between predicting hazard levels and estimating human eyes fixations. Chen, Hwann Tzong 陳煥宗 2016 學位論文 ; thesis 35 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立清華大學 === 資訊工程學系 === 104 === We introduce a method for understanding road scenes and simultaneously predicting the hazard levels of three categories of objects in road scene images by using a fully convolutional network (FCN) architecture. In our approach, with a single input image, the multi-task model produces a _ne segmentation result and a prediction of hazard levels in a form of heatmap. The model can be divided into three parts: shared net, segmentation net, and hazard level net. The shared net and segmentation net use the encoder-decoder architecture provided by Badrinarayanan et al . [2]. The hazard level net is a fully convolution network estimating hazard level of a segment with a coarse segmentation result. We also provide a dataset with the object segmentation ground truth and the hazard levels for training and evaluating the proposed deep networks. To prove that our network can learn highly semantic attributes of objects, we use two measurements to evaluate the performance of our method, and compare our method with a saliency-based method to show the difference between predicting hazard levels and estimating human eyes fixations.
author2 Chen, Hwann Tzong
author_facet Chen, Hwann Tzong
Kung, Wen Yao
龔芠瑤
author Kung, Wen Yao
龔芠瑤
spellingShingle Kung, Wen Yao
龔芠瑤
Road Scene Understanding with Semantic Segmentation and Object Hazard Level Prediction
author_sort Kung, Wen Yao
title Road Scene Understanding with Semantic Segmentation and Object Hazard Level Prediction
title_short Road Scene Understanding with Semantic Segmentation and Object Hazard Level Prediction
title_full Road Scene Understanding with Semantic Segmentation and Object Hazard Level Prediction
title_fullStr Road Scene Understanding with Semantic Segmentation and Object Hazard Level Prediction
title_full_unstemmed Road Scene Understanding with Semantic Segmentation and Object Hazard Level Prediction
title_sort road scene understanding with semantic segmentation and object hazard level prediction
publishDate 2016
url http://ndltd.ncl.edu.tw/handle/60494150880821921387
work_keys_str_mv AT kungwenyao roadsceneunderstandingwithsemanticsegmentationandobjecthazardlevelprediction
AT gōngwényáo roadsceneunderstandingwithsemanticsegmentationandobjecthazardlevelprediction
AT kungwenyao gēnjùyǔyìshìqiègēyǔwùtǐwēixiǎnděngjíyùcèdedàolùchǎngjǐnglǐjiě
AT gōngwényáo gēnjùyǔyìshìqiègēyǔwùtǐwēixiǎnděngjíyùcèdedàolùchǎngjǐnglǐjiě
_version_ 1718519367515242496