Fully Convolutional Networks with Cross-layer Concatenation and Multi-Scale Prediction for Semantic Segmentation

碩士 === 國立清華大學 === 資訊工程學系 === 104 === Semantic image segmentation aims to assign a semantic label to each pixel in an image. Recent state-of-the-art approaches are mainly based on Convolutional Neural Networks. Although these approaches achieve outstanding performance, they adopt very complex CNN mod...

Full description

Bibliographic Details
Main Authors:	Shih, Tun-Huai, 史敦槐
Other Authors:	Hsu, Chiou-Ting
Format:	Others
Language:	en_US
Published:	2016
Online Access:	http://ndltd.ncl.edu.tw/handle/09714437432732089706

id	ndltd-TW-104NTHU5392122
record_format	oai_dc
spelling	ndltd-TW-104NTHU53921222017-08-27T04:30:35Z http://ndltd.ncl.edu.tw/handle/09714437432732089706 Fully Convolutional Networks with Cross-layer Concatenation and Multi-Scale Prediction for Semantic Segmentation 透過跨層併列與多尺度預測的完全卷積網路之語意分割 Shih, Tun-Huai 史敦槐碩士國立清華大學資訊工程學系 104 Semantic image segmentation aims to assign a semantic label to each pixel in an image. Recent state-of-the-art approaches are mainly based on Convolutional Neural Networks. Although these approaches achieve outstanding performance, they adopt very complex CNN models. As the result, they usually require larger training dataset and spend more time on both training and inference stages. In contrast to recent complex CNN-based approaches, we propose to simplify an existing CNN architecture, VGG-16, but do not compromise the segmentation performance. Firstly, we propose a basic model by replacing the original fully-connected layers with several convolutional and pooling layers for extracting hierarchical features. We then use the extracted hierarchical features to generate multi-scale predictions, and aggregate all predictions to derive one dense prediction result. Furthermore, we extend the basic model with cross-layer feature concatenation to jointly exploit the information from lower- and higher-level layers. Experimental results show that with only one-fourth the parameters of the original VGG and no post-processing or Conditional Random Field refinement, the proposed model achieves comparable results on three popular datasets: SIFT Flow, Pascal VOC 2012, and Pascal Context. Hsu, Chiou-Ting 許秋婷 2016 學位論文 ; thesis 30 en_US
collection	NDLTD
language	en_US
format	Others
sources	NDLTD
description	碩士 === 國立清華大學 === 資訊工程學系 === 104 === Semantic image segmentation aims to assign a semantic label to each pixel in an image. Recent state-of-the-art approaches are mainly based on Convolutional Neural Networks. Although these approaches achieve outstanding performance, they adopt very complex CNN models. As the result, they usually require larger training dataset and spend more time on both training and inference stages. In contrast to recent complex CNN-based approaches, we propose to simplify an existing CNN architecture, VGG-16, but do not compromise the segmentation performance. Firstly, we propose a basic model by replacing the original fully-connected layers with several convolutional and pooling layers for extracting hierarchical features. We then use the extracted hierarchical features to generate multi-scale predictions, and aggregate all predictions to derive one dense prediction result. Furthermore, we extend the basic model with cross-layer feature concatenation to jointly exploit the information from lower- and higher-level layers. Experimental results show that with only one-fourth the parameters of the original VGG and no post-processing or Conditional Random Field refinement, the proposed model achieves comparable results on three popular datasets: SIFT Flow, Pascal VOC 2012, and Pascal Context.
author2	Hsu, Chiou-Ting
author_facet	Hsu, Chiou-Ting Shih, Tun-Huai 史敦槐
author	Shih, Tun-Huai 史敦槐
spellingShingle	Shih, Tun-Huai 史敦槐 Fully Convolutional Networks with Cross-layer Concatenation and Multi-Scale Prediction for Semantic Segmentation
author_sort	Shih, Tun-Huai
title	Fully Convolutional Networks with Cross-layer Concatenation and Multi-Scale Prediction for Semantic Segmentation
title_short	Fully Convolutional Networks with Cross-layer Concatenation and Multi-Scale Prediction for Semantic Segmentation
title_full	Fully Convolutional Networks with Cross-layer Concatenation and Multi-Scale Prediction for Semantic Segmentation
title_fullStr	Fully Convolutional Networks with Cross-layer Concatenation and Multi-Scale Prediction for Semantic Segmentation
title_full_unstemmed	Fully Convolutional Networks with Cross-layer Concatenation and Multi-Scale Prediction for Semantic Segmentation
title_sort	fully convolutional networks with cross-layer concatenation and multi-scale prediction for semantic segmentation
publishDate	2016
url	http://ndltd.ncl.edu.tw/handle/09714437432732089706
work_keys_str_mv	AT shihtunhuai fullyconvolutionalnetworkswithcrosslayerconcatenationandmultiscalepredictionforsemanticsegmentation AT shǐdūnhuái fullyconvolutionalnetworkswithcrosslayerconcatenationandmultiscalepredictionforsemanticsegmentation AT shihtunhuai tòuguòkuàcéngbìnglièyǔduōchǐdùyùcèdewánquánjuǎnjīwǎnglùzhīyǔyìfēngē AT shǐdūnhuái tòuguòkuàcéngbìnglièyǔduōchǐdùyùcèdewánquánjuǎnjīwǎnglùzhīyǔyìfēngē
_version_	1718520160328876032

Fully Convolutional Networks with Cross-layer Concatenation and Multi-Scale Prediction for Semantic Segmentation

Similar Items