Fully Convolutional Networks with Cross-layer Concatenation and Multi-Scale Prediction for Semantic Segmentation

碩士 === 國立清華大學 === 資訊工程學系 === 104 === Semantic image segmentation aims to assign a semantic label to each pixel in an image. Recent state-of-the-art approaches are mainly based on Convolutional Neural Networks. Although these approaches achieve outstanding performance, they adopt very complex CNN mod...

Full description

Bibliographic Details
Main Authors: Shih, Tun-Huai, 史敦槐
Other Authors: Hsu, Chiou-Ting
Format: Others
Language:en_US
Published: 2016
Online Access:http://ndltd.ncl.edu.tw/handle/09714437432732089706
id ndltd-TW-104NTHU5392122
record_format oai_dc
spelling ndltd-TW-104NTHU53921222017-08-27T04:30:35Z http://ndltd.ncl.edu.tw/handle/09714437432732089706 Fully Convolutional Networks with Cross-layer Concatenation and Multi-Scale Prediction for Semantic Segmentation 透過跨層併列與多尺度預測的完全卷積網路之語意分割 Shih, Tun-Huai 史敦槐 碩士 國立清華大學 資訊工程學系 104 Semantic image segmentation aims to assign a semantic label to each pixel in an image. Recent state-of-the-art approaches are mainly based on Convolutional Neural Networks. Although these approaches achieve outstanding performance, they adopt very complex CNN models. As the result, they usually require larger training dataset and spend more time on both training and inference stages. In contrast to recent complex CNN-based approaches, we propose to simplify an existing CNN architecture, VGG-16, but do not compromise the segmentation performance. Firstly, we propose a basic model by replacing the original fully-connected layers with several convolutional and pooling layers for extracting hierarchical features. We then use the extracted hierarchical features to generate multi-scale predictions, and aggregate all predictions to derive one dense prediction result. Furthermore, we extend the basic model with cross-layer feature concatenation to jointly exploit the information from lower- and higher-level layers. Experimental results show that with only one-fourth the parameters of the original VGG and no post-processing or Conditional Random Field refinement, the proposed model achieves comparable results on three popular datasets: SIFT Flow, Pascal VOC 2012, and Pascal Context. Hsu, Chiou-Ting 許秋婷 2016 學位論文 ; thesis 30 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立清華大學 === 資訊工程學系 === 104 === Semantic image segmentation aims to assign a semantic label to each pixel in an image. Recent state-of-the-art approaches are mainly based on Convolutional Neural Networks. Although these approaches achieve outstanding performance, they adopt very complex CNN models. As the result, they usually require larger training dataset and spend more time on both training and inference stages. In contrast to recent complex CNN-based approaches, we propose to simplify an existing CNN architecture, VGG-16, but do not compromise the segmentation performance. Firstly, we propose a basic model by replacing the original fully-connected layers with several convolutional and pooling layers for extracting hierarchical features. We then use the extracted hierarchical features to generate multi-scale predictions, and aggregate all predictions to derive one dense prediction result. Furthermore, we extend the basic model with cross-layer feature concatenation to jointly exploit the information from lower- and higher-level layers. Experimental results show that with only one-fourth the parameters of the original VGG and no post-processing or Conditional Random Field refinement, the proposed model achieves comparable results on three popular datasets: SIFT Flow, Pascal VOC 2012, and Pascal Context.
author2 Hsu, Chiou-Ting
author_facet Hsu, Chiou-Ting
Shih, Tun-Huai
史敦槐
author Shih, Tun-Huai
史敦槐
spellingShingle Shih, Tun-Huai
史敦槐
Fully Convolutional Networks with Cross-layer Concatenation and Multi-Scale Prediction for Semantic Segmentation
author_sort Shih, Tun-Huai
title Fully Convolutional Networks with Cross-layer Concatenation and Multi-Scale Prediction for Semantic Segmentation
title_short Fully Convolutional Networks with Cross-layer Concatenation and Multi-Scale Prediction for Semantic Segmentation
title_full Fully Convolutional Networks with Cross-layer Concatenation and Multi-Scale Prediction for Semantic Segmentation
title_fullStr Fully Convolutional Networks with Cross-layer Concatenation and Multi-Scale Prediction for Semantic Segmentation
title_full_unstemmed Fully Convolutional Networks with Cross-layer Concatenation and Multi-Scale Prediction for Semantic Segmentation
title_sort fully convolutional networks with cross-layer concatenation and multi-scale prediction for semantic segmentation
publishDate 2016
url http://ndltd.ncl.edu.tw/handle/09714437432732089706
work_keys_str_mv AT shihtunhuai fullyconvolutionalnetworkswithcrosslayerconcatenationandmultiscalepredictionforsemanticsegmentation
AT shǐdūnhuái fullyconvolutionalnetworkswithcrosslayerconcatenationandmultiscalepredictionforsemanticsegmentation
AT shihtunhuai tòuguòkuàcéngbìnglièyǔduōchǐdùyùcèdewánquánjuǎnjīwǎnglùzhīyǔyìfēngē
AT shǐdūnhuái tòuguòkuàcéngbìnglièyǔduōchǐdùyùcèdewánquánjuǎnjīwǎnglùzhīyǔyìfēngē
_version_ 1718520160328876032