Fully Convolutional Networks with Cross-layer Concatenation and Multi-Scale Prediction for Semantic Segmentation
碩士 === 國立清華大學 === 資訊工程學系 === 104 === Semantic image segmentation aims to assign a semantic label to each pixel in an image. Recent state-of-the-art approaches are mainly based on Convolutional Neural Networks. Although these approaches achieve outstanding performance, they adopt very complex CNN mod...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2016
|
Online Access: | http://ndltd.ncl.edu.tw/handle/09714437432732089706 |
id |
ndltd-TW-104NTHU5392122 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-104NTHU53921222017-08-27T04:30:35Z http://ndltd.ncl.edu.tw/handle/09714437432732089706 Fully Convolutional Networks with Cross-layer Concatenation and Multi-Scale Prediction for Semantic Segmentation 透過跨層併列與多尺度預測的完全卷積網路之語意分割 Shih, Tun-Huai 史敦槐 碩士 國立清華大學 資訊工程學系 104 Semantic image segmentation aims to assign a semantic label to each pixel in an image. Recent state-of-the-art approaches are mainly based on Convolutional Neural Networks. Although these approaches achieve outstanding performance, they adopt very complex CNN models. As the result, they usually require larger training dataset and spend more time on both training and inference stages. In contrast to recent complex CNN-based approaches, we propose to simplify an existing CNN architecture, VGG-16, but do not compromise the segmentation performance. Firstly, we propose a basic model by replacing the original fully-connected layers with several convolutional and pooling layers for extracting hierarchical features. We then use the extracted hierarchical features to generate multi-scale predictions, and aggregate all predictions to derive one dense prediction result. Furthermore, we extend the basic model with cross-layer feature concatenation to jointly exploit the information from lower- and higher-level layers. Experimental results show that with only one-fourth the parameters of the original VGG and no post-processing or Conditional Random Field refinement, the proposed model achieves comparable results on three popular datasets: SIFT Flow, Pascal VOC 2012, and Pascal Context. Hsu, Chiou-Ting 許秋婷 2016 學位論文 ; thesis 30 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立清華大學 === 資訊工程學系 === 104 === Semantic image segmentation aims to assign a semantic label to each pixel in an image. Recent state-of-the-art approaches are mainly based on Convolutional Neural Networks. Although these approaches achieve outstanding performance, they adopt very complex CNN models. As the result, they usually require larger training dataset and spend more time on both training and inference stages. In contrast to recent complex CNN-based approaches, we propose to simplify an existing CNN architecture, VGG-16, but do not compromise the segmentation performance. Firstly, we propose a basic model by replacing the original fully-connected layers with several convolutional and pooling layers for extracting hierarchical features. We then use the extracted hierarchical features to generate multi-scale predictions, and aggregate all predictions to derive one dense prediction result. Furthermore, we extend the basic model with cross-layer feature concatenation to jointly exploit the information from lower- and higher-level layers. Experimental results show that with only one-fourth the parameters of the original VGG and no post-processing or Conditional Random Field refinement, the proposed model achieves comparable results on three popular datasets: SIFT Flow, Pascal VOC 2012, and Pascal Context.
|
author2 |
Hsu, Chiou-Ting |
author_facet |
Hsu, Chiou-Ting Shih, Tun-Huai 史敦槐 |
author |
Shih, Tun-Huai 史敦槐 |
spellingShingle |
Shih, Tun-Huai 史敦槐 Fully Convolutional Networks with Cross-layer Concatenation and Multi-Scale Prediction for Semantic Segmentation |
author_sort |
Shih, Tun-Huai |
title |
Fully Convolutional Networks with Cross-layer Concatenation and Multi-Scale Prediction for Semantic Segmentation |
title_short |
Fully Convolutional Networks with Cross-layer Concatenation and Multi-Scale Prediction for Semantic Segmentation |
title_full |
Fully Convolutional Networks with Cross-layer Concatenation and Multi-Scale Prediction for Semantic Segmentation |
title_fullStr |
Fully Convolutional Networks with Cross-layer Concatenation and Multi-Scale Prediction for Semantic Segmentation |
title_full_unstemmed |
Fully Convolutional Networks with Cross-layer Concatenation and Multi-Scale Prediction for Semantic Segmentation |
title_sort |
fully convolutional networks with cross-layer concatenation and multi-scale prediction for semantic segmentation |
publishDate |
2016 |
url |
http://ndltd.ncl.edu.tw/handle/09714437432732089706 |
work_keys_str_mv |
AT shihtunhuai fullyconvolutionalnetworkswithcrosslayerconcatenationandmultiscalepredictionforsemanticsegmentation AT shǐdūnhuái fullyconvolutionalnetworkswithcrosslayerconcatenationandmultiscalepredictionforsemanticsegmentation AT shihtunhuai tòuguòkuàcéngbìnglièyǔduōchǐdùyùcèdewánquánjuǎnjīwǎnglùzhīyǔyìfēngē AT shǐdūnhuái tòuguòkuàcéngbìnglièyǔduōchǐdùyùcèdewánquánjuǎnjīwǎnglùzhīyǔyìfēngē |
_version_ |
1718520160328876032 |