Scene Understanding Based on High-Order Potentials and Generative Adversarial Networks
Scene understanding is to predict a class label at each pixel of an image. In this study, we propose a semantic segmentation framework based on classic generative adversarial nets (GAN) to train a fully convolutional semantic segmentation model along with an adversarial network. To improve the consi...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Hindawi Limited
2018-01-01
|
Series: | Advances in Multimedia |
Online Access: | http://dx.doi.org/10.1155/2018/8207201 |
id |
doaj-c39e1f67b2034cb3a2d597d0774c6f49 |
---|---|
record_format |
Article |
spelling |
doaj-c39e1f67b2034cb3a2d597d0774c6f492020-11-25T02:51:30ZengHindawi LimitedAdvances in Multimedia1687-56801687-56992018-01-01201810.1155/2018/82072018207201Scene Understanding Based on High-Order Potentials and Generative Adversarial NetworksXiaoli Zhao0Guozhong Wang1Jiaqi Zhang2Xiang Zhang3School of Electronic and Electrical Engineering, Shanghai University of Engineering Science, Shanghai 201620, ChinaSchool of Electronic and Electrical Engineering, Shanghai University of Engineering Science, Shanghai 201620, ChinaSchool of Electronic and Electrical Engineering, Shanghai University of Engineering Science, Shanghai 201620, ChinaSchool of Electronic and Electrical Engineering, Shanghai University of Engineering Science, Shanghai 201620, ChinaScene understanding is to predict a class label at each pixel of an image. In this study, we propose a semantic segmentation framework based on classic generative adversarial nets (GAN) to train a fully convolutional semantic segmentation model along with an adversarial network. To improve the consistency of the segmented image, the high-order potentials, instead of unary or pairwise potentials, are adopted. We realize the high-order potentials by substituting adversarial network for CRF model, which can continuously improve the consistency and details of the segmented semantic image until it cannot discriminate the segmented result from the ground truth. A number of experiments are conducted on PASCAL VOC 2012 and Cityscapes datasets, and the quantitative and qualitative assessments have shown the effectiveness of our proposed approach.http://dx.doi.org/10.1155/2018/8207201 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Xiaoli Zhao Guozhong Wang Jiaqi Zhang Xiang Zhang |
spellingShingle |
Xiaoli Zhao Guozhong Wang Jiaqi Zhang Xiang Zhang Scene Understanding Based on High-Order Potentials and Generative Adversarial Networks Advances in Multimedia |
author_facet |
Xiaoli Zhao Guozhong Wang Jiaqi Zhang Xiang Zhang |
author_sort |
Xiaoli Zhao |
title |
Scene Understanding Based on High-Order Potentials and Generative Adversarial Networks |
title_short |
Scene Understanding Based on High-Order Potentials and Generative Adversarial Networks |
title_full |
Scene Understanding Based on High-Order Potentials and Generative Adversarial Networks |
title_fullStr |
Scene Understanding Based on High-Order Potentials and Generative Adversarial Networks |
title_full_unstemmed |
Scene Understanding Based on High-Order Potentials and Generative Adversarial Networks |
title_sort |
scene understanding based on high-order potentials and generative adversarial networks |
publisher |
Hindawi Limited |
series |
Advances in Multimedia |
issn |
1687-5680 1687-5699 |
publishDate |
2018-01-01 |
description |
Scene understanding is to predict a class label at each pixel of an image. In this study, we propose a semantic segmentation framework based on classic generative adversarial nets (GAN) to train a fully convolutional semantic segmentation model along with an adversarial network. To improve the consistency of the segmented image, the high-order potentials, instead of unary or pairwise potentials, are adopted. We realize the high-order potentials by substituting adversarial network for CRF model, which can continuously improve the consistency and details of the segmented semantic image until it cannot discriminate the segmented result from the ground truth. A number of experiments are conducted on PASCAL VOC 2012 and Cityscapes datasets, and the quantitative and qualitative assessments have shown the effectiveness of our proposed approach. |
url |
http://dx.doi.org/10.1155/2018/8207201 |
work_keys_str_mv |
AT xiaolizhao sceneunderstandingbasedonhighorderpotentialsandgenerativeadversarialnetworks AT guozhongwang sceneunderstandingbasedonhighorderpotentialsandgenerativeadversarialnetworks AT jiaqizhang sceneunderstandingbasedonhighorderpotentialsandgenerativeadversarialnetworks AT xiangzhang sceneunderstandingbasedonhighorderpotentialsandgenerativeadversarialnetworks |
_version_ |
1724734235630108672 |