JMLNet: Joint Multi-Label Learning Network for Weakly Supervised Semantic Segmentation in Aerial Images

Weakly supervised semantic segmentation in aerial images has attracted growing research attention due to the significant saving in annotation cost. Most of the current approaches are based on one specific pseudo label. These methods easily overfit the wrongly labeled pixels from noisy label and limi...

Full description

Bibliographic Details
Main Authors: Rongxin Guo, Xian Sun, Kaiqiang Chen, Xiao Zhou, Zhiyuan Yan, Wenhui Diao, Menglong Yan
Format: Article
Language:English
Published: MDPI AG 2020-09-01
Series:Remote Sensing
Subjects:
Online Access:https://www.mdpi.com/2072-4292/12/19/3169
id doaj-1b1ab6b680c148fb83f087958a8c124d
record_format Article
spelling doaj-1b1ab6b680c148fb83f087958a8c124d2020-11-25T02:48:39ZengMDPI AGRemote Sensing2072-42922020-09-01123169316910.3390/rs12193169JMLNet: Joint Multi-Label Learning Network for Weakly Supervised Semantic Segmentation in Aerial ImagesRongxin Guo0Xian Sun1Kaiqiang Chen2Xiao Zhou3Zhiyuan Yan4Wenhui Diao5Menglong Yan6Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100190, ChinaAerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100190, ChinaAerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100190, ChinaAerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100190, ChinaAerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100190, ChinaAerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100190, ChinaAerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100190, ChinaWeakly supervised semantic segmentation in aerial images has attracted growing research attention due to the significant saving in annotation cost. Most of the current approaches are based on one specific pseudo label. These methods easily overfit the wrongly labeled pixels from noisy label and limit the performance and generalization of the segmentation model. To tackle these problems, we propose a novel joint multi-label learning network (JMLNet) to help the model learn common knowledge from multiple noisy labels and prevent the model from overfitting one specific label. Our combination strategy of multiple proposals is that we regard them all as ground truth and propose three new multi-label losses to use the multi-label guide segmentation model in the training process. JMLNet also contains two methods to generate high-quality proposals, which further improve the performance of the segmentation task. First we propose a detection-based GradCAM (GradCAM<inline-formula><math display="inline"><semantics><msup><mrow></mrow><mi>D</mi></msup></semantics></math></inline-formula>) to generate segmentation proposals from object detectors. Then we use GradCAM<inline-formula><math display="inline"><semantics><msup><mrow></mrow><mi>D</mi></msup></semantics></math></inline-formula> to adjust the GrabCut algorithm and generate segmentation proposals (GrabCut<inline-formula><math display="inline"><semantics><msup><mrow></mrow><mi>C</mi></msup></semantics></math></inline-formula>). We report the state-of-the-art results on the semantic segmentation task of iSAID and mapping challenge dataset when training with bounding boxes annotations.https://www.mdpi.com/2072-4292/12/19/3169deep learningimage segmentationweak supervisionaerial imagemulti-label learning
collection DOAJ
language English
format Article
sources DOAJ
author Rongxin Guo
Xian Sun
Kaiqiang Chen
Xiao Zhou
Zhiyuan Yan
Wenhui Diao
Menglong Yan
spellingShingle Rongxin Guo
Xian Sun
Kaiqiang Chen
Xiao Zhou
Zhiyuan Yan
Wenhui Diao
Menglong Yan
JMLNet: Joint Multi-Label Learning Network for Weakly Supervised Semantic Segmentation in Aerial Images
Remote Sensing
deep learning
image segmentation
weak supervision
aerial image
multi-label learning
author_facet Rongxin Guo
Xian Sun
Kaiqiang Chen
Xiao Zhou
Zhiyuan Yan
Wenhui Diao
Menglong Yan
author_sort Rongxin Guo
title JMLNet: Joint Multi-Label Learning Network for Weakly Supervised Semantic Segmentation in Aerial Images
title_short JMLNet: Joint Multi-Label Learning Network for Weakly Supervised Semantic Segmentation in Aerial Images
title_full JMLNet: Joint Multi-Label Learning Network for Weakly Supervised Semantic Segmentation in Aerial Images
title_fullStr JMLNet: Joint Multi-Label Learning Network for Weakly Supervised Semantic Segmentation in Aerial Images
title_full_unstemmed JMLNet: Joint Multi-Label Learning Network for Weakly Supervised Semantic Segmentation in Aerial Images
title_sort jmlnet: joint multi-label learning network for weakly supervised semantic segmentation in aerial images
publisher MDPI AG
series Remote Sensing
issn 2072-4292
publishDate 2020-09-01
description Weakly supervised semantic segmentation in aerial images has attracted growing research attention due to the significant saving in annotation cost. Most of the current approaches are based on one specific pseudo label. These methods easily overfit the wrongly labeled pixels from noisy label and limit the performance and generalization of the segmentation model. To tackle these problems, we propose a novel joint multi-label learning network (JMLNet) to help the model learn common knowledge from multiple noisy labels and prevent the model from overfitting one specific label. Our combination strategy of multiple proposals is that we regard them all as ground truth and propose three new multi-label losses to use the multi-label guide segmentation model in the training process. JMLNet also contains two methods to generate high-quality proposals, which further improve the performance of the segmentation task. First we propose a detection-based GradCAM (GradCAM<inline-formula><math display="inline"><semantics><msup><mrow></mrow><mi>D</mi></msup></semantics></math></inline-formula>) to generate segmentation proposals from object detectors. Then we use GradCAM<inline-formula><math display="inline"><semantics><msup><mrow></mrow><mi>D</mi></msup></semantics></math></inline-formula> to adjust the GrabCut algorithm and generate segmentation proposals (GrabCut<inline-formula><math display="inline"><semantics><msup><mrow></mrow><mi>C</mi></msup></semantics></math></inline-formula>). We report the state-of-the-art results on the semantic segmentation task of iSAID and mapping challenge dataset when training with bounding boxes annotations.
topic deep learning
image segmentation
weak supervision
aerial image
multi-label learning
url https://www.mdpi.com/2072-4292/12/19/3169
work_keys_str_mv AT rongxinguo jmlnetjointmultilabellearningnetworkforweaklysupervisedsemanticsegmentationinaerialimages
AT xiansun jmlnetjointmultilabellearningnetworkforweaklysupervisedsemanticsegmentationinaerialimages
AT kaiqiangchen jmlnetjointmultilabellearningnetworkforweaklysupervisedsemanticsegmentationinaerialimages
AT xiaozhou jmlnetjointmultilabellearningnetworkforweaklysupervisedsemanticsegmentationinaerialimages
AT zhiyuanyan jmlnetjointmultilabellearningnetworkforweaklysupervisedsemanticsegmentationinaerialimages
AT wenhuidiao jmlnetjointmultilabellearningnetworkforweaklysupervisedsemanticsegmentationinaerialimages
AT menglongyan jmlnetjointmultilabellearningnetworkforweaklysupervisedsemanticsegmentationinaerialimages
_version_ 1724747250079367168