JMLNet: Joint Multi-Label Learning Network for Weakly Supervised Semantic Segmentation in Aerial Images
Weakly supervised semantic segmentation in aerial images has attracted growing research attention due to the significant saving in annotation cost. Most of the current approaches are based on one specific pseudo label. These methods easily overfit the wrongly labeled pixels from noisy label and limi...
Main Authors: | , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2020-09-01
|
Series: | Remote Sensing |
Subjects: | |
Online Access: | https://www.mdpi.com/2072-4292/12/19/3169 |
id |
doaj-1b1ab6b680c148fb83f087958a8c124d |
---|---|
record_format |
Article |
spelling |
doaj-1b1ab6b680c148fb83f087958a8c124d2020-11-25T02:48:39ZengMDPI AGRemote Sensing2072-42922020-09-01123169316910.3390/rs12193169JMLNet: Joint Multi-Label Learning Network for Weakly Supervised Semantic Segmentation in Aerial ImagesRongxin Guo0Xian Sun1Kaiqiang Chen2Xiao Zhou3Zhiyuan Yan4Wenhui Diao5Menglong Yan6Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100190, ChinaAerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100190, ChinaAerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100190, ChinaAerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100190, ChinaAerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100190, ChinaAerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100190, ChinaAerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100190, ChinaWeakly supervised semantic segmentation in aerial images has attracted growing research attention due to the significant saving in annotation cost. Most of the current approaches are based on one specific pseudo label. These methods easily overfit the wrongly labeled pixels from noisy label and limit the performance and generalization of the segmentation model. To tackle these problems, we propose a novel joint multi-label learning network (JMLNet) to help the model learn common knowledge from multiple noisy labels and prevent the model from overfitting one specific label. Our combination strategy of multiple proposals is that we regard them all as ground truth and propose three new multi-label losses to use the multi-label guide segmentation model in the training process. JMLNet also contains two methods to generate high-quality proposals, which further improve the performance of the segmentation task. First we propose a detection-based GradCAM (GradCAM<inline-formula><math display="inline"><semantics><msup><mrow></mrow><mi>D</mi></msup></semantics></math></inline-formula>) to generate segmentation proposals from object detectors. Then we use GradCAM<inline-formula><math display="inline"><semantics><msup><mrow></mrow><mi>D</mi></msup></semantics></math></inline-formula> to adjust the GrabCut algorithm and generate segmentation proposals (GrabCut<inline-formula><math display="inline"><semantics><msup><mrow></mrow><mi>C</mi></msup></semantics></math></inline-formula>). We report the state-of-the-art results on the semantic segmentation task of iSAID and mapping challenge dataset when training with bounding boxes annotations.https://www.mdpi.com/2072-4292/12/19/3169deep learningimage segmentationweak supervisionaerial imagemulti-label learning |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Rongxin Guo Xian Sun Kaiqiang Chen Xiao Zhou Zhiyuan Yan Wenhui Diao Menglong Yan |
spellingShingle |
Rongxin Guo Xian Sun Kaiqiang Chen Xiao Zhou Zhiyuan Yan Wenhui Diao Menglong Yan JMLNet: Joint Multi-Label Learning Network for Weakly Supervised Semantic Segmentation in Aerial Images Remote Sensing deep learning image segmentation weak supervision aerial image multi-label learning |
author_facet |
Rongxin Guo Xian Sun Kaiqiang Chen Xiao Zhou Zhiyuan Yan Wenhui Diao Menglong Yan |
author_sort |
Rongxin Guo |
title |
JMLNet: Joint Multi-Label Learning Network for Weakly Supervised Semantic Segmentation in Aerial Images |
title_short |
JMLNet: Joint Multi-Label Learning Network for Weakly Supervised Semantic Segmentation in Aerial Images |
title_full |
JMLNet: Joint Multi-Label Learning Network for Weakly Supervised Semantic Segmentation in Aerial Images |
title_fullStr |
JMLNet: Joint Multi-Label Learning Network for Weakly Supervised Semantic Segmentation in Aerial Images |
title_full_unstemmed |
JMLNet: Joint Multi-Label Learning Network for Weakly Supervised Semantic Segmentation in Aerial Images |
title_sort |
jmlnet: joint multi-label learning network for weakly supervised semantic segmentation in aerial images |
publisher |
MDPI AG |
series |
Remote Sensing |
issn |
2072-4292 |
publishDate |
2020-09-01 |
description |
Weakly supervised semantic segmentation in aerial images has attracted growing research attention due to the significant saving in annotation cost. Most of the current approaches are based on one specific pseudo label. These methods easily overfit the wrongly labeled pixels from noisy label and limit the performance and generalization of the segmentation model. To tackle these problems, we propose a novel joint multi-label learning network (JMLNet) to help the model learn common knowledge from multiple noisy labels and prevent the model from overfitting one specific label. Our combination strategy of multiple proposals is that we regard them all as ground truth and propose three new multi-label losses to use the multi-label guide segmentation model in the training process. JMLNet also contains two methods to generate high-quality proposals, which further improve the performance of the segmentation task. First we propose a detection-based GradCAM (GradCAM<inline-formula><math display="inline"><semantics><msup><mrow></mrow><mi>D</mi></msup></semantics></math></inline-formula>) to generate segmentation proposals from object detectors. Then we use GradCAM<inline-formula><math display="inline"><semantics><msup><mrow></mrow><mi>D</mi></msup></semantics></math></inline-formula> to adjust the GrabCut algorithm and generate segmentation proposals (GrabCut<inline-formula><math display="inline"><semantics><msup><mrow></mrow><mi>C</mi></msup></semantics></math></inline-formula>). We report the state-of-the-art results on the semantic segmentation task of iSAID and mapping challenge dataset when training with bounding boxes annotations. |
topic |
deep learning image segmentation weak supervision aerial image multi-label learning |
url |
https://www.mdpi.com/2072-4292/12/19/3169 |
work_keys_str_mv |
AT rongxinguo jmlnetjointmultilabellearningnetworkforweaklysupervisedsemanticsegmentationinaerialimages AT xiansun jmlnetjointmultilabellearningnetworkforweaklysupervisedsemanticsegmentationinaerialimages AT kaiqiangchen jmlnetjointmultilabellearningnetworkforweaklysupervisedsemanticsegmentationinaerialimages AT xiaozhou jmlnetjointmultilabellearningnetworkforweaklysupervisedsemanticsegmentationinaerialimages AT zhiyuanyan jmlnetjointmultilabellearningnetworkforweaklysupervisedsemanticsegmentationinaerialimages AT wenhuidiao jmlnetjointmultilabellearningnetworkforweaklysupervisedsemanticsegmentationinaerialimages AT menglongyan jmlnetjointmultilabellearningnetworkforweaklysupervisedsemanticsegmentationinaerialimages |
_version_ |
1724747250079367168 |