HECR-Net: Height-Embedding Context Reassembly Network for Semantic Segmentation in Aerial Images
Semantic segmentation in aerial images has become an indispensable part in remote sensing image understanding for its extensive application prospects. It is crucial to jointly reason the 2-D appearance along with 3-D information and acquire discriminative global context to achieve better segmentatio...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2021-01-01
|
Series: | IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9528946/ |
id |
doaj-ee44af7964bf4ea4868539faa3aabb7b |
---|---|
record_format |
Article |
spelling |
doaj-ee44af7964bf4ea4868539faa3aabb7b2021-09-22T23:00:05ZengIEEEIEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing2151-15352021-01-01149117913110.1109/JSTARS.2021.31094399528946HECR-Net: Height-Embedding Context Reassembly Network for Semantic Segmentation in Aerial ImagesWenjie Liu0Wenkai Zhang1https://orcid.org/0000-0002-8903-2708Xian Sun2https://orcid.org/0000-0002-0038-9816Zhi Guo3https://orcid.org/0000-0001-5083-3578Kun Fu4Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing, ChinaAerospace Information Research Institute, Chinese Academy of Sciences, Beijing, ChinaAerospace Information Research Institute, Chinese Academy of Sciences, Beijing, ChinaAerospace Information Research Institute, Chinese Academy of Sciences, Beijing, ChinaAerospace Information Research Institute, Chinese Academy of Sciences, Beijing, ChinaSemantic segmentation in aerial images has become an indispensable part in remote sensing image understanding for its extensive application prospects. It is crucial to jointly reason the 2-D appearance along with 3-D information and acquire discriminative global context to achieve better segmentation. However, previous approaches require accurate elevation data (e.g., nDSM and Digital Surface Model (DSM)) as additional inputs to segment semantics, which sorely limits their applications. On the other hand, due to the various forms of objects in complex scenes, the global context is generally dominated by features of salient patterns (e.g., large objects) and tends to smooth inconspicuous patterns (e.g., small stuff and boundaries). In this article, a novel joint framework named height-embedding context reassembly network (HECR-Net) is proposed. First, considering the fact that the corresponding elevation data is insufficient while we still want to exploit the serviceable height information, to alleviate the above data constraint, our method simultaneously predicts semantic labels and height maps from single aerial images by distilling height-aware embeddings implicitly. Second, we introduce a novel context-aware reorganization module to generate a discriminative feature with global context appropriately assigned to each local position. It benefits from both the global context aggregation module for ambiguity eliminating and local feature redistribution module for detailed refinement. Third, we make full use of the learning height-aware embeddings to promote the performance of semantic segmentation via introducing a modality-affinitive propagation block. Finally, without bells and whistles, the segmentation results on ISPRS Vaihingen and Potsdam data set illustrate that the proposed HECR-Net achieves state-of-the-art performance.https://ieeexplore.ieee.org/document/9528946/Aerial imagerycontext-aware reorganizationheight-aware embeddingsmodality-affinitivesemantic segmentation |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Wenjie Liu Wenkai Zhang Xian Sun Zhi Guo Kun Fu |
spellingShingle |
Wenjie Liu Wenkai Zhang Xian Sun Zhi Guo Kun Fu HECR-Net: Height-Embedding Context Reassembly Network for Semantic Segmentation in Aerial Images IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing Aerial imagery context-aware reorganization height-aware embeddings modality-affinitive semantic segmentation |
author_facet |
Wenjie Liu Wenkai Zhang Xian Sun Zhi Guo Kun Fu |
author_sort |
Wenjie Liu |
title |
HECR-Net: Height-Embedding Context Reassembly Network for Semantic Segmentation in Aerial Images |
title_short |
HECR-Net: Height-Embedding Context Reassembly Network for Semantic Segmentation in Aerial Images |
title_full |
HECR-Net: Height-Embedding Context Reassembly Network for Semantic Segmentation in Aerial Images |
title_fullStr |
HECR-Net: Height-Embedding Context Reassembly Network for Semantic Segmentation in Aerial Images |
title_full_unstemmed |
HECR-Net: Height-Embedding Context Reassembly Network for Semantic Segmentation in Aerial Images |
title_sort |
hecr-net: height-embedding context reassembly network for semantic segmentation in aerial images |
publisher |
IEEE |
series |
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing |
issn |
2151-1535 |
publishDate |
2021-01-01 |
description |
Semantic segmentation in aerial images has become an indispensable part in remote sensing image understanding for its extensive application prospects. It is crucial to jointly reason the 2-D appearance along with 3-D information and acquire discriminative global context to achieve better segmentation. However, previous approaches require accurate elevation data (e.g., nDSM and Digital Surface Model (DSM)) as additional inputs to segment semantics, which sorely limits their applications. On the other hand, due to the various forms of objects in complex scenes, the global context is generally dominated by features of salient patterns (e.g., large objects) and tends to smooth inconspicuous patterns (e.g., small stuff and boundaries). In this article, a novel joint framework named height-embedding context reassembly network (HECR-Net) is proposed. First, considering the fact that the corresponding elevation data is insufficient while we still want to exploit the serviceable height information, to alleviate the above data constraint, our method simultaneously predicts semantic labels and height maps from single aerial images by distilling height-aware embeddings implicitly. Second, we introduce a novel context-aware reorganization module to generate a discriminative feature with global context appropriately assigned to each local position. It benefits from both the global context aggregation module for ambiguity eliminating and local feature redistribution module for detailed refinement. Third, we make full use of the learning height-aware embeddings to promote the performance of semantic segmentation via introducing a modality-affinitive propagation block. Finally, without bells and whistles, the segmentation results on ISPRS Vaihingen and Potsdam data set illustrate that the proposed HECR-Net achieves state-of-the-art performance. |
topic |
Aerial imagery context-aware reorganization height-aware embeddings modality-affinitive semantic segmentation |
url |
https://ieeexplore.ieee.org/document/9528946/ |
work_keys_str_mv |
AT wenjieliu hecrnetheightembeddingcontextreassemblynetworkforsemanticsegmentationinaerialimages AT wenkaizhang hecrnetheightembeddingcontextreassemblynetworkforsemanticsegmentationinaerialimages AT xiansun hecrnetheightembeddingcontextreassemblynetworkforsemanticsegmentationinaerialimages AT zhiguo hecrnetheightembeddingcontextreassemblynetworkforsemanticsegmentationinaerialimages AT kunfu hecrnetheightembeddingcontextreassemblynetworkforsemanticsegmentationinaerialimages |
_version_ |
1717371268563468288 |