HECR-Net: Height-Embedding Context Reassembly Network for Semantic Segmentation in Aerial Images

Semantic segmentation in aerial images has become an indispensable part in remote sensing image understanding for its extensive application prospects. It is crucial to jointly reason the 2-D appearance along with 3-D information and acquire discriminative global context to achieve better segmentatio...

Full description

Bibliographic Details
Main Authors: Wenjie Liu, Wenkai Zhang, Xian Sun, Zhi Guo, Kun Fu
Format: Article
Language:English
Published: IEEE 2021-01-01
Series:IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9528946/
id doaj-ee44af7964bf4ea4868539faa3aabb7b
record_format Article
spelling doaj-ee44af7964bf4ea4868539faa3aabb7b2021-09-22T23:00:05ZengIEEEIEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing2151-15352021-01-01149117913110.1109/JSTARS.2021.31094399528946HECR-Net: Height-Embedding Context Reassembly Network for Semantic Segmentation in Aerial ImagesWenjie Liu0Wenkai Zhang1https://orcid.org/0000-0002-8903-2708Xian Sun2https://orcid.org/0000-0002-0038-9816Zhi Guo3https://orcid.org/0000-0001-5083-3578Kun Fu4Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing, ChinaAerospace Information Research Institute, Chinese Academy of Sciences, Beijing, ChinaAerospace Information Research Institute, Chinese Academy of Sciences, Beijing, ChinaAerospace Information Research Institute, Chinese Academy of Sciences, Beijing, ChinaAerospace Information Research Institute, Chinese Academy of Sciences, Beijing, ChinaSemantic segmentation in aerial images has become an indispensable part in remote sensing image understanding for its extensive application prospects. It is crucial to jointly reason the 2-D appearance along with 3-D information and acquire discriminative global context to achieve better segmentation. However, previous approaches require accurate elevation data (e.g., nDSM and Digital Surface Model (DSM)) as additional inputs to segment semantics, which sorely limits their applications. On the other hand, due to the various forms of objects in complex scenes, the global context is generally dominated by features of salient patterns (e.g., large objects) and tends to smooth inconspicuous patterns (e.g., small stuff and boundaries). In this article, a novel joint framework named height-embedding context reassembly network (HECR-Net) is proposed. First, considering the fact that the corresponding elevation data is insufficient while we still want to exploit the serviceable height information, to alleviate the above data constraint, our method simultaneously predicts semantic labels and height maps from single aerial images by distilling height-aware embeddings implicitly. Second, we introduce a novel context-aware reorganization module to generate a discriminative feature with global context appropriately assigned to each local position. It benefits from both the global context aggregation module for ambiguity eliminating and local feature redistribution module for detailed refinement. Third, we make full use of the learning height-aware embeddings to promote the performance of semantic segmentation via introducing a modality-affinitive propagation block. Finally, without bells and whistles, the segmentation results on ISPRS Vaihingen and Potsdam data set illustrate that the proposed HECR-Net achieves state-of-the-art performance.https://ieeexplore.ieee.org/document/9528946/Aerial imagerycontext-aware reorganizationheight-aware embeddingsmodality-affinitivesemantic segmentation
collection DOAJ
language English
format Article
sources DOAJ
author Wenjie Liu
Wenkai Zhang
Xian Sun
Zhi Guo
Kun Fu
spellingShingle Wenjie Liu
Wenkai Zhang
Xian Sun
Zhi Guo
Kun Fu
HECR-Net: Height-Embedding Context Reassembly Network for Semantic Segmentation in Aerial Images
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Aerial imagery
context-aware reorganization
height-aware embeddings
modality-affinitive
semantic segmentation
author_facet Wenjie Liu
Wenkai Zhang
Xian Sun
Zhi Guo
Kun Fu
author_sort Wenjie Liu
title HECR-Net: Height-Embedding Context Reassembly Network for Semantic Segmentation in Aerial Images
title_short HECR-Net: Height-Embedding Context Reassembly Network for Semantic Segmentation in Aerial Images
title_full HECR-Net: Height-Embedding Context Reassembly Network for Semantic Segmentation in Aerial Images
title_fullStr HECR-Net: Height-Embedding Context Reassembly Network for Semantic Segmentation in Aerial Images
title_full_unstemmed HECR-Net: Height-Embedding Context Reassembly Network for Semantic Segmentation in Aerial Images
title_sort hecr-net: height-embedding context reassembly network for semantic segmentation in aerial images
publisher IEEE
series IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
issn 2151-1535
publishDate 2021-01-01
description Semantic segmentation in aerial images has become an indispensable part in remote sensing image understanding for its extensive application prospects. It is crucial to jointly reason the 2-D appearance along with 3-D information and acquire discriminative global context to achieve better segmentation. However, previous approaches require accurate elevation data (e.g., nDSM and Digital Surface Model (DSM)) as additional inputs to segment semantics, which sorely limits their applications. On the other hand, due to the various forms of objects in complex scenes, the global context is generally dominated by features of salient patterns (e.g., large objects) and tends to smooth inconspicuous patterns (e.g., small stuff and boundaries). In this article, a novel joint framework named height-embedding context reassembly network (HECR-Net) is proposed. First, considering the fact that the corresponding elevation data is insufficient while we still want to exploit the serviceable height information, to alleviate the above data constraint, our method simultaneously predicts semantic labels and height maps from single aerial images by distilling height-aware embeddings implicitly. Second, we introduce a novel context-aware reorganization module to generate a discriminative feature with global context appropriately assigned to each local position. It benefits from both the global context aggregation module for ambiguity eliminating and local feature redistribution module for detailed refinement. Third, we make full use of the learning height-aware embeddings to promote the performance of semantic segmentation via introducing a modality-affinitive propagation block. Finally, without bells and whistles, the segmentation results on ISPRS Vaihingen and Potsdam data set illustrate that the proposed HECR-Net achieves state-of-the-art performance.
topic Aerial imagery
context-aware reorganization
height-aware embeddings
modality-affinitive
semantic segmentation
url https://ieeexplore.ieee.org/document/9528946/
work_keys_str_mv AT wenjieliu hecrnetheightembeddingcontextreassemblynetworkforsemanticsegmentationinaerialimages
AT wenkaizhang hecrnetheightembeddingcontextreassemblynetworkforsemanticsegmentationinaerialimages
AT xiansun hecrnetheightembeddingcontextreassemblynetworkforsemanticsegmentationinaerialimages
AT zhiguo hecrnetheightembeddingcontextreassemblynetworkforsemanticsegmentationinaerialimages
AT kunfu hecrnetheightembeddingcontextreassemblynetworkforsemanticsegmentationinaerialimages
_version_ 1717371268563468288