HECR-Net: Height-Embedding Context Reassembly Network for Semantic Segmentation in Aerial Images

Semantic segmentation in aerial images has become an indispensable part in remote sensing image understanding for its extensive application prospects. It is crucial to jointly reason the 2-D appearance along with 3-D information and acquire discriminative global context to achieve better segmentatio...

Full description

Bibliographic Details
Main Authors:	Wenjie Liu, Wenkai Zhang, Xian Sun, Zhi Guo, Kun Fu
Format:	Article
Language:	English
Published:	IEEE 2021-01-01
Series:	IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Subjects:	Aerial imagery context-aware reorganization height-aware embeddings modality-affinitive semantic segmentation
Online Access:	https://ieeexplore.ieee.org/document/9528946/

id	doaj-ee44af7964bf4ea4868539faa3aabb7b
record_format	Article
spelling	doaj-ee44af7964bf4ea4868539faa3aabb7b2021-09-22T23:00:05ZengIEEEIEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing2151-15352021-01-01149117913110.1109/JSTARS.2021.31094399528946HECR-Net: Height-Embedding Context Reassembly Network for Semantic Segmentation in Aerial ImagesWenjie Liu0Wenkai Zhang1https://orcid.org/0000-0002-8903-2708Xian Sun2https://orcid.org/0000-0002-0038-9816Zhi Guo3https://orcid.org/0000-0001-5083-3578Kun Fu4Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing, ChinaAerospace Information Research Institute, Chinese Academy of Sciences, Beijing, ChinaAerospace Information Research Institute, Chinese Academy of Sciences, Beijing, ChinaAerospace Information Research Institute, Chinese Academy of Sciences, Beijing, ChinaAerospace Information Research Institute, Chinese Academy of Sciences, Beijing, ChinaSemantic segmentation in aerial images has become an indispensable part in remote sensing image understanding for its extensive application prospects. It is crucial to jointly reason the 2-D appearance along with 3-D information and acquire discriminative global context to achieve better segmentation. However, previous approaches require accurate elevation data (e.g., nDSM and Digital Surface Model (DSM)) as additional inputs to segment semantics, which sorely limits their applications. On the other hand, due to the various forms of objects in complex scenes, the global context is generally dominated by features of salient patterns (e.g., large objects) and tends to smooth inconspicuous patterns (e.g., small stuff and boundaries). In this article, a novel joint framework named height-embedding context reassembly network (HECR-Net) is proposed. First, considering the fact that the corresponding elevation data is insufficient while we still want to exploit the serviceable height information, to alleviate the above data constraint, our method simultaneously predicts semantic labels and height maps from single aerial images by distilling height-aware embeddings implicitly. Second, we introduce a novel context-aware reorganization module to generate a discriminative feature with global context appropriately assigned to each local position. It benefits from both the global context aggregation module for ambiguity eliminating and local feature redistribution module for detailed refinement. Third, we make full use of the learning height-aware embeddings to promote the performance of semantic segmentation via introducing a modality-affinitive propagation block. Finally, without bells and whistles, the segmentation results on ISPRS Vaihingen and Potsdam data set illustrate that the proposed HECR-Net achieves state-of-the-art performance.https://ieeexplore.ieee.org/document/9528946/Aerial imagerycontext-aware reorganizationheight-aware embeddingsmodality-affinitivesemantic segmentation
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Wenjie Liu Wenkai Zhang Xian Sun Zhi Guo Kun Fu
spellingShingle	Wenjie Liu Wenkai Zhang Xian Sun Zhi Guo Kun Fu HECR-Net: Height-Embedding Context Reassembly Network for Semantic Segmentation in Aerial Images IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing Aerial imagery context-aware reorganization height-aware embeddings modality-affinitive semantic segmentation
author_facet	Wenjie Liu Wenkai Zhang Xian Sun Zhi Guo Kun Fu
author_sort	Wenjie Liu
title	HECR-Net: Height-Embedding Context Reassembly Network for Semantic Segmentation in Aerial Images
title_short	HECR-Net: Height-Embedding Context Reassembly Network for Semantic Segmentation in Aerial Images
title_full	HECR-Net: Height-Embedding Context Reassembly Network for Semantic Segmentation in Aerial Images
title_fullStr	HECR-Net: Height-Embedding Context Reassembly Network for Semantic Segmentation in Aerial Images
title_full_unstemmed	HECR-Net: Height-Embedding Context Reassembly Network for Semantic Segmentation in Aerial Images
title_sort	hecr-net: height-embedding context reassembly network for semantic segmentation in aerial images
publisher	IEEE
series	IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
issn	2151-1535
publishDate	2021-01-01
description	Semantic segmentation in aerial images has become an indispensable part in remote sensing image understanding for its extensive application prospects. It is crucial to jointly reason the 2-D appearance along with 3-D information and acquire discriminative global context to achieve better segmentation. However, previous approaches require accurate elevation data (e.g., nDSM and Digital Surface Model (DSM)) as additional inputs to segment semantics, which sorely limits their applications. On the other hand, due to the various forms of objects in complex scenes, the global context is generally dominated by features of salient patterns (e.g., large objects) and tends to smooth inconspicuous patterns (e.g., small stuff and boundaries). In this article, a novel joint framework named height-embedding context reassembly network (HECR-Net) is proposed. First, considering the fact that the corresponding elevation data is insufficient while we still want to exploit the serviceable height information, to alleviate the above data constraint, our method simultaneously predicts semantic labels and height maps from single aerial images by distilling height-aware embeddings implicitly. Second, we introduce a novel context-aware reorganization module to generate a discriminative feature with global context appropriately assigned to each local position. It benefits from both the global context aggregation module for ambiguity eliminating and local feature redistribution module for detailed refinement. Third, we make full use of the learning height-aware embeddings to promote the performance of semantic segmentation via introducing a modality-affinitive propagation block. Finally, without bells and whistles, the segmentation results on ISPRS Vaihingen and Potsdam data set illustrate that the proposed HECR-Net achieves state-of-the-art performance.
topic	Aerial imagery context-aware reorganization height-aware embeddings modality-affinitive semantic segmentation
url	https://ieeexplore.ieee.org/document/9528946/
work_keys_str_mv	AT wenjieliu hecrnetheightembeddingcontextreassemblynetworkforsemanticsegmentationinaerialimages AT wenkaizhang hecrnetheightembeddingcontextreassemblynetworkforsemanticsegmentationinaerialimages AT xiansun hecrnetheightembeddingcontextreassemblynetworkforsemanticsegmentationinaerialimages AT zhiguo hecrnetheightembeddingcontextreassemblynetworkforsemanticsegmentationinaerialimages AT kunfu hecrnetheightembeddingcontextreassemblynetworkforsemanticsegmentationinaerialimages
_version_	1717371268563468288

HECR-Net: Height-Embedding Context Reassembly Network for Semantic Segmentation in Aerial Images

Similar Items