Scene Semantic Understanding Based on the Spatial Context Relations of Multiple Objects
As a result of the large semantic gap between the low-level features and the high-level semantics, scene understanding is a challenging task for high satellite resolution images. To achieve scene understanding, we need to know the contents of the scene. However, most of the existing scene classifica...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2017-10-01
|
Series: | Remote Sensing |
Subjects: | |
Online Access: | https://www.mdpi.com/2072-4292/9/10/1030 |
id |
doaj-93ea8930c3ef46f9b890eeeda0547771 |
---|---|
record_format |
Article |
spelling |
doaj-93ea8930c3ef46f9b890eeeda05477712020-11-24T21:54:11ZengMDPI AGRemote Sensing2072-42922017-10-01910103010.3390/rs9101030rs9101030Scene Semantic Understanding Based on the Spatial Context Relations of Multiple ObjectsYanfei Zhong0Siqi Wu1Bei Zhao2State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan 430079, ChinaState Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan 430079, ChinaDepartment of Geography & Resource Management, The Chinese University of Hong Kong, Hong Kong, ChinaAs a result of the large semantic gap between the low-level features and the high-level semantics, scene understanding is a challenging task for high satellite resolution images. To achieve scene understanding, we need to know the contents of the scene. However, most of the existing scene classification methods, such as the bag-of-visual-words model (BoVW), feature coding, topic models, and neural networks, can only classify the scene while ignoring the components and the semantic and spatial relations between these components. Therefore, in this paper, a bottom-up scene understanding framework based on the multi-object spatial context relationship model (MOSCRF) is proposed to combine the co-occurrence relations and position relations at the object level. In MOSCRF, the co-occurrence relation features are modeled by the fisher kernel coding of objects (oFK), while the position relation features are represented by the multi-object force histogram (MOFH). The MOFH is the evolution of the force histogram between pairwise objects. The MOFH not only has the property of being invariant to rotation and mirroring, but also acquires the spatial distribution of the scene by calculating the acting force between multiple land-cover objects. Due to the utilization of the prior knowledge of the objects’ information, MOSCRF can explain the objects and their relations to allow understanding of the scene. The experiments confirm that the proposed MOSCRF can reflect the layout mode of the scene both semantically and spatially, with a higher precision than the traditional methods.https://www.mdpi.com/2072-4292/9/10/1030scene understandingobject-oriented classificationco-occurrence relationsposition relationsmulti-object force histogram |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Yanfei Zhong Siqi Wu Bei Zhao |
spellingShingle |
Yanfei Zhong Siqi Wu Bei Zhao Scene Semantic Understanding Based on the Spatial Context Relations of Multiple Objects Remote Sensing scene understanding object-oriented classification co-occurrence relations position relations multi-object force histogram |
author_facet |
Yanfei Zhong Siqi Wu Bei Zhao |
author_sort |
Yanfei Zhong |
title |
Scene Semantic Understanding Based on the Spatial Context Relations of Multiple Objects |
title_short |
Scene Semantic Understanding Based on the Spatial Context Relations of Multiple Objects |
title_full |
Scene Semantic Understanding Based on the Spatial Context Relations of Multiple Objects |
title_fullStr |
Scene Semantic Understanding Based on the Spatial Context Relations of Multiple Objects |
title_full_unstemmed |
Scene Semantic Understanding Based on the Spatial Context Relations of Multiple Objects |
title_sort |
scene semantic understanding based on the spatial context relations of multiple objects |
publisher |
MDPI AG |
series |
Remote Sensing |
issn |
2072-4292 |
publishDate |
2017-10-01 |
description |
As a result of the large semantic gap between the low-level features and the high-level semantics, scene understanding is a challenging task for high satellite resolution images. To achieve scene understanding, we need to know the contents of the scene. However, most of the existing scene classification methods, such as the bag-of-visual-words model (BoVW), feature coding, topic models, and neural networks, can only classify the scene while ignoring the components and the semantic and spatial relations between these components. Therefore, in this paper, a bottom-up scene understanding framework based on the multi-object spatial context relationship model (MOSCRF) is proposed to combine the co-occurrence relations and position relations at the object level. In MOSCRF, the co-occurrence relation features are modeled by the fisher kernel coding of objects (oFK), while the position relation features are represented by the multi-object force histogram (MOFH). The MOFH is the evolution of the force histogram between pairwise objects. The MOFH not only has the property of being invariant to rotation and mirroring, but also acquires the spatial distribution of the scene by calculating the acting force between multiple land-cover objects. Due to the utilization of the prior knowledge of the objects’ information, MOSCRF can explain the objects and their relations to allow understanding of the scene. The experiments confirm that the proposed MOSCRF can reflect the layout mode of the scene both semantically and spatially, with a higher precision than the traditional methods. |
topic |
scene understanding object-oriented classification co-occurrence relations position relations multi-object force histogram |
url |
https://www.mdpi.com/2072-4292/9/10/1030 |
work_keys_str_mv |
AT yanfeizhong scenesemanticunderstandingbasedonthespatialcontextrelationsofmultipleobjects AT siqiwu scenesemanticunderstandingbasedonthespatialcontextrelationsofmultipleobjects AT beizhao scenesemanticunderstandingbasedonthespatialcontextrelationsofmultipleobjects |
_version_ |
1725868294915751936 |