Scene Semantic Understanding Based on the Spatial Context Relations of Multiple Objects

As a result of the large semantic gap between the low-level features and the high-level semantics, scene understanding is a challenging task for high satellite resolution images. To achieve scene understanding, we need to know the contents of the scene. However, most of the existing scene classifica...

Full description

Bibliographic Details
Main Authors: Yanfei Zhong, Siqi Wu, Bei Zhao
Format: Article
Language:English
Published: MDPI AG 2017-10-01
Series:Remote Sensing
Subjects:
Online Access:https://www.mdpi.com/2072-4292/9/10/1030
id doaj-93ea8930c3ef46f9b890eeeda0547771
record_format Article
spelling doaj-93ea8930c3ef46f9b890eeeda05477712020-11-24T21:54:11ZengMDPI AGRemote Sensing2072-42922017-10-01910103010.3390/rs9101030rs9101030Scene Semantic Understanding Based on the Spatial Context Relations of Multiple ObjectsYanfei Zhong0Siqi Wu1Bei Zhao2State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan 430079, ChinaState Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan 430079, ChinaDepartment of Geography & Resource Management, The Chinese University of Hong Kong, Hong Kong, ChinaAs a result of the large semantic gap between the low-level features and the high-level semantics, scene understanding is a challenging task for high satellite resolution images. To achieve scene understanding, we need to know the contents of the scene. However, most of the existing scene classification methods, such as the bag-of-visual-words model (BoVW), feature coding, topic models, and neural networks, can only classify the scene while ignoring the components and the semantic and spatial relations between these components. Therefore, in this paper, a bottom-up scene understanding framework based on the multi-object spatial context relationship model (MOSCRF) is proposed to combine the co-occurrence relations and position relations at the object level. In MOSCRF, the co-occurrence relation features are modeled by the fisher kernel coding of objects (oFK), while the position relation features are represented by the multi-object force histogram (MOFH). The MOFH is the evolution of the force histogram between pairwise objects. The MOFH not only has the property of being invariant to rotation and mirroring, but also acquires the spatial distribution of the scene by calculating the acting force between multiple land-cover objects. Due to the utilization of the prior knowledge of the objects’ information, MOSCRF can explain the objects and their relations to allow understanding of the scene. The experiments confirm that the proposed MOSCRF can reflect the layout mode of the scene both semantically and spatially, with a higher precision than the traditional methods.https://www.mdpi.com/2072-4292/9/10/1030scene understandingobject-oriented classificationco-occurrence relationsposition relationsmulti-object force histogram
collection DOAJ
language English
format Article
sources DOAJ
author Yanfei Zhong
Siqi Wu
Bei Zhao
spellingShingle Yanfei Zhong
Siqi Wu
Bei Zhao
Scene Semantic Understanding Based on the Spatial Context Relations of Multiple Objects
Remote Sensing
scene understanding
object-oriented classification
co-occurrence relations
position relations
multi-object force histogram
author_facet Yanfei Zhong
Siqi Wu
Bei Zhao
author_sort Yanfei Zhong
title Scene Semantic Understanding Based on the Spatial Context Relations of Multiple Objects
title_short Scene Semantic Understanding Based on the Spatial Context Relations of Multiple Objects
title_full Scene Semantic Understanding Based on the Spatial Context Relations of Multiple Objects
title_fullStr Scene Semantic Understanding Based on the Spatial Context Relations of Multiple Objects
title_full_unstemmed Scene Semantic Understanding Based on the Spatial Context Relations of Multiple Objects
title_sort scene semantic understanding based on the spatial context relations of multiple objects
publisher MDPI AG
series Remote Sensing
issn 2072-4292
publishDate 2017-10-01
description As a result of the large semantic gap between the low-level features and the high-level semantics, scene understanding is a challenging task for high satellite resolution images. To achieve scene understanding, we need to know the contents of the scene. However, most of the existing scene classification methods, such as the bag-of-visual-words model (BoVW), feature coding, topic models, and neural networks, can only classify the scene while ignoring the components and the semantic and spatial relations between these components. Therefore, in this paper, a bottom-up scene understanding framework based on the multi-object spatial context relationship model (MOSCRF) is proposed to combine the co-occurrence relations and position relations at the object level. In MOSCRF, the co-occurrence relation features are modeled by the fisher kernel coding of objects (oFK), while the position relation features are represented by the multi-object force histogram (MOFH). The MOFH is the evolution of the force histogram between pairwise objects. The MOFH not only has the property of being invariant to rotation and mirroring, but also acquires the spatial distribution of the scene by calculating the acting force between multiple land-cover objects. Due to the utilization of the prior knowledge of the objects’ information, MOSCRF can explain the objects and their relations to allow understanding of the scene. The experiments confirm that the proposed MOSCRF can reflect the layout mode of the scene both semantically and spatially, with a higher precision than the traditional methods.
topic scene understanding
object-oriented classification
co-occurrence relations
position relations
multi-object force histogram
url https://www.mdpi.com/2072-4292/9/10/1030
work_keys_str_mv AT yanfeizhong scenesemanticunderstandingbasedonthespatialcontextrelationsofmultipleobjects
AT siqiwu scenesemanticunderstandingbasedonthespatialcontextrelationsofmultipleobjects
AT beizhao scenesemanticunderstandingbasedonthespatialcontextrelationsofmultipleobjects
_version_ 1725868294915751936