Remote sensing image description based on word embedding and end-to-end deep learning

Abstract This study proposes an end-to-end image description generation model based on word embedding technology to realise the classification and identification of Populus euphratica and Tamarix in complex remote sensing images by providing descriptions in precise and concise natural sentences. Fir...

Full description

Bibliographic Details
Main Authors: Yuan Wang, Hongbing Ma, Kuerban Alifu, Yalong Lv
Format: Article
Language:English
Published: Nature Publishing Group 2021-02-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-021-82704-4
Description
Summary:Abstract This study proposes an end-to-end image description generation model based on word embedding technology to realise the classification and identification of Populus euphratica and Tamarix in complex remote sensing images by providing descriptions in precise and concise natural sentences. First, category ambiguity over large-scale regions in remote sensing images is addressed by introducing the co-occurrence matrix and global vectors for word representation to generate the word vector features of the object to be identified. Second, a new multi-level end-to-end model is employed to further describe the content of remote sensing images and to better advance the description tasks for P. euphratica and Tamarix in remote sensing images. Experimental results reveal that the natural language sentences generated using this method can better describe P. euphratica and Tamarix in remote sensing images compared with conventional deep learning methods.
ISSN:2045-2322