ATT-BM-SOM: A Framework of Effectively Choosing Image Information and Optimizing Syntax for Image Captioning
The current challenges of image captioning technology are how to make the generated captions closely relate to the image information, and generated captions are highly syntactically readable. Therefore, we focus on two problems: 1) how to correctly choose semantic and visual information in an image,...
Main Authors: | Zhenyu Yang, Qiao Liu |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2020-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9035503/ |
Similar Items
-
Variational Autoencoder-Based Multiple Image Captioning Using a Caption Attention Map
by: Boeun Kim, et al.
Published: (2019-07-01) -
VSAM-Based Visual Keyword Generation for Image Caption
by: Suya Zhang, et al.
Published: (2021-01-01) -
Multilayer Dense Attention Model for Image Caption
by: Ke Wang, et al.
Published: (2019-01-01) -
Multi-Gate Attention Network for Image Captioning
by: Weitao Jiang, et al.
Published: (2021-01-01) -
Cascade Semantic Fusion for Image Captioning
by: Shiwei Wang, et al.
Published: (2019-01-01)