Captioning Transformer with Stacked Attention Modules

Image captioning is a challenging task. Meanwhile, it is important for the machine to understand the meaning of an image better. In recent years, the image captioning usually use the long-short-term-memory (LSTM) as the decoder to generate the sentence, and these models show excellent performance. A...

Full description

Bibliographic Details
Main Authors: Xinxin Zhu, Lixiang Li, Jing Liu, Haipeng Peng, Xinxin Niu
Format: Article
Language:English
Published: MDPI AG 2018-05-01
Series:Applied Sciences
Subjects:
Online Access:http://www.mdpi.com/2076-3417/8/5/739