Captioning Transformer with Stacked Attention Modules

Image captioning is a challenging task. Meanwhile, it is important for the machine to understand the meaning of an image better. In recent years, the image captioning usually use the long-short-term-memory (LSTM) as the decoder to generate the sentence, and these models show excellent performance. A...

Full description

Bibliographic Details
Main Authors:	Xinxin Zhu, Lixiang Li, Jing Liu, Haipeng Peng, Xinxin Niu
Format:	Article
Language:	English
Published:	MDPI AG 2018-05-01
Series:	Applied Sciences
Subjects:	image caption image understanding deep learning computer vision
Online Access:	http://www.mdpi.com/2076-3417/8/5/739

Internet

http://www.mdpi.com/2076-3417/8/5/739

Captioning Transformer with Stacked Attention Modules

Internet

Similar Items