Improving reasoning with contrastive visual information for visual question answering
Abstract Visual Question Answering (VQA) aims to output a correct answer based on cross‐modality inputs including question and visual content. In general pipeline, information reasoning plays the key role for a reasonable answer. However, visual information is commonly not fully employed in many pop...
Main Authors: | Yu Long, Pengjie Tang, Hanli Wang, Jian Yu |
---|---|
Format: | Article |
Language: | English |
Published: |
Wiley
2021-09-01
|
Series: | Electronics Letters |
Online Access: | https://doi.org/10.1049/ell2.12255 |
Similar Items
-
Compressive Visual Question Answering
Published: (2017) -
Role of Premises in Visual Question Answering
by: Mahendru, Aroma
Published: (2017) -
Achieving Human Parity on Visual Question Answering
by: Bi, B., et al.
Published: (2023) -
Towards Supporting Visual Question and Answering Applications
Published: (2017) -
Visual question answering with modules and language modeling
by: Pahuja, Vardaan
Published: (2019)