Hierarchical Question-Image Co-Attention for Visual Question Answering
Jiasen Lu, Jianwei Yang, Dhruv Batra, Devi Parikh
–Neural Information Processing Systems
Answering (VQA) that generate spatial maps highlighting image regions relevant to answering the question. In this paper, we argue that in addition to modeling "where
Neural Information Processing Systems
Nov-21-2025, 08:21:31 GMT
- Country:
- Europe > Spain
- Catalonia > Barcelona Province > Barcelona (0.04)
- North America > United States
- Virginia (0.04)
- Europe > Spain
- Technology: