Reviews: Visual Question Answering with Question Representation Update (QRU)
–Neural Information Processing Systems
Strength: The technical contributions are a clever and simple extension/combination of existing ideas such as "Neural Reasoner" [B. Show, attend and tell: 307 Neural image caption generation with visual attention. The paper is well-written and easy to follow, especially the architecture of the model and the explanations for it are modular and simple (image understanding layer, question encoding layer, reasoning layer, and answering layer). Haven't yet encountered a VQA system that changes the question representation based on image. This novelty adds strength to this paper.
Neural Information Processing Systems
Jan-20-2025, 23:06:48 GMT
- Technology: