Learning Conditioned Graph Structures for Interpretable Visual Question Answering
Will Norcliffe-Brown, Stathis Vafeias, Sarah Parisot
–Neural Information Processing Systems
Understanding both the question and image, as well as modelling their interactions requires us to combine Computer Vision and NLP techniques. The problem is generally framed in terms of classification, such that the network learns to produce answers from a finite set of classes which facilitates training and evaluation.
Neural Information Processing Systems
Feb-12-2026, 18:46:49 GMT