Debiased Visual Question Answering from Feature and Sample Perspectives

Dec-23-2025, 20:41:56 GMT–Neural Information Processing Systems

Visual question answering (VQA) is designed to examine the visual-textual reasoning ability of an intelligent agent. However, recent observations show that many VQA models may only capture the biases between questions and answers in a dataset rather than showing real reasoning abilities. For example, given a question, some VQA models tend to output the answer that occurs frequently in the dataset and ignore the images. To reduce this tendency, existing methods focus on weakening the language bias. Meanwhile, only a few works also consider vision bias implicitly.

electronic proceedings, feature and sample perspective, name change, (4 more...)

Neural Information Processing Systems

Dec-23-2025, 20:41:56 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (0.36)