Unexplored flaws in multiple-choice VQA evaluations

Open in new window