Reassessing Evaluation Practices in Visual Question Answering: A Case Study on Out-of-Distribution Generalization

Open in new window