ChiQA: A Large Scale Image-based Real-World Question Answering Dataset for Multi-Modal Understanding

Open in new window