MindGYM: Enhancing Vision-Language Models via Synthetic Self-Challenging Questions

Open in new window