VisPlay: Self-Evolving Vision-Language Models from Images

Open in new window