No Labels, No Problem: Training Visual Reasoners with Multimodal Verifiers

Open in new window