R1-Zero's "Aha Moment" in Visual Reasoning on a 2B Non-SFT Model

Open in new window