Advancing Multimodal Reasoning via Reinforcement Learning with Cold Start

Open in new window