Reinforcement Fine-Tuning Powers Reasoning Capability of Multimodal Large Language Models

Open in new window