GuardReasoner-VL: Safeguarding VLMs via Reinforced Reasoning

Jun-11-2026, 08:35:04 GMT–Neural Information Processing Systems

To enhance the safety of VLMs, this paper introduces a novel reasoning-based VLM guard model dubbed GuardReasoner-VL. The core idea is to incentivize the guard model to deliberatively reason before making moderation decisions via online RL. First, we construct GuardReasoner-VLTrain, a reasoning corpus with 123K samples and 631K reasoning steps, spanning text, image, and text-image inputs. Then, based on it, we cold-start our model's reasoning ability via SFT. In addition, we further enhance reasoning regarding moderation through online RL.

artificial intelligence, guardreasoner-vl, proceedings, (5 more...)

Neural Information Processing Systems

Jun-11-2026, 08:35:04 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence (0.80)