VRPRM: Process Reward Modeling via Visual Reasoning

Open in new window