Systematic Reward Gap Optimization for Mitigating VLMHallucinations

Open in new window