Unveiling Chain of Step Reasoning for Vision-Language Models with Fine-grained Rewards

Jun-13-2026, 17:06:51 GMT–Neural Information Processing Systems

Chain of thought reasoning has demonstrated remarkable success in large language models, yet its adaptation to vision-language reasoning remains an open challenge with unclear best practices. Existing attempts typically employ reasoning chains at a coarse-grained level, which struggles to perform fine-grained structured reasoning and, more importantly, are difficult to evaluate the reward and quality of intermediate reasoning.

artificial intelligence, natural language, proceedings, (5 more...)

Neural Information Processing Systems

Jun-13-2026, 17:06:51 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Natural Language (0.69)