Visual Structures Help Visual Reasoning: Addressing the Binding Problem in LVLMs

Jun-14-2026, 07:10:17 GMT–Neural Information Processing Systems

Despite progress in Large Vision-Language Models (LVLMs), their capacity for visual reasoning is often limited by the binding problem: the failure to reliably associate perceptual features with their correct visual referents.

artificial intelligence, natural language, proceedings, (7 more...)

Neural Information Processing Systems

Jun-14-2026, 07:10:17 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Vision (0.62)
  - Natural Language (0.59)