Rethinking Visual Dependency in Long-Context Reasoning for Large Vision-Language Models

Open in new window