Enhance Reasoning Ability of Visual-Language Models via Large Language Models

Open in new window