ALLaVA: Harnessing GPT4V-Synthesized Data for Lite Vision-Language Models

Open in new window