X-LLaVA: Optimizing Bilingual Large Vision-Language Alignment

Open in new window