Breaking Language Barriers in Visual Language Models via Multilingual Textual Regularization

Open in new window