Xmodel-VLM: A Simple Baseline for Multimodal Vision Language Model

Open in new window