Evaluating Linguistic Capabilities of Multimodal LLMs in the Lens of Few-Shot Learning

Open in new window