Difficult Task Yes but Simple Task No: Unveiling the Laziness in Multimodal LLMs

Open in new window