Difficult Task Yes but Simple Task No: Unveiling the Laziness in Multimodal LLMs