Cross-Modal Consistency in Multimodal Large Language Models