Gemini in Reasoning: Unveiling Commonsense in Multimodal Large Language Models