Beyond Words: Exploring Cultural Value Sensitivity in Multimodal Models