With Ears to See and Eyes to Hear: Sound Symbolism Experiments with Multimodal Large Language Models