or Sound Symbolism in Vision and Language Models Supplementary Material