Do Audio-Language Models Understand Linguistic Variations?

Open in new window