Robustness assessment of large audio language models in multiple-choice evaluation

Open in new window