VGGSounder: Audio-Visual Evaluations for Foundation Models

Open in new window