Unveiling Visual Biases in Audio-Visual Localization Benchmarks

Open in new window