On the Audio Hallucinations in Large Audio-Video Language Models

Open in new window