SAVVY: Spatial Awareness via Audio-Visual LLMs through Seeing and Hearing

Open in new window