SpA2V: Harnessing Spatial Auditory Cues for Audio-driven Spatially-aware Video Generation

Open in new window