AV-GS: Learning Material and Geometry Aware Priors for Novel View Acoustic Synthesis

Bhosale, Swapnil, Yang, Haosen, Kanojia, Diptesh, Deng, Jiankang, Zhu, Xiatian

Jun-14-2024–arXiv.org Artificial Intelligence

Novel view acoustic synthesis (NVAS) aims to render binaural audio at any target viewpoint, given a mono audio emitted by a sound source at a 3D scene. Existing methods have proposed NeRF-based implicit models to exploit visual cues as a condition for synthesizing binaural audio. However, in addition to low efficiency originating from heavy NeRF rendering, these methods all have a limited ability of characterizing the entire scene environment such as room geometry, material properties, and the spatial relation between the listener and sound source. To address these issues, we propose a novel Audio-Visual Gaussian Splatting (AV-GS) model. To obtain a material-aware and geometry-aware condition for audio synthesis, we learn an explicit point-based scene representation with an audio-guidance parameter on locally initialized Gaussian points, taking into account the space relation from the listener and sound source. To make the visual scene model audio adaptive, we propose a point densification and pruning strategy to optimally distribute the Gaussian points, with the per-point contribution in sound propagation (e.g., more points needed for texture-less wall surfaces as they affect sound path diversion).

binaural audio, listener, representation, (14 more...)

arXiv.org Artificial Intelligence

Jun-14-2024

arXiv.org PDF

Add feedback

Country:
- Europe > United Kingdom
  - England
    - Surrey (0.05)
    - Greater London > London (0.04)
- Asia > Japan
  - Honshū > Chūbu > Ishikawa Prefecture > Kanazawa (0.04)

Genre:
- Research Report (0.50)
- Instructional Material > Course Syllabus & Notes (0.40)

Industry:
- Education (0.64)

Technology:
- Information Technology
  - Human Computer Interaction > Interfaces
    - Virtual Reality (0.46)
  - Artificial Intelligence
    - Vision (1.00)
    - Machine Learning > Neural Networks (0.68)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found