deepwave
DeepWave: A Recurrent Neural-Network for Real-Time Acoustic Imaging
We propose a recurrent neural-network for real-time reconstruction of acoustic camera spherical maps. The network, dubbed DeepWave, is both physically and algorithmically motivated: its recurrent architecture mimics iterative solvers from convex optimisation, and its parsimonious parametrisation is based on the natural structure of acoustic imaging problems. Each network layer applies successive filtering, biasing and activation steps to its input, which can be interpreted as generalised deblurring and sparsification steps. To comply with the irregular geometry of spherical maps, filtering operations are implemented efficiently by means of graph signal processing techniques. Unlike commonly-used imaging network architectures, DeepWave is moreover capable of directly processing the complex-valued raw microphone correlations, learning how to optimally back-project these into a spherical map. We propose moreover a smart physically-inspired initialisation scheme that attains much faster training and higher performance than random initialisation. Our real-data experiments show DeepWave has similar computational speed to the state-of-the-art delay-and-sum imager with vastly superior resolution. While developed primarily for acoustic cameras, DeepWave could easily be adapted to neighbouring signal processing fields, such as radio astronomy, radar and sonar.
- Europe > Switzerland > Vaud > Lausanne (0.04)
- Europe > Switzerland > Zürich > Zürich (0.04)
- Europe > Switzerland > Vaud > Lausanne (0.04)
- Europe > Switzerland > Zürich > Zürich (0.04)
Latent Acoustic Mapping for Direction of Arrival Estimation: A Self-Supervised Approach
Roman, Adrian S., Roman, Iran R., Bello, Juan P.
Acoustic mapping techniques have long been used in spatial audio processing for direction of arrival estimation (DoAE). Traditional beamforming methods for acoustic mapping, while interpretable, often rely on iterative solvers that can be computationally intensive and sensitive to acoustic variability. On the other hand, recent supervised deep learning approaches offer feedforward speed and robustness but require large labeled datasets and lack interpretability. Despite their strengths, both methods struggle to consistently generalize across diverse acoustic setups and array configurations, limiting their broader applicability. We introduce the Latent Acoustic Mapping (LAM) model, a self-supervised framework that bridges the interpretability of traditional methods with the adaptability and efficiency of deep learning methods. LAM generates high-resolution acoustic maps, adapts to varying acoustic conditions, and operates efficiently across different microphone arrays. We assess its robustness on DoAE using the LOCATA and STARSS benchmarks. LAM achieves comparable or superior localization performance to existing supervised methods. Additionally, we show that LAM's acoustic maps can serve as effective features for supervised models, further enhancing DoAE accuracy and underscoring its potential to advance adaptive, high-performance sound localization systems.
- Asia > Middle East > Iran (0.40)
- North America > United States > California > Los Angeles County > Los Angeles (0.14)
- North America > United States > New Jersey > Union County > Summit (0.04)
- (3 more...)
Reviews: DeepWave: A Recurrent Neural-Network for Real-Time Acoustic Imaging
Traditional acoustic camera methods had been advanced significantly with the advent of compressed sensing techniques, which reconstruct the original signals successfully by means of hand-crafted features or functions with nonlinear optimization, e.g., proximal gradient descent. However, the performance of the reconstruction process has been significantly slow due to nonlinear optimization steps. This paper proposes a new approach that substitutes the traditional nonlinear optimization approach with recurrent network architecture, i.e., by unrolling the iterative convex optimization algorithm in a form of neural network architecture. This paper takes a two-layered design, where a bias and back-projection gradient, and deblurring matrix are learned. As described in the paper, the recurrent architecture has been proposed to substitute the signal reconstruction problem for other field applications of compressive sensings, such as compressive imaging.
Reviews: DeepWave: A Recurrent Neural-Network for Real-Time Acoustic Imaging
The paper proposes a RNN LISTA architecture to the problem of real-time acoustic imaging and a novel parametrization leading to a number of parameters that grow linearly wrt resolution instead of quadratically, as well as a novel initialization scheme. Some Experiments comparing this approach to the state-of-art in the field validate the proposed model. While the paper may of of narrow interest for the ML community, it presents some interesting contributions.
DeepWave: A Recurrent Neural-Network for Real-Time Acoustic Imaging
We propose a recurrent neural-network for real-time reconstruction of acoustic camera spherical maps. The network, dubbed DeepWave, is both physically and algorithmically motivated: its recurrent architecture mimics iterative solvers from convex optimisation, and its parsimonious parametrisation is based on the natural structure of acoustic imaging problems. Each network layer applies successive filtering, biasing and activation steps to its input, which can be interpreted as generalised deblurring and sparsification steps. To comply with the irregular geometry of spherical maps, filtering operations are implemented efficiently by means of graph signal processing techniques. Unlike commonly-used imaging network architectures, DeepWave is moreover capable of directly processing the complex-valued raw microphone correlations, learning how to optimally back-project these into a spherical map.
DeepWave: A Recurrent Neural-Network for Real-Time Acoustic Imaging
SIMEONI, Matthieu, Kashani, Sepand, Hurley, Paul, Vetterli, Martin
We propose a recurrent neural-network for real-time reconstruction of acoustic camera spherical maps. The network, dubbed DeepWave, is both physically and algorithmically motivated: its recurrent architecture mimics iterative solvers from convex optimisation, and its parsimonious parametrisation is based on the natural structure of acoustic imaging problems. Each network layer applies successive filtering, biasing and activation steps to its input, which can be interpreted as generalised deblurring and sparsification steps. To comply with the irregular geometry of spherical maps, filtering operations are implemented efficiently by means of graph signal processing techniques. Unlike commonly-used imaging network architectures, DeepWave is moreover capable of directly processing the complex-valued raw microphone correlations, learning how to optimally back-project these into a spherical map.