Low-Rank Time-Frequency Synthesis

Mar-13-2024, 07:17:00 GMT–Neural Information Processing Systems

Many single-channel signal decomposition techniques rely on a low-rank factorization of a time-frequency transform. In particular, nonnegative matrix factorization (NMF) of the spectrogram - the (power) magnitude of the short-time Fourier transform (STFT) - has been considered in many audio applications. In this setting, NMF with the Itakura-Saito divergence was shown to underly a generative Gaussian composite model (GCM) of the STFT, a step forward from more empirical approaches based on ad-hoc transform and divergence specifications. Still, the GCM is not yet a generative model of the raw signal itself, but only of its STFT. The work presented in this paper fills in this ultimate gap by proposing a novel signal synthesis model with low-rank time-frequency structure. In particular, our new approach opens doors to multi-resolution representations, that were not possible in the traditional NMF setting. We describe two expectation-maximization algorithms for estimation in the new model and report audio signal processing results with music decomposition and speech enhancement.

algorithm, coefficient, estimation, (15 more...)

Neural Information Processing Systems

Mar-13-2024, 07:17:00 GMT

Conferences PDF

Add feedback

Country:
- North America > United States (0.04)
- Europe
  - Austria > Vienna (0.04)
  - United Kingdom > England
    - Greater London > London (0.04)
  - France > Provence-Alpes-Côte d'Azur
    - Bouches-du-Rhône > Marseille (0.04)
    - Alpes-Maritimes > Nice (0.04)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning (1.00)
  - Representation & Reasoning > Uncertainty
    - Bayesian Inference (0.47)