FromDiscreteTokenstoHigh-FidelityAudioUsing Multi-BandDiffusion

Feb-7-2026, 09:26:33 GMT–Neural Information Processing Systems

Deep generativemodels cangenerate high-fidelity audio conditioned onvarious types of representations (e.g., mel-spectrograms, Mel-frequency Cepstral Coefficients (MFCC)). Recently, such models have been used to synthesize audio waveforms conditioned on highly compressed representations.

artificial intelligence, arxivpreprintarxiv, machine learning, (18 more...)

Neural Information Processing Systems

Feb-7-2026, 09:26:33 GMT

Conferences PDF

Add feedback

Country:
- South America > Chile
  - Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
- Europe
  - Italy > Calabria
    - Catanzaro Province > Catanzaro (0.04)
  - France > Grand Est
    - Meurthe-et-Moselle > Nancy (0.04)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (0.94)

Duplicate Docs Excel Report

Title
054f771d614df12fe8def8ecdbe4e8e1-Supplemental-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found