FromDiscreteTokenstoHigh-FidelityAudioUsing Multi-BandDiffusion
–Neural Information Processing Systems
Deep generativemodels cangenerate high-fidelity audio conditioned onvarious types of representations (e.g., mel-spectrograms, Mel-frequency Cepstral Coefficients (MFCC)). Recently, such models have been used to synthesize audio waveforms conditioned on highly compressed representations.
Neural Information Processing Systems
Feb-7-2026, 09:26:33 GMT
- Country:
- Europe
- France > Grand Est
- Meurthe-et-Moselle > Nancy (0.04)
- Italy > Calabria
- Catanzaro Province > Catanzaro (0.04)
- France > Grand Est
- South America > Chile
- Europe
- Technology: