Symbolic Music Generation with Diffusion Models

Mittal, Gautam, Engel, Jesse, Hawthorne, Curtis, Simon, Ian

Mar-30-2021–arXiv.org Machine Learning

Score-based generative models and diffusion probabilistic models have been successful at generating high-quality samples in continuous domains such as images and audio. However, due to their Langevin-inspired sampling mechanisms, their application to discrete and sequential data has been limited. In this work, we present a technique for training diffusion models on sequential data by parameterizing the discrete domain in the continuous latent space of a pre-trained variational autoencoder. Our method is non-autoregressive and learns to generate sequences of latent embeddings through the reverse process and offers parallel generation with a constant number of iterative refinement steps. We apply this technique to modeling symbolic music and show strong unconditional generation and post-hoc conditional infilling results compared to autoregressive language models operating over the same continuous embeddings.

diffusion model, sequence, symbolic music generation, (12 more...)

arXiv.org Machine Learning

Mar-30-2021

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - California
    - Alameda County > Berkeley (0.14)
    - Santa Clara County > Mountain View (0.04)
- Europe
  - Italy > Calabria
    - Catanzaro Province > Catanzaro (0.04)
  - France > Hauts-de-France
    - Nord > Lille (0.04)

Genre:
- Research Report (0.40)

Industry:
- Media > Music (1.00)
- Leisure & Entertainment (1.00)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found