Jukebox
This has led to impressive results like producing Bach chorals,[ reference-5][ reference-6] polyphonic music with multiple instruments,[ reference-7][ reference-8][ reference-9] as well as minute long musical pieces.[ But symbolic generators have limitations--they cannot capture human voices or many of the more subtle timbres, dynamics, and expressivity that are essential to music. A different approach[ footnote-approach] is to model music directly as raw audio.[ For comparison, GPT-2 had 1,000 timesteps and OpenAI Five took tens of thousands of timesteps per game. Thus, to learn the high level semantics of music, a model would have to deal with extremely long-range dependencies.
Apr-6-2023, 16:27:42 GMT