Music to Dance as Language Translation using Sequence Models
Correia, André, Alexandre, Luís A.
–arXiv.org Artificial Intelligence
Synthesising appropriate choreographies from music remains an open problem. We introduce MDLT, a novel approach that frames the choreography generation problem as a translation task. Our method leverages an existing data set to learn to translate sequences of audio into corresponding dance poses. We present two variants of MDLT: one utilising the Transformer architecture and the other employing the Mamba architecture. We train our method on AIST++ and PhantomDance data sets to teach a robotic arm to dance, but our method can be applied to a full humanoid robot. Evaluation metrics, including Average Joint Error and Frechet Inception Distance, consistently demonstrate that, when given a piece of music, MDLT excels at producing realistic and high-quality choreography. The code can be found at github.com/meowatthemoon/MDLT.
arXiv.org Artificial Intelligence
Mar-22-2024
- Country:
- Africa > Mozambique
- Sofala Province > Beira (0.04)
- Asia > Japan
- Honshū > Chūbu > Ishikawa Prefecture > Kanazawa (0.04)
- Europe > Portugal (0.04)
- Africa > Mozambique
- Genre:
- Research Report (0.84)
- Technology: