MixAssist: An Audio-Language Dataset for Co-Creative AI Assistance in Music Mixing
Clemens, Michael, Marasović, Ana
–arXiv.org Artificial Intelligence
While AI presents significant potential for enhancing music mixing and mastering workflows, current research predominantly emphasizes end-to-end automation or generation, often overlooking the collaborative and instructional dimensions vital for co-creative processes. This gap leaves artists, particularly amateurs seeking to develop expertise, underserved. To bridge this, we introduce MixAssist, a novel audio-language dataset capturing the situated, multi-turn dialogue between expert and amateur music producers during collaborative mixing sessions. Comprising 431 audio-grounded conversational turns derived from 7 in-depth sessions involving 12 producers, MixAssist provides a unique resource for training and evaluating audio-language models that can comprehend and respond to the complexities of real-world music production dialogues. Our evaluations, including automated LLM-as-a-judge assessments and human expert comparisons, demonstrate that fine-tuning models such as Qwen-Audio on MixAssist can yield promising results, with Qwen significantly outperforming other tested models in generating helpful, contextually relevant mixing advice. By focusing on co-creative instruction grounded in audio context, MixAssist enables the development of intelligent AI assistants designed to support and augment the creative process in music mixing.
arXiv.org Artificial Intelligence
Jul-10-2025
- Country:
- Africa > Eswatini
- Europe
- Italy > Calabria
- Catanzaro Province > Catanzaro (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Italy > Calabria
- North America
- Puerto Rico > Peñuelas
- Peñuelas (0.04)
- United States
- Michigan > Washtenaw County
- Ann Arbor (0.04)
- New Jersey (0.04)
- Pennsylvania > Philadelphia County
- Philadelphia (0.04)
- Utah (0.04)
- Virginia (0.04)
- Michigan > Washtenaw County
- Puerto Rico > Peñuelas
- Genre:
- Questionnaire & Opinion Survey (1.00)
- Research Report > New Finding (0.92)
- Industry:
- Education (1.00)
- Leisure & Entertainment (1.00)
- Media > Music (1.00)
- Technology: