ABC-Eval: Benchmarking Large Language Models on Symbolic Music Understanding and Instruction Following

Open in new window