pitch spelling
EngravingGNN: A Hybrid Graph Neural Network for End-to-End Piano Score Engraving
Karystinaios, Emmanouil, Foscarin, Francesco, Widmer, Gerhard
This paper focuses on automatic music engraving, i.e., the creation of a humanly-readable musical score from musical content. This step is fundamental for all applications that include a human player, but it remains a mostly unexplored topic in symbolic music processing. In this work, we formalize the problem as a collection of interdependent subtasks, and propose a unified graph neural network (GNN) framework that targets the case of piano music and quantized symbolic input. Our method employs a multi-task GNN to jointly predict voice connections, staff assignments, pitch spelling, key signature, stem direction, octave shifts, and clef signs. A dedicated postprocessing pipeline generates print-ready MusicXML/MEI outputs. Comprehensive evaluation on two diverse piano corpora (J-Pop and DCML Romantic) demonstrates that our unified model achieves good accuracy across all subtasks, compared to existing systems that only specialize in specific subtasks. These results indicate that a shared GNN encoder with lightweight task-specific decoders in a multi-task setting offers a scalable and effective solution for automatic music engraving.
- North America > United States > Utah > Salt Lake County > Salt Lake City (0.04)
- North America > Canada > Nova Scotia > Halifax Regional Municipality > Halifax (0.04)
- Europe > Austria > Upper Austria > Linz (0.04)
- Leisure & Entertainment (1.00)
- Media > Music (0.70)
GraphMuse: A Library for Symbolic Music Graph Processing
Karystinaios, Emmanouil, Widmer, Gerhard
Graph Neural Networks (GNNs) have recently gained traction in symbolic music tasks, yet a lack of a unified framework impedes progress. Addressing this gap, we present GraphMuse, a graph processing framework and library that facilitates efficient music graph processing and GNN training for symbolic music tasks. Central to our contribution is a new neighbor sampling technique specifically targeted toward meaningful behavior in musical scores. Additionally, GraphMuse integrates hierarchical modeling elements that augment the expressivity and capabilities of graph networks for musical tasks. Experiments with two specific musical prediction tasks -- pitch spelling and cadence detection -- demonstrate significant performance improvement over previous methods. Our hope is that GraphMuse will lead to a boost in, and standardization of, symbolic music processing based on graph representations. The library is available at https://github.com/manoskary/graphmuse
- North America > United States > California > San Francisco County > San Francisco (0.14)
- Europe > Italy > Tuscany > Florence (0.04)
- Europe > Greece (0.04)
- Europe > Austria > Upper Austria > Linz (0.04)
- Media > Music (1.00)
- Leisure & Entertainment (1.00)