Automatic Music Transcription using Convolutional Neural Networks and Constant-Q transform
Telila, Yohannis, Cucinotta, Tommaso, Bacciu, Davide
–arXiv.org Artificial Intelligence
Automatic music transcription (AMT) is the problem of analyzing an audio recording of a musical piece and detecting notes that are being played. AMT is a challenging problem, particularly when it comes to polyphonic music. The goal of AMT is to produce a score representation of a music piece, by analyzing a sound signal containing multiple notes played simultaneously. In this work, we design a processing pipeline that can transform classical piano audio files in .wav format into a music score representation. The features from the audio signals are extracted using the constant-Q transform, and the resulting coefficients are used as an input to the convolutional neural network (CNN) model.
arXiv.org Artificial Intelligence
May-8-2025
- Country:
- North America > United States > New York > New York County > New York City (0.14)
- Genre:
- Research Report (0.83)
- Industry:
- Media > Music (1.00)
- Leisure & Entertainment (1.00)
- Technology: