Fine-Tuning MIDI-to-Audio Alignment using a Neural Network on Piano Roll and CQT Representations