Augmentation Methods on Monophonic Audio for Instrument Classification in Polyphonic Music
Kratimenos, Agelos, Avramidis, Kleanthis, Garoufis, Christos, Zlatintsi, Athanasia, Maragos, Petros
Instrument classification is one of the fields in Music Information Retrieval (MIR) that has attracted a lot of research interest. However, the majority of that is dealing with monophonic music, while efforts on polyphonic material mainly focus on predominant instrument recognition or multi-instrument recognition for entire tracks. We present an approach for instrument classification in polyphonic music using monophonic training data that involves mixing-augmentation methods. Specifically, we experiment with pitch and tempo-based synchronization, as well as mixes of tracks with similar music genres. Further, a custom CNN model is proposed, that uses the augmented training data efficiently and a plethora of suitable evaluation metrics are discussed as well. The tempo-sync and genre techniques stand out, achieving an 81% label ranking average precision accuracy, detecting up to 9 instruments in over 2300 testing tracks.
Nov-27-2019
- Country:
- Europe
- France > Provence-Alpes-Côte d'Azur
- Bouches-du-Rhône > Marseille (0.04)
- Greece (0.04)
- Spain
- Andalusia > Málaga Province
- Málaga (0.04)
- Catalonia > Barcelona Province
- Barcelona (0.04)
- Andalusia > Málaga Province
- France > Provence-Alpes-Côte d'Azur
- Europe
- Genre:
- Research Report (0.50)
- Industry:
- Leisure & Entertainment (1.00)
- Media > Music (1.00)
- Technology: