SSM-Net: feature learning for Music Structure Analysis using a Self-Similarity-Matrix based loss
Peeters, Geoffroy, Angulo, Florian
–arXiv.org Artificial Intelligence
In this paper, we propose a new paradigm to learn audio features for Music Structure Analysis (MSA). We train a deep encoder to learn features such that the Self-Similarity-Matrix (SSM) resulting from those approximates a ground-truth SSM. This is done by minimizing a loss between both SSMs. Since this loss is differentiable w.r.t. its input features we can train the encoder in a straightforward way. We successfully demonstrate the use of this training paradigm using the Area Under the Curve ROC (AUC) on the RWC-Pop dataset.
arXiv.org Artificial Intelligence
Nov-15-2022
- Country:
- Asia
- Europe
- France > Île-de-France
- Germany (0.04)
- Netherlands
- North Holland > Amsterdam (0.04)
- South Holland > Delft (0.04)
- Spain > Andalusia
- Málaga Province > Málaga (0.04)
- United Kingdom > England
- East Sussex > Brighton (0.04)
- North America
- Canada > British Columbia
- Vancouver Island > Capital Regional District > Victoria (0.04)
- United States (0.05)
- Canada > British Columbia
- Genre:
- Research Report (0.40)
- Industry:
- Leisure & Entertainment (0.88)
- Media > Music (0.88)
- Technology: