SSM-Net: feature learning for Music Structure Analysis using a Self-Similarity-Matrix based loss

Nov-15-2022–arXiv.org Artificial Intelligence

In this paper, we propose a new paradigm to learn audio features for Music Structure Analysis (MSA). We train a deep encoder to learn features such that the Self-Similarity-Matrix (SSM) resulting from those approximates a ground-truth SSM. This is done by minimizing a loss between both SSMs. Since this loss is differentiable w.r.t. its input features we can train the encoder in a straightforward way. We successfully demonstrate the use of this training paradigm using the Area Under the Curve ROC (AUC) on the RWC-Pop dataset.

artificial intelligence, latexit sha1, machine learning, (16 more...)

arXiv.org Artificial Intelligence

Nov-15-2022

arXiv.org PDF

Add feedback

Country:
- North America
  - United States (0.05)
  - Canada > British Columbia
    - Vancouver Island > Capital Regional District > Victoria (0.04)
- Europe
  - Germany (0.04)
  - United Kingdom > England
    - East Sussex > Brighton (0.04)
  - Spain > Andalusia
    - Málaga Province > Málaga (0.04)
  - Netherlands
    - South Holland > Delft (0.04)
    - North Holland > Amsterdam (0.04)
  - France > Île-de-France
    - Paris > Paris (0.04)
- Asia
  - Taiwan > Taiwan Province
    - Taipei (0.04)
  - India > Karnataka
    - Bengaluru (0.04)

Genre:
- Research Report (0.40)

Industry:
- Media > Music (0.88)
- Leisure & Entertainment (0.88)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.95)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found