Towards Leveraging Contrastively Pretrained Neural Audio Embeddings for Recommender Tasks

Grötschla, Florian, Strässle, Luca, Lanzendörfer, Luca A., Wattenhofer, Roger

Sep-13-2024–arXiv.org Artificial Intelligence

Music recommender systems frequently utilize network-based models to capture relationships between music pieces, artists, and users. Although these relationships provide valuable insights for predictions, new music pieces or artists often face the cold-start problem due to insufficient initial information. To address this, one can extract content-based information directly from the music to enhance collaborative-filtering-based methods. While previous approaches have relied on hand-crafted audio features for this purpose, we explore the use of contrastively pretrained neural audio embedding models, which offer a richer and more nuanced representation of music. Our experiments demonstrate that neural embeddings, particularly those generated with the Contrastive Language-Audio Pretraining (CLAP) model, present a promising approach to enhancing music recommendation tasks within graph-based frameworks.

artist, information, representation, (11 more...)

arXiv.org Artificial Intelligence

Sep-13-2024

arXiv.org PDF

Add feedback

Country:
- North America > Canada (0.04)
- Europe
  - Switzerland > Zürich
    - Zürich (0.14)
  - Netherlands > South Holland
    - Delft (0.04)
  - Italy > Apulia
    - Bari (0.04)

Genre:
- Research Report
  - New Finding (0.47)
  - Promising Solution (0.34)

Industry:
- Media > Music (1.00)
- Leisure & Entertainment (1.00)

Technology:
- Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found