Double Entendre: Robust Audio-Based AI-Generated Lyrics Detection via Multi-View Fusion
Frohmann, Markus, Meseguer-Brocal, Gabriel, Schedl, Markus, Epure, Elena V.
–arXiv.org Artificial Intelligence
The rapid advancement of AI-based music generation tools is revolutionizing the music industry but also posing challenges to artists, copyright holders, and providers alike. This necessitates reliable methods for detecting such AI-generated content. However, existing detectors, relying on either audio or lyrics, face key practical limitations: audio-based detectors fail to generalize to new or unseen generators and are vulnerable to audio perturbations; lyrics-based methods require cleanly formatted and accurate lyrics, unavailable in practice. To overcome these limitations, we propose a novel, practically grounded approach: a multimodal, modular late-fusion pipeline that combines automatically transcribed sung lyrics and speech features capturing lyrics-related information within the audio. By relying on lyrical aspects directly from audio, our method enhances robustness, mitigates susceptibility to low-level artifacts, and enables practical applicability. Experiments show that our method, DE-detect, outperforms existing lyrics-based detectors while also being more robust to audio perturbations. Thus, it offers an effective, robust solution for detecting AI-generated music in real-world scenarios. Our code is available at https://github.com/deezer/robust-AI-lyrics-detection.
arXiv.org Artificial Intelligence
Jul-1-2025
- Country:
- Asia
- China > Hong Kong (0.04)
- Middle East > Jordan (0.04)
- Singapore > Central Region
- Singapore (0.04)
- Thailand > Bangkok
- Bangkok (0.04)
- Europe
- Austria > Upper Austria
- Linz (0.04)
- France > Île-de-France
- Italy > Tuscany
- Florence (0.04)
- Austria > Upper Austria
- North America
- Dominican Republic (0.04)
- United States
- Florida > Miami-Dade County
- Miami (0.04)
- Georgia > Fulton County
- Atlanta (0.04)
- New Mexico > Bernalillo County
- Albuquerque (0.04)
- Florida > Miami-Dade County
- Asia
- Genre:
- Research Report (0.82)
- Industry:
- Leisure & Entertainment (1.00)
- Media > Music (1.00)
- Technology:
- Information Technology > Artificial Intelligence
- Machine Learning
- Neural Networks > Deep Learning (0.69)
- Performance Analysis > Accuracy (1.00)
- Natural Language > Large Language Model (0.94)
- Speech (0.68)
- Machine Learning
- Information Technology > Artificial Intelligence