GTSinger: A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks Y u Zhang
–Neural Information Processing Systems
To tackle these problems, we present GTSinger, a large G lobal, multi-T echnique, free-to-use, high-quality singing corpus with realistic music scores, designed for all singing tasks, along with its benchmarks. Particularly, (1) we collect 80.59 hours of high-quality singing voices, forming the largest recorded singing dataset; (2) 20 professional singers across nine widely spoken languages offer diverse timbres and styles; (3) we provide controlled comparison and phoneme-level annotations of six commonly used singing techniques, helping technique modeling and control; (4) GTSinger offers realistic music scores, assisting real-world musical composition; (5) singing
Neural Information Processing Systems
Aug-7-2025, 00:25:42 GMT
- Genre:
- Research Report (0.95)
- Industry:
- Media > Music (1.00)
- Leisure & Entertainment (1.00)
- Technology:
- Information Technology > Artificial Intelligence
- Speech (1.00)
- Natural Language (0.93)
- Machine Learning > Neural Networks (0.46)
- Information Technology > Artificial Intelligence