YouTube-ASL: A Large-Scale, Open-Domain American Sign Language-English Parallel Corpus
–Neural Information Processing Systems
Machine learning for sign languages is bottlenecked by data. In this paper, we present YouTube-ASL, a large-scale, open-domain corpus of American Sign Language (ASL) videos and accompanying English captions drawn from YouTube. With 1000 hours of videos and 2500 unique signers, YouTube-ASL is 3x as large and has 10x as many unique signers as the largest prior ASL dataset. We train baseline models for ASL to English translation on YouTube-ASL and evaluate them on How2Sign, where we achieve a new fine-tuned state of the art of 12.397 BLEU and, for the first time, nontrivial zero-shot results.
american sign language-english parallel corpus, open-domain american sign language-english, youtube-asl, (2 more...)
Neural Information Processing Systems
Jan-18-2025, 15:28:30 GMT