Reconsidering Sentence-Level Sign Language Translation
Tanzer, Garrett, Shengelia, Maximus, Harrenstien, Ken, Uthus, David
–arXiv.org Artificial Intelligence
Historically, sign language machine translation has been posed as a sentence-level task: datasets consisting of continuous narratives are chopped up and presented to the model as isolated clips. In this work, we explore the limitations of this task framing. First, we survey a number of linguistic phenomena in sign languages that depend on discourse-level context. Then as a case study, we perform the first human baseline for sign language translation that actually substitutes a human into the machine learning task framing, rather than provide the human with the entire document as context. This human baseline -- for ASL to English translation on the How2Sign dataset -- shows that for 33% of sentences in our sample, our fluent Deaf signer annotators were only able to understand key parts of the clip in light of additional discourse-level context. These results underscore the importance of understanding and sanity checking examples when adapting machine learning to new domains.
arXiv.org Artificial Intelligence
Jun-16-2024
- Country:
- North America
- Canada (0.04)
- United States
- Pennsylvania (0.04)
- Massachusetts (0.04)
- New York > New York County
- New York City (0.04)
- Maryland > Montgomery County
- Silver Spring (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Europe
- Spain (0.04)
- Belgium (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Italy > Tuscany
- Florence (0.04)
- France > Provence-Alpes-Côte d'Azur
- Bouches-du-Rhône > Marseille (0.04)
- Asia
- Singapore (0.04)
- Middle East > UAE
- Abu Dhabi Emirate > Abu Dhabi (0.04)
- Japan > Kyūshū & Okinawa
- Kyūshū > Miyazaki Prefecture > Miyazaki (0.04)
- China > Beijing
- Beijing (0.04)
- North America
- Genre:
- Research Report > New Finding (0.48)
- Industry:
- Education > Curriculum > Subject-Specific Education (1.00)
- Technology: