GATE: A Challenge Set for Gender-Ambiguous Translation Examples
Rarrick, Spencer, Naik, Ranjita, Mathur, Varun, Poudel, Sundar, Chowdhary, Vishal
–arXiv.org Artificial Intelligence
Although recent years have brought significant progress in improving translation of unambiguously gendered sentences, translation of ambiguously gendered input remains relatively unexplored. When source gender is ambiguous, machine translation models typically default to stereotypical gender roles, perpetuating harmful bias. Recent work has led to the development of "gender rewriters" that generate alternative gender translations on such ambiguous inputs, but such systems are plagued by poor linguistic coverage. To encourage better performance on this task we present and release GATE, a linguistically diverse corpus of gender-ambiguous source sentences along with multiple alternative target language translations. We also provide tools for evaluation and system analysis when using GATE and use them to evaluate our translation rewriter system.
arXiv.org Artificial Intelligence
Mar-7-2023
- Country:
- Oceania > Australia
- North America
- Dominican Republic (0.04)
- United States
- Pennsylvania (0.04)
- New York (0.04)
- Europe
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Netherlands > North Holland
- Amsterdam (0.04)
- Italy > Tuscany
- Florence (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Belgium > Wallonia
- Walloon Brabant > Louvain-la-Neuve (0.04)
- United Kingdom > England
- Asia > Middle East
- UAE > Abu Dhabi Emirate > Abu Dhabi (0.04)
- Genre:
- Research Report (0.40)
- Technology: