NeuTral Rewriter: A Rule-Based and Neural Approach to Automatic Rewriting into Gender-Neutral Alternatives
Vanmassenhove, Eva, Emmery, Chris, Shterionov, Dimitar
–arXiv.org Artificial Intelligence
Recent years have seen an increasing need for gender-neutral and inclusive language. Within the field of NLP, there are various mono- and bilingual use cases where gender inclusive language is appropriate, if not preferred due to ambiguity or uncertainty in terms of the gender of referents. In this work, we present a rule-based and a neural approach to gender-neutral rewriting for English along with manually curated synthetic data (WinoBias+) and natural data (OpenSubtitles and Reddit) benchmarks. A detailed manual and automatic evaluation highlights how our NeuTral Rewriter, trained on data generated by the rule-based approach, obtains word error rates (WER) below 0.18% on synthetic, in-domain and out-domain test sets.
arXiv.org Artificial Intelligence
Sep-13-2021
- Country:
- North America > United States
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- California
- San Diego County > San Diego (0.04)
- Los Angeles County > Long Beach (0.04)
- Minnesota > Hennepin County
- Europe
- Italy (0.04)
- Netherlands (0.04)
- Germany > Berlin (0.04)
- Spain > Valencian Community
- Alicante Province > Alicante (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- North America > United States
- Genre:
- Research Report (0.50)
- Industry:
- Law Enforcement & Public Safety (0.47)
- Media > News (0.39)
- Technology: