It's Morphin' Time! Combating Linguistic Discrimination with Inflectional Perturbations
Tan, Samson, Joty, Shafiq, Kan, Min-Yen, Socher, Richard
–arXiv.org Artificial Intelligence
Training on only perfect Standard English corpora predisposes pre-trained neural networks to discriminate against minorities from non-standard linguistic backgrounds (e.g., African American Vernacular English, Colloquial Singapore English, etc.). We perturb the inflectional morphology of words to craft plausible and semantically similar adversarial examples that expose these biases in popular NLP models, e.g., BERT and Transformer, and show that adversarially fine-tuning them for a single epoch significantly improves robustness without sacrificing performance on clean data.
arXiv.org Artificial Intelligence
May-9-2020
- Country:
- Asia
- China > Guangdong Province
- Guangzhou (0.04)
- Middle East > Syria
- Latakia Governorate > Latakia (0.04)
- Singapore (0.25)
- China > Guangdong Province
- Europe
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- France (0.14)
- Germany > Berlin (0.04)
- Italy > Tuscany
- Florence (0.04)
- Portugal > Lisbon
- Lisbon (0.04)
- Spain > Valencian Community
- Valencia Province > Valencia (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Oxfordshire > Oxford (0.04)
- Belgium > Brussels-Capital Region
- North America
- Canada
- United States
- California > San Diego County
- San Diego (0.04)
- Georgia > Fulton County
- Atlanta (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Massachusetts (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.15)
- Pennsylvania (0.04)
- Texas (0.04)
- California > San Diego County
- Oceania > Australia
- New South Wales > Sydney (0.04)
- Victoria > Melbourne (0.04)
- Asia
- Genre:
- Research Report > New Finding (0.46)
- Industry:
- Government (0.46)
- Technology: