DIALECTBENCH: A NLP Benchmark for Dialects, Varieties, and Closely-Related Languages
Faisal, Fahim, Ahia, Orevaoghene, Srivastava, Aarohi, Ahuja, Kabir, Chiang, David, Tsvetkov, Yulia, Anastasopoulos, Antonios
–arXiv.org Artificial Intelligence
Language technologies should be judged on their usefulness in real-world use cases. An often overlooked aspect in natural language processing (NLP) research and evaluation is language variation in the form of non-standard dialects or language varieties (hereafter, varieties). Most NLP benchmarks are limited to standard language varieties. To fill this gap, we propose DIALECTBENCH, the first-ever large-scale benchmark for NLP on varieties, which aggregates an extensive set of task-varied variety datasets (10 text-level tasks covering 281 varieties). This allows for a comprehensive evaluation of NLP system performance on different language varieties. We provide substantial evidence of performance disparities between standard and non-standard language varieties, and we also identify language clusters with large performance divergence across tasks. We believe DIALECTBENCH provides a comprehensive view of the current state of NLP for language varieties and one step towards advancing it further. Code/data: https://github.com/ffaisal93/DialectBench
arXiv.org Artificial Intelligence
Jul-7-2024
- Country:
- Africa
- Eritrea (0.04)
- Ethiopia (0.04)
- Kenya (0.04)
- Middle East > Morocco
- Casablanca-Settat Region > Casablanca (0.04)
- Nigeria (0.04)
- Tanzania (0.04)
- Asia
- Indonesia > Bali (0.04)
- North Korea (0.04)
- India > West Bengal (0.04)
- Japan
- Honshū > Kansai
- Kyoto Prefecture > Kyoto (0.04)
- Kyūshū & Okinawa > Kyūshū
- Miyazaki Prefecture > Miyazaki (0.04)
- Honshū > Kansai
- Middle East
- Bahrain (0.04)
- Jordan (0.04)
- Lebanon (0.04)
- Saudi Arabia > Riyadh Province
- Riyadh (0.04)
- Syria > Aleppo Governorate
- Aleppo (0.04)
- UAE > Abu Dhabi Emirate
- Abu Dhabi (0.04)
- Yemen (0.04)
- China (0.04)
- South Korea > Seoul
- Seoul (0.04)
- Taiwan (0.04)
- Bangladesh > Dhaka Division
- Dhaka District > Dhaka (0.04)
- Europe
- United Kingdom
- England > Cambridgeshire
- Cambridge (0.04)
- Scotland (0.04)
- England > Cambridgeshire
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Croatia > Dubrovnik-Neretva County
- Dubrovnik (0.04)
- Belgium (0.04)
- Ukraine > Kyiv Oblast
- Kyiv (0.04)
- Greece (0.04)
- Middle East > Cyprus
- Italy
- Apulia (0.04)
- Lazio (0.04)
- Tuscany
- Florence (0.04)
- Pisa Province > Pisa (0.04)
- Umbria (0.04)
- Calabria (0.04)
- Abruzzo (0.04)
- Basilicata (0.04)
- Molise (0.04)
- Veneto (0.04)
- Trentino-Alto Adige/Südtirol > Trentino Province
- Trento (0.04)
- Emilia-Romagna (0.04)
- Friuli Venezia Giulia (0.04)
- Liguria (0.04)
- Campania (0.04)
- Portugal > Lisbon
- Lisbon (0.04)
- Switzerland (0.04)
- Germany (0.04)
- Spain > Valencian Community
- Valencia Province > Valencia (0.04)
- United Kingdom
- North America
- Canada
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.04)
- Ontario > Toronto (0.04)
- British Columbia > Metro Vancouver Regional District
- Dominican Republic (0.04)
- United States
- Indiana > Boone County
- Lebanon (0.04)
- Michigan > Washtenaw County
- Ann Arbor (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Pennsylvania (0.04)
- Indiana > Boone County
- Canada
- Oceania
- Australia (0.04)
- Fiji (0.04)
- New Zealand (0.04)
- South America
- Africa
- Genre:
- Research Report (0.81)
- Industry:
- Technology: