Language Varieties of Italy: Technology Challenges and Opportunities
–arXiv.org Artificial Intelligence
Italy is characterized by a one-of-a-kind linguistic diversity landscape in Europe, which implicitly encodes local knowledge, cultural traditions, artistic expressions and history of its speakers. However, most local languages and dialects in Italy are at risk of disappearing within few generations. The NLP community has recently begun to engage with endangered languages, including those of Italy. Yet, most efforts assume that these varieties are under-resourced language monoliths with an established written form and homogeneous functions and needs, and thus highly interchangeable with each other and with high-resource, standardized languages. In this paper, we introduce the linguistic context of Italy and challenge the default machine-centric assumptions of NLP for Italy's language varieties. We advocate for a shift in the paradigm from machine-centric to speaker-centric NLP, and provide recommendations and opportunities for work that prioritizes languages and their speakers over technological advances. To facilitate the process, we finally propose building a local community towards responsible, participatory efforts aimed at supporting vitality of languages and dialects of Italy.
arXiv.org Artificial Intelligence
Nov-20-2023
- Country:
- South America > Colombia
- Meta Department > Villavicencio (0.04)
- North America
- Dominican Republic (0.04)
- United States
- District of Columbia > Washington (0.04)
- Washington > King County
- Seattle (0.04)
- Texas > Dallas County
- Dallas (0.04)
- New Mexico > Santa Fe County
- Santa Fe (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.04)
- Georgia > Fulton County
- Atlanta (0.04)
- Europe
- Spain (0.04)
- Switzerland (0.04)
- Romania (0.04)
- Czechia > Prague (0.04)
- Iceland > Capital Region
- Reykjavik (0.04)
- Germany
- Hamburg (0.04)
- Hesse > Darmstadt Region
- Wiesbaden (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- Bulgaria > Sofia City Province
- Sofia (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.14)
- Greater London > London (0.04)
- Austria > Tyrol
- Innsbruck (0.04)
- France
- Île-de-France > Paris
- Paris (0.04)
- Provence-Alpes-Côte d'Azur > Bouches-du-Rhône
- Marseille (0.04)
- Bourgogne-Franche-Comté > Doubs
- Besançon (0.04)
- Île-de-France > Paris
- Italy
- Molise (0.04)
- Sicily (0.04)
- Sardinia (0.04)
- Calabria (0.04)
- Aosta Valley > Aosta (0.04)
- Veneto (0.04)
- Friuli Venezia Giulia (0.04)
- Apulia > Bari (0.04)
- Lombardy > Milan (0.04)
- Trentino-Alto Adige/Südtirol
- Trentino Province > Trento (0.04)
- South Tyrol (0.04)
- Piedmont > Turin Province
- Turin (0.04)
- Tuscany
- Pisa Province > Pisa (0.04)
- Florence (0.04)
- Middle East
- Netherlands > South Holland
- Dordrecht (0.04)
- Croatia > Dubrovnik-Neretva County
- Dubrovnik (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Asia
- Singapore (0.04)
- Indonesia > Bali (0.04)
- Middle East
- Israel (0.04)
- Republic of Türkiye > Istanbul Province
- Istanbul (0.04)
- Japan > Kyūshū & Okinawa
- Kyūshū > Miyazaki Prefecture > Miyazaki (0.04)
- China > Beijing
- Beijing (0.04)
- South America > Colombia
- Genre:
- Overview (0.68)
- Research Report (0.50)
- Technology: