Cultural Compass: Predicting Transfer Learning Success in Offensive Language Detection with Cultural Features
Zhou, Li, Karamolegkou, Antonia, Chen, Wenyu, Hershcovich, Daniel
–arXiv.org Artificial Intelligence
The increasing ubiquity of language technology necessitates a shift towards considering cultural diversity in the machine learning realm, particularly for subjective tasks that rely heavily on cultural nuances, such as Offensive Language Detection (OLD). Current understanding underscores that these tasks are substantially influenced by cultural values, however, a notable gap exists in determining if cultural features can accurately predict the success of cross-cultural transfer learning for such subjective tasks. Addressing this, our study delves into the intersection of cultural features and transfer learning effectiveness. The findings reveal that cultural value surveys indeed possess a predictive power for cross-cultural transfer learning success in OLD tasks and that it can be further improved using offensive word distance. Based on these results, we advocate for the integration of cultural information into datasets. Additionally, we recommend leveraging data sources rich in cultural information, such as surveys, to enhance cultural adaptability. Our research signifies a step forward in the quest for more inclusive, culturally sensitive language technologies.
arXiv.org Artificial Intelligence
Oct-10-2023
- Country:
- Oceania > Australia (0.04)
- South America
- North America
- Trinidad and Tobago (0.04)
- Panama (0.04)
- Costa Rica (0.04)
- Dominican Republic (0.04)
- Honduras (0.04)
- Guatemala (0.04)
- Mexico (0.04)
- Jamaica (0.04)
- El Salvador (0.04)
- Puerto Rico > Peñuelas
- Peñuelas (0.04)
- United States
- Washington > King County
- Seattle (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Washington > King County
- Europe
- Germany (0.04)
- Poland (0.04)
- United Kingdom (0.04)
- Netherlands (0.04)
- Russia (0.04)
- Western Europe (0.04)
- Austria (0.04)
- Middle East (0.04)
- Czechia (0.04)
- Iceland > Capital Region
- Reykjavik (0.04)
- Spain > Valencian Community
- Valencia Province > Valencia (0.04)
- Italy > Tuscany
- Florence (0.04)
- France > Provence-Alpes-Côte d'Azur
- Bouches-du-Rhône > Marseille (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- Ukraine > Kyiv Oblast
- Kyiv (0.04)
- Sweden > Östergötland County
- Linköping (0.04)
- Croatia > Dubrovnik-Neretva County
- Dubrovnik (0.04)
- Faroe Islands > Streymoy
- Tórshavn (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Estonia > Tartu County
- Tartu (0.04)
- Asia
- India (0.04)
- South Korea (0.04)
- East Asia (0.04)
- Japan (0.04)
- China > Hong Kong (0.04)
- Russia (0.04)
- Indonesia > Bali (0.04)
- Sri Lanka (0.04)
- Middle East
- Republic of Türkiye (0.04)
- Jordan (0.04)
- Iran (0.04)
- Saudi Arabia (0.04)
- Lebanon (0.04)
- Kuwait (0.04)
- Syria (0.04)
- Qatar (0.04)
- UAE > Abu Dhabi Emirate
- Abu Dhabi (0.05)
- Africa
- Zambia (0.04)
- Sierra Leone (0.04)
- Senegal (0.04)
- Nigeria (0.04)
- Namibia (0.04)
- Ghana (0.04)
- Burkina Faso (0.04)
- Middle East
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Law Enforcement & Public Safety (0.46)
- Information Technology (0.46)
- Technology: