EmoBench-UA: A Benchmark Dataset for Emotion Detection in Ukrainian
Dementieva, Daryna, Babakov, Nikolay, Fraser, Alexander
–arXiv.org Artificial Intelligence
While Ukrainian NLP has seen progress in many texts processing tasks, emotion classification remains an underexplored area with no publicly available benchmark to date. In this work, we introduce EmoBench-UA, the first annotated dataset for emotion detection in Ukrainian texts. Our annotation schema is adapted from the previous English-centric works on emotion detection (Mohammad et al., 2018; Mohammad, 2022) guidelines. The dataset was created through crowdsourcing using the Toloka.ai platform ensuring high-quality of the annotation process. Then, we evaluate a range of approaches on the collected dataset, starting from linguistic-based baselines, synthetic data translated from English, to large language models (LLMs). Our findings highlight the challenges of emotion classification in non-mainstream languages like Ukrainian and emphasize the need for further development of Ukrainian-specific models and training resources.
arXiv.org Artificial Intelligence
Sep-29-2025
- Country:
- Asia
- Middle East
- Republic of Türkiye > Istanbul Province
- Istanbul (0.04)
- UAE > Abu Dhabi Emirate
- Abu Dhabi (0.14)
- Republic of Türkiye > Istanbul Province
- Thailand > Bangkok
- Bangkok (0.04)
- Middle East
- Europe
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Croatia > Dubrovnik-Neretva County
- Dubrovnik (0.04)
- Middle East > Republic of Türkiye
- Istanbul Province > Istanbul (0.04)
- Ukraine
- Kharkiv Oblast > Kharkiv (0.04)
- Kyiv Oblast > Kyiv (0.04)
- Lviv Oblast > Lviv (0.04)
- Spain
- France > Provence-Alpes-Côte d'Azur
- Bouches-du-Rhône > Marseille (0.04)
- Bulgaria (0.04)
- Germany > Bavaria
- Upper Bavaria > Munich (0.04)
- Austria > Vienna (0.14)
- Ireland > Leinster
- North America
- Canada > Ontario
- Toronto (0.04)
- Mexico > Mexico City
- Mexico City (0.04)
- United States
- California
- Los Angeles County > Long Beach (0.04)
- San Francisco County > San Francisco (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- California
- Canada > Ontario
- South America > Chile
- Asia
- Genre:
- Research Report > New Finding (0.48)
- Industry:
- Information Technology (0.67)
- Technology: